Last modified: 2014-07-30 01:22:47 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T49867, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 47867 - use mediawiki's HTML output
use mediawiki's HTML output
Status: RESOLVED FIXED
Product: MediaWiki extensions
Classification: Unclassified
Collection (Other open bugs)
unspecified
All All
: Low normal (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks: 47575
  Show dependency treegraph
 
Reported: 2013-04-30 11:09 UTC by Ralf Schmitt
Modified: 2014-07-30 01:22 UTC (History)
8 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Ralf Schmitt 2013-04-30 11:09:06 UTC
the collection extension should generate PDFs from mediawiki's HTML
output instead of using a custom parser and relying on mediawiki's
broken expand templates feature.
That would fix all bugs related to parsing mediawiki markup and expanding templates.
Comment 1 Brion Vibber 2013-04-30 17:25:56 UTC
I recommended this a few years ago, but we went with the second parser solution as it was already in development.

Using a headless WebKit browser to generate PDFs is fairly straightforward, but I'm not sure how best to handle combining multiple articles together etc. This wouldn't be a trivial project, but would be nice to investigate.
Comment 2 Gabriel Wicke 2013-08-22 17:04:44 UTC
Parsoid HTML with RDFa might be a better starting point for print-specific customization as it contains a lot of semantic information. See 
http://www.mediawiki.org/wiki/Parsoid/MediaWiki_DOM_spec.

Another potentially relevant project would be http://bookjs.net/, a JS library that prepares HTML content for printing in WebKit. It did not work in my testing and seems to be pretty cutting-edge, but there are probably ways to make it work as it is used by the booktype project.
Comment 3 Gabriel Wicke 2013-08-22 17:56:28 UTC
Update re book.js: The demo at http://bookjs.net/data/body.html does not work for me, but the demos in the git checkout work both in Chromium 28 and Chrome 29:

git clone https://github.com/sourcefabric/BookJS.git
Comment 4 Helder 2014-01-21 18:49:58 UTC
Is this fixed by the new PDF renderer?
* Re-implementing PDF support
http://lists.wikimedia.org/pipermail/wikitech-l/2013-November/073059.html
* Status update on new Collections PDF Renderer
http://lists.wikimedia.org/pipermail/wikitech-l/2013-November/073238.html
Comment 5 Gabriel Wicke 2014-02-03 19:18:47 UTC
(In reply to comment #4)
> Is this fixed by the new PDF renderer?

I'd say yes. I'll leave the pleasure of closing this bug to the PDF team though ;)
Comment 6 C. Scott Ananian 2014-02-06 04:49:40 UTC
The new PDF renderer isn't deployed yet; maybe we should wait until then?
Comment 7 Matt Walker 2014-07-30 01:22:47 UTC
With the 'public' release of the OCG renderer; I'm going to close this bug.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links