Last modified: 2014-07-30 01:22:47 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T49867, the corresponding Phabricator task for complete and up-to-date bug report information.

Bug 47867 - use mediawiki's HTML output


Summary:	use mediawiki's HTML output

Status:	RESOLVED FIXED

Product:	MediaWiki extensions
Classification:	Unclassified
Component:	Collection (Other open bugs)
Version:	unspecified
Hardware:	All All

Importance:	Low normal (vote)
Target Milestone:	---
Assigned To:	Nobody - You can work on this!

URL:
Whiteboard:
Keywords:

Depends on:
Blocks:	47575
	Show dependency tree / graph

Reported:	2013-04-30 11:09 UTC by Ralf Schmitt
Modified:	2014-07-30 01:22 UTC (History)
CC List:	8 users (show)

See Also:
Web browser:	---
Mobile Platform:	---
Assignee Huggle Beta Tester:	---

Attachments
Add an attachment (proposed patch, testcase, etc.)

Description Ralf Schmitt 2013-04-30 11:09:06 UTC

the collection extension should generate PDFs from mediawiki's HTML
output instead of using a custom parser and relying on mediawiki's
broken expand templates feature.
That would fix all bugs related to parsing mediawiki markup and expanding templates.

Comment 1 Brion Vibber 2013-04-30 17:25:56 UTC

I recommended this a few years ago, but we went with the second parser solution as it was already in development.

Using a headless WebKit browser to generate PDFs is fairly straightforward, but I'm not sure how best to handle combining multiple articles together etc. This wouldn't be a trivial project, but would be nice to investigate.

Comment 2 Gabriel Wicke 2013-08-22 17:04:44 UTC

Parsoid HTML with RDFa might be a better starting point for print-specific customization as it contains a lot of semantic information. See 
http://www.mediawiki.org/wiki/Parsoid/MediaWiki_DOM_spec.

Another potentially relevant project would be http://bookjs.net/, a JS library that prepares HTML content for printing in WebKit. It did not work in my testing and seems to be pretty cutting-edge, but there are probably ways to make it work as it is used by the booktype project.

Comment 3 Gabriel Wicke 2013-08-22 17:56:28 UTC

Update re book.js: The demo at http://bookjs.net/data/body.html does not work for me, but the demos in the git checkout work both in Chromium 28 and Chrome 29:

git clone https://github.com/sourcefabric/BookJS.git

Comment 4 Helder 2014-01-21 18:49:58 UTC

Is this fixed by the new PDF renderer?
* Re-implementing PDF support
http://lists.wikimedia.org/pipermail/wikitech-l/2013-November/073059.html
* Status update on new Collections PDF Renderer
http://lists.wikimedia.org/pipermail/wikitech-l/2013-November/073238.html

Comment 5 Gabriel Wicke 2014-02-03 19:18:47 UTC

(In reply to comment #4)
> Is this fixed by the new PDF renderer?

I'd say yes. I'll leave the pleasure of closing this bug to the PDF team though ;)

Comment 6 C. Scott Ananian 2014-02-06 04:49:40 UTC

The new PDF renderer isn't deployed yet; maybe we should wait until then?

Comment 7 Matt Walker 2014-07-30 01:22:47 UTC

With the 'public' release of the OCG renderer; I'm going to close this bug.

Wikimedia Bugzilla is closed!

Search

Personal tools

Navigation

Links