Last modified: 2014-09-12 09:34:28 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T59813, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 57813 - Google Books > Internet Archive > Commons upload cycle
Google Books > Internet Archive > Commons upload cycle
Status: ASSIGNED
Product: MediaWiki extensions
Classification: Unclassified
Extensions requests (Other open bugs)
unspecified
All All
: Low enhancement with 3 votes (vote)
: ---
Assigned To: Rohit Dua
:
Depends on:
Blocks: Wikisource
  Show dependency treegraph
 
Reported: 2013-12-01 16:17 UTC by vladjohn2013
Modified: 2014-09-12 09:34 UTC (History)
9 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description vladjohn2013 2013-12-01 16:17:55 UTC
Google Books > Internet Archive > Commons upload cycle

Wikisources all around the world use heavily GB digitizations for transcription and proofreading. As GB provides just the PDF, the usual cycle is:

    go to Google Books and look for a book
    check if the book is already in IA
    if it's not, upload it there
    get the djvu from IA
    upload it on Commons
    use it on Wikisource

For point 4, we have this awesome tool: https://toolserver.org/~tpt/iaUploadBot/step1.php What we miss right now is a tool for point 2.1, that would serve many other users outside the Wikimedia movement too. Eventually, we could think of a bot/script which would do all the work altogether, notifying the user when their help is needed (eg metadata polishing, Commons categories, etc.) Mentors: Aubrey is available for "design" mentorship, paired with a technical expert. We can maybe ask help from a IA expert.

URL:https://www.mediawiki.org/wiki/Mentorship_programs/Possible_projects#Google_Books_.3E_Internet_Archive_.3E_Commons_upload_cycle
Comment 1 vladjohn2013 2013-12-01 16:18:13 UTC
This proposal has been listed at https://www.mediawiki.org/wiki/Mentorship_programs/Possible_projects and we are filing a report to gather community feedback and share updates.
Comment 2 Rohit Dua 2014-03-03 18:13:43 UTC
(In reply to vladjohn2013 from comment #0)
> Google Books > Internet Archive > Commons upload cycle
> 
> Wikisources all around the world use heavily GB digitizations for
> transcription and proofreading. As GB provides just the PDF, the usual cycle
> is:
> 
>     go to Google Books and look for a book
>     check if the book is already in IA
>     if it's not, upload it there
>     get the djvu from IA
>     upload it on Commons
>     use it on Wikisource
> 
> For point 4, we have this awesome tool:
> https://toolserver.org/~tpt/iaUploadBot/step1.php What we miss right now is
> a tool for point 2.1, that would serve many other users outside the
> Wikimedia movement too. Eventually, we could think of a bot/script which
> would do all the work altogether, notifying the user when their help is
> needed (eg metadata polishing, Commons categories, etc.) Mentors: Aubrey is
> available for "design" mentorship, paired with a technical expert. We can
> maybe ask help from a IA expert.
> 
> URL:https://www.mediawiki.org/wiki/Mentorship_programs/
> Possible_projects#Google_Books_.3E_Internet_Archive_.3E_Commons_upload_cycle


Hi

This is to inform that I am working on Bug 57813 - Google Books > Internet Archive > Commons upload cycle, via GSOC-2014 project.
I'm ready with with the outline of google-books download script.

--
Rohit Dua
8ohit.dua
New Delhi,India
Comment 3 Rohit Dua 2014-03-12 14:24:25 UTC
I have selected a mentor-ship project<https://www.mediawiki.org/wiki/Mentorship_programs/Possible_projects#Google_Books_.3E_Internet_Archive_.3E_Commons_upload_cycle> which corresponds to this Bug 57813.

I have proposed a rough proposal (after discussing the steps with mentors/community).

Link to proposal: https://www.mediawiki.org/wiki/User:8ohit.dua/GSoC_proposal_2014

It would be great if we could get the community's valuable suggestions/ feedback for the proposal/project.

Thanks a lot.
Rohit Dua
(8ohit.dua)
Delhi, India
Comment 4 Quim Gil 2014-03-13 14:44:16 UTC
Rohit, Paolo (is he following this report?), your proposals are still missing in Google Melange. Please submit them there as a draft linking to your wiki pages. In any case, we will evaluate your proposals in mediawiki.org. Thank you!
Comment 5 Yann Forget 2014-03-16 13:40:09 UTC
FYI, I accept to mentor Rohit Dua.
Comment 6 Quim Gil 2014-03-17 21:59:03 UTC
(In reply to Yann Forget from comment #5)
> FYI, I accept to mentor Rohit Dua.

Tyhank you! Instructions: https://www.mediawiki.org/wiki/Mentorship_programs/Possible_mentors
Comment 7 Quim Gil 2014-09-12 09:34:28 UTC
Rohit's GSoC project was marked as PASSED by his mentors, but the required final blog post is still missing and, in general, it is unclear what we have and what is missing to resolve this report as FIXED.

Please wrap up your project properly.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links