Last modified: 2012-08-23 22:33:51 UTC

Wikimedia Bugzilla is closed!

Wikimedia has migrated from Bugzilla to Phabricator. Bug reports should be created and updated in Wikimedia Phabricator instead. Please create an account in Phabricator and add your Bugzilla email address to it.
Wikimedia Bugzilla is read-only. If you try to edit or create any bug report in Bugzilla you will be shown an intentional error message.
In order to access the Phabricator task corresponding to a Bugzilla report, just remove "static-" from its URL.
You could still run searches in Bugzilla or access your list of votes but bug reports will obviously not be up-to-date in Bugzilla.
Bug 22392 - Revisions existing in both archive and revision tables
Revisions existing in both archive and revision tables
Status: NEW
Product: Wikimedia
Classification: Unclassified
General/Unknown (Other open bugs)
unspecified
All All
: Low trivial (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks: 16660 revdel
  Show dependency treegraph
 
Reported: 2010-02-05 09:49 UTC by OverlordQ
Modified: 2012-08-23 22:33 UTC (History)
6 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments
Example dupe (1.19 KB, text/plain)
2010-02-05 09:49 UTC, OverlordQ
Details

Description OverlordQ 2010-02-05 09:49:02 UTC
Created attachment 7076 [details]
Example dupe

When doing some investigating for FT2 for bug 18104, I took a dump of rev_id's from revision and ar_rev_id, sorted then for giggles ran uniq -d on it to find some duplicates. Out of my ~17M row sample (~16M from rev, ~550k from archive), I came across about 2000 instances where a given revision id was found in both rev_id and ar_rev_id.

From the example it seems either the row wasn't cleared from revision on delete, or it wasn't remove from archive on restore since the information is the same.
Comment 1 OverlordQ 2010-02-05 16:27:03 UTC
I forgot to include the link to the duplicate rev_id's in my original post, this output[1] was only generated up to revision id ~17.7M and change. And from a cursory glance appears to be mostly attributable to [[Wikipedia:Village_pump_(assistance)]]

1 - http://toolserver.org/~overlordq/duplicate_revisions.txt
Comment 2 Graham87 2010-02-06 04:35:22 UTC
Yeah, the Village pump (assistance) problem should be cleand up at some point. See section 16 of:
http://en.wikipedia.org/wiki/Wikipedia:Administrators%27_noticeboard/IncidentArchive327
Comment 3 Happy-melon 2010-02-08 20:05:34 UTC
I managed to restore the old revisions... ish.  They now appear in the live history of the page (and I used RevDelete to clear up the privacy issue that prompted the incident in the first place); but the revisions haven't been deleted from the archive table, and I didn't get a deletion log entry.  So in principle, there are now 10,000 duplicates from VPA, not 2000...  However, I notice that the old revisions universally have a different ar_page_id to the current VPA page (1099394 instead of 14276593), which may be why they've become disconnected from the live page.  Every deleted revision of VPA now has a corresponding live revision; the deleted duplicates can be safely deleted.
Comment 4 Graham87 2010-02-20 16:05:17 UTC
The page history for Wikipedia:Village pump (assistance) doesn't show any duplicate revisions. Were they deleted by the sysadmins, or by some other method? Are they automatically filtered out by MediaWiki, or did something else happen to them?
Comment 5 Happy-melon 2010-02-20 16:26:17 UTC
Yes, they seem to have gone (from the toolserver db as well as live server).  Interesting.
Comment 6 Graham87 2010-04-19 15:05:42 UTC
Are there any remaining examples of this bug?
Comment 7 Happy-melon 2010-05-17 16:47:57 UTC
(In reply to comment #6)
> Are there any remaining examples of this bug?

OverlordQ, do you mind running your original analysis again?

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links