Last modified: 2010-05-15 15:54:44 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T19779, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 17779 - Abnormal URL or Bad Web-Crawler (Spider) overloads CPU
Abnormal URL or Bad Web-Crawler (Spider) overloads CPU
Status: RESOLVED INVALID
Product: MediaWiki
Classification: Unclassified
General/Unknown (Other open bugs)
1.12.x
All All
: Normal enhancement (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2009-03-04 02:59 UTC by player
Modified: 2010-05-15 15:54 UTC (History)
3 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description player 2009-03-04 02:59:00 UTC
I used MediaWiki 1.12.0.(config lang to zh-tw)
but, sometime Abnormal URL or Bad Web-Crawler (Spider) , make to High CPU loading?
At my most recent one month, and found it.

Ex. (at 2009/3/1)
Top Process %CPU 7.3 httpd [player.idv.tw] [/prog/index.php?diff1687&oldid1668&titleJoomla_1.0.12%2B]
Top Process %CPU 6.6 httpd [player.idv.tw] [/prog/index.php?days1&from20080924164000&hideliu1&title]
Top Process %CPU 6.5 httpd [player.idv.tw] [/prog/index.php?actionhistory&titleGridView%E5%9C%A8%E8%B]

That is not correct URL?

I try to fix it.
Edit file index.php

-----------------------------
# Query string fields
$action = $wgRequest->getVal( 'action', 'view' );
$title = $wgRequest->getVal( 'title' );
 
# Fix Bug?("%E9%A6%96%E9%A0%81" = "首頁", is name of mainpage ,at lang zh-tw)
if (is_null( $title ) or ($title == ""))
{
	header("Location: index.php/%E9%A6%96%E9%A0%81"); exit;
}
-----------------------------

	
Maybe you have a better way to repair it?


Maybe you can see my report, If you can see chinese.

http://zh.wikipedia.org/w/index.php?title=Special%3A%E6%90%9C%E7%B4%A2&search=%E5%90%84%E7%89%88%E6%9C%AC%E5%8F%AF%E8%83%BD%E9%9A%B1%E5%90%ABCPU%E8%B3%87%E6%BA%90%E8%80%97%E7%9B%A1%E7%9A%84%E6%BC%8F%E6%B4%9E&ns4=1&fulltext=%E6%90%9C%E5%B0%8B
Comment 1 player 2009-03-06 03:32:40 UTC
# Query string fields
$action = $wgRequest->getVal( 'action', 'view' );
$title = $wgRequest->getVal( 'title' );
 
#Fix Bug? Ver2
if (is_null( $title ) or ($title == "") or (substr($_SERVER['QUERY_STRING'],0,7)=='action='))
{
	header("Location: index.php/%E9%A6%96%E9%A0%81"); exit;
}
Comment 2 player 2009-03-10 03:02:22 UTC
# Query string fields
$action = $wgRequest->getVal( 'action', 'view' );
$title = $wgRequest->getVal( 'title' );
 
#Fix Bug? Ver3 
if (is_null($_GET["rs"]) and ((is_null( $title ) or ($title == "")) or
	((substr($_SERVER['QUERY_STRING'],0,7)=='action=') and ($_GET["action"]!="ajax"))))
{
	header("Location: index.php/%E9%A6%96%E9%A0%81"); exit;
}
Comment 3 Mike.lifeguard 2009-07-02 14:15:43 UTC
Is this even a bug in MediaWiki?
Comment 4 Chad H. 2009-07-15 18:49:36 UTC
Robots can be blocked using robots.txt. Misbehaving robots that don't respect robots.txt can be blocked in other ways. Neither are MW issues, marking INVALID.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links