Last modified: 2010-05-15 15:54:47 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T17027, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 15027 - Searching for IP Addresses in page content unsuccessful
Searching for IP Addresses in page content unsuccessful
Status: RESOLVED FIXED
Product: MediaWiki
Classification: Unclassified
Search (Other open bugs)
1.12.x
All Linux
: Normal normal with 1 vote (vote)
: ---
Assigned To: Brion Vibber
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2008-08-04 00:18 UTC by Cameron Moody
Modified: 2010-05-15 15:54 UTC (History)
4 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Cameron Moody 2008-08-04 00:18:20 UTC
We are using Mediawiki for technical documentation. As part of this we have device info contained in a table and part of this is the IP Addresses configured.
 
Unfortunately searches pick up the ip addresses entered which do definately exist on pages. If I enter the ip then the search finds no hits.

We are running MediaWiki 1.12.0, PHP 5.1.2, MySQL 5.0.26
 
Below is an exert of part of a table in use (IPs changed). EG if "10.10.10.1" is entered in the search box and search clicked, it will find no results.

{| style="width:75%; text-align:left" border="1"
! width="30%" bgcolor="#EEEEEE"| ''Configuration Item''
! bgcolor="#EEEEEE"| ''Setting''
|-
! Device Name
| {{PAGENAMEE}}
|-
| Management IP Address 
| 10.10.10.1
|-
|}
Comment 1 Brion Vibber 2008-08-04 06:04:59 UTC
The index field markup munging probably needs some love to make sure common constructs like this are handled cleanly.
Comment 2 Daniel Friesen 2008-08-04 11:34:12 UTC
Yes, but in the meantime this kind of use is actually the point of Semantic MediaWiki. Things like that with special importance should be tagged. The information gathered from it would be far more useful than that of a simple search.
Comment 3 Brion Vibber 2008-08-04 21:04:33 UTC
That's nice, but irrelevant to the real world where people use straight text. :)
Comment 4 Siebrand Mazeland 2008-08-10 23:56:14 UTC
Take the liberty to assign to Brion. He obviously knows what this is about.
Comment 5 Tathagata 2008-08-18 01:27:39 UTC
This bug had put me in a faux pas, some weeks back after I had evangelised moving to mediawiki for migrating all system maintenance documentations. As I was demonstrating the migrated documents to my senior, a similar request for searching IP of a server or name of a feed file returned no results - even though they were very well in the page, and the older documenting methods (abominable .doc files) worked fine.
Nobody showed any interest in wikifying "everything" after this :(
Comment 6 Cameron Moody 2008-10-20 00:42:55 UTC
Also occurs on 1.13 since upgrading. 

Is anybody aware of any method that we can use for the searches to work. I have tried putting the IP in "quotes" etc but made no difference.

Otherwise some form of realistic time before it can/will be fixed? Sorry I just need to give justification to the higher up people...
Comment 7 Brion Vibber 2008-12-19 01:23:35 UTC
Ok, the problem here appears to be that with the default MySQL search backend the search terms get broken up at the periods...

Thus "192.168.1.1" would become the boolean mode search: "+192 +168 +1 +1"
All the terms would be too short for the default, and fail to return any results.

In current 1.14 development trunk, we now pad the short words in the index, so they can be found:
'+192U800 +168U800 +1U800 +1U800''

This is a slight improvement, in that the search will work and return matches. But it's still pretty crappy, since the loss of the periods loses you context; you can easily end up with false positives, and the result highlighting doesn't show you what you wanted either.

If those periods don't get stripped out of the input, it might work a little better, but the index may need adjustment as well...
Comment 8 Brion Vibber 2008-12-19 01:50:32 UTC
Ok, fixed on 1.14 dev trunk in r44791.

Now allowing periods through to the indexed text, and encoding periods that appear within a compound word so they get caught more cleanly.

Also made a tweak so highlighting works a bit better on word boundaries -- eg "192.168.1.1" no longer hits a highlight match for "192.168.1.100". However it's still not 100% handling some cases with the periods. Sigh.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links