Talk:Wiki.pl

From Organic Design wiki
Revision as of 12:10, 18 June 2008 by Sven (talk | contribs) (http://wikisum.com/w/User:Adam/Creating_MediaWiki_bots_in_PHP)


Info.svg Documentation:


Todo
  • RSS and general URL matching notworking proeprly, probably cos all going into 00:00 time... things are showing up in the cache files but not appearing in the changes. --Nad 11:55, 8 Oct 2006 (NZST)

Wiki version

Couldnt see any functions which determine the wiki version that wikid is accessing, this could be useful. --Sven 15:09, 7 Jun 2006 (NZST)

I don't think we should need it, I think that they're all similar enough that the regexp's should be able to be generalised to cover them all. --Nad 12:08, 26 Jun 2006 (NZST)
I will continue documenting url changes between versions partly for my own understanding and to easily create regEx's --Sven 12:54, 26 Jun 2006 (NZST)
Well differences in URL's would require a version check like you say, but they're nothing to do with the regexps, those are matching the html content, not the urls --Nad 13:26, 26 Jun 2006 (NZST)
Bugger, thats true --Sven 13:30, 26 Jun 2006 (NZST)
Modifications to work with MediaWiki 1.6.5

I've installed the latest MediaWiki on a site at the Uni and was using some of your automated wikiLogon, wikiEdit code. I discovered that they have changed the login form slightly and the regexp provided in this article did not work. So here is the modified line of code that I have tested and it works. I'm not sure how best to integrate this. I have a feeling the regexp could be generalised to deal with both cases. I was tripped up by the forward matching style of the source HTML where the value comes before the key, but making the last bit greedy fixed it, providing the Token is the last thing in the form we want to grab.

/^<input type='hidden' value="(.*?)" name="wpSection".+?value="(\d*?)"
name="wpEdittime".+<input type='hidden' value="(.*?)" name="wpEditToken"/sm

--Rob 14:00, 15 May 2006 (NZST)

See also

Examples

Edit

{{{1}}}

Delete

{{{1}}}

Querying categories

The code to fetch a bullet list doesn't appear to be available in the form of a function. The query string can be used here to fetch most of the content efficiently. e.g. Using the MW:Manual:Parameters_to_index.php action=render argument in the query sting returns a small amount of structured html in the for of a bullet list. Unfortunately content from the summary itself is also present, so the last bullet list of content is most relevant for parsing.

For example http://www.organicdesign.co.nz/wiki/index.php?title=Category:Nad/Todo&action=render will output a bullet list wrapped in html.

In the Mediawiki MW:API, there is an [ http://en.wikipedia.org/w/api.php?action=query&list=categorymembers&cmtitle=Category:Physics&cmlimit=30 example] query to fetch category content . The maximum number of members that can be fetched is 500 for humans and 5000 for bots. This is documented within http://en.wikipedia.org/w/api.php. --Sven 16:26, 11 June 2008 (NZST)

Debugging

Output the form content and run the perl oneliner on it

{{{1}}}

Redirecting output to info.php is also useful to compare header and post request information automated

<PHP>

<?php phpinfo(); ?> </PHP>

--Sven 22:07, 17 June 2008 (NZST)