Difference between revisions of "Reencoding MediaWiki pages"
imported>Cov (Created page with 'The following script is a work in progress. The end goal is to produce [w:WikiMarkup] from the HTML of a downloaded [w:MediaWiki] page. <pre> sed -rn -e '/<!-- start content -->…') |
imported>Cov |
||
Line 1: | Line 1: | ||
− | The following script is a work in progress. The end goal is to produce [w:WikiMarkup] from the HTML of a downloaded [w:MediaWiki] page. | + | The following script is a work in progress. The end goal is to produce [[w:WikiMarkup]] from the HTML of a downloaded [[w:MediaWiki]] page. |
<pre> | <pre> |
Revision as of 03:07, 9 November 2009
The following script is a work in progress. The end goal is to produce w:WikiMarkup from the HTML of a downloaded w:MediaWiki page.
sed -rn -e '/<!-- start content -->/,/<!--/p' page.html | \ sed -r -e '/<!--/d' -e 's%</?p>%%g' \ -e 's%<br>%<br />%' \ -e 's|<a href="([^"]*)" class="external text" title="[^"]*" rel="nofollow">([^<]*)</a>|[\1 \2]|'