Fullscreen
[Show/Hide Left Column]
[Show/Hide Right Column]

Print

GSOC 2009: MediaWiki export formats

There are more than one way to export MediaWiki content. In this page I will describe the methods I studied as possibilities to use in the MediaWiki - TikiWiki importer.

MediaWiki XML export feature

MediaWiki has a built-in XML feature (external link) to export all wiki page content. It does not export users and the XML contains the MediaWiki syntax (no wiki syntax parsing is done).

It has a easy to use command line script called dumpBackup.php (external link) that output all wiki pages with history but also accept a lot of different arguments to export only the last version of each page and so on.

MediaWiki XML Bridge

MediaWiki XML Bridge (external link) extension is another tool to export wiki pages to XML format (or in this case also XHTML). It uses the mwlib (external link), a python library to parse MediaWiki articles.

Nelson question to help evaluate XML Bridge:
  1. Is XML Bridge any good?
    Rodrigo: I'm not confident that XML Bridge is something interesting for our project. Apparently they use a non standard and MediaWiki specific XML representation called mwxml. I wasn't able to find the format specification. Also, mwlib is oriented to fetch through HTTP only the last revision of an article. mwlib is developed by pediapress.com, they print books from MediaWiki sites. Maybe that is why they are not concerned with wiki page history.
    As XML Bridge doesn't export the page history I don't think it might be useful for the MediaWiki to TikiWiki importer.
  2. What should we write to convert this XML to Tiki? (maybe we can write a PHP XML bridge in reverse to Tiki or maybe stick with Python)
    Rodrigo: A mwxml parser :-)
  3. Is the XML representation a standard to wiki conversion?
    Rodrigo: No, XML Bridge use mwxml a XML representation specific for the MediaWiki syntax. I wasn't able to find the format specification.
  4. Is XML Bridge to MW a two way bridge? I suppose it is. Is it lossy? Are some syntax lost?
    Rodrigo: I'm not sure if XML Bridge is two way, I didn't found in the documentation any way to insert content in a wiki page using the mwxml format. Also, I didn't found any reference to be sure if mwxml support 100% of the MediaWiki syntax or if there is syntax loss. Probably there no significant syntax loss as XML Bridge uses mwlib which is the official way supported by the MediaWiki foundation to export MediaWiki articles to formats such as PDF or OpenDocument.

Contributors to this page: rodrigo615 points  .
Page last modified on Wednesday 03 June, 2009 14:01:21 UTC by rodrigo615 points .

Main Menu [toggle]


Bugs and Wishes
  1. Report a Bug (or suggest a feature enhancement)

  2. Search Bugs

  3. List yours



About Development

Mailing lists

Extra Stuff

Teams

External Links

Full list of Wiki Pages

TikiWiki on Social Networks


To register [toggle]

To have an account at this site, please register at Tikiwiki.org (external link), and then use that user name and password to log in here.

Search a Wiki Page [toggle]

Exact match

Search Tracker Items Subject [toggle]

Keywords

The following is a list of keywords that should serve as hubs for navigation within the Tiki development and should correspond to documentation keywords.

Each feature in Tiki has a wiki page which regroups all the bugs, requests for enhancements, etc. It is somewhat a form of wiki-based project management. You can also express your interest in a feature by adding it to your profile. You can also try out the Dynamic filter.

Accessibility (WAI – 508)
Action log 2.x
Administration
Ajax 2.x
Alert 3.x
Articles & Submissions
Backlinks
Banner
Blog
Bookmark
Browser Compatibility
Calendar
Category
Chat
Comment
Communication Center
Consistency
Contacts Address book
Contact us
Content template
Contribution 2.x
Cookie
Copyright
Custom Home (and Group Home Page)
Database independence
Database MySQL
Date and Time
Debugger Console
Directory (of hyperlinks)
Documentation link from Tiki to doc.tikiwiki.org (Help System)
DogFood
Dynamic Content
Dynamic Variable
External Authentication
FAQ
Featured links
File Gallery
Forum
Friendship Network (Community)
Gmap Google maps
Group
Help System
Hotword
HTML Page
i18n (Multilingual, l10n, Babelfish)
Image Gallery
Import-Export
Install
Integrator
Interaction
Inter-User Messages
InterTiki
jQuery
Karma
Live Support
Lost edit protection
Mail-in
Map with Mapserver
Menu
Meta Tag
Missing features
MindMap 3.x
Mobile Tiki and Voice Tiki
Mods
Module
MultiTiki
MyTiki
Newsletter
Notepad
OS independence (Non-Linux, Windows/IIS, Mac, BSD)
Payment 5.x
Performance Speed / Load / Compression / Cache
Permission
Poll
Profile Manager
Quiz
Rating
RSS
Score
Search engine optimization (SEO)
Search
Security
Semantic links 3.x
Shoutbox
Site Identity
Slideshow
Smarty Template
Spam protection (Anti-bot CATPCHA)
Spellcheck
Spreadsheet
Staging and Approval
Stats
Survey
System log
Tags 2.x
Task
Tell a Friend + Social Bookmarking 2.x
TikiTests 2.x
Theme
Toolbar (Quicktags)
Trackers
TRIM
User Administration
User Files
User Menu
Watch
WebHelp
Webmail and Groupmail
WebServices 3.x
Wiki 3D
Wiki History, page rename, etc
Wiki plugins extends basic syntax
Wiki syntax text area, parser, etc
Wiki structure (book and table of content)
Workspace Ideas 4.x
WYSIWTSN 4.x
WYSIWYCA
WYSIWYG 2.x
XMLRPC

Last Comments [toggle]

  1. Working on it !
  2. Hack attached doesn´t work
  3. Possible reason and solution
  4. Another URL