|
Disclaimer: I assume you're running linux. If not, you may have to change some things. Unfortunatey, I ( BradMarshall ) am not of much help on other platforms.
Disclaimer2: This isn't really a tutorial for newbies. It's more some pointers for experienced users. I hope to correct this in the near future.
First, you'll need an xml parser - I strongly suggest the apache project's xerces. I believe this is currently the best xml parser available. Unfortunately, as of this writing it only supports java and c++ (and perl for windows), but the perl for linux version is on its way! If you want python support, check out http://www.fourthought.com. Anyway, xerces is a full-featured free software xml parser that supports sax2. sax2 is fast and supports namespaces.
The easiest way to test a document against the bioxml dtds is using the example programs distributed with xerces. From the command line (assuming your ClassPath? is correctly set), just type:
java sax.SAXParser -vN xmlfile
The -v turns on validation and the N turns OFF namespaces (so prefix:name is seen as prefix:name rather than "http://www.bioxml.org/prefix name").
Some sample xml files can be found under the cvs repository at bioxml-html/samples . Or try the web versions at http://www.bioxml.org/samples/dna.xml, http://www.bioxml.org/samples/feature.xml, http://www.bioxml.org/samples/comp.xml, http://www.bioxml.org/samples/dbxref.xml.
Related pages: Unclassified?
|