[bxmlt] Nicole Natho – mArachna, an OWL-based mathematical knowledge base

In this short talk, Nicole describes a system that extracts knowledge from mathematical texts in order to build an intelligent encyclopedia.

In mathematical texts, there are sections with a fairly regular structure: conditions, conclusions, properties. “Let x be … Y is a Z ifff …” These can be extracted with text analysis.

The goal is an ontology of mathematical texts, not mathematics.

They use the TRALE system, a grammar analyser for the German language. The implementation uses Jena. They used to output topic maps, but changed to OWL.

Q: (Rainer Eckstein) So how much effort is this?

A: A lot. We had to build a lexicon for the TRALE parser first, that took about a year.

Q: Implementation?

A: Convert input texts to TEI, then a lot of Java, some Python scripts. TRALE is Prolog.

This entry was posted in General, Semantic Web. Bookmark the permalink.