In this short talk, Nicole describes a system that extracts knowledge from mathematical texts in order to build an intelligent encyclopedia.
In mathematical texts, there are sections with a fairly regular structure: conditions, conclusions, properties. “Let x be … Y is a Z ifff …” These can be extracted with text analysis.
The goal is an ontology of mathematical texts, not mathematics.
They use the TRALE system, a grammar analyser for the German language. The implementation uses Jena. They used to output topic maps, but changed to OWL.
Q: (Rainer Eckstein) So how much effort is this?
A: A lot. We had to build a lexicon for the TRALE parser first, that took about a year.
Q: Implementation?
A: Convert input texts to TEI, then a lot of Java, some Python scripts. TRALE is Prolog.