@Web, a software to annotate heterogeneous scientific data sources guided by a termino-ontological resource
@Web is a semantic web application using a predefined vocabulary (called ontology in the following) organized as a taxonomy. @Web is a semi-automatic tool designed to help domain experts to annotate data found in scientific documents (publications, data sheets, ...). @Web project is a joint collaboration between UMR INRA MIA AgroParisTech/INRA, UMR INRA IATE, INRIA GraphiK, UMR CNRS Heudyasic, UMR INRA Mistea and Plastic platform (INRA CEPIA software platform).
We focus on data tables as they often contain a synthesis of experimental results published in scientific publications. The user downloads an HTML scientific document, then data tables are semi-automatically identified and extracted from the document. A graphical user-friendly interface helps the user to annotate data tables thanks to the ontology. Annotators may suggest evolutions of the ontology (new candidate terms of the vocabulary) which are managed by the ontology administrator. Metadata are associated with documents in order to assess documents’ reliability. Relevant annotated information from scientific data tables may be queried thank to a semantic browser using the ontology.