Data is is usually structured withing a logical/conceptual schema, such as a relational or object-oriented databases. Unfortunately, some textual documents are unstructured since they do not have a logical pattern. To resolve this problem, the Biomedical Informatics Group has created a 4-stage method using Text Mining and Natural Language Processing techniques. The tool that supports this method is called «OntoAnnotator» and it allows the automatically extraction of a logical schema that describes an unstructured source.

Over the last decade, the GIB has been involved in a large number of text mining and information extraction/retrieval projects. We have been active in accessing and extracting knowledge from various unstructured sources, and from the biomedical literature available in PubMed. Bringing together structured and text-based sources is an exciting challenge for biomedical informaticians, since the most relevant biomedical sources belong to one of these categories. Unfortunately, the methods and tools provided by state-of-the-art database integration tools cannot be reused to bridge together structured and non-structured (text-based) sources, since all of them require the individual sources to be equipped with a logical schema. To address this issue, we created various approaches based on text mining techniques to automatically create a logical schema for non-structured sources. As seen in other sections, we have widely used text mining techniques in a large number of areas. 

Related Projects

The completion of the Human Genome Project sparked the development of many new tools for today’s biomedical researcher to use in finding the mechanism behind disease. More...

Achieving semantic interoperability among EHR and clinical trial systems is at the core of the project, as it is the basis for enabling many of the software services and tools. More...

The main objective of the project is the creation of centres of excellence to promote health research, education and practice in Africa. The creation of these centres will be based on four main pillars: e-learning, knowledge sharing, "know-how" and information technologies. More...

One of the objectives of ACTION Grid is to create a common infrastructure in Europe and to promote grid technologies, nanoinformatics and biomedical informatics research. More...



OntoMineBase

OntoMineBase project has two main goals: To standardize database information in order to improve the processes of Data Mining; To research and develop new methods of information retrieval. More...

Related News

LAST CALL: «Clinical and Research Databases» Community of Practice!

The «Clinical and Research Databases» Community of Practice (CoP) starts in 5 days through the AFRICA BUILD PORTAL!This is a Community …

LAST CALL: «Data Mining in Biomedicine» Community of Practice!

The «Data Mining in Biomedicine» Community of Practice (CoP) starts in 12 days through the AFRICA BUILD PORTAL! The outline of …

Professor Victor Maojo, AFRICA BUILD Coordinator, exchanging views with two ministers from Cameroon

AFRICA BUILD 1st International Conference, Yaoundé Cameroon.Professor Victor Maojo, AFRICA BUILD Coordinator, exchanging views with Health Minister of Cameroon André Mama …

AFRICA BUILD Portal: Connecting Health Researchers in Africa

The AFRICA BUILD Portal (ABP) is an Open and Free Social and Professional Network for supporting collaborative links and access to a broad variety …