Knowledge Discovery –
The exploitation of knowledge
mail Roman Kern Area Head
Do you ever feel like a treasure hunter searching for hidden information and necessary knowledge? Indeed, performing research activities and dealing with big amounts of data consumes a lot of our working time. In the area "Knowledge Discovery" we develop automated methods for the analyzing, enrichment and linking of complex data sources.We never run out of ideas!
The ongoing project with our business partner Hyperwave is dedicated to develop an enterprise search solution for content management systems. Many search technologies developed by us are thereby being integrated into the existing solution of the partner organization. Ultimately, all customers of Hyperwave benefit from this by receiving a content management system featuring a search solution tailored towards their respective domain.
For Lexis-Nexis we developed a solution for the automatic assignment and grouping of documents in various legal areas.
In all areas of our work, one concept is especially important: Big Data. All of the above mentioned methods can be applied both to small as well as massive numbers of documents that can be -analyzed by the use of multiple, distributed machines. Over the last years, the Knowledge Discovery team has acquired the necessary engineering skills and has developed methods for dealing with constantly growing data sets.
An important part of our work is to provide our project partners with technologies that promote innovation. An example of it is our common project with Mendeley, a provider of software for managing scientific publications. Within this project, existing approaches for extracting information from unstructured documents were further developed and new methods created, e.g., automated table extraction and reconstruction of the table of contents. We achieved the state of the art and beyond in the recognition of entities in the domains of biomedicine and computer science. These techniques will ultimately help Mendeley users, for example to speed up the navigation within publications.