Meshin, an Outlook sidebar, boosts your productivity using semantic technologies to find the right information at the right time about people, companies, email communications and even documents.
Our goal is to develop a probabilistic knowledge base that mirrors the content of the web. We are developing a system that uses semi-supervised learning methods to learn to extract symbolic knowledge from unstructured text and HTML. We are exploring methods of continous learning, where our system runs 24x7, continuously learning to read better, and continuously extracting facts from the web.
Our goal is to develop a probabilistic knowledge base that mirrors the content of the web. We are developing a system that uses semi-supervised learning methods to learn to extract symbolic knowledge from unstructured text and HTML. We are exploring methods of continous learning, where our system runs 24x7, continuously learning to read better, and continuously extracting facts from the web.
Webstemmer is a web crawler and HTML layout analyzer that automatically extracts main text of a news site without having banners, ads and/or navigation links mixed up
Semantic MediaWiki (SMW) is a free extension of MediaWiki that helps to search, organise, tag, browse, evaluate, and share the wiki's content. While traditional wikis contain only texts which computers can neither understand nor evaluate, SMW adds semantic annotations that bring the power of the Semantic Web to the wiki.
MuNPEx is a multi-lingual noun phrase (NP) extraction component developed for the GATE architecture, implemented in JAPE. It currently supports English, German, French, and Spanish (in beta).
MuNPEx requires a part-of-speech (POS) tagger to work and can additionally use detected named entities (NEs) to improve chunking performance. Please read the documentation (or source code) for more details.
H. Kroll, J. Pirklbauer, and W. Balke. ACM/IEEE Joint Conference on Digital Libraries, JCDL 2021, Champaign, IL, USA, September 27-30, 2021, page 21--30. IEEE, (2021)
H. Kroll, J. Al-Chaar, and W. Balke. Proceedings of the Workshop on Digital Infrastructures for Scholarly Content Objects (DISCO 2021) co-located with ACM/IEEE Joint Conference on Digital Libraries 2021(JCDL 2021), Online (Due to the Global Pandemic), September 30, 2021, volume 2976 of CEUR Workshop Proceedings, page 14--18. CEUR-WS.org, (2021)
J. Lafferty, A. McCallum, and F. Pereira. Proceedings of the Eighteenth International Conference on Machine Learning, page 282--289. San Francisco, CA, USA, Morgan Kaufmann Publishers Inc., (2001)
P. Kluegl, M. Atzmueller, and F. Puppe. Proceedings of the Biennial GSCL Conference 2009, 2nd UIMA@GSCL Workshop, page 233-240. Gunter Narr Verlag, (2009)
R. Basili, and M. Pazienza. Information Extraction A Multidisciplinary Approach to an Emerging Information Technology, volume 1299 of Lecture Notes in Computer Science, Springer Berlin Heidelberg, (1997)