Do some heavy duty Data-Mining on your iPad and Android
EMM is an entirely free news monitoring platform that first came online in 2002. It currently gathers an average of 175,000 news articles per day in 75 languages and applies a whole range of fully automatic multilingual natural language processing tools to structure and analyse the news. The user-friendly interface gives readers a quick and clear overview of what is happening around the world and how the top stories are developing. Users can browse the news by subject and they can follow specific entities such as persons, organisations and countries. Timelines give a birds’ eye overview of recent developments. Quotations by and about people highlight the more social aspect of the news, giving the reader condensed access to criticism or support towards persons and their statements.
The newly developed EMM App goes beyond the big-screen version in that users can personalise their home pages by selecting those themes, entities and languages they want to follow. The current Apps have been developed for the iPad and for selected products running the Android operating system. Dedicated Apps for even smaller screens such as the iPhone will follow soon.
The mobile version of the JRC-developed Europe Media Monitor (EMM) – available for both iOS and Android platforms and covering 75 languages. After its launch, it was even briefly Germany’s top-ranked free news App and ranked ninth out of hundreds of thousands of all free Apps.
BACKGROUND AND READING
EMM is a freely available advert-free family of news monitoring and analysis applications developed by the OPTIMA Team at the Joint Research Centre (JRC), which is the European Commission’s in-house science service.
You find literature (scientific publications) on the Europe Media Monitor EMM and its individual text mining tools at the URL http://langtech.jrc.ec.europa.eu/JRC_Publications.html, including papers giving an introduction and a generic overview of the Europe Media Monitor EMM.
FREELY AVAILABLE LINGUISTIC RESOURCES
The JRC has also released a large volume of freely available multilingual linguistic resources that can be used to develop or test a variety of multilingual and cross-lingual Natural Language Processing tools. These include parallel corpora in up to 26 languages; readily trained automatic document categorisation software in 22 languages; dictionaries of names and their variant spellings, including across languages and scripts; and more. See http://ipsc.jrc.ec.europa.eu/index.php?id=61 for details and to download the data and the tools.