Under the CCCT umbrella, researchers from the humanities, the social and behavioral sciences, and the natural sciences collaborate in a multidisciplinary setting on information-rich research topics. CCCT is organizing monthly seminars in which speakers report on research activities that are of shared interest.
Speaker: Laura Hollink (Web & Media group at VU Amsterdam)
Connecting political events to media data – the ‘PoliMedia’ and ‘Talk of Europe’ projects.
In this presentation I will talk about our work on connecting data about political events to media archives. By doing to, we aim to facilitate a large-scale, quantitative analysis of the coverage of political events in the media, over long time spans and across media-outlets.
In the PoliMedia project, we focus on the meetings of the Dutch parliament, and provide automatically generated links between the transcripts of those meetings and radio bulletins & newspaper articles at the National Library. Via a web application users can search through the debates and find related media coverage in various media outlets. Furthermore, the generated links are available via a SPARQL endpoint, allowing quantitative analyses with complex, structured queries. Ultimately, we aim to extend this work to include other types of events as well as other media sources, including social media. One step into this direction is the new project ‘Talk of Europe / Traveling Clarin Campus’ in which we focus on the meetings of the European parliament. The fact that these are available in 23 European languages provides new opportunities to link to data in various countries.
Speaker: Paul Groth (Web & Media group at VU Amsterdam)
Transparency in the Data Supply Chain
Domains such as drug discovery, data science, and policy studies increasing rely on the combination of complex analysis pipelines with integrated data sources to come to conclusions. A key question then arises is what are these conclusions based upon? Thus, there is a tension between integrating data for analysis and understanding where that data comes from (its provenance). In this talk, I describe recent work that is attempting to facilitate transparency by combining provenance tracked within databases with the data integration and analytics pipelines that feed them. I discuss this with respect to use cases from public policy as well as drug discovery.
Moderator: Maarten de Rijke (Informatics Institute, UvA)
Date and Time: Friday 21 February 2014, 16.00-17.00 (followed by drinks)
Science Park, room: B0.201
Science Park 904
1098 XH Amsterdam
Laura Hollink is an assistant professor in the Web & Media group at the VU University Amsterdam. Laura’s main research themes are modeling, linking and enrichment of data, including studies of the usage of data. She is the initiator of the USEWOD workshop series on usage analysis of linked open data, organizer of the DeRiVE workshop series on modeling events, and organizer of a track in the Ontology Matching workshop on linking large and multi-lingual resources to each other. Laura has been work-package leader in two European FP7 projects on semantics and data integration. In the Extended Semantic Web Conference, she has chaired the PhD symposium, and is now co-chair of the Linked Data track.
Paul Groth is an assistant professor in the Web & Media Group at the VU University of Amsterdam and a member of its Network Institute. He holds a Ph.D. in Computer Science from the University of Southampton (2007) and has done research at the University of Southern California. His research focuses on dealing with large amounts of diverse contextualized knowledge with a particular focus on the web and e-Science applications. This includes research in data provenance, web science, knowledge integration and knowledge sharing. Paul was co-chair of the W3C Provenance Working Group developing a standard for provenance interchange and is co-author of Provenance: an Introduction to PROV. Currently, he is a key contributor to Open PHACTS (http://www.openphacts.org) – a project to develop a provenance-enabled platform for large scale pharmacological information. He blogs at http://thinklinks.wordpress.com and you can find him on twitter: @pgroth