Finding Better Ways of Mining Scientific Publications

Mendeley is supporting the 3^rd edition of the International Workshop on Mining Scientific Publications, which will take place on the 12^th September 2014 in London. The event will bring together researchers and practitioners from across industry, government, digital libraries and academia to address the latest challenges in the field of mining data from scientific publications.

Kris Jack, Chief Data Scientist at Mendeley, is part of the organizing Committee, which also includes The Open University and The European Library. Following a very successful call for papers, he is now looking forward to a very busy and productive day of presentations and discussions:

“We’ve had a record number of high-quality submissions this year, so were really spoiled for choice in putting together the agenda, which combines long papers, short papers, demonstrations and various presentations. We also worked with Elsevier to engage directly with the research community, which is really fantastic.”

As part of that ongoing outreach, Gemma Hersh, Policy Director at Elsevier, will be giving a brief presentation and answering questions from the participants regarding the company’s recently updated Text and Data Mining policy, and how it can best support the evolving needs of the research community.

As in previous years, this workshop is run in conjunction with the Digital Libraries conference – DL 2014 – and participants can register on the City University London website to attend the entire conference or just the workshops/tutorials.

See the full programme below, and for the latest updates be sure to follow @WOSP2014 or send any questions to @_krisjack or @alicebonasio on Twitter

PROGRAM

09:00-09:10	Introduction
09:10-09:45	Keynote talk Information Extraction and Data Mining for Scholarly Big Data Dr. C. Lee Giles
09:45-10:10	Long paper A Comparison of two Unsupervised Table Recognition Methods from Digital Scientific Articles Stefan Klampfl, Kris Jack and Roman Kern
10:10-10:30	Short paper A Keyquery-Based Classification System for CORE Michael Völske, Tim Gollub, Matthias Hagen and Benno Stein
10:30-10:50	Short paper Discovering and visualizing interdisciplinary content classes in scientific publications Theodoros Giannakopoulos, Ioannis Foufoulas, Eleftherios Stamatogiannakis, Harry Dimitropoulos, Natalia Manola and Yannis Ioannidis
10:50-11:10	Break
11:10-11:35	Long paper Efficient blocking method for a large scale citation matching Mateusz Fedoryszak and Łukasz Bolikowski
11:35-12:00	Long paper Extracting Textual Descriptions of Mathematical Expressions in Scientific Papers Giovanni Yoko Kristianto, Goran Topic and Akiko Aizawa
12:00-12:20	Short paper Towards a Marketplace for the Scientific Community: Accessing Knowledge from the Computer Science Domain Mark Kröll, Stefan Klampfl and Roman Kern
12:20-12:40	Short paper Experiments on Rating Conferences with CORE and DBLP Irvan Jahja, Suhendry Effendy and Roland Yap
12:40-13:00	Short paper A new semantic similarity based measure for assessing research contribution Petr Knoth and Drahomira Herrmannova
13:00-13:10	Presentation Elsevier’s Text and Data Mining Policy Gemma Hersh
13:10-14:00	Lunch
14:00-14:35	Keynote talk Developing benchmark datasets of scholarly documents and investigating the use of anchor text physics retrieval Birger Larsen
14:35-14:50	Demo paper AMI-diagram: Mining Facts from Images Peter Murray-Rust, Richard Smith-Unna and Ross Mounce
14:50-15:05	Demo paper Annota: Towards Enriching Scientific Publications with Semantics and User Annotations Michal Holub, Róbert Móro, Jakub Ševcech, Martin Lipták and Maria Bielikova
15:05-15:20	Demo paper The ContentMine scraping stack: literature-scale content mining with community maintained collections of declarative scrapers Richard Smith-Unna and Peter Murray-Rust
15:20-15:35	Break
15:35-16:00	Long paper GROTOAP2 – The methodology of creating a large ground truth dataset of scientific articles Dominika Tkaczyk, Pawel Szostek and Lukasz Bolikowski
16:00-16:25	Long paper The Architecture and Datasets of Docear’s Research Paper Recommender System Joeran Beel, Stefan Langer, Bela Gipp, and Andreas Nürnberger
16:25-16:50	Long paper Social, Political and Legal Aspects of Text and Data Mining Michelle Brook, Peter Murray-Rust and Charles Oppenheim
16:50-17:00	Closing

One thought on “Finding Better Ways of Mining Scientific Publications”

Joeran says:

September 3, 2014 at 07:15

I am sure it will be a great event 🙂 For those, who are interested in the pre-print of our paper “The Architecture and Datasets of Docear’s Research Paper Recommender System”: http://docear.org/papers/The%20Architecture%20and%20Datasets%20of%20Docear's%20Research%20Paper%20Recommender%20System.pdf

Comments are closed.

	pelorustech on Ditch those duplicates with Me…
	womotayo on Ditch those duplicates with Me…
	womotayo on Ditch those duplicates with Me…
	Rachel on Ditch those duplicates with Me…
	FK on Ditch those duplicates with Me…

PROGRAM

Share this:

One thought on “Finding Better Ways of Mining Scientific Publications”