One million articles uploaded to Mendeley!

articles-uploadedWe passed a landmark today: As of 16.50h GMT, our users have uploaded one million articles to their Mendeley accounts! Including the cited references which Mendeley also extracts from research papers, we now have over 14 million metadata sets in our database. Even we were surprised by the speed in which this has happened!

For the record, the millionth article added to our database was “The somatic marker hypothesis: A critical evaluation by Dunn et al. (2006) in Neuroscience and Biobehavioral Reviews – as luck would have it, that’s a topic related to my personal research on the role of emotions in decision making! The closest publication by one of our users, Joaquin Rivera, was added at 17.01h – a maths paper titled “On the exact multiplicity of solutions for boundary-value problems via computing the direction of bifurcations”, available for download on Joaquin’s Mendeley profile.

90% of these one million articles have been uploaded since January 2009, and our database is currently doubling in size every 6 weeks. For comparison, venerable PubMed – the largest database of biomedical literature – contains 18,813,527 records as of today. Assuming we managed to keep up our growth, we could surpass the size of the PubMed database within the next 6 months!

Roughly 43% of the papers in our database are in the biological and medical sciences (even though only about 27% of users are working in these academic disciplines). Computer and information science comes in second with roughly 11% of all papers, followed by engineering with 7%, and chemistry, physics, psychology and other social sciences with 4-5% each.

As we’ve said before on this blog and elsewhere (e.g. see my talk at the Plugg Conference), we’re not hoarding all that data just because we can, no Sir! Our vision is to create the largest open, interdisciplinary and ontological database of research – as crazy as that sounds, remember that Last.fm (whose former chairman and COO are our co-founders and investors) pulled it off in the space of music within just three years, using the same user data-aggregation model that Mendeley is built on.

We’ve already begun to report real-time “usage-based” research trends – a nice discussion of Mendeley statistics showing the most-read journal in the biological sciences can be found here (we’ll be writing more about this soon!). Analogous to Last.fm, we will provide APIs to let others mash up the research statistics we’re generating. Moreover, our database will be the basis for our upcoming collaborative filtering recommendation engine: Based on the articles in your Mendeley library, we will be able to tell you about articles you don’t know yet, but which have been read and recommended by researchers with similar interests. You can read more about these plans in our recent IEEE e-Science paper.

A big thank you to our wonderful users who have been helping us improve Mendeley with their constant feedback. After celebrating the millionth article upload tonight, we’ll get back to work on our next two releases, packed full with exciting new features!

Release update

A couple of days ago we released our latest update of Mendeley. Details about feature improvements and bug fixes of Mendeley Desktop v0.6.4 can be found in my last blog entry, but I thought I just mention some of the improvements in Mendeley Web:

  • Improved grouping in online library
  • Introducing a simple document search
  • Support of two new sites for easy import (SpringerLink and WorldCat)
  • Duplicate detection for bookmarklet import

Next on the list for Mendeley Web is introducing group and network pages which are overdue for a long time. We are also working on a new site design which we’ll announce in a separate post soon.

Happy Easter! 🙂

Mendeley's new algorithm analyzes research behavior

Our team of coders have been working extremely hard to produce accurate and enlightening statistical results based on Mendeley usage and reading habits.

We are quite proud to announce a brand-spanking-new algorithm that analyzes not only the metadata extracted from the papers read by Mendeley users, but also research habits based on their interaction with Mendeley Desktop/Web.

This newly developed system has been named AGraSBAS – Automated Graduate Student Behavior Analysis System. We are currently in the process of patenting this groundbreaking technology.

The image embedded in this post presents preliminary results that we are pleased to showcase before implementing the algorithm live on our services.

agrasbas-analysis_small

Based on some of the replies from our AGraSBAS beta-testers, the complex algorithm produces results with incredible accuracy.

Paulo Nuin from Cold Town, Canada said the following:

“Not even my girlfriend knew that I recycled (aka reuse!) tea bags. I’ve mastered this technique and my guests rarely notice a difference. I don’t know how, but AGraSBAS got it right!”

Other statistics such as “Number of papers uploaded to Mendeley and NEVER read”, “Abstract/Full Paper reading ratio” and “Scroll speed”  have been quite interesting to look at.

In the hope of providing the academic research community with the best statistical information, we are very proud to have produced AGraSBAS.

We are keen to hear your opinions as soon as the system goes live. Stay tuned…

Mendeley Desktop v0.6.4 beta coming soon, work on v0.7.0 started already

Just to give you a quick update on our development. Unfortunately we didn’t manage to release an updated version of Mendeley Desktop last week as originally thought. There were some stability issues on Mac OS 10.5.6 when importing a large number of documents through folder monitoring. We thought we better fix this now since it’s one of the first things many users will do when starting to use Mendeley Desktop. It turned out not to be that easy though since it only happened on Mac OS Leopard only and the bug was well hidden deep down in the internals of the app.

Anyway, it’s fixed now and QA is doing its best to make sure that there are no regressions in the new version and everything works as expected. Here is preview of some of the things that will change in v0.6.4:

Improvements to Existing Features:
  • Full text search is now available for PDF files attached to documents manually or downloaded from Mendeley Web
  • Word plugin is now much faster with documents that have a large number of citations
  • Export issue, DOI, URLs and several other fields to Bibtex
  • Export file links to Bibtex
  • Import tags (‘links’) from EndNote XML
  • When importing documents which are already in your library, add the existing documents to the current group
Bug Fixes
  • Fix HTTP proxy support
  • Fix possible ‘UUID Clash’ error when syncing
  • File organiser forgot the root folder after Mendeley Desktop was restarted
  • Allow files to be removed from documents after they have been uploaded to Mendeley Web
  • Files were re-downloaded on each sync if they contained characters which are not permitted in file names
  • Fixed problem where organiser would re-process the same files several times
  • Bibtex exporter could generate invalid citation key names
  • Fix auto-complete in manual document entry dialog
  • Metadata form would sometimes prompt for the user to save changes twice
  • Fix possible ‘XML POST’ error when importing documents containing XML entities in the notes field

For v0.7.0 we will continue working on metadata extraction, focus more on the interface and usability as well as try to include the often asked for internal PDF viewer. There are still some cross-platform issues to solve with the PDF viewer but we’re confident that we will have something ready to ship for v0.7.0.