CiteULike-discussion forum Feature requests http://www.citeulike.org/groupforum/%f_group_forum_id CiteULike.org en-gb Copyright © 2004-2007 Oversity Limited Shared bibliographic info for a paper across all of citeulike http://www.citeulike.org/groupforum/63 Cases: * I add a paper to my library and mark it to read later. Several weeks later I run across the same paper again. I have forgotten that I already added it to my Citeulike library. I add it again, not realizing that there are now two entries for the same paper in my library. * I find a paper that I want to add. It's not from a supported website, so I enter the details of a paper manually. Sometime later I notice that the same paper is already in someone else's library in Citeulike and I could have saved time by copying to my own library. I also notice now that I have made several typos while doing the manual post. * I am researching a new area and am unfamiliar with the seminal, well-known works of the field. I see there are statistics on Citeulike of how many people share a paper. I think it could be useful for me to find popular papers in my field. But I soon find out that there are many different entries for the same paper. So the statistics of the paper do not reflect the real number of people who have the same article in their library. Feature suggestion: These problems could be solved if Citeulike used a single database entry for the static information of a paper (i.e. bibliographic data). A method like fuzzy-matching for the authors name and paper title could identify identical articles. A new article that matches one in the database would use the existing entry information. This way identical articles would not be posted twice to a library. The bibliographic data would also become more complete and accurate as several users would verify it as they added the paper to their personal library. Users could also save time during a manual post by first only entry the author and title, and checking whether it already exists in the Citeulike database. Finally, the statistics would accurately reflect the real number of people who hold the same paper. Of course the individuals-specific data like notes, priority, tags, etc. could be stored separately from the paper's static information. 2007-11-02T10:48:37+00:00 Comment posted by Flit http://www.citeulike.org/groupforum/63#530 <p> Sorry, that was me again (from a different library!) </p> 2007-11-17T18:43:43+00:00 Comment posted by flr http://www.citeulike.org/groupforum/63#504 <p> It would also help when importing a set of papers from bibtex (at least where the DOI field is present). A couple of times I have exported a set of papers to set up a new project library, and no relationship to the original record is retained. </p> 2007-11-15T19:30:20+00:00 Comment posted by Flit http://www.citeulike.org/groupforum/63#405 <p> The second point is the one I meant to address with the doi suggestion. I can sometimes add a doi to a reference that I can't find on a supported site, or that I have imported from a bibtex library. Citeulike doesn't at the moment associate this with other instances of that doi. It might even encourage doi use if it did! </p> 2007-11-02T21:30:16+00:00 Comment posted by cjhall http://www.citeulike.org/groupforum/63#400 <p> Thanks for your message. </p> <br/> <p> We try very hard to only store one 'master' record for an article. </p> <br/> <p> Here's an example: I searched <a rel="nofollow" href="http://www.ingentaconnect.com">Ingenta Connect</a> for the term <i>cancer</i>. I selected the first article in the returned list - title is: <i>Methodological issues in online data collection</i> and posted it to CiteULike. Note that the URL of the article is something horrible, in my case this <a rel="nofollow" href="http://www.ingentaconnect.com/search/article?title=cancer&amp;title_type=tka&amp;year_from=1998&amp;year_to=2007&amp;database=1&amp;pageSize=20&amp;index=18">useless and non-permanent URL</a> </p> <br/> <p> When you've posted it to CiteULike, you'll see, when you go to that individual article in your library that there are two links to the online article: </p> <ul> <li><a rel="nofollow" href="http://www.ingentaconnect.com/content/bsc/jan/2007/00000060/00000005/art00010">Ingenta Connect</a></li> <li><a rel="nofollow" href="http://dx.doi.org/10.1111/j.1365-2648.2007.04448.x">DOI</a></li> </ul> <p> Note that both of these links are canonical ('proper' and permanent) links, not subject to the vagaries of links which were the result of the search. They have both been derived from looking at the article whose link was posted. </p> <br/> <p> Now, if I go to <a rel="nofollow" href="http://www.blackwell-synergy.com">Blackwell Synergy</a> and find the same article (you can cheat and just use the DOI link on the article page if you like), you end up seeing <a rel="nofollow" href="http://www.blackwell-synergy.com/doi/abs/10.1111/j.1365-2648.2007.04448.x">this</a> page. </p> <br/> <p> If you now post that link to CiteULike, you will be taken straight to the article page for <b>the the same article</b> that you posted previously. Note that you will also see that the Online Article links have been augmented with the link direct to Blackwell Synergy: </p> <ul> <li><a rel="nofollow" href="http://www.ingentaconnect.com/content/bsc/jan/2007/00000060/00000005/art00010">Ingenta Connect</a></li> <li><a rel="nofollow" href="http://dx.doi.org/10.1111/j.1365-2648.2007.04448.x">DOI</a></li> <li><a rel="nofollow" href="http://www.blackwell-synergy.com/doi/abs/10.1111/j.1365-2648.2007.04448.x">Blackwell Synergy</a></li> </ul> <br/> <p> This works for articles which are posted via plugins for supported sites. Does this answer your first and third points? </p> <br/> <p> The second point, about manually posted articles, is more tricky -- we don't try to determine whether links point to the same content or not, and we don't try to match based on metadata. We intend to keep adding plugins for supported content. If you are reading a journal which isn't supported, please let us know and we'll try to write a plugin for it. Of course, you can always write your own plugins - details are <a rel="nofollow" href="http://svn.citeulike.org/svn/plugins/HOWTO.txt">here</a> </p> 2007-11-02T14:28:15+00:00 Comment posted by Flit http://www.citeulike.org/groupforum/63#391 <p> A shortcut to this might be to work with the doi. Admittedly coverage isn't universal, but it is a unique id associated with papers? </p> 2007-11-02T11:01:13+00:00