Connotea: Latest News and Blog

We are looking for people to fill in a survey for us.

Wednesday 24 June 2009

We are looking for students, undergrad, master’s or PhD students, who are working in the life sciences, to help us with a survey.

If you are based in the UK the survey is at http://www.insightexpress.com/s/Conn151043.

If you are based in the US the url of the survey is at http://www.insightexpress.com/s/Conn151021.

In fact anyone who is involved in any of the following subjects would be great: Agricultural sciences, Biological sciences, Psychology, "Clinical medicine or Health sciences.

We are trying to figure out what to build for the scientific community, and want to get some direction.

The survey is a little detailed, it could take up to about 15 mins to complete, so if you have any time to spare we would really appreciate it.

Connotea now supported by PLOS one

Friday 22 May 2009

Earlier this year PLOS one rolled out support for Connotea. Pete Binfield, the managing editor wrote to point out that they have a blog post
describing the features, which include supporting bookmarking of items as well as reporting on whether an article has been bookmarked in Connotea.

Its great to see publishers starting to adopt integration with tools such as Connotea and CiteULike. PLOS has really been at the forefront of introducing new technologies to the sphere of academic publishing. As more and more information on the real time reading habits of academics becomes available we will move more towards a situation in which it will be possible to get an accurate picture of the current state of science.

Of course that’s not to say that the instant view is the one that should be the most important one, given the requirement for science to measure itself against the harshest peer-reviewer, stern reality herself. It still has to be better than the coarse tools that are currently predominantly used to peer into the trends of scientific thought.

Connotea blog

Instructions for installing Connotea on OS X

Wednesday 20 May 2009

OK, so you have grabbed a copy of the source for Connotea and you want to install it on your nice shiny new mac book pro. How do you do it? I’ve just written up a preliminary set of instructions for installing Connotea on OS X. It’s not trivial, but it is possible. I had to reinstall the stack recently after a software update from Apple broke my machine :(.

I’ll leave these instructions out in the wild for a while, and any feedback would be great. I’ll roll them into the source code in a little while when I get a chance!

Connotea blog

Connotea public code now on GitHub.

Wednesday 20 May 2009

I recently place the public snapshot of Connotea onto GitHub. We continue to use darcs internally for managing the flow of code from dev to live machines, however as the codebase has become larger the speed of transfers across ssh has become a real pain under darcs. I had been playing around with making a snapshot of the public repo in Git and a request from some groups interested in the most recent version of the code prompted my to place the Git copy that I had been working on out into the open. You can now go over and have a look at the code at the connotea-public page on github.

Our old snapshots still live in sourceforge, however I’ve moved away from using sourceforge for a whole bunch of reasons.

At the same time that I started playing around with GitHub the main NPG codebase is moving over to mercurial, so a move to mercurial is not to be ruled out at some point in the future for Conntoea, but for the time being it’s going to remain with git. Thankfully there are lots of utilities that preserve change history and allow you to move from darcs to git and from git to mercurial so any such changes should be painless.

Connotea blog

Peter Sauber make a call for an Open Access Tracking Project (OATP)

Friday 17 April 2009

Peter Suber has made a call for people in the OA community to tag items on Connotea as a part of the Open Access Tracking Project. The aim of the project is to get people to converge on using a common tag for tracking open access news. The proposed tag is oa.new.

Great! I’m a fan of open data, open source and open knowledge and I’m thrilled that people can find Connotea useful in organising their information. He raises a number of points around using Connotea for this, and just wanted to quickly comment on some of them.




  • Connotea feeds only deliver the 10 most recently tagged items, and the project is already tagging more than 10 items per day.  Hence, use a feed reader which refreshes several times a day and stores past items until you’ve read or deleted them.  Bloglines stores the most recent 200 items, and Google Reader appears to store all past items until you’re ready to delete them.  There may be many other readers with this feature as well; I just haven’t had time to check.  Note that for now the email feed is stuck with the 10 item limitation.

By default we deliver the 10 most recent RSS items, however you can call a larger number by specifying the amount in the url like so: feed://www.connotea.org/rss/user/IanMulvany?num=100. I believe that in principle there is a limit of 1000 items that can be retrieved, but we would appreciate it if you restrict calls to a reasonable number.

One thing I’d like to call out. The links in our RSS feeds link to the bookmarked resources, and not to the Connotea page. We used to link to the Connotea page, but that didn’t seem right, so we changes this last year.

Something that is not well know, but might be of interest, the parsing engine can also display a library page in plain text using the following uri structure:

http://www.connotea.org/txt/user/IanMulvany

This can be handy if you want to quickly get data into a text document.




  • If two or more users tag the same item with the same tag (like oa.new), then the item will appear in the feed two or more times.  This doesn’t prevent the feed from becoming comprehensive, but it makes an already-large feed larger than necessary.  I welcome suggestions and work-arounds, including other tagging services that don’t create this problem.

I’m not sure I see the problem here. The RSS feeds provide a time ordered list of when items were tagged, the the newest items on top. Over time one would expect the same item to appear in a feed if it gets tagged again by a new person.

If you go to the item page in Connotea this is a different matter. We do display one instance per person who bookmarked an item. There are two reasons for this. The first is that you see which tags different use has used for the same resource. The second reason is just one of code efficiency, it was just easier to reuse the template for a single bookmark in this way. It may indeed not be optimal, and if people are having a really hard time with this I’m willing to consider a change.




  • Connotea users already use at least four different tags for OA-related sites:  open access, open_access, open-access, and openaccess.  The variant forms make it hard for users to find all the relevant feeds; they also prevent any single feed from taking full advantage of the collective tagging effort.  More to the point for this project, they are not limited to new developments and are often used to tag older developments.  Fortunately, OATP is fully compatible with existing tags.  I’m not asking anyone to stop using existing tags, but merely to start using oa.new for developments that are new within the last six months.

If Connotea users would like to automatically add the oa.new tag to items that they have already tagged with some other related tag then they can do this with the “Rename tag” feature. It’s a bit more powerful that just renaming. You can also use it to split a tag into multiple new tags. I’ve just renamed my “openaccess” tag to “openaccess” and “oa.new”, effectively rolling in the new tag to my previously tagged items, without losing the tag I had been using. We have discussed tag relations, and that is something I would like to introduce at some point in the future, but other commitments mean that we are unable to introduce this feature right now.

New Hardware up and running,

Monday 22 December 2008

Well, last night we switched over to the new hardware. There are still one or two little niggles to iron out, but I hope this will bring considerably more stability to the service for the next few months.

Have a great christmas everyone!.

Connotea downtime Sunday 21st of December, 21.00 GMT, 16.00 EST, new hardware.

Friday 19 December 2008

Well, after installing Connotea on our new hardware and testing it this week, we are going to transition the main site over to the new machines on Sunday.

We will be putting the service into read only mode for a while, and then the site will be down for a few moments as we switch over.

If the transition goes smoothly we will be back up with a much improved performance.

I know we have been having major problems over the last two months, but the new hardware has significantly more memory and so the query bottlenecks that we have been experiencing should ease off. It may take us a little while in the new year to get the most optimal server settings, but the new horsepower should help from the get go.

Conntoea Blog

Request for participation in research survey

Friday 19 December 2008

Markus Heckner from the university of Regensburg contacted me looking for volunteers to participate in a survey on the use of social bookmarking systems. They are looking for people who have at least 20 papers tagged in their Connotea library, if you are interested in helping out then head on over to http://pc51010.uni-r.de/limesurvey/index.php?sid=63888&lang=en. The full call for participation is below.

Easier import of bookmarks and tags.

Monday 10 November 2008

We rolled out a feature a few weeks ago, plain text import of bookmarks. When you go to the upload page ‘Plain Text (one URL and tags per line)’ is now a supported file type. It also works if you place a DOI or a PMID there instead of a URI. Tags, as usual, should be comma separated and multi-word tags delimited with `"`’s.

More feedback wanted, where do we go now?

Tuesday 19 August 2008

Connotea has been around for a few years now, and to be honest with you, not a huge amount has changed with it in that time. Our usage increases and more citations are being bookmarked in the site every day, but we have been putting on our thinking caps and trying to brain storm where we could take the site to make it more useful to scientists.

We have are developing a shortlist of suggestions for features that we hope to develop, but I’d like to hear any thoughts that any of you might have.

- Ian

We got a mention in The Guardian, Woot!

Tuesday 19 August 2008

With the release of the A Level results in the UK (almost like the SAT’s) last week, the Guardian paper ran a series of ‘get ready for university’ stories last week.

One of the stories The Art of Being Virtual talks about digital tools that come in handy for students and Connotea makes it as a recommendation!

List of tools and tips for librarians managing user generated content

Monday 2 June 2008

Fiona King just sent this list of tips for librarians dealing with user generated content. Connotea gets a mention, yay!

Connotea Blog

Update to remote.js script

Friday 16 May 2008

I noticed the other day that the little piece of javascript that we generated to allow you to show your Connotea bookmarks was showing some of my private bookmarks. Not good :(. It turns out this is a rather interesting issue:

Martin says:

This is a curiosity caused by the user’s own login cookie from
connotea.org still being active and being passed to connotea.org during
the loading of the web page when it requests the Javascript file that
contains the posts.

In other words, you see your private posts, but others visiting the
same web page do not.

Well, that’s not really obvious, so we decided to change the behavior of the widget. Updated info here.
The bottom line is that the url to call is now http://www.connotea.org/pub/jsw/user/username which will ignore the cookie, and if you want to see your own private bookmarks you can use
http://www.connotea.org/auth/jsw/user/username for HTTP authentication to see your private bookmarks.

http://www.connotea.org/blog

ECML PKDD Discovery Challenge 2008

Monday 12 May 2008

The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) is being held this year in Antwerp in September and a part of this conference is the
Discovery Challenge. This should be of interest to social bookmarking enthusiasts, as the challenge this year is being held by one of the other services that do very much the same kind of thing as Connotea, Bibsonomy. They are interested in looking at spam Detection in Social Bookmarking Systems and Tag Recommendation in Social Bookmark Systems. Connotea already has a rudimentary implementation of related tags. My own feeling is that recommendations need to go further (a nice review of recommendations is given in this MIT tech review) and we need to produce article recommendations, but nonetheless, I’ll be keeping an eye on the results of this challenge, and if anyone out there is interested in applying their methods on a complimentary data set then we can arrange to get access to the Connotea data.

Connotea Blog

Social Software for Libraries.

Wednesday 7 May 2008

Via the supernumerarypa blog I just found a book called

Social Software for Libraries by Meredith Farkas.

From the reviews it looks like a good nuts and bolts introduction to Web 2.0 tools that have a current place in Libraries. I believe Connotea is given a mention.

Of course being in a book format has advantages and disadvantages, and one of the people providing a review on Amazon sums it up nicely:

“If I had a criticism, it would only be “book versus web”, as the web is a river and a book is an island. Printing it ‘fixes’ it in time, and the highly dynamic web will outrun the content of this book in a few years, maybe sooner. Meantime, its succint, direct and practical nature recommend it as a map out of the bewildering tangle of what’s out there. Now is the time to buy it"

Connotea Blog

German Tutorial for using Connotea

Tuesday 6 May 2008

A student from the Danube University Krems has created a short Wiki page with a concise description of how to use Connotea in German. You can have a look at the page here. Many thanks!

Connotea Blog

Display your Connotea Bookmarks on your Site.

Friday 25 April 2008

We have developed a little piece of javascript so you can now show off your recent Connotea bookmarks on your site! You can check out how to do it here. Below is a screen shot, and you can see it live (but unstyled, cos I’m old skool like that) my own homepage

This is a bit experimental at the moment, so we might change things around over the next few weeks. As ever if you have any feedback let us know here or you can mail me at i.mluvany@nature.com

Connotea Blog

Improved import of RIS files to better handle PUBMED tag terms

Thursday 24 April 2008

Ben Good pointed out that Conntoea’s import mechanisim for RIS files generated from Endnote was magnling tags, and especially tag items like MESH terms.

We have now fixed this, but the fix is somewhat non-trivial.

The main reason for the problem is that we were usin the same tag compreshension code that is used in the “Add To Connotea” pop-up. This assumes that “mutliple words in quotes” are one tag, and lots, of, tag, separated, words, are individual tags. In addition, since you access tags in conntoea through the url we have to throw away forward slashes since a tag with a forward slash in it is going to confuse the url resover in Connotea.

Now when it comes to pub med records it looks like all of these rules are specifically chosen to break the way Pubmed records describe tags :/

Ben describes the problem very well:

In Pubmed records, and in the Endnote records, /’s are used to separate descriptors such as “Transcription Factors” from qualifiers such as “antagonists & inhibitors” and “metabolism”. For example, you might see a keyword listed as “Transcription Factors/antagonists & inhibitors/metabolism”. When imported, Connotea strips the slashes from the tag and thus adds the tag “Transcription Factorsantagonists & inhibitors metabolism” to the post.

So now we deal correctly with these tags, yay!

MeSH terms sometimes contain commas like “Models, Genetic”. When imported, these compound terms get split into multiple separate tags (Models and Genetic).

That’s because our comma separation parsing used to take precendence over our parsing of collecvie terms, but we have fixed that now.

In addition, it appears that quite a few people have managed to import the “Research Support” aspect of Pubmed Records as well. This is why you see more than a thousand bookmarks with the rather misleading tag “Non-U.S. Gov’t”, often also tagged with the seemingly contradictor “U.S. Gov’t”. (This happens when the research in the paper had both U.S. and non-U.S. funding).

We decided to leave this alone, as solving this problem requires understanding what the tags mean, and the context in which they appear. OK, so you can’t win every time. I guess we are just going to have to wait for the semantic web!

p.s. You will often see a ‘star’ appended to the beginning of tags imported in this manner such as “’star’Genes”. This indicates that the stared’d term is a major topic (as opposed to minor topic) in the manuscript according to MEDLINE indexing.

OK, so now we strip leading stars from tag names, so that ’’star’gene’ imported becomes the tag ‘gene’ and can connect to all of the other items that have been tagged with ‘gene’ by users.

In a way so far all of the above is pretty straigt forward, now things get a little itneresting,

Martin our developer points out the following behaviour:

“One of the annoying parts of import is that if the keywords are separated by newlines but a tag with commas was collapsed into two, it would likely merge with other tags on the first or second term and then be tedious for the user to pick out later no matter what the UI.

In the RIS importer I’ve added a heuristic test which allows splitting on commas, except where it sort of looks like newlines are being used to demarcate the tags.

Here are some examples:

(1)
KW – aaa, bbb, ccc, ddd, eee, fff, ggg, hhh,

iii, jjj, kkk, lll, mmm, nnn, ooo, ppp

(2)
KW – aaa

bbb ccc ddd Transcription, Genetic eee fff ggg hhh iii jjj kkk lll mmm nnn ooo ppp

(3)
KW – aaa, bbb

ccc, ddd eee, fff

(Describing this in this blog is a bit hard, you have to ignore the extra lines between text lines as the blog parser treats whitespace in this system in a funny way, but I hope you see what we mean)

So in (1), sixteen tags are evident. In (2), the same tags appear, and I’ve added a comma-containing tag to show how they appear in dang.txt from our friend Ben. Clearly in (2) the newline is supposed to be the separator, not the comma. However, if you eat commas as part of tag names then (1) will fail.

The heuristic I came up with is that if there are at least three lines and no line runs longer than 60 characters then it should be treated as newline-separated and include the commas in the tag names. Otherwise, separate on commas as well.

This makes (3) not do the right thing, so it’s up to you if you think this will help or hurt. (3) IMHO is not likely to be computer generated… a computer would either write one per KW line (avoiding all this), split on newlines, or fill up lines to ~80 chars split on commas and add in newlines to keep going. All of which work with my test."

And so that’s how we have left it for now.

Connotea Blog

Annual Reviews now supported

Wednesday 16 April 2008

We had a request recently from Annual reviews for Connotea to support their site.

Martin, our developer had a look at their site and came up with some options. He takes up the story:

“Although the DOI appears in the URL, and we could switch off to CrossRef
for citation data, we lose the authors and full publication date, so
that’s not as good.

They embed citation metadata using Dublin Core conventions inside HTML
meta tags, but here we’re missing journal, volume, and issue data.

I noticed that their site allows download in RIS, BibTeX, etc. format,
and in looking for a similar existing citation source module that does
such a download of a secondary RIS file, that Blackwell.pm was actually
a perfect module to do the work. Too perfect – it works with a domain
name change! So then I realized that these two domains, plus more, are
using software from Atypon (www.atypon.com) and that if we recognized
the pages served from this software, we could immediately support all
their current and future customers. Considering that their customer list
includes places like MIT Press that seemed like a good idea.

So I’ve produced Atypon.pm which is reasonably short and can replace
Blackwell.pm as well as supporting many of their other customers."

Atypon provide publishing software for the following (and so now we support the following too!:

Commercial publishers

  • Alexandrine Press
  • AOAC International
  • Australian Academic Press
  • Blackwell Publishing
  • eContent Management Pty Ltd.
  • Expert Opinion
  • FDI World Dental Press Ltd.
  • Future Drugs
  • Future Medicine
  • Guilford Publications, Inc.
  • IAHS Press
  • IFIS Publishing
  • Intellect Ltd.
  • Lawrence Erlbaum Associates, Inc.
  • Logos
  • Lynne Rienner Publishers
  • Mary Ann Liebert, Inc., publishers
  • Morgan & Claypool Publishers
  • Oldenbourg Wissenschaftsverlag, GmbH
  • Oxford Business Group
  • PNG Publications
  • S. Karger AG
  • Salem Press
  • Scientific Journal Publishers Ltd.
  • SCR Publishing Ltd.
  • Sheridan Press
  • Thomas Telford Publishing
  • Uitgeverij Boom
  • Vathek Publishing
  • VEETECH Ltd.
  • Walter de Gruyter

Not-for-profit / society publishers

  • AIS Educator Association
  • Aluka
  • American Academy of Periodontology
  • American Accounting Association
  • American Anthropological Association
  • American Association of Cereal Chemists
  • American Chemical Society
  • American Economic Association
  • American Marketing Association
  • American Phytopathological Society
  • American Society for Bone and Mineral Research
  • American Statistical Association
  • American Veterinary Medical Association
  • Annual Reviews
  • Association for Childhood Education International
  • BioOne
  • British Institute of Non-Destructive Testing
  • CFA Institute
  • Chartered Institute of Building
  • Commonwealth Forestry Association
  • Countertrade & Offset
  • CrossRef
  • Institute of Electrical and Electronics Engineers
  • Journal of Neurosurgery Publishing Group
  • JSTOR
  • Modern Language Association
  • Pharmacotherapy Publishing Inc.
  • Production and Operations Management Society
  • TASH

University presses

  • American School of Classical Studies in Athens
  • Cold Spring Harbor Laboratory Press
  • Edinburgh University Press
  • Govi Verlag
  • Indiana University Press
  • Johnson Graduate School, Cornell University
  • MIT Press
  • Monash University ePress
  • University of California Press
  • University of Chicago Press

Connotea Blog

Nice introduction to Connotea in the JLMA

Tuesday 8 April 2008

Melissa Rethlefsen has written a nice introduction to Conntoea in the JLMA.

If you are interested in press clippings about Connotea you can follow a list of them here.

Connotea Blog

Connotea hot 25

Thursday 6 March 2008

Mitch Andre Garcia has just built a page that shows the Top 25 bookmarks on Connotea from the past week. He is working on some other scripts on top of Connotea, so keep an ear to the ground.

Connotea Blog

Connotea is now OpenID enabled.

Friday 22 February 2008

We have added support for OpenID on Connotea.

If you don’t know what OpenID is then head over to the OpenID page to get an introduction. The short version is that it is a system for managing access to sites through a trusted ID provider.

Why are we doing this? We are hoping that the introduction of OpenID on Connotea will help you guy’s with managing your online personas, and in addition we are talking with some other groups about using it as a way of creating bridges between Connotea and some other services.

If you know what OpenID is already then just have a go and log in at http://www.connotea.org/openid. At the moment we are a relying party. This means that we don’t host or generate OpenID’s but if you have one you can use it to log in to Connotea.

Each Connotea account can have one OpenID associated with it. You can set this in the Advanced settings section of your account, now available in the Toolbox on the right hand side of the page.

If you don’t already have an account, one will be generated automatically for you when you log in with an OpenID.

http://www.connotea.org/blog

Short talk on Science and Web2.0

Wednesday 20 February 2008

I was in The Netherlands last week and gave a short presentation to some PhD students from Utrecht University on Science and Web2.0. I’ve uploaded the slides to slideshare.

Connotea Blog.

Using Connotea for CrystalEye data checking

Thursday 7 February 2008

This is another example of Connotea being used for open science. The group working on Crystaleye are encouraging people to report any problems with the data by bookmarking pages in Connotea and using the crystaleyeproblem tag. Jim Downings Blog has further details.

Getting just the tags from a group in Connotea.

Thursday 7 February 2008

Another user asked today if it was possible to extract the tags that were used by a group from an API call in Connotea. The answer is that yes it is.

A normal API call on a group will look like:
http://www.connotea.org/data/group/Human%20Mitochondrial%20Phylogeny

But if you want to get the tags used by the group directly then you can use the following call:

http://www.connotea.org/data/tags/group/Human%20Mitochondrial%20Phylogeny

Which will also tell you how many times each tag has been used, and by how many people!

Connotea Blog

Our archived news page is also available.