It was a while back that I caught the video on the PDB site which explained all the functionalities that its search interface has. Thanks to the screencast I became a much more efficient querier of the PDB, especially after they adopted the new ( now almost three years old) interface.
I strongly believe that screencasting can play a role in helping us all search better.
Since I work on crystallizing membrane proteins , I found the MPDB very useful and decided to screencast its features.
I sincerely hope that database creators, and users alike, take to this effective medium and screencast their tips and tricks for us all to benefit from.
Why are our bioinformatics workflows so complicated!
Last week to answer one question I had to resort to information from several sources . A lot of them contributed immense value to my “workflow” and were also either difficult to perform or very easy. For a start I have ranked them in terms of both Value ( 1 for no value to 10 for a lot of value) to ease of use ( 1 for very complicated to 10 for very easy)
# Assembling my sequences in DNAstar (Value 10 : Ease 7 )
# Compiling my sequences and pulling them into Jalview. Ran CLUSTALW web service on edited alignments and realized that all of my clones had basically two sequences for their CDRs . . Jalviews excellent web-service CLUSTALW interface allowed me to quickly edit the 32 sequences , align them interactively and realize they belonged to two types. This got me thinking that maybe the primers I used to clone my CDRs from my mouse kappa light chains were probably mis-priming ( Value 10 : Ease 9)
# Use pubmed to look at precedents i.e analyze all possible papers which had sequenced the mouse anitbody kappa light chain CDR region as I had attempted to do and derive the sequences of the primers they had used. It took forever to get the right keywords to query and I still have only three kappa light chain primer sequences. ANd they are all different! ( Value 10 : Ease 1 ),
# Use my primer sequences , compare them with the literature and figure out how I had misprimed and why my sequences were all either of two types ( Still in progress Value immense : Ease 1 i.e still difficult to do)
# Use pubmed / NCBI genome to understand the sequence space for mouse kappa light chains ( Value 10 , Ease 4 , )
# Use EBI to get the same sequence data ( Value 10 : Ease 8 )
This is still work in progress . But to summarize –
The pubmed steps were the most painful . Pubmed search has to improve!.
Jalview contributed the most value. For a free App its a must have in any bioinformatics toolkit!. DNAstar played its role ..but for its cost ( a few thousand dollars )! It sure gave a lot less value than Jalview
All of this begs the question! ..why are bioinformatics workflows so difficult! We are a long ways away from making these things easy to do for everyone!
I first caught this on Pierres blog.
NCBI it turns out can be queried along REST principles ( hence the RESTful in the title). Ever since learning about REST-based URLs , I always wished that many web APIs implemented the ideology in their design. I was excited to learn how easy and intuitive it becomes to query a database using REST principles.
Gone are queries that looked like
And here come queries that look like this
which look for genes that have homology to dystrophin.
Several of the web APIs like the one for connotea and del.icio.us are also implemented RESTfully, making them very easy to query. For eg to get all entries on connotea or del.icio.us with tag metagenomics you would query the URL
Or on del.icio.us the URL
I dont yet know how extensive the possibilities of such querying of the NCBI are, but it looks so much easier than understanding equery.
Ref: NCBI resource locator.
I have often blogged about my trials and tribulations with the NCBI database.This morning I was trying to locate all the kappa light chain genes from the NCBI database.
I tried the following search
Immunoglobulin kappa mouse in the Genome database subsection.
The results I got were a curious mix of microbe genomes ranging from Aspergillus Niger to Salmonella enterica. Maybe I left my search skills at home or my eyes are playing tricks on me.
Addendum: Eric Jane from Uniprot showed me how to do the same query on Uniprot beta. Uniprot really rocks. Not only could I do the query , but also downloaded the results in batch mode as fasta sequences and in the xml format.Thanks eric , I would definitely recommend uniprot beta to everyone. Isabelle phan from uniprot did post an excellent screencast detailing the features of uniprot beta at this link on Bioscreencast.com . Do check it out as well as Erics comments below.
Well I had talked about how Deepak went to SciFoo recently. It turns out that some of this years SciFoo alumni led by the indomitable Jean Claude Bradley (JCB or Horace Moody ) started the “metaverse” version of these sessions on the Nature Island on Second Life called Second Nature.
In keeping with the “non conference” format of the original, session themes at SciFoo Lives On are decided on by the attendants , in this case on the wiki that serves as its permanent home outside of Second Life. Yesterdays session was on the role of “Video in Science” and of course we were there with Deepak as Whitewizard Chemistry and myself as Vishwaroop Baroque.
As I awkwardly bumped into the attendees thanks to my terrible gaming skills , the whitewizard chemistry told the audience about bioscreencast.com. This was followed by a talk by JCB on “YouTube and the Sciences” and finally one from someone at the SciVee project.
This was my first time in Second Life. I entered as a skeptic, since I always thought second Life is just a toy for gaming geeks and uber nerds. But I must say the poster session was just like the real thing with some added benefits. Like in the real thing, the questions made the poster session come alive but this time you get a text transcript of all conversations that took place and an overall rich experience. Not to mention the fact that the poster lives on on the NPG island and does not end up in my lab storage area ( read trash can).
I came away convinced that activities like this have a great value in enriching the online scientific experience.
Bertalan, one of the attendees, live blogged the event. You can catch also read about the goings on at Deepaks bbgm blog and of course on the bioscreencast.com blog.
A full text transcript is available on Jean Claude Bradleys blog