I shake my head and suddenly realize a decade has almost passed since I first became a computational structural biologist. Feeling that I’ve earnt some kind of perspective, I hereby present to you a list of significant developments in computational structural biology of the naughties:
1. Rosetta.
There is no doubt in my mind that the greatest contribution to computational structural biology in this decade is the suite of programs known as Rosetta, written by David Baker and his lab.
Look at the accomplishments. Rosetta has consistently won the CASP competition for protein folding (see next item) in the new fold category. Nothing else comes close in deducing the structure of a protein from an arbitrary protein sequence. Rosetta was used to create a protein with the first artificial fold. Rosetta was used to design the first artificial enzymatic reaction in a protein.
Each of these accomplishments are staggering in themselves, but to find a computational core across all three, makes Rosetta the crowning jewel of computational structural biology. I’ve said it before and I’ll say it again, David Baker is on the fast-track to Stockholm.
2. Formally Organized Competitions
Every 2 years, a whole bunch of computational structural biology labs effectively shut down for business, and throw every man, woman and workstation together to attempt to crack a set of problems. This same set of problems is simultaneously being attempted in labs all around the world, as researchers race against a clock to predict the 3 dimensional atomic structure of protein sequences published at the CASP protein-folding competition website.
We often say that science is a competition but is is astonishing to me how the computational structural biology community has embraced formally organized competitions such as CASP. Here, we have pure naked competition, complete with a scoring system, judges, and rankings that determine winners and losers. It has all the drama that you’d expect from a reality TV show: recriminations, anger and tears. And it has taken the field of protein folding much farther than anyone would have imagined 10 years ago.
I still remember the late 90’s when the field of protein folding was an epistemological joke. Every odd month, some theorist would publish an article claiming that they had solved the protein folding “in principle” yet none of them actually predicted the structure of a sequence with no known structure. Of course, no one was actually stupid enough to make such a prediction in print. Others, more pessimistic, were even saying that it’s an unsolvable problem, like the theory of nuclear energy shells. Indeed, the results of the very first CASP showed just what a disaster that the protein folding field was, at the time. A lot of prominent theorists got egg on their face.
In hindsight, we can see that CASP worked along a very darwinian idea: that of defining a fitness landscape. In the field of genetic algorithms, you can use random mutations to find efficient algorithms, but only if you have a robust fitness function. The field of protein folding had been drifting along in some kind of crappy fitness valley and it wasn’t until CASP came along, that we could even define what protein folding was, in a concrete definitive way. In terms of protein folding, the targets of CASP could be used to define a good fitness function, which was enough to spur the field to scramble out of the valley and up a fitness peak. This approach has proven so successful that we have now embraced formal competitions for other problems of similar difficulty: CAPRI for protein-protein interaction, and SAMPL for ligand binding.
3. Distributed computation.
First there were Beowolf clusters on commodity linux boxes, and now we have cloud computing with Folding@Home and Amazon. Either way, it’s become standard practice for computational structural biologists to think beyond a single workstation, and fall in love with distributed computing.
As the hardware has gone parallel, so have our simulations. All the standard molecular-dynamic packages are locked in an arms race to squeeze the best performance out of parallel clusters. It is hoped one day that they will actually scale. As the capacity of our calculations has gone up, so has the quality of our simulations. It is virtually di rigeur these days to use explicit solvent in molecular-dynamics simulations.
Nevertheless, there is still a huge way to go before we can happily sample large protein systems to the point where we can merrily pluck out interesting but rare conformational fluctuations from an untainted molecular dynamics simulation. Until that time comes, be prepared to apply a whole cottage industry of thermodynamic techniques that trick the system into generating those precious rare conformational changes at a much faster simulation time.
These techniques break up into two groups. The first group of techniques, such as replica-exchange, are used to set up the simulations with weird temperature distributions so that large motions can be generated easily. The second group of techniques, such as thermodynamic averaging and master equations, tries to normalize these weird thermodynamic distributions back down to well-behaved free-energy equilibrium distributions, with which one can compare to experiments. Either way, it’s wise to brush up on your math before you wade into these extremely mathemagical waters.
4. Elastic Network Models
One running theme in computational structural biology is the search for simple models of proteins. These range from single-bead models to Gō models, to 2 dimensional lattice models. Ultimately, the litmus test of simple models is that they provide useful informational for researchers in other areas of computational structural biology.
Over the last decade, I’ve found that the only simple model that provides concrete information for researchers outside the field of simple models is the Elastic Network Model. It is the model where, given a protein structure, you replace all atomic contacts and covalent models with a simple harmonic spring. You will then obtain a network of springs. From this network, it is trivial to extract the dominant vibrational mode.
The motions deduced from these vibrational modes turn out to be exceeedingly useful, and these has been shown to reproduce the motions from much longer molecular-dynamics simulations. Elastic Network Models have been used to deduce virus capsid maturation and ribosome motions, systems that are way to large to study in any other way. The speed of the calculation and the fidelity of these results makes this an essential tool for computational structural biologists.
5. Fragment binding to proteins
Ligand binding is the great computational problem of the pharmaceutical industry. We already have enough solved structures to provide plentiful targets for drug activity, but without a way to calculate exactly how ligands bind, this doesn’t get us very far. Given the commercial nirvana lying on the other side of solving the ligand binding problem, there has been a ton of work in improving ligand-binding prediction systems. Still, the improvements in the field have proceeded at a glacial place, where the only novel ideas seems to be to get faster computers and to add more expensive dynamics to the simulations.
However, I’ve found one approach that seems to be genuinely different. This is the idea of using chemical fragments. Instead of studying how a ligand binds, you break up the ligand into its constituent chemical fragments, which are conformationally exceedingly simple. Then you study how the fragments bind to a protein. By studying the binding pattern of fragments, you can back-track to work out what ligand could have bound to it. Although this approach hasn’t yet lead to a radical overhaul of the ligand-binding problem, I am tipping this as a method to watch.
6. NMR measurements of residue dynamics
One of the reasons we do molecular-dynamics simulations is that these simulations provides the only way of seeing how proteins move on the atomic-detailed level.
Nevertheless it is a massive leap of faith to trust only in the simulations. Over the last decade, although NMR has fallen out of favor for solving protein structures, NMR methods have proven their worth in providing detailed information about the motions of proteins. These include the S order parameters for nanosecond motions, chemical shifts to detect secondary structure, and exchange parameters to measure microsecond transitions in chemical shifts.
Since careful NMR experiments can resolve signals to individual residues, these mesurements can give exceptionally precise dynamic measurements to proteins in solution. Recently, NMR Residual Dipolar Couplings were used to measure microsecond motions for every single residue in ubiquitin. We now have a very rich set of measurements that capture dynamic motions of proteins. These provide virtually the only checks on the wild world of long-time moleuclar-dynamics simulations.
7. The inability to go beyond partial charges.
In molecular-dynamics, we’ve been promised more sophisticated force-fields for a very long time now. Whilst there is no doubt that things such as dipoles and quadropoles can be programmed, it turns that there is a very steep speed tradeoff. It may be time to give up on the dream and accept that maybe, single partial charges may be sitting at some kind of sweet spot of simulation realism versus efficiency.
8. Conformational changes of single molecules
Experiment has raced far ahead of simulation in recent years. In particular, two techniques have been developed that directly measures conformational changes of individual protein molecules.
Flouresence Reseonance Energy Transfer (FRET) can measure the distance between chromophore pairs simply by the amount of fluoresence emitted by the chromophores. If you can attach the chromophore to two residues in a protein, you can measure their distance distributions in solution.
In Atomic Force Microscopy (AFM), sophisticated servo-motors can gently tease apart singles protein using exqusitely tiny forces. Incredibly detailed force-extension profiles of individual proteins can be measured.
These measurements have provided incredibly rich insights especially to systems where these large conformational changes can be linked to biologically-relevant processes.
9. Reconstruction of long dead ancestral proteins
For me, the most imaginative technique developed in the last decade is the black art of reviving ancestral proteins.
If we look at the spectrum of living things before us, we see only the furtherest leaves in the tree of life. However bioinformatics has given us techniques to extract the historical traces of evolution from our knowledge of protein sequences. By comparing homologous proteins across species, we can use sequence alignment comparisons to construct phylogeny trees between the species.
The principal insight of groups who study ancestral proteins is that, with our sophisticated sequence alignment programs, not only can we construct phylogeny trees, but we can construct the actual sequence of the last common ancestors at each fork of the tree.
These ancestral proteins can be cloned and crystallized. These groups have focused on proteins that bind to different ligands in different species. By carefully studying the changes in binding and in the structure over the phylogeny tree, we can figure out how nature has sculpted the structure of a protein to accomodate the speciation of different organisms over millions of years of evolution.
It’s sad to say this but most people I work with are just not that into science. It’s not that they’re not good at it – they clearly are – it’s just that they’re interested only in their little niche, and are thoroughly satisfied with all that it entails.
Case in point: recently, I was at work looking up a biology textbook on basic paleontology, spurred on by the intriguing hypothesis that the Cambrian explosion was induced by the evolution of eyes. My buddy S wanders by and notices that I am reading a first-year biology textbook. Being a curious guy, he asks me why. That’s when, whilst explaining to S the whole theory about eyes and the Cambrian explosion, we were interreputed by a third guy J, who burst out with “Ha ha, that’s funny. You never hear words like Cambrian used on the other side of the lab.”
This struck me as profoundly weird. First, the word “Cambrian” is not a particularly unusual word, especially in a biology environment. An average 6th grader interested in science would have heard of it. Yet, in a lab that does fundamental biology research, it struck the ears of J as strange. Second, J’s reaction was mystifyingly inert. Although the word Cambrian had piqued J’s curiosity, he wasn’t interested in finding out what we were talking about. Instead, he pointed out how strange it was to hear words of science spoken in a laboratory of science.
I think this is emblemic of a certain attitude in science. Many researchers become masters of their own research field but have minimal interest in areas outside their own. For such scientists, it is the qualification of knowing that counts, not the knowing itself. We do research so that we can publish prestigious research so that we can be recognized as prestigious scientists. Anything other than this is a sign of amateurism. Perhaps it’s a way for socially awkward people who played too much dungeons-and-dragons in high-school to claw back some kind of respect in a hostile world. There are many people in science like this, and they are not infrequently assholes (although J is a pretty nice guy).
This attitude induces a type of scientific myopia. To whit, a couple of weeks later, I found J sitting in the lunch room staring blankly at a draft of his article. He was stuck in writing a synopsis that would explain to lay readers why his research was important. J thought this was a pointless exercise dreamed up by PR lackeys at the journal he was going to send it. But it’s not. Being able to explain why our research is important reflects an important synthetic ability to see how different bits of science fit together. This requires a vision beyond our own little niche. It seemed to me that J knew everything about his research topic except for why it’s important.
In contrast, there are those of us who are in science because we are irresistibly curious about how the world works. This curiousity is a habit of mind and it’s not something you can switch off. I had a really busy year this year (worked on 4 articles, wrote a job application, released a piece of software) yet I read a programming textbook (Programming Language Pragmatics), a neuroscience pop-sci book (The Brain that Changes Itself), a particle physics pop-sci (In search of the Ultimate Building Blocks) and a stat-mech textbook (Noise and Fluctuations). I feel stuck if I am not finding about some area of science that is outside my comfort zone. I think you can tell which scientists have this kind of curiosity simply by whether they are reading popular science books. These scientists will find time to read whether they have time or not. They have no choice even though they’re running 9 hour experiments on their supercali-fragilistic-fractionator.
The difference to the other type scientist is stark. One of the grad students in my lab told me that once, when she was talking about this great pop-sci book she was reading, another postdoc D interrupted her with a withering look, “so you read populular science books do you?” As if it we were a bad thing. Yet, I have a feeling that non-curious scientists are in the majority. Doing a totally unscientific straw poll, out of about 17 postdocs and grad students in my lab, I can think of about 7 who I’ve talked to at some point about some kind of awesome science outside our lab’s principal interests.
Recently, I’ve read all sorts of reports that the scientific body is in general, not interested in social media, even amongst younger scientists. All sorts of explanations have been proffered as to why there is such a low adoption rate but the one explanation that hasn’t been offered is that perhaps, there are not that many engaged scientists in the scientific community. Perhaps, most people working in science are just not that into science.
William Faulkner’s “The Sound and the Fury” is one of most reader-unfriendly book I’ve ever read. Written in 1929, it’s clear that Faulkner was enamoured with the new innovations in prose flowing out of Europe such as that diarrhea of words known as stream of consciousness.
But there’s good stream of consciousnessness and there’s bad stream of consciousness. Good stream of consciousness conjures up a heightened state of mind. By writing from the inner psyche of the protaganist, we get a better sense of the character through free associations and displacements of languge. Still, stream of consciousness should flow, ratcheting tension through a logic of comprehensible symbolism.
Bad stream of consciousness grates on the nerves like an open-mike session in a second-rate college town. And the opening section of “The Sound and the Fury” is as bad as anything I’ve read. Written from the point of view of a retard, the thoughts are jumbled up with such a profusion of indistinct pronouns that it’s hard to figure out who is talking about what. Characters are introduced willy-nilly. They appear and disappear like a bad itch, and since we’re travelling backwards and forwards in time at will, most characters don’t even have the privilege of having the simple soliditiy of existence over a paragraph.
It’s the kind of chapter where book critics suggests that you read it several times with colored markers so you mark out what time period each paragraph belongs to. Meaning that this is an incoherently written piece of writing, where it’s up to you to figure out how to make it coherent. It’s like stumbling on the scribbled notes of an unfinished novel. Given the effort needed to figure out what is happening, the final result is stunningly not worth the effort. The characters are not that interesting. The language not particularly rivetting, and the dialogue is barely serviceable.
The chapters do get more comprehensible as we go along. And I was almost ready to forgive Faulkner when I got to the final chapter. Here, Faulkner introduces a new character called Quentin, who is in fact the niece of a man called Quentin from the second chapter. Given that it was written in that spasmodic style where pronouns are thrown around like confetti, and characters are described with as much care as pigs eat slop, this took me a good hour to figure out that Quentin was not actually Quentin. There’s no real good reason for this, and it’s just a sign of the sloppiness in the writing. And of course, like any piece of literary schlock, the denouement with Quentin involved incest.
Ever since I brought home my large 22” monitor, I’ve been trying to read more PDF files on the computer instead of printing them out on paper first. But there is something lacking in reading from a computer screen. The thing I miss most is that I lose a sense of place in a document. With the document printed on pages of paper, I get a physical sense of where I am simply from the feel of the thickness of the pages and from the physical act of turning a page. I can scan quickly through the document and find certain sections much faster than reading on the computer.
In comparison, a typical PDF viewer only gives you a very poor sense of place in a document via a scrollbar. This does not provide you with any way to scan ahead:

However, many PDF viewers also provide a thumbnail drawer, which allows you to scan ahead several pages:
![]()
However, this particular arrangement ends up with a rather convoluted user interface where you get 3 inconsistent markers for your position in the document: 1) the original scrollbar with full-view document, 2) the highlighted page in the thumbnail drawer and 3) the scrollbar for the thumbnails. Also, the thumbnail view only shows one highlighted page whereas the full view often falls between two pages.
Dragging the markers on the scrollbars provides a rather confusing and fragmentary experience. I have no idea where the thumbnail scrollbar is, or where the highlighted thumbnail will pop up, after dragging the main page scrollbar. This makes it impossible to use these scrollbars to give a visual sense of your place in the document
Then I realized that you can solve this problem by combining the scrollbars with the thumbnails together to give you something like this mockup:

The idea is that we can use the thumbnail view itself as the scrollbar, which I call the thumbnail bar. With this arrangement, the part of the thumbnail displayed in the full page view becomes the marker of the scrollbar. This provides a clean and simple integration between the full page view and the thumbnails.
Nevertheless, you don’t want the thumbnails to be too small, so you won’t be able to display a lot of thumbnails. To compensate for this, I add a third bar, which I call the document bar, which provides an ultra-compressed view of all the thumbnails in the document. By using appropriate coloring or shading, we can color the part of the document shown in the thumbnail bar and the full view. This combined-thumbnail view can become the marker for the document bar.
Finally, moving around the document is simply a matter of dragging the smaller versions of the page view around in the thumbnail bar, or dragging the combined-thumbnail view in the document bar.
In both the document bar and the thumbnail bar, we keep the contiguous vertical arrangement of the individual pages to convey the spatial sense of page in a nice linear fashion. And if we can animate all three together when dragging, then we can viscerally relate the document at all three levels of detail – page, thumbnail and document.
I recently went to the San Francisco Public Library Book Fair. In an aircraft hangar of a warehouse, billions and billions of used books were arranged over miles and miles of old folding tables. Readers from all over the bay area (and some beyond I might imagine) pored over the dessicated remains of someone else’s poor reading habits. Some even pushed trolley carts to carry all of their booty.
Over the years I’ve gotten picky with my reading and I prefer to buy wanted books instead of reading what lays at hand. But still after an afternoon of triage at the book fair, I found some out-of-print gems.
The first was a ratty old Dover book complete with a sickly green cover titled “Aquatic Mammals: Their Adaptations to Life in the Water” by a former president of the Society of Mammals, A. Brazier Howell. It’s a pleasant surprise to read a book by a scientist who has clearly studied his material over a lifetime of idosyncratic cogitation. So many aspects are considered in this book, from swimming mechanics, to breathing changes. The writing is clean and soberly unlike today’s flimsy attention-grabbing scientific chaff that cry out with breathless titles such as “Secret Life of This” and “Secret Life of That”. This is a straight up biology book, albeit softened a bit for the lay reader.
But I share in the author’s delight over aquatic mammals. My favorite part of zoos is still the sea mammal sections. If you have ever watched a walrus swim right up to the pyrex glass and reverse with a slap of the tail onto the glass, you will know the heft and majesty of aquatic mammal life. I must admit, one of the reasons I am reading this is in light of the Aquatic Ape theory of the origin of humans from the monkey line.
The second is a collection of short stories by one of China’s foremost women writers, Zhang Jie in a collection titled “Love Must Not Be Forgotten”. These are strange stories, telling of thwarted loves and ruined lives. It charts how the irrepressible tides of love course even under the bureacratic juggernaut of communist China. These are frigid stories illuminated with little shards of emotion, and for me, an alienable yet recognizable landscape.
I got a taste for French music living in Belgium almost ten years ago. Before living in francophile Brussels, I had formed, in my adolescent mind, a world populated by languorous women with a detached air of cultured sensuality, whispering an unending stream of soft slurred vowels running over each other like waves on a soft beach. The sound of French does not posses the cut and thrust of English, where the profane and the sexual are cleaved into the puritan and the lustful. In French, every act is a sensual act.
But they never tell you how difficult it is to explore the musical landscape of another language. Indeed, I had forgotten my own experience as a unmanned adolescent trying to get to grips with the mysteries of the Top 40 and other such things. Figuring out the landscape of the music of someone else’s culture is just as hard. Although I could understand the finer distinctions between grunge, metal and indie rock, this did not help me negotiate the differences between nouvelle chanson, les rocks, eurovision, and star academy.
The classics are easy to find, colossal singers etched in good Parisian sandstone such as Edith Piaf and Jacques Brels. But I was more interested in French artists who are still alive. Singers who I may have a chance to see in person, to watch their bodies sway through the cigarette smoke as their voice purrs through cracked-out speakers in a dank hall in park slope. Of course, you say, I could just ask someone. But it’s not that easy. It’s hard enough finding people who like the same stuff as you in your own language. But in another language you have to get over the double barrier of meeting someone who can translate and who also shares your taste.
Over the years, I’ve managed to scrape together a small but durable list of great french singers, but I am always on the lookout for more. My prayers were finally answered when I recently stumbled onto a treasure trove of french music, a music blog called Filles Sourires. Their tagline “Fragile girls. Gainsbourgian guys. Singing in French. Making me sigh. Any questions?” That’s certainly an aesthetic I could buy into.
Most useful was that “Filles Sourrires” recently asked their regular contributions for their top 10 lists of French albums released in the last ten years. This list of 100 albums (1 2 3 4 5 6 7 8 9 10) provides a comprehensive survey of the terrain of modern french music. I am gratified to see that many of my favorites are here, but there are also many that I have not heard off. For these of you who are Franco-curious, I suggest that you check these out. Do not be surprised if you have a sudden urge to buy a beret and a book a one-way ticket to Paris.
The real bioinformatics journal has not been created yet. Wait, let me explain. What we currently have are traditional journals that deal with articles on bioinformatics. The journal themselves, though they may provide digital PDF files, are still stuck in the paper paradigm. The journal itself is not bioinformatic.
What crack am I smoking? In a recent article on the future of science publishing, Michaell Nielsen speculates that the crunch in scientific publishing will not simply lead to an age of open-access science publishing, but in all likelihood, will lead to new forms of publishing centered around services.
Well there’s one service that I wish bioinformatics journals provide. How about actually providing what you promise? Here’s the situation. You find an 6 year old article in “BMC Bioinformatics” about some clever little algorithm. Tucked away at the end of the abstract is a link http://some.uni.edu/old_group/people/~postdocx/project69/index.html. You click on the link and several things may happen: 1) if you’re lucky, yet get a butt-ugly website with a working program or 2) you get a program but it doesn’t work with your configuration, or in all likelihood, 3) the website is not found anymore. Without the program, the article is next to useless.
The thing is, the majority of bioinformatics papers are either description of programs, or description of data-sets. Unlike traditional experimental or theory papers, code and data are things that grow with time. Programs can be continually improved – they gather bugfixes, and may even be rewritten with better algorithms. Datasets are often regenerated with the latest updates of standard databases, where many genomic analyses are often expanded to include more organisms.
You might think that dealing with the changes is going to require a lot of work. But the maintainence of a changing program is actually a solved problem. Most people who are serious about software use Github, or Google Code or any number of software repositories that gracefully handle mutating software. These websites provide excellent integration services for downloading the program, registering and tracking bugs, discussion pages, with a solid admin interface, and allows you to look at the history of the program. They make it easy to do all this stuff. Moreover, these repositories provide a permanent location for the program with a clean url.
Nevertheless, the way that academia works is that if you have written a neat little program that solves a bioinformatic problem, you must get it peer-reviewed and published in an academic journal in order to be recognized for official academic business. Thus your program will live on two different websites, where the journal website links to your program website. Of course, one could include the download as a supplementary file on the journal website, but then it’s only a single file download without any kind of proper software infrastructure.
So here’s my idea, why don’t we setup up a specialized bioinformatic journal that is tightly integrated to a software repositry? Let’s call it Biohub.org or something like that. Users are first encouraged to set up projects in biohub.org like they already do in Google Code or Github, as a software repository. There is nothing particularly cutting edge about that, as there is plenty of existing software to facilitate the construction of a site like that.
Then when a project reaches maturation, an article is written and sent to the editors who run the website. If the article passes peer-review then the project will be registered on the front page of Biohub.org as a peer-reviewed project, and the link will be directly link to the project page on the very same website. A new tab will pop up with the .html of the article, formatted in a way that is consistent with the rest of the software. The article and the software project will be one and the same.
The editors will be responsible for making a printable PDF that goes with the article, and these can be linked to a journal page of the website, which can be made to look like any other academic journal website. You can even slap an ISBN number and register these articles with the relevant scientific literature databases. More importantly, the journal runs the software repository so that the software will be there as long as the journal exists.
This actually makes sense for datasets as well. Datasets change overtime just like programs, and if stored in a software repositry, it provides an easy way to look at older versions of the database, as well as a provide a place for others to submit useful scripts, and start discussions (especially since most repositories provide wiki and forum services for every project). Furthermore, if the repositry as run as an open-source code repositry, then it makes it quite easy for collaborators to be added to projects.
If we get into the habit of setting up bioinformatic software and data in these centralized hubs, we can stitch a truly bioinformatic journal. No more will the work of some postdoc die on some long forgotten server unplugged in the back of someone’s lab. Our collective scientific output will live in an organic fusion of prose, code and data.
Update: great discussion on friendfeed
Finally ported my Molecular Dynamics perturbation method RIP from AMBER to GROMACS. I find it kind of eerie watching the tyrosine sweep out a hole in the water:
Anyone who does simulations of proteins should post this quote up on their wall:
… if you really want to understand the detailed molecular interactions that make it go in a particular direction, make certain contacts, break other contacts, hydrolyze GTP, you know, form bonds, etcetera, and do it all amazingly accurately, then you do need a high resolution picture of those states. But, that’s not going to be enough. It’s going to take a lot of work by biochemists, by computational people who do molecular dynamics and things like that to really, eventually, understand it in the sense that we would understand, say, a more typical reaction.
~ Venkatraman Ramakrishnan, in his Nobel prize phone interview
I went to my first large conference in several years, a few months ago. It was the Protein Society Symposium held in Boston. It’s very important that a conference is held in a decent city like Boston. That means you can escape from the conference hall, have some nice food, and merge with normal people between sessions. I went three times to this amazing French bistro several blocks from the conference center at the Mariott hotel. My tongue was every bit as stimulated as my brain.
Just to state the bleeding obvious, conferences are places where you can learn about what’s happening in the field in a fun and efficient way. It points out the fact that publishing a paper is only the beginning of a piece of work. The long process of selling your work to other researchers has now begun. And I do mean selling, as it’s vitally important that you be an entertaining speaker if you want your work to get known. Terrible speakers I will not give a shit.
Membrane proteins are finally getting the love they deserve
Of the talks I enjoyed this year, membrane protein studies really stood out. Paula Booth (Bristol University) is trying to experimentally capture the folding intermediates of membrane proteins. She is trying to denature folded membrane proteins meaning that she has to really fuck up the membrane, which is needed to keep the membrane proteins intact. The results are kind of preliminary but this is really heroic stuff. The membrane proteins are trapped in weird bicelles making the data extremely hard to interpret.
Olga Boudker (Weil Medical College) described a particularly dramatic conformational changes in a glutamate transporter. It’s a trimeric gate that involves a huge coordinated swinging motion of all monomers. She used crosslinkers to trap the glutamate transporter in the other state, and found some exquisite symmetries in the arrangement of alpha-helices between the swinging conformations.
One of the questions that have kept me up at night (I exaggerate) is how membrane proteins actually fold inside the cell. Gunner von Heijne (Stockholm) has been studying how freshly-translated nascent membrane-protein chains gets inserted into the membrane through the Sec61 transmembrane protein. He has studied the insertion of proteins by sticking tags into the sequence of the protein. If a particular part of the protein does not get pushed into the membrane the tag will end up inside the rough microsome organelle where the tags will get glycosolated. With this assay, Gunner has identified the membrane insertion affinities of a whole bunch of peptide sequences, providing a testable set of membrane insertion affinities.
Lighting up single molecules
I saw two talks that demonstrated the awesome power of FRET. Basically FRET can be used to detect changes of single molecules on a very fast time-scale. William Eaton (NIH) in an award talk (kind of like an artist’s retrospective) gave a masterful discussion of using FRET to measure the folding of a single molecule. He showed that once a molecule has decided to fold (or in technical terms, passed the folding transition state), it only takes 200 nanoseconds to fold. However, it may take 2 whole seconds for the entire ensemble to fold.
Dorothy Erie (University of North Carolina) used FRET to study the states of a DNA repair protein. By placing FRET tags on the DNA, she can detect bending angles of single DNA molecules. FRET signals are used to detect transitions of the protein. From careful studies in a whole bunch of different conditions, she built an free energy landscape of the DNA repair protein, complete with allowed and forbidden transitions.
More tools
It’s nice to know that chemical shifts have matured into a useful powerful tool. Paul Robustelli (Vendruscolo group, Cambridge) showed that MD simulations augmented by chemical shift constraints improve dramatically to sub-Angstrom RMSD ensembles. Much better than the 2 or 3 angstroms that we’re used to.
One of the big frontiers of protein studies is unstructured proteins, making up a significant portion of 40% of our proteins. Peter Wright (Scripps) has developed a whole battery of methods for understanding how unstructured loops actually bind to proteins in complexes using coupled folding/binding mechanisms.
Bertrand Garcia-Moreno (Johns Hopkins) showed how to measure the pKa of individual residues in the freaking interior of the protein.
Protein Engineering
I saw two great protein engineering talks. Stewart Loh (SUNY) added a 3rd EF hand to Calmodulin. The new protein now has two alternate conformations where the middle EF hand can bind with either of the other two EF hands, but not both. By playing around with allosteric sites, he engineered a completely switchable protein.
Donald Hilvert (Zurich) took a functional dimer with swapped domains, and engineered a monomer version. Surprisingly the monomer was not completely folded but was still fully functional.
Theory geeks
Finally it would be remiss of me not to mention theory talks and posters. Wilfred Van Gunsternen (ETH, Zurich) gave a wagging-finger overview of sampling problems in Molecular Dynamics. In particular, he described one particular memory method where, if you know a priori that a particular measurable has multiple values (e.g. the J-coupling), then if your simulations gets stuck in one value, then you simply add a memory potential to push the system out of that value and check out the other values. At the end you magically subtract out the memory potentials. This way you can sample the multiple values even if you have big potential well between them.
Wenxun Gan (Roux lab, University of Chicago) described the “string” method for targeted molecular dynamics. It’s a clever iterative method for generating a path of least resistance between two different conformations of a protein. Breaking up the paths into a bunch of points, swarms of short trajectories are generated at each point, and the median of that swarm is used to improve the positioning of the point of phase space.
Dana Vuzman (Levy lab, Weizmann) described a beautiful coarse grain simulation of homeodomain transcription factors sliding along the DNA, trying to find the binding site. At this level of description, you can carry out very long langevin dynamics simulations that simulate the diffusion of the protein along the DNA, identifying different modes of diffusion (sliding along the DNA, hopping, or diffusing on/off the DNA). But more important she found that the homeodomain proteins can actually hop between separated strands of DNA by a “monkey bar” mechanism with bridged itermediates, which surprisingly, can improve the search efficiency of the transcription factor.
My loose tips for conferencing
- stay at a nice hotel away from the conference, (but not too far away). In Boston, I stayed at the lovely Omni Parker House
- go to networking dinners, they are actually great for meeting people, and meeting people who want to be met.
- otherwise, don’t try to meet people randomly. You just end up talking to crazy people. Just be the lone wolf
- get away from the conference as often as you can
- do not waste your time and money on the conference dinner. The food will be mediocre, and the talks will be poor
- skip the mixer unless you know a bunch of people
- bring drinks in your bag
- wade in and out of sessions. Only go to topics that are reasonably close to your research area. I once tried to listen stuff that I didn’t really understand. But, unless the speaker is bill-clinton brilliant, it’s worse than listening to community radio
- smart-phones are great! I used it to google the speaker and terms I didn’t understand, check email, and read books during the boring talks
- if you get a chance to talk to a big swinging dick, know that you have maybe 5 minutes, and that is all. They will promise to meet you later, but it will never happen. Get contact details and an idea for a followup, there and then.
- if you can’t read a poster in a minute, skip it.
- stay at your friggin’ poster, unless no one is checking it out. the two hours in your poster is your only time to sell your work and get people interested in reading your paper(s)
- start collaborations at your poster, that’s really what you’re there for
