tag:blogger.com,1999:blog-116731752008-07-13T22:45:35.693-04:00On GeneticsStevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comBlogger25125tag:blogger.com,1999:blog-11673175.post-9889541785668011562008-01-20T17:28:00.001-05:002008-01-31T22:12:24.773-05:00Plant genomes, animal genomes, more and more genomes!I recently returned from the <a href="http://www.intl-pag.org/">Plant and Animal Genome Conference</a> (XVI, Jan. 12-16, in San Diego). This conference is much more applied than what I'm used to, but I came because it seemed a good place to see comparative genomics in full bloom, and that turned out to be true. I was struck by the extent to which the meeting was a showcase for vendors (Agilent, Sequenom, BioTrove, Illumina, Roche (now incorporating 454 and Nimblegen), Affymetrix, Applied Biosystems, Keygene, etc.), many of whom literally wined and dined conferees at their workshops.<p></p><p>However, I was also struck by the extent to which new high-throughput sequencing technologies are already in widespread use. Ronan O'Malley (<a href="http://pbio.salk.edu/pbioe/">Ecker lab</a>) described the sequencing of Cvi, a strain of Arabidopsis distinct from the Columbia accession already determined; in the process he compared 454 and Solexa sequencing. Steve Jacobson (<a href="http://www.mcdb.ucla.edu/Research/Jacobsen/LabWebSite/index.html">UCLA</a>) described the repeated re-sequencing of (bisulfite-modified) Columbia for the purpose of studying cytosine methylation. Several more plant genomes are in in the pipeline, and a sense of the pace is conveyed by the fact that plenary speaker Eddy Rubin (<a href="http://www.jgi.doe.gov/whoweare/rubin.html">JGI</a>) "announced" the completion of the <a href="http://www.phytozome.net/soybean">soybean genome</a> almost in passing.</p><p>Other plenary talks were uniformly excellent. I missed the initial talk, by <a href="http://www.finisterepartners.com/abo_ourteambios.aspx">Jerry Caulder</a>, which was apparently quite controversial. David Baulcombe referred to it by saying that the European perspective on genetically modified foods is different and that "by shying away from the hazards we don't gain credibility." Another notable aside was Michael Ashburner's statement that "there is no point in funding biomedical research unless you also fund informatics."<br /><br /><br /></p>Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-29386408117900197122007-11-25T19:45:00.000-05:002007-11-25T20:11:01.953-05:00This week, it's ancestryLast weekend there was a lot of buzz about personal genomics (see Genome Technology Daily, "<a href="http://www.genome-technology.com/issues/blog/general/143452-1.html">It was a Helluva Weekend for Personal Genomics</a>"; or Eye on DNA, "<a href="http://www.eyeondna.com/2007/11/20/dna-network-members-discuss-personal-genomics-service-providers-23andme-decodeme-and-navigenics/">DNA Network Members Discuss Personal Genomics Service Providers 23andMe, deCODEme, and Navigenics</a>"; or my previous post). This weekend, it's ancestry. Today's papers had two interesting features on ancestry testing, both of which nicely echoed my own post about caution regarding ancestry testing ("<a href="http://ongenes.blogspot.com/2007/11/what-can-i-learn-about-my-ancestors.html">On Genes</a>"). First, the New York Times business section ("<a href="http://www.nytimes.com/2007/11/25/business/25dna.html">DNA Tests Find Branches but Few Roots</a>") discusses the business of ancestry testing. The article is nice in that it compares the cost of ancestry testing by various companies, shows that results differ, and quotes Henry Louis Gates Jr. making reasoned assessments of the role that DNA testing can play. Second, the Washington Post reviews "The Genetic Strand: Exploring a Family History Through DNA" by Edward Ball("<a href="http://www.washingtonpost.com/wp-dyn/content/article/2007/11/21/AR2007112101948.html">Blue Blood, Black Genes</a>").<br /><br />The theme is clear. You can only learn so much about your ancestors from DNA.Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-55977534411835586392007-11-17T09:56:00.001-05:002007-11-17T11:06:59.608-05:00Ready or not, personalized genetics is here.Yesterday's announcement by deCODE genetics that they would be launching a personalized genetics service, <a href="http://www.decodeme.com/">deCODEme</a> (<a href="http://www.decode.com/News/2007_11_16.php">news release</a>), means that a major player in gene discovery has just joined the growing field of companies offering personalized genetic services. As I wrote in my Nature Network blog, "On Genes" in "<a href="http://network.nature.com/blogs/user/smount/2007/10/24/the-scientist-blogger-and-the-personal-genome">The Scientist Blogger and the Personal Genome</a>," information about susceptibility to disease, potential for health or accomplishment and responsiveness to therapies is found in our genes, and it is going to be made available to people who want it. A lot of people are going to want it. Most are not going to be prepared to understand it. Even Jim Watson and J. Craig Venter aren't entirely sure what to make of their genomes. <a href="http://www.nature.com/naturejobs/2007/071018/full/nj7164-940a.html">Genetic counseling</a> may morph into a profession that serves everyone, not just those who faced with clear cases of genetic disease.<br /><br />Journalists and scientists also have a role to play. Let me highlight three useful responses.<br /><br />The New York Times has an excellent series called <span style="font-weight: bold;">"</span><a style="font-weight: bold;" href="http://topics.nytimes.com/top/news/national/series/dnaage/index.html">The DNA age</a><span style="font-weight: bold;">."</span> These articles (all by Amy Harmon, at least so far), "explore the impact of new genetic technology on American life." One published today, "<a href="http://www.nytimes.com/2007/11/17/us/17dna.html">My Genome, Myself: Seeking Clues in DNA</a>" describes her use of the 23andMe service.<br /><br />Bertalan Meskó, a blogger at "ScienceRoll," presents coverage of <a style="font-weight: bold;" href="http://scienceroll.com/personalized-medicine/">Personalized Medicine</a>, including a summary of breaking news (<a href="http://scienceroll.com/2007/11/17/personalized-genetics-still-rising/">today</a>) and a <a href="http://scienceroll.com/2007/11/10/navigenics-23andme-and-helix-health-the-review/">review</a> of services offered by Navigenics, 23andMe and Helix Health (last week, before the deCODE announcement).<br /><br />I have started "<span style="font-weight: bold;">Information on Genes</span>," (<a style="font-weight: bold;" href="http://ongenes.org/">ongenes</a><span style="font-weight: bold;">)</span>, a web site that is intended to be a place where answers to questions on genes, genetics and genomics are provided by experts in the field. Questions will be posted anonymously but answers will not. I plan to solicit answers from people in the know. My hope is that ongenes will provide useful information to anyone trying to understand genetic tests, including professionals in the field.Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-13469368116448700902007-09-06T12:57:00.000-04:002007-09-06T13:08:04.937-04:00Simons Foundation funds research on sporadic autism mutationsBecause I've dealt with the issue of sporadic autism linked to paternal age before (<a href="http://newsongenetics.blogspot.com/search/label/paternal%20age">links</a>) it seems worthwhile noting here that the <span class="bodycopy">Rutgers University Cell and DNA Repository will use a $7.8 million grant from the <a href="http://www.simonsfoundation.org/page.php?id=96">Simons Foundation</a> to establish a new collection of DNA samples to help autism researchers study sporadic germ-line mutations. This story is covered by <a href="http://www.genomeweb.com/issues/news/141967-1.html">GenomeWeb</a> today.<br /></span>Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-62626534064549493112007-08-28T23:33:00.000-04:002007-09-03T11:33:05.221-04:00'On Genes," my blog on Nature NetworkAfter commenting on Nature Network ("<a href="http://ongenetics.blogspot.com/2007/07/whats-up-with-nature-nature-network.html">What's up with Nature?</a>"), I ended up creating a new blog over there. It's "On Genes," and the URL is <a href="http://network.nature.com/blogs/user/smount">network.nature.com/blogs/user/smount</a>. It's not clear what I'll put there as opposed to here. Perhaps one of the two blogs will die. Right now, the plan is to put more substantial scientific posts here and more news-oriented posts there.<br /><br />Along those lines, my first real post on the Nature Network blog, "<a href="http://network.nature.com/blogs/user/smount/2007/08/29/prism-distorts-our-view-of-the-open-access-debate">PRISM distorts our view of the open access debate</a>" was in response to Jonathan Eisen's blog entry “<a href="http://phylogenomics.blogspot.com/2007/08/prism-partnership-for-research.html">PRISM – Partnership for Research Integrity in Science and Medicine – Seems like a spoof but it is real, and sad</a>“). It makes me angry to see issues that concern me be taken up by a public relations firm that is so thoroughly dishonest. But I won't repeat that here. You can read about it <a href="http://network.nature.com/blogs/user/smount/2007/08/29/prism-distorts-our-view-of-the-open-access-debate">there</a>.Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-32679504923970118242007-07-29T15:44:00.000-04:002007-07-29T17:51:19.671-04:00Plants, Animals and the Ancient RNA Toolkit<a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.tolweb.org/Eukaryotes"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://bp1.blogger.com/_f_GeZsNWphQ/Rqzw3BHj6VI/AAAAAAAAAB8/W4xMPO2zXDg/s400/Euks-tree.jpg" alt="" id="BLOGGER_PHOTO_ID_5092710106632808786" border="0" /></a>Multicellularity has arisen independently several times, but most famously twice, in the two lineages giving rise to plants and animals. In fact, the last unicellular ancestors of these two lineages were not particularly closely related, and the last common ancestor of both plants and animals also gave rise to an enormous number of extant unicellular progeny, including all of the fungi. When I began serious work on the regulation of pre-mRNA splicing in plants in 2001 I did so with an awareness of how very similar the process is to pre-mRNA splicing in animals. This is all the more striking because so many species have lost this complexity. In fact, plants and animals share many processes that must have been present in the last common ancestor, but have been lost in many unicellular eukaryotes derived from that same ancestory. RNA figures heavily in the list, which includes microRNAs, U12 introns, the exon junction complex and complex alternative splicing.<br /><br />Although the last common ancestor of plants and animals was almost certainly much more complex than most modern unicellular eukaryotes (at least in terms of its genome), it was probably not multicellular. The signals that control development in animals (wnts, hedgehog, FGFs, TGF-betas, etc.) are completely missing in plants. Likewise, the genes involved in meristem maintenance, ethylene-signaling, auxin-signaling and so on are missing in animals. It's also worth pointing out that the opisthokont clade (which includes animals and the fungi) is well-established (see the figure, which is from the <a href="http://www.tolweb.org/tree/">Tree of Life Web Project</a>).<br /><br />Perhaps most convincing are the exceptions: the processes shared by animals and plants but missing from most unicellular eukaryotes are not missing from all. U12 introns were recently found in distantly related protists and in a fungus (see my <a href="http://newsongenetics.blogspot.com/2006/10/u12-introns-in-unicellular-eukaryotes.html">comment</a>). MicroRNAs were recently described in <span style="font-style: italic;">Chlamydomonas reinhardtii</span>, a unicellular green alga (<a href="http://www.ncbi.nlm.nih.gov/sites/entrez?Db=pubmed&Cmd=ShowDetailView&TermToSearch=17470535&ordinalpos=5&itool=EntrezSystem2.PEntrez.Pubmed.Pubmed_ResultsPanel.Pubmed_RVDocSum">Zhao et al., 2007</a>). There is even a miRNA family that is appears to be conserved between plants and animals and targets a homologous family of splicing regulators (<a href="http://www.plantcell.org/cgi/content/full/18/12/3355">Arteaga-Vazquez et al. 2006</a>).<br /><br />It is therefore frustrating to read commentaries that are written as though genomic complexity is new. For example, Ram and Ast (<a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=pubmed&cmd=Retrieve&dopt=AbstractPlus&list_uids=17070958&query_hl=4&itool=pubmed_docsum">2007</a>) mistakenly generalize from <span style="font-style: italic;">S. cerevisiae</span> to <span style="font-style: italic;">S. pombe</span> (which retains more genomic complexity of several sorts, including alternative splicing) and talk about "before and after" incorrectly. Their conclusion, that "SR proteins had already facilitated the splicing of weak introns before the evolution of alternative splicing" may be correct, but complex alternative splicing was almost certainly present in the last common ancestor of plants and animals. I say this based on the fact that it had many genes whose products function in the regulation of alternative splicing, and which have been lost in unicellular descendants lacking complex alternative splicing (among these is a repertoire of at least four SR proteins).<br /><br />What is most interesting to me is the correlation between developmental complexity and retention of genomic complexity, including alternative splicing and miRNAs. It might not have evolved with multicellularity, but the ancient RNA toolkit might be very useful when it comes to building a complex organism.Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-92108632429047012752007-07-21T11:32:00.000-04:002007-08-03T13:19:50.695-04:00What's up with Nature? Nature network, screwy renewals, more.Nature (Nature Publishing Group, to be precise) has been aggressively embracing the internet in new and interesting ways. Their main page at <a href="http://www.nature.com/index.html">nature.com</a> no longer has a list of journals. Instead, journals is just one of many choices (it's at the top, to be sure, and they now have no less than 77), including podcasts, gateways (which aggregate related content across journals), feeds, blogs, jobs, society partners, conferences, regional websites and miscellany (which is creatively titled "launch pad"). It's all a bit unfocused, but much of it is very useful. For example, <a href="http://www.connotea.org/user/smount">Connotea</a> is the shared bookmarks site that I settled on after some <a href="http://ongenetics.blogspot.com/2006/07/shared-bookmarks-for-literature-what.html">deliberation and experimentation</a>, and that decision reflects the quality of the site. Nature provides useful tools oriented towards literature citations, the most important of which allow easy capture of bibliographic information (with one annoying bug that involves authors with multiple names) and export of libraries.<br /><br /><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://network.nature.com/"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://bp3.blogger.com/_f_GeZsNWphQ/RqJAJBHj6UI/AAAAAAAAAB0/L4iz-kaqVdY/s400/NatureNetwork.jpg" alt="" id="BLOGGER_PHOTO_ID_5089701052545231170" border="0" /></a>Nature's newest venture is "<a href="http://network.nature.com/">Nature Network</a>," which is social networking for the scientist. I quickly found and joined groups for people working on bioinformatics, Drosophila and Arabidopsis. I didn't find a group working on splicing, so I created one. Nature Network could be quite useful, but I wonder if it is going to succeed. To do so, it must "catch on," a phenomenon that his hard to predict and depends very much on the site providing useful tools not available elsewhere. Right now, most of what it offers seems redundant, but at the very least it provides a professional alternative to Yahoo groups for <span style="font-style: italic;">ad hoc</span> groups of scientists who want to create an online forum for exchange on a particular topic. One especially interesting choice is the elevation of London and Boston to a special status. I'm sure that Nature Network San Francisco will come soon, but I see lots of problems with this. Would the East Bay have their own Nature Network? I can't wait to find out if New York or Washington will be added first. Where will it end? Recalling the desperate enthusiasm with which I have often seen local politicians embrace biotechnology, I fear that this could get competitive and ugly, even before Nature Network runs out of space on their local menu toolbar.<br /><br />Of course, the weekly journal is still the keystone of Nature Publishing Group. I have had a personal subscription for over 20 years and I read the journal, in print, every week, bringing it along with me to meals and whatnot. This year, they are being very aggressive about renewals and they're getting it very wrong. My annual renewal expires in September. About a month ago I received a phone call in my office, inviting me to renew. Yes, <span style="font-weight: bold;">they called me</span>. Promised a 30% discount, I did so. I renewed online in an attempt to be sure that I generated a renewal of my existing subscription, following the instructions of the person who called me. The result was an entirely new subscription, which expired not in September of 2008, but in July of 2008. I also found that I had three or four customer IDs associated with my account (for only two journals, the other being Nature Genetics). After many rounds of email with their customer service my subscriptions were simplified under a single subscriber number with the proper expiration date. I should emphasize that the replies were prompt, cordial and helpful; the problem is with their system. I assumed that everything was fine, despite the numerous entries on "/myaccount/show/subs," shown here for your amusement.<br /><br /><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://bp2.blogger.com/_f_GeZsNWphQ/RqI6jxHj6SI/AAAAAAAAABk/G0LSW-4u-to/s1600-h/Nature.jpg"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://bp2.blogger.com/_f_GeZsNWphQ/RqI6jxHj6SI/AAAAAAAAABk/G0LSW-4u-to/s400/Nature.jpg" alt="" id="BLOGGER_PHOTO_ID_5089694915036965154" border="0" /></a>Then, I received two copies of Nature in the mail. Inquiry generated a response that came down to this:<br /><blockquote><span style="font-family:hevetica,sans-serif;">So this is the reason you are receiving two copies of the same journal Nature but they are two different volumes and issues.so you will be receiving two copies of Nature till Sep 2007.<br /></span></blockquote>I decided to leave well enough alone.<br /><br />Today, I received an email, from Sarah Greaves, PhD, Publisher, Nature, herself, that read in part<br /><blockquote><span style="font-family:hevetica,sans-serif;">Your current subscription to Nature is now up for renewal. To ensure you don’t miss a single issue, I am pleased to offer you a 30% discount from our normal subscription rate.<br /><br />This offer expires on SEPTEMBER 27th and is only available online through this email, so act now to ensure you don’t miss out on this exclusive rate. </span></blockquote>I wanted to run screaming from the room, but I opted instead for writing this post.Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-48596528089780515322007-02-14T14:01:00.000-05:002007-02-14T14:48:53.367-05:00Michael Crichton weighs in on patenting and the Genomic Research and Accessibility ActIn <a href="http://www.nytimes.com/2007/02/13/opinion/13crichton.html?ex=1329109200&en=2e30337139d379a2&ei=5124&partner=permalink&exprod=permalink">yesterday's New York Times</a>, <a href="http://www.michaelcrichton.net/">Michael Crichton</a> (author of "Jurassic Park" and "Next") wrote in favor of the Genomic Research and Accessibility Act, which would ban the patenting of genes found in nature. He correctly points out that genes are not inventions and attributes the fact that they can be patented to "a mistake by an underfinanced and understaffed government agency, The United States Patent Office." I note that the bill, <a href="http://becerra.house.gov/HoR/CA31/News/Press+Releases/2007/02-09-07+REPS+BECERRA+WELDON+INTRODUCE+BILL+TO+BAN+THE+PRACTICE+OF+GENE+PATENTING.htm">as described by co-sponsor Xavier Becerra</a>, is not retroactive, so, while it's a no-brainer that the patent office should not be granting patents for the discovery of natural phenomena, this bill won't do much to facilitate the promise of personalized medicine (because most of the genes that matter have already been patented). I have discussed the possibility that we might find relief in the courts before (regarding <a href="http://smmquicknotes.blogspot.com/2006/08/ebay-inc-vs-mercexchange-llc.html">EBay Inc. vs. MercExchange, LLC</a> and <a href="http://ogquicknotes.blogspot.com/2006/03/labcorp-vs-metabolite-laboratories.html">Labcorp vs. Metabolite Laboratories</a>). Certainly, gene patents disserve the public interest, but that is not enough, and the prospect of understanding the law in these cases is daunting.Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-5035853967344147292006-12-25T14:40:00.000-05:002006-12-25T15:27:21.898-05:00Science Blogs listed on the OMMBID blogThe OMMBID (Online Metabolic and Molecular Bases of Inherited Diseases) <a href="http://books.mcgraw-hill.com/medical/ommbid/blog/index.php">blog</a> has published a <a href="http://books.mcgraw-hill.com/medical/ommbid/blog/?p=111">list of science blogs and related websites</a>. It's nice to have been included. When I get some time I will browse that list and update my own lists of favorites on <a href="http://www.connotea.org/user/smount/tag/blog">Connotea</a> and my <a href="http://outfoxing.com/OnGenetics/Sites.html">summary page</a>. <a href="http://outfoxing.com/OnGenetics/Sites.html"></a>Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1152634151678113222006-07-11T11:24:00.000-04:002006-09-05T09:55:25.090-04:00Shared bookmarks for the literature -- what to do?After some <a href="http://stevemount.blogspot.com/2005/08/is-sharing-bookmarks-social-revolution.html">initial skepticism</a>, I agree that "social bookmarking" is nice. It's very useful to put bookmarks to the literature online and see what articles others have cited, and the old approach of journal-specific browsing (associated with hard-copy volumes, but also including eTOCs) is certainly outmoded. I'm experimenting now with <a href="http://del.icio.us/">del.icio.us</a> (as both <a href="http://del.icio.us/ongenetics">ongenetics</a> and <a href="http://del.icio.us/RNAinfo">RNAinfo</a>) and <a href="http://www.connotea.org/user/smount">Connotea</a>. An <a href="http://www.rnasociety.org/surveys">RNA Society survey</a> generated a few votes for <a href="http://www.citeulike.org/user/smount">CiteUlike</a>, which looks great, although it is not easy to get it to accept an article from PubMed (in order to get the correct URL active you have to select the article you want from a list; if it is only result of your search, PubMed continues to display the URL for the search). A user named <a href="http://www.citeulike.org/user/Cortel">Cortel</a> has lots of relevant citations, so I may continue to keep track of him, whether I settle on CiteUlike or not. I also created my parallel blog, "<a href="http://ogquicknotes.blogspot.com/">Quick Notes on Genetics</a>," primarily with the idea of citing articles, and I keep a list of especially relevant articles tied to my lab's web page (the <a href="http://www.chemlife.umd.edu/labs/mount/readings.html">Mount lab reading room</a>). Finally, it's worth mentioning the <a href="http://www.f1000biology.com/my/">Faculty of 1000</a> in this context. This is all <span style="font-weight: bold;">way </span>too much. It's not clear what I will settle on, but most of these will be forgotten once I stop exploring and develop a routine for finding and sharing the articles that interest me.Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1147461513342904802006-05-12T14:46:00.000-04:002006-05-12T15:18:33.410-04:00The genetic architecture of complex traits: significant differences can involve noncoding DNA and can be epistaticClark et al., in "A distant upstream enhancer at the maize domestication gene <i>tb1</i> has pleiotropic effects on plant and inflorescent architecture," describe a significant QTL residing in noncoding (and largely repetitive) DNA far upstream of the affected gene (<a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=16642024&query_hl=1&itool=pubmed_docsum">PubMed</a>, <a href="http://www.nature.com/ng/journal/v38/n5/full/ng1784.html">Nature Genetics</a>). I am reminded of the graduate genetics course given by Michael Freeling that I sat in on while a postdoc at Berkeley (in 1984, while studying the effects of transposable element insertions on gene expression in Drosophila in Gerry Rubin's lab). He emphasized epigenetic phenomenon and truly expected that novel molecular mechanisms (such as transposon instability) would reveal "a molecular clock that really ticks." From <a href="http://plantbio.berkeley.edu/faculty/faculty_pages/freeling.html">his web page</a> it looks like he's continuing on the same tack today.<br /><br />A paper in the previous month's issue of Nature Genetics (Carlborg <i>et al.</i>, "Epistasis and the release of genetic variation during long-term selection," <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=16532011&query_hl=4&itool=pubmed_docsum">PubMed</a>, <a href="http://www.nature.com/ng/journal/v38/n4/full/ng1761.html">Nature Genetics</a>) reported a genetic network of four interacting loci affecting chicken growth ("Growth4, Growth6 and Growth12 had a significantly larger effect on growth in homozygous Growth9 individuals than [others]"). This kind of genetic interaction is precisely what any developmental geneticist would expect, yet breeders and population geneticists often cling to simple linear models. This paper will certainly help, not least of all because it involves a method for the detection of epistatic QTLs.Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1141832305117141922006-03-08T10:00:00.000-05:002006-03-08T10:43:12.920-05:00From HapMap to selection mapIt was the article by Nicolas Wade in the New York Times (<a href="http://www.nytimes.com/2006/03/07/science/07evolve.html">"Still Evolving, Human Genes Tell New Story"</a>) that alerted me to the new article in PLoS Biology by Voight et al. (<a href="http://biology.plosjournals.org/perlserv/?request=get-document&doi=10.1371/journal.pbio.0040072">"A Map of Recent Positive Selection in the Human Genome"</a>, from Jonathan Pritchard's group at the University of Chicago). I've been anticipating a list of human genes under selection for some time, and it's exciting to see this published. This paper, perhaps more than any other, marks the transition to a new and controversial era in genetics. On the positive side, we're going to learn a lot very quickly about the genetics of human differences. This will provide many benefits and engage curiosity in satisfying and useful ways.<br /><br />On the other hand, the uncritical acceptance of results that are statistical in nature (and have a real possibility of being wrong) is disturbing. A recent visitor to Sarah Tishkoff's lab (Jeff Jensen, from Cornell, where he works with Aquadro and Bustamonte) gave a talk about the statistical problem of distinguishing selection from certain demographic phenomena that made me think the interpretation of selection maps is going to be extremely uncertain. It is surprising that none of those issues were addressed in Wade's article, especially so because the New York Times typically fills their science articles with quotes from others in the field. I felt the same unease a few weeks ago when watching a PBS documentary <a href="http://www.pbs.org/wnet/aalives/">"African American Lives,"</a> in which famous African-Americans were given overly specific information about their ancestry without appropriate statistical disclaimers.<br /><br />I suppose that we will all be talking a lot more about selection and race with my friends who are not geneticists, and putting a lot more population genetics into my graduate genetics course. Clearly, the idea that population genetics is <i>passé </i>is now <i>passé</i>.Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1124941282368122452005-08-24T23:41:00.000-04:002005-08-26T12:00:44.046-04:00Alternative splicing and host defense in flies and plantsIn an article appearing online in <a href="http://www.sciencemag.org/cgi/content/abstract/1116887v1">Science</a> this week, and discussed in <a href="http://www.the-scientist.com/news/20050819/01">The Scientist</a>, Watson et al. (<a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=16109846&query_hl=7">PubMed</a>) implicate the Drosophila Dscam gene in host defense. <a href="http://www.dana-farber.org/res/physician/detail.asp?personID=549&RD=True&group=%28Researcher%29">They</a> detect secreted forms of the protein in hemolymph and show that the gene enhances phagocytosis of bacteria by hemocytes. They also demonstrate conservation of the potential for extreme isoform diversity across insect taxa, an extension of earlier work from <a href="http://genetics.uchc.edu/Graveley/Research/Research.html">the Graveley lab</a> (Graveley et al. 2004; <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15383675&query_hl=12">PubMed</a>, <a href="http://www.rnajournal.org/cgi/content/full/10/10/1499">RNA journal</a>). Isoform diversity due to alternative splicing is therefore implicated in the generation of adaptive variation in host defense molecules. It is interesting that isoform diversity due to alternative splicing of Toll-like proteins has likewise been implicated in plant defense (reviewed by <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=14557042&amp;amp;query_hl=17">Kazan 2003</a> and <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=12234730&amp;amp;query_hl=23">Jordan et al. 2002</a>; an example is <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=14523247&amp;query_hl=15">Zhang & Gassmann 2003</a>). <br /><br />What kind of adaptation does this make possible? Certainly, extreme variability allows rapid adaptation on a population level. Furthermore, the presence of membrane bound and secreted forms of the same molecule presents the possibility of adaptive immunity through clonal selection of hemocytes that see antigen. Louisa Wu pointed me to an article in Nature Immunity (<a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15970937&query_hl=2">Little, Hultmark and Read 2005</a>) making the point that neither memory nor specificity has been ruled out in invertebrate immunity. True adaptive immunity in insects would be very exciting, but we're a long way from that. How could variation in isoform production among hemocytes in Dscam isoforms be heritable? Through epigenetic silencing of splicing factors? We're just at the beginning of this story.<br /><br />The authors say this:<br /><br /><span style="font-size:85%;"><span style="font-family: arial;">broad conservation of receptor diversity strongly suggests important </span><br /> <span style="font-family: arial;">functions and future studies will have to further address </span><br /> <span style="font-family: arial;">whether the presence of diverse immune receptors in </span><br /> <span style="font-family: arial;">invertebrates increases the effectiveness of immune responses </span><br /> <span style="font-family: arial;">of individual animals. Alternatively, given the relative short </span><br /> <span style="font-family: arial;">life span of many invertebrates, it may be that immune </span><br /> <span style="font-family: arial;">receptor diversity is less important ontogenetically but rather </span><br /> <span style="font-family: arial;">enhances the adaptive potential of animal populations to </span><br /> <span style="font-family: arial;">changing environmental and pathogenic threats. </span></span>Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1124244915288974262005-08-16T21:52:00.000-04:002005-09-09T16:06:58.253-04:00Nature Genetics and the Mid-Atlantic Plant Molecular Biology Society Conference<p class="MsoNormal">There is always something interesting in Nature Genetics, but the <a href="http://www.nature.com/ng/journal/v37/n7/index.html">July issue</a> seems especially rich. </p> <p class="MsoNormal"><b><a href="http://www.nature.com/ng/journal/v37/n7/full/ng0705-653.html">Postdocs</a></b><br />I appreciate this editorial.<span style=""> </span>The advice here (e.g. that postdocs and advisors make a formal plan, and that postdocs ask themselves such questions as "is this the most important scientific question I can ask") is excellent. Anyone considering a postdoc, or taking on a postdoc, should read this.<br /> </p> <p class="MsoNormal"><b><a href="http://www.nature.com/ng/journal/v37/n7/full/ng0705-655.html">Race</a></b><br />I am often asked (especially by educated non-scientists in my acquaintance) about genetics and race.<span style=""> </span>This is an old debate and there are excellent sources of information and opinion (including a Social Sciences Research Council <a href="http://raceandgenomics.ssrc.org/">forum</a>, a <a href="http://www.nature.com/ng/journal/v36/n11s/index.html">special issue</a> of Nature Genetics and an <a href="http://www.amazon.com/exec/obidos/tg/detail/-/0805837574/qid=1124228576/002-7144358-8596821?v=glance&s=books">edited volume by Jefferson Fish</a>; a more complete listing is on <a href="http://anthropology.net/node/19">Anthropology.net</a>). <span style=""> </span>The bottom line is that race is indeed a social construct. (At least it is very poorly defined within biology, and what biological definitions might be partially valid differ significantly from the way the concept is normally used in our society).<span style=""> </span>The <a href="http://news.google.com/news?hl=en&ned=&ie=UTF-8&q=BiDil+approval+race&btnG=Search+News">licensing of BiDil</a> specifically for African Americans is therefore troubling.<span style=""> </span>It seems to me that if a drug differs in either safety or efficacy for one "race" or another, then the underlying basis is probably either a genetic difference or a cultural difference.<span style=""> </span>In the first case, the relevant genetic difference itself, or a related biomarker, would be much more reliable than popular notions of race.<span style=""> </span>On the other hand, if the basis is cultural, the relevant practice (such as lifestyle or diet) should be identified.<span style=""> </span>I was therefore gratified to see Nature Genetics publish this letter from Jonathan Kahn making the case against the misuse of race, as well as a sidebar showing how the media has misrepresented their own statements.</p> <p class="MsoNormal"><b>Transcriptional Gene Silencing, RNA polymerase IV and siRNAs<o:p></o:p></b><br />The association of specific RNAs (siRNAs) with silenced chromosomes presents something of a paradox (since the siRNAs themselves must be transcribed).<span style=""> </span>This paradox is elegantly resolved by the discovery of "RNA polymerase IV," which is presumed transcribe otherwise silent regions, at least in Arabidopsis (Kanno et al.: "Atypical RNA polymerase subunits required for RNA-directed DNA methylation" <a href="http://www.nature.com/ng/journal/v37/n7/full/ng1580.html">Nature Genetics</a>; <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15924141&query_hl=1">PubMed</a>; other recent papers cited therein and a <a href="http://www.nature.com/ng/journal/v37/n7/full/ng0705-659.html">News and Views</a> by Vaucheret).<span style=""> </span>In other species RNA polymerase II is implicated (e.g. <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15965464&query_hl=6">Schramke et al.</a>) but there may be less siRNA corresponding to silenced loci in those species. On a related note, I was impressed by the massive amounts of MPSS data on Arabidopsis siRNAs presented by Pam Green at the <a href="http://psi081.ba.ars.usda.gov/MAPMBS/MAPMBS2005/Agenda.htm">MAPMBS meeting</a> last week.<span style=""> </span>This data includes over 75,000 different siRNA sequences and will soon to be online at <a href="http://mpss.dbi.udel.edu/">http://mpss.dbi.udel.edu/</a> in a browsable form.</p> <p class="MsoNormal"><b>Structural genomic variation within species</b><br />One of the insights I came away from last year's MAPMBS meeting with was the idea (Rafalski, <a href="http://psi081.ba.ars.usda.gov/MAPMBS/MAPMBS2004/Abstracts/Ab%20inv%20sp%20Rafalski.pdf">MAPMBS2004</a>, <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15659640&query_hl=8">PubMed</a>) that "races" of maize show significant variation in gene content due to small (sub megabase scale) structural differences: insertion, deletion and inversion.<span style=""> </span>Although a speaker at this year's meeting expressed the opinion (based on sequence data) that the case in maize may have been overstated, another paper in Nature Genetics (Tuzun et al., Fine-scale structural variation of the human genome; <a href="http://www.nature.com/ng/journal/v37/n7/full/ng1562.html">NG</a>; <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15895083&query_hl=3">PubMed</a>) reports 297 cases of "intermediate scale" structural variation in a single human individual!<span style=""> </span>It will be interesting to see how this plays out with more time, but SNPs may well be displaced by presence/absence variation as the focus of attention in human genetics.<span style=""> </span>As Charles Lee notes in his <a href="http://www.nature.com/ng/journal/v37/n7/full/ng0705-660.html">News and Views</a> piece, what we see depends on our technology for looking, and I am reminded that a lot of early work in population genetics was based on inversions visible on polytene chromosomes. </p> <p class="MsoNormal"><o:p> </o:p></p>Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1123905067639219932005-08-12T23:44:00.000-04:002005-08-12T23:51:07.646-04:00Global regulation of alternative splicing: starting with Nova<p>The global analysis of alternative splicing is complicated by the fact that standard microarrays, and even tiling arrays without junction oligos, do a poor job of reporting on the ratio between alternatively spliced mRNA isoforms that share most of their nucleotides. Only in the past few years has alternative splicing data from arrays been reported (see <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=11988574&query_hl=24">Clark et al., 2002</a>; <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=14684825&query_hl=25">Johnson et al., 2003</a>; <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15499012&query_hl=28">Stolc et al. 2004</a> and <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15610736&query_hl=19">Pan et al. 2004</a>). <span style=""> </span>It was therefore exciting to see the paper by <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=16041372&query_hl=18">Ule et al.</a> in the new issue of Nature Genetics reporting the effect of Nova2 knockouts on global patterns of alternative splicing in the mouse brain.<p>A custom microarray from Affymetrix was used for this study. Although I applaud the efforts of Hui Wang and John Blume to bring alternative splicing to the Affymetrix platform (and [full disclosure] I own some Affymetrix stock), custom chips are extremely expensive. I have my eyes on the Agilent platform used by <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15610736&query_hl=19">Pan et al.</a> and what I would really like to see is the widespread use of a common (inexpensive) platform so that publicly available data can be mined for unexpected associations. </p> <p>Another notable aspect of this study is the truly remarkable degree of functional connection between proteins whose isoforms appear to be regulated by Nova2. <span style=""> </span>Figure 5 in this paper makes a compelling case for the idea that while transcriptional regulators can turn gene sets on and off, splicing regulators can fine-tune an entire module for a specific task.</p> <p>Finally, it is important to note that these experiments are facilitated by the fact that Nova2 knockout mice are viable, which rendered tissue-specific ablation (as practiced by <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=14963485&query_hl=16">Ding et al.</a> and <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15652482&query_hl=16">Xu et al.</a> for similar studies on SR proteins) unnecessary.<span style=""> </span>That is why we consider it a good thing that so many of the <i style="">Arabidopsis</i> SR proteins we work with are not essential.<span style=""> </span>This does not mean they are not important! <span style=""> </span></p>Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1122316630861569962005-07-25T14:27:00.000-04:002005-07-25T14:38:30.030-04:00PLoS GeneticsI just got an email that <a href="http://www.plosjournals.org/perlserv/?request=index-html">PLoS</a> <a href="http://genetics.plosjournals.org/">Genetics</a> has launched. It looks great, and given the quality of the other <a href="http://www.plosjournals.org/perlserv/?request=index-html">PLoS</a> journals (particularly <a href="http://biology.plosjournals.org/">Biology</a> and <a href="http://compbiol.plosjournals.org/">Computational Biology</a>), I'm expecting a lot from it.Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1122044866695409622005-07-22T10:58:00.000-04:002005-07-23T11:00:33.676-04:00Parameters for using blastn with noncoding queries<p>If one wants to look for a conserved noncoding RNA in a new genome using the best possible tools, then one should use sophisticated structure-based methods such as Klein and Eddy's RSEARCH ( <a href="http://www.biomedcentral.com/1471-2105/4/44">BMC Bioinformatics 4:44 </a>, <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=14499004">PubMed</a>), and should consult the RNA database <a href="http://rfam.wustl.edu/">Rfam </a>(Griffths-Jones et al., <a href="http://nar.oupjournals.org/cgi/content/abstract/31/1/439">2003: Rfam: an RNA family database </a>and <a href="http://nar.oupjournals.org/cgi/content/full/33/suppl_1/D121">2005: Rfam: annotating non-coding RNAs in complete genomes </a>. <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15608160&query_hl=3">PubMed</a>). However, alignment tools such as blast or fasta are more readily available, so it is often expedient to use alignment when other tools would do better.<span style=""> </span>If you do that, you must adjust the parameters – you will never find noncoding RNAs using the default parameters for blast. <span style=""> </span>I confronted this problem at the Drosophila genome jamboree in 1999 and published the parameters I used there in the paper I wrote with Helen Salz (<a href="http://www.jcb.org/cgi/reprint/150/2/F37">J. Biol. Chem.</a>; <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=10908584">PubMed</a>).</p> <p>Now, I've posted a discussion and "how-to" guide (<a href="http://www.stevemount.com/Posting0004.html">Posting 4</a> on SteveMount.com) based on work that Chau Nguyen (a University of Maryland Computer Science and Biology double major) did with me a few years ago. <span style=""> </span>These are written for use on the NCBI <a href="http://www.ncbi.nlm.nih.gov/blast/Blast.cgi?CMD=Web&LAYOUT=TwoWindows&AUTO_FORMAT=Semiauto&ALIGNMENTS=50&ALIGNMENT_VIEW=Pairwise&CLIENT=web&DATABASE=nr&DESCRIPTIONS=100&ENTREZ_QUERY=(none)&EXPECT=10&FILTER=L&FORMAT_OBJECT=Alignment&FORMAT_TYPE=HTML&HITLIST_SIZE=100&NCBI_GI=on&PAGE=Nucleotides&PROGRAM=blastn&SERVICE=plain&SET_DEFAULTS.x=34&SET_DEFAULTS.y=8&SHOW_OVERVIEW=on&END_OF_HTTPGET=Yes&SHOW_LINKOUT=yes">blastn </a>server, but are easily adapted to running blast locally.<span style=""> </span>Briefly, my advice is to use the parameters <b>-r 5 -q -4 -G 10 -E 4 -W 7. </b><span style=""> </span>These values not only find mammalian U6atac using plant U6atac but provide an alignment across the entire snRNA. <span style=""> </span>If you don't find what you want, you may want to make adjustments based on the more thorough discussion in <a href="http://www.stevemount.com/Posting0004.html">the posting</a>, where I describe several parameter sets there that will correctly idenfity plant snRNA genes using animal snRNA queries.<span style=""> </span></p> <p>Bear in mind two caveats: limit the size of your query and be prepared to use independent criteria for identifying correct hits.<span style=""> </span>These searches require more computing resources than standard blast searches and it will generally take longer than the estimated time for your results to come back. <span style=""> </span>For related reasons, you should not attempt to use these parameters for queries longer than about 500 bp. (if you are using a noncoding RNA as the query do not include nontranscribed flanking sequence in your query; you may even want to remove poorly conserved parts of the RNA itself from your query). <span style=""> </span>Also, because the assumptions that go into calculating E values are violated by these parameters, the E values reported in your output will be meaningless (except as relative numbers; better matches will still have lower E values). <span style=""> </span>Do not pay attention to the E values (except when comparing results obtained with the same parameter set) and do not report them. <span style=""> </span>However, the lack of reliable E values is not license to believe nonsense; your results should be validated by external criteria such as secondary structure and conservation of known functional regions.</p><p>Good luck! <span style=""> </span>If you have experience that bears on this, or can cite relevant literature, please let <a href="mailto:smount@umd.edu">me</a> know and I'll update the posting. </p>Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1120171816428666592005-06-30T18:39:00.000-04:002005-06-30T19:02:21.030-04:00Things that are not exons<p>I have thought for many years that the genomics community needs a term other than 'exon' for coding segments. This post points out how lacking such a name has led to misuse of the word 'exon'. I also suggest that the word 'croe' be used instead, but my primary purpose is to call attention to the need for new names. I would be happy to have other names used properly.<br /></p> <p><br />This was presented at the <a href="http://biolinfo.org/as-sig/">Alternative Splicing SIG</a> at <a href="http://www.iscb.org/ismb2005/">ISMB</a>. My presentation in PowerPoint form is available <a href="http://stevemount.outfoxing.com/Nomenclature-ISMB-SIG.ppt">here</a> and is posted on my web site as <a href="http://stevemount.outfoxing.com/Posting0003.html">Posting 3</a>. My hope is that the term be introduced into the <a href="http://song.sourceforge.net/">Sequence Ontology</a>, but I'll leave it up to my friends there to get it right.<br /></p> <p>An exon is defined as a segment of a gene that is present in the mature mRNA product of that gene. Genes for noncoding RNAs that are spliced are divided into exons and introns (examples include tRNAs and rRNAs, as well as a variety of noncoding RNA polymerase II transcripts) and every spliced mRNA has at least two exons that are partly noncoding, containing the 5' UTR and the 3' UTR. However, the need to refer to isolated coding segments that are often complete exons but are sometimes only a part of an exon has led many people to use the term 'exon' inappropriately, and this has created confusion. In one extreme case, a published paper presents an "exon size distribution" which includes many coding segments that are only part of an exon. There are many other examples. </p> <p>Some people are careful to get it right, and many of them use the term CDS to refer to these coding segments. For example, Michael Zhang, in his excellent 2002 review of computational genefinding (<a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=12209144">PubMed</a>) writes "To discriminate CDS from intervening sequence, the best content measures are the so-called frame-specific hexamer frequencies" and "... hexamer frequencies alone can detect most [long] CDS regions." However, CDS has shortcomings as a word. Foremost among them is its ambiguous meaning. The same exact term is used to refer to the entire coding region of a gene. This is analogous to using the same word for exon and mRNA. </p> <p>I am grateful to Myles Axton (Nature Genetics 37 :15 (01 Jan 2005) "Touching Base; <a href="http://www.nature.com/ng/journal/v37/n1/full/ng0105-15.html">Full Text </a> | <a href="http://www.nature.com/cgi-taf/DynaPage.taf?file=/ng/journal/v37/n1/full/ng0105-15.html&filetype=pdf">PDF </a> |) for introducing the readers of Nature Genetics to his term for coding segments that are less than an entire exon, which is CROE ( <b>c</b>oding <b>r</b>egion <b>o</b>f an <b>e</b>xon, pronounced as in "crow"). Because the term 'exon' never communicates anything about where coding information lies, it is important that the term 'croe' apply as well to coding regions that are coincident with an exon. People should be able to say "the croes of this gene" when they refer to the units that together make up a full CDS. </p> <p><b>Alternatively spliced segments. </b>I have a related concern that there be a term for segments that appear as indels when two alternatively spliced mRNAs (or cDNAs) are compared. This can be a complete exon, part of an exon (occurring between two alternative splice sites) or an intron, and need not be coding. Kondrashov and Koonin refer to these various mechanisms as generating LDAS (length difference alternative splicing; 2003 <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&list_uids=12615001&dopt=Citation">PubMed </a> | <a href="http://dx.doi.org/10.1016/S0168-9525%2802%2900029-X"> Trends in Genetics </a> 19:115-9) but do not suggest a name for the segments themselves (other than "alternative segment," or "inserted alternative segment," which terms they use repeatedly). One idea is 'asproe,' for <b>a</b>lternatively <b>sp</b>liced <b>r</b>egion <b>o</b>f an <b>e</b>xon, which has the advantage of being paired with croe (but the disadvantage that a single insertion may consist of two or more croes, <b>a</b>lternatively <b>sp</b>liced <b>r</b>egion <b>o</b>f <b>e</b>xons and will often be less than an entire croe). It is a useful concept. If one has in hand cDNA or EST sequences that differ by an insertion the mode of alternative splicing is unknown, but the alternatively spliced region is clear, even when genomic sequence is not available. Finally, there could be two terms here. One to refer to the alternative segment at the nucleotide level and another to refer to the alternative segment at the protein level. These need not correspond; an interesting case is where the length of the segment is not a multiple of 3 nucleotides, so that the coding of downstream regions is affected. A classical case, found in the first complete eukaryotic genome sequence (SV40), comes from the small t antigen, in which overlapping reading frames are created by alternative splicing. </p>Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1120056653733594212005-06-29T10:26:00.000-04:002005-06-29T13:15:44.403-04:00ISMB 2005I have been in Detroit for a week. <span style=""> </span>The two-day <a href="http://www.biolinfo.org/as-sig">Alternative Splicing meeting</a> preceding <a href="http://www.iscb.org/ismb2005/">ISMB </a>was outstanding, and really crystallized a community of people who are working on genome scale analysis of alternative splicing.<br /><br />Two things that really struck me at this meeting were:<br /><br />1) the importance of ontologies (and, more generally, the formal description of scientific knowledge). <span style=""> </span>There were 51 posters in the section on ontologies and NLP. <span style=""> </span>One title that caught my eye was "Transforming Full-Text Literature to Formalized Facts." <span style=""> </span>I was trained to believe that scientific publication was the formalization of facts! <span style=""> </span>I see that it's not good enough anymore. <span style=""> </span>Ewan Birney articulated this clearly in his Keynote address this morning when he said that databases are Biology, "the starting point and the end point of our understanding." <span style=""> </span>I heard calls at this meeting for the formal annotation of data on function analogous to the submission of sequence data. <span style=""> </span>This is clearly coming. <span style=""> </span>Experimental scientists who want their results to be included in emerging system-wide descriptions will have to participate, and informaticians will have to find a way to collect formal descriptions of functional data (Janet Thornton, in her keynote, refered to this as data harvesting and showed a <a href="http://www.ibiblio.org/Dave/Dr-Fun/df200207/df20020729.jpg">cartooon</a>).<br /><br />2) The idea that very few people can speak "both languages" (Biology and Computing) is outdated. <span style=""> </span>Being at the alternative splicing workshop really brought this home. <span style=""> </span>It reminds me of being in Miami, where virtually everyone speaks both English and Spanish perfectly. <span style=""> </span>It's still true that the majority of Biologists are still inadequately familiar with databases and computers, and that the majority of computer scientists don't "get" biological questions, but virtually everyone here (a large meeting with well over 1,000 people) is completely bilingual.<span style=""> </span>This is a change from just five years ago and it means that we can stop worrying about translation and get on with the research. <span style=""> </span><br /><br />Another very interesting point was in the keynote by Jill Mesirov on the use of Gene Sets. <span style=""> </span>By using predefined sets of genes (her "knowledge base") she was able to apply rank statistics to find signficant differences between microarray data sets between which no single gene shows a significant difference. <span style=""> </span>She has published on these methods (e.g. <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15016911&query_hl=1">Brunet <i style="">et al.</i> 2004</a>) but it was new to me.<br /><br />The hotel (Renaissance Marriott) was nice in many ways, but had its problems. <span style=""> </span>When I arrived, they could not make keys; I had to be let into my room by a valet and come back later. <span style=""> </span>Once in my room, I discovered that the phone didn't work. <span style=""> </span>The internet was constantly going down (which caused problems for two of the three presentations I saw that used it). <span style=""> </span>Twice (2/7 days), housecleaning did not replace the coffee packets. <span style=""> </span>Access to the hotel itself, and navigation among the first three floors, was absurdly indirect. <span style=""> </span>This design feature is apparently related to ideas of security more evocative of the middle ages (embattled castles protected by moats) than the Renaissance (intellectual excitement derived from an open exchange of people and ideas).<span style=""> </span>The architecture reflects a philosophy which ignores the fact that inaccessibility leads to marginalization. <span style=""> </span>This center houses the General Motors corporate headquaters and I was led to an image of GM executives cowering like Quasimoto in his tower, in this case the Detroit <a href="http://www.detroitrenaissance.com/">Dark Ages Center</a>, while life goes on below them (and without them).<span style=""> </span>Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1118519698074606942005-06-11T15:50:00.000-04:002005-06-11T15:57:21.793-04:00Cultural Transmission of FitnessIt was more or less by chance that I read the recent article by Heyer, Sibert and Austerlitz in the April issue of Trends in Genetics about what they call cultural transmission of fitness as carefully as I did. I had it with me on a plane today, and the seats on Northwest Airlink were just too close together for me to get out my laptop. CTF is the nongenetic transmission of fitness, and they make an intuitively compelling case (<a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15797619&query_hl=3">PubMed</a>) that CTF can have a huge effect on effective population size and coalescence times. Their model appears applicable not only to the transmission of true culture in human populations, but also to epigenetic changes and artificial selection. It's not every day that a new idea in population genetics is articulated, and I found this fairly exciting. However, the idea is more a formulation of ideas that I've been vaguely aware of for a long time than an entirely new idea. This does not to take anything away from them; a formal statement of a phenomenon and its consequences is what constitutes progress in population genetics (the real work is presented in Sibert, Austerlitz and Heyer, Theoretical Population Biology 2002; <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=12167356&query_hl=3">PubMed</a>). Furthermore, their citations suggest that the idea has been around for a while (although it's new to me). In fact, the applicability of this model to my previous post has apparently already been tested and rejected ("CTF was [not detected] in Ashkenazi Jews")!Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1117858256622511182005-06-04T00:04:00.000-04:002005-06-09T22:43:29.976-04:00Selection vs. differential allele flow<p class="MsoNormal"><span style="">The media is reporting (<a href="http://www.nytimes.com/2005/06/03/science/03gene.html">NYTimes</a>; <a href="http://www.economist.com/science/displaystory.cfm?story_id=4032638">Economist</a>) that there is a paper in press in <i style=""><a href="http://www.cambridge.org/uk/journals/journal_links.asp?mnemonic=JBS">The Journal of Biosocial Science</a></i> that attributes the pattern of inherited diseases among Ashkenazi Jews to selection for intelligence.<span style=""> </span>This hypothesis breaks not one but several taboos by talking about race, selection and intelligence, so I'm reluctant to say anything at all about it.<span style=""> </span>However, I think that they missed something (I won't be sure until I see the paper, which is not out yet).<span style=""> </span> <span style=""> </span> Selection, "red in tooth and claw," need not be invoked.<span style=""> </span>Differential migration out of the population could have a powerful effect and seems to have been overlooked.<span style=""> </span></span><span style="">In a minority population with asymmetric gene flow (in other words, whenever the rate of assimilation into greater society exceeds the rate of acquisition of new converts) any genetic variation that disfavors assimilation will increase in frequency in the minority population.<span style=""> </span>It is plausible that intelligence could be enhanced by this (for example, if intelligence improved one's ability to learn Torah or become a rabbi and those things made assimilation less likely).<span style=""> </span>It is also plausible that alleles causing non-lethal genetic diseases could actually be favored within a minority population by reducing the probability that affected individuals would leave, which seems likely if the community provided care not available outside and not needed by healthier relatives who were therefore more likely to leave. </span></p>Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1116077552152124172005-05-14T08:44:00.000-04:002005-05-14T09:36:06.236-04:00Sean Carroll on Kojo NnamdiI happened to catch <a href="http://www.molbio.wisc.edu/carroll/index.html">Sean Carroll</a> on <a href="http://www.wamu.org/programs/kn/05/05/12.php">the Kojo Nnamdi show</a> Thursday. He was refreshingly articulate and reasonable on the subject of evolution and religion, pointing to the middle ground and quoting the Pope. This is especially refreshing at a time when so many on the political right have confused a faith they share with much of mainstream American with political views that have no place in a civilized society. The title of this book, <a href="http://www.molbio.wisc.edu/carroll/Endless_Forms.html">"Endless Forms Most Beautiful,"</a> cites an explicitly religious quote from Darwin ("There is a grandeur in this view of life, with its several powers, having been originally breathed by the Creator into a few forms or into one; and that, whilst this planet has gone cycling on according to the fixed law of gravity, from so simple a beginning endless forms most beautiful and most wonderful have been, and are being evolved"). I haven't read the book, but I'm familiar with much of Sean Carroll's work and I suspect that he does an outstanding job of laying out that which is indeed most wonderful.Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1113230648902989192005-04-11T10:21:00.000-04:002005-04-11T10:44:08.903-04:00Arabidopsis minisymposium<p>The sixth Arabidopsis <a href="http://www.life.umd.edu/labs/ATRIUM/Symposium/">minisymposium</a>, joined this year with the spring Mid-Atlantic section <a href="http://www.aspb.org/">ASPB</a> meeting, was a big success. It's great to be at the center of something, and helping to host a regional meeting of such high quality definitely makes me feel that I am at the center of Arabidopsis research, even though my colleagues are entirely responsible for the excellent selection of speakers and I still have to pause and mentally review whenever anyone relies on my knowledge of photosynthesis, parts of the flower or plant hormones. Caren gets credit for putting the two meetings together and for inviting Susan Lolle to tell us about the work that put Arabidopsis on the front page of the New York Times. Heven, Zhongchi, June and their students all deserve credit for making this happen. </p> <p>My own talk was well received, even though it was the last and delayed by an unplanned break when the projector overheated after about eight hours of nearly continuous use. I am happy to have made the case before this audience that RNA processing, including alternative splicing, is important in plants.<span style=""> </span>I was aided in this by talks that presented roles for RNA binding proteins in crosstalk between ethylene and auxin (<a href="http://www.cals.ncsu.edu/genetics/alonso/alonso.html">Jose Alonso</a>), pollen tube growth (<a href="http://www.brown.edu/Departments/Molecular_Biology/pgl/">Mark Johnson</a>) and leaf polarity (<a href="http://waksman.rutgers.edu/kerstetter/Kerstetter.html">Randy Kerstetter</a>); RNA binding proteins are getting hot! The question I most appreciate came from Ken Birnbaum, who challenged me to think of an example in which a forward genetic screen identified regulated alternative splicing. Of course, there is the regulation of flowering time through <i>FLC</i> (<a href="http://www.biochemsoctrans.org/bst/032/0565/bst0320565.htm">Simpson and Dean</a>) but that appears to be regulated by polyadenylation, not alternative splicing. </p>Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1112738480941970922005-04-05T17:51:00.000-04:002005-04-05T18:13:59.843-04:00What you can do with a dozen genomesI really enjoyed the <a href="http://www.isr.umd.edu/ISR/about/dls_previous.html#ericgreen">ISR Distinguished Lecture</a> by <a href="http://www.genome.gov/Staff/Green/">Eric Green</a> a few weeks ago. It reinforced my excitement about the idea that having a dozen genomes will allow us to obtain qualitatively different information than we’ve been able to obtain from a single genome. In addition to the alignment-based methods he described, there is the (rather amazing) possibility of reconstructing the ancestral sequence (see <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15574820">Blanchette et al. 2004</a>, a very nice paper by an all-star cast) and methods of assigning gene function based on patterns of duplication and loss (e.g. the recent paper by <a href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=15696156">Li, Pellegrini and Eisenberg</a> in Nature Biotechnology). A talk by Najib El-Sayed on Friday about three trypanosomic genomes (<i>Trypanosoma brucei, Trypanosoma cruzi</i> and <i>Leishmania</i>) underscored the prospect of understanding the responses of genomes to selection. It strikes me that with 20-30 appropriately related genomes one could deduce whether individual nucleotides within a conserved block are under selection, an incredibly powerful tool (of course, I’m thinking about ESEs). Like many new methods, comparative genomics will yield insights in ways that will not be fully appreciated until the data are at hand. It is exciting, and it reminds me of the excitement we all felt during the late 70s, when the first sequences were being obtained.<br /><p><span style="">This was originally posted on <a href="http://stevemount.blogspot.com/">Steve's View</a> a few weeks ago, soon after the talk from Eric Green. </span></p>Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.comtag:blogger.com,1999:blog-11673175.post-1111692238723503232005-03-24T14:22:00.000-05:002005-03-24T14:23:58.723-05:00"On Genetics"I have always been interested in the nature of genetic information, including its expression, transmission and change. After five months of intermittent blogging, it occurs to me that it would be useful to separate my personal diaries, opinions and recollections from commentary on genetics and genomics that might be of interest to students and colleagues. For that reason, I'm creating this blog, "On Genetics," which is devoted to my comments on scientific matters related to genetics, genomics and gene expression. If you want to see my views on more personal or political matters visit <A HREF="http://stevemount.blogspot.com/">"Steve's View."</A>Stevehttp://www.blogger.com/profile/15264977010144529019noreply@blogger.com