Skip to Main Content (access key 1)
Skip to Search (access key 2)
Skip to Search GO (access key 3)
Skip to comments (access key 4)
Skip to navigation (access key 5)
Skip to top of page (access key 6)
Saturday, November 17, 2007 | Science : Genetics | print version Print | Comments

Document Evolutionary comparison finds new human genes

by PhysOrg

Reposted from:
http://www.physorg.com/news114446301.html

Over millions of years of evolution, a gene called GRIA2 has continued to do it's job of making a receptor for neurotransmitters. The portions of the gene that code for amino acids that make up a protein change in different ways from other parts of the genome, so computer algorithms can use these distinctive patterns of evolutionary change to identify new genes that have been missed by other methods. A portion of GRIA2 is shown here in an alignment of the genomes of several species, beneath a graph of the computer analysis. Peaks in the graph identify exons (regions that are expressed), separated by introns (non-coding regions). When a cell reads the gene to make a protein the introns are edited out.

Using supercomputers to compare portions of the human genome with those of other mammals, researchers at Cornell have discovered some 300 previously unidentified human genes, and found extensions of several hundred genes already known.

The discovery is based on the idea that as organisms evolve, sections of genetic code that do something useful for the organism change in different ways.

The research is reported by Adam Siepel, Cornell assistant professor of biological statistics and computational biology, Cornell postdoctoral researcher Brona Brejova and colleagues at several other institutions in the online version of the journal Genome Research, and it will appear in the December print edition.

The complete human genome was sequenced several years ago, but that simply means that the order of the 3 billion or so chemical units, called bases, that make up the genetic code is known. What remains is the identification of the exact location of all the short sections that code for proteins or perform regulatory or other functions.

More than 20,000 protein-coding genes have been identified, so the Cornell contribution, while significant, doesn't dramatically change the number of known genes. What's important, the researchers say, is that their discovery shows there still could be many more genes that have been missed using current biological methods. These methods are very effective at finding genes that are widely expressed but may miss those that are expressed only in certain tissues or at early stages of embryonic development, Siepel said.

"What's exciting is using evolution to identify these genes," Siepel said. "Evolution has been doing this experiment for millions of years. The computer is our microscope to observe the results."

Four different bases -- commonly referred to by the letters G, C, A and T -- make up DNA. Three bases in a row can code for an amino acid (the building blocks of proteins), and a string of these three-letter codes can be a gene, coding for a string of amino acids that a cell can make into a protein.

Siepel and colleagues set out to find genes that have been "conserved" -- that are fundamental to all life and that have stayed the same, or nearly so, over millions of years of evolution.

The researchers started with "alignments" discovered by other workers -- stretches up to several thousand bases long that are mostly alike across two or more species. Using large-scale computer clusters, including an 850-node cluster at the Cornell Center for Advanced Computing, the researchers ran three different algorithms, or computing designs -- one of which Siepel created -- to compare these alignments between human, mouse, rat and chicken in various combinations.

Over millions of years, individual bases can be swapped -- C to G, T to A, for example -- by damage or miscopying. Changes that alter the structure of a protein can kill the organism or send it down a dead-end evolutionary path. But conserved genes contain only minor changes that leave the protein able to do its job. The computer looked for regions with those sorts of changes by creating a mathematical model of how the gene might have changed, then looking for matches to this model.

After eliminating predictions that matched already known genes, the researchers tested the remainder in the laboratory, proving that many of the genes could in fact be found in samples of human tissue and could code for proteins. The researchers were sometimes able to identify the proteins by comparison with databases of known proteins. The discovered genes mainly have to do with motor activity, cell adhesion, connective tissue and central nervous system development, functions that might be expected to be common to many different creatures.

The entire project, from building and testing the mathematical models to running final laboratory tests, took about three years, Siepel said. The work was supported by the National Cancer Institute, a National Science Foundation Early Career Development Grant and a University of California graduate research fellowship.

Source: Cornell University

Comments 1 - 5 of 5 |

Reload Comments | Back to Top | Page Numbers

1. Comment #88897 by Xenocratic on November 19, 2007 at 6:29 am

First for the first time ever!!!

Other Comments by Xenocratic

2. Comment #88904 by konquererz on November 19, 2007 at 6:40 am

 avatarThe more we learn about the gene, the more we can marvel at the evolutionary process! Dawkins really had it spot on when he said that it is a wonder that we should even be here at all. With all the different changes that can and do take place, its amazing that I, as the individual person I am, should have come into existence at all! How much more wonderful is that then being created? And how much more special does that make each person! Religion has absolutely nothing on science when it comes to wonderment.

Other Comments by konquererz

3. Comment #88963 by Xenocratic on November 19, 2007 at 9:24 am

I concur heartily with your sentiments, konquererz. Dawkins has also pointed out that, because each human being represents one particular alternative out of around 40 trillion genetically unique possibilities, there are more people who haven't been born than there are atoms in the known universe. How wonderful to marvel at the utterly mind boggling uniqueness of each person who has ever existed. And all thanks to scientific knowledge!

Other Comments by Xenocratic

4. Comment #88998 by arogop on November 19, 2007 at 12:52 pm

 avatarThe random but minor changes with our DNA that occur every time we reproduce is probably the reason that most pregnancies end in a natural abortion.

Other Comments by arogop

5. Comment #107653 by the_ultimate_samurai on January 4, 2008 at 11:18 pm

hmm, this makes me wonder if there may be a time when we can make a computer to naturally understand the genetic language, be able to read it as a mater of fact, and use that to extrapolate information about something.
for instance, suppose one had an unknown creature (maybe an alien or something) and could read the genetic code, understand it at a fundamental level, then extrapolate all the ways this DNA could have been, extrapolate how it may have been once, and maybe even the environment in which it lived (since all creatures carry the unique print of their environment on their DNA, by taking how they were and all the ways they could have been, you can plot a course from where they were to where they are, and eliminate all the ways they arent, to see how they were best fit for their environment.)
or just taking the human DNA and showing how we could have been, and how we were. its kinda fascinating if you think about it.
now im sure it would be a loooooong time before technology ever allows for such a thing, but in the terms of potential, its pretty nice. and it would certainly prove beyond a shadow of a doubt (that is so completely even a creationist couldnt deny it) that humans evolved from apes. when you can physically play a video of the human evolution as recorded in the dna, the expression of all the different homonids, there are few who could deny that.

Other Comments by the_ultimate_samurai
Reload Comments | Back to Top

Comment Entry: Please Login

Register a new account

Username:

Password: