#  Genomics and Speciation 

 



 ##  

  expand\_more  

 
  

 

The diversity of life on our planet is generated by the process of speciation, evolutionary splitting into divergent groups that can coexist in a state of nature. The changes at genomic level that cause speciation are still obscure. This is a time to jettison preconceived ideas, and to absorb what new data can tell us.  
Today, these data will frequently include whole genome sequences. *Heliconius melpomene* is the first butterfly species to have its genome both sequenced and mapped to chromosomes, by the *Heliconius* Genome Consortium (led by two former postdocs, Chris Jiggins and Owen McMillan). The new genome has considerably enhanced already active work on understanding the developmental genetics and evolution of mimicry, on the conservation of genomic structure (synteny), and on the evolution of olfactory and chemosensory genes, as well as on speciation. The *Heliconius* genome is only the second member of the Lepidoptera to be sequenced and mapped to chromosomes. The Lepidoptera is an important group with 180,000 species described, around 10% of the planet's known species. The genome is one of the key ENSEMBL genomes now used for comparative genomics purposes in many different fields.

   ![genomics_and_speciation](/sites/g/files/omnuum6606/files/styles/hwp_1_1__960x960_scale/public/malletlab/files/fig2.jpeg?itok=2LFnKqhS) 

 

**a.** Maps of the 21 *Heliconius* chromosomes (colour) and of the 28 *Bombyx* chromosomes (grey) based on positions of 6,010 orthologue pairs demonstrate highly conserved synteny and a shared n = 31 ancestor. Dotted lines within chromosomes indicate major chromosomal fusions. The major differences in chromosome number are explained by ten fusions on the *Heliconius* lineage (to give *n*=21 pairs of chromosomes), and three on the *Bombyx* lineage (*n*=28), both from the Lepidoptera modal karyotype of *n*=31.  
  
**b.** Maximum-likelihood tree showing expansions of chemosensory protein (CSP) genes in the two butterfly genomes.  
From Fig. 2 of [*Heliconius* Genome Consortium 2012](http://www.nature.com/nature/journal/vaop/ncurrent/full/nature11041.html)  
In spite of considerable conservation of synteny, we now realize that a single reference sequence in the genus is not enough. One problem is that separate species, and even colour pattern races within species may differ in indels or highly divergent regions that could be important in developmental switching.   
These are hard to find using just a single reference sequence. Therefore, the Consortium's next project is to sequence 16 genomes from across the genus.   
This will considerably improve reference mapping (for instance, in the *Heliconius erato* group), as well as improving our understanding of genomic evolution in the Lepidoptera, and in general.

[The *Heliconius melpomene* genome on ENSEMBL](http://metazoa.ensembl.org/Heliconius_melpomene/Info/Index)  
[The *Heliconius* genome browser](http://www.butterflygenome.org/) and [downloadable complete genomes](https://lepbase.cog.sanger.ac.uk/)

LepBase: *Heliconius* and other Lepidoptera genome data <https://lepbase.cog.sanger.ac.uk/>

#### How to sequence a genome on a tight budget: the case of *Heliconius*

The international group now known as the *Heliconius* Genome Consortium was conceived in around 2000, first meeting that year as a symposium at the Association of Tropical Biologists meeting at Bloomington, Indiana. In 2004 we put in the first white paper to NHGRI to sequence *Heliconius*. However, this was not chosen for funding. By 2009 next-generation sequencing became readily feasible, and the Consortium decided to go it alone. In 2009, [we drew up plans to sequence the *Heliconius melpomene* genome at a meeting in Harvard University](http://www.heliconius.org/2009/heliconius-genome-consortium-meeting-summary/). Nine laboratories each committed $15,000 to the enterprise, and the main sequencing work was carried out for us by Baylor. We were very fortunate that many other people and groups worldwide contributed analyses, assembly and annotations. By 2011 we had a draft genome, and in [2012 our paper was published in Nature](http://www.nature.com/nature/journal/vaop/ncurrent/full/nature11041.html).

What next. We next decided on a comparative data project, and ended up sequencing 20 genomes in collaboration with the Broad Institute of Harvard and MIT in Cambridge, Mass. Once again this was a collaborative, international project. We sequenced other related Heliconiini: *Dryas iulia, Eueides tales*, as well as re-sequencing *Heliconius erato* and *H. melpomene* and other de novo Heliconius genomes. This time the assemblies were made with high fidelity 2 x 250 bp paired end PCR-free Illumina, and the DISCOVAR assembly method. In [2019 our new paper was published in Science](https://pubmed.ncbi.nlm.nih.gov/31672890/). All the results were deposited in [LepBase](https://lepbase.cog.sanger.ac.uk/).

For further details of the Heliconius Genome consortium, see: [heliconius.org](http://www.heliconius.org/?s=heliconius+genome+consortium)

#### The laboratories contributing to the original *Heliconius* Genome Consortium

(alphabetical order)  
[Richard ffrench-Constant](http://biosciences.exeter.ac.uk/staff/index.php?web_id=richard_ffrench-constant), University of Exeter, Cornwall Campus  
[Chris Jiggins](http://heliconius.zoo.cam.ac.uk/), Cambridge University  
[Mathieu Joron](http://isyeb.mnhn.fr/joron/), formerly at Muséum d'Histoire Naturelle, Paris; now at CNRS Montpellier  
[Marcus Kronforst](http://www.kronforstlab.org/), University of Chicago  
[James Mallet](http://www.ucl.ac.uk/taxome/jim), formerly University College London; now at Harvard University  
[W. Owen McMillan](http://www.stri.si.edu/english/scientific_staff/staff_scientist/scientist.php?id=62), Smithsonian Tropical Research Institute, Panamá  
[Sean Mullen](http://people.bu.edu/smullen/Mullen_Lab_at_Boston_University/Home.html), Boston University  
[Robert Reed](http://reedlab.org/), Cornell University, and [Adriana Briscoe](http://visiongene.bio.uci.edu/Adriana_Briscoe/Briscoe_Lab.html), University of California, Irvine (both then at UC Irvine)