Scholarly article on topic 'The complete mitochondrial genome of Solemya velum (Mollusca: Bivalvia) and its relationships with Conchifera'

The complete mitochondrial genome of Solemya velum (Mollusca: Bivalvia) and its relationships with Conchifera Academic research paper on "Biological sciences"

CC BY
0
0
Share paper
Academic journal
BMC Genomics
OECD Field of science
Keywords
{""}

Academic research paper on topic "The complete mitochondrial genome of Solemya velum (Mollusca: Bivalvia) and its relationships with Conchifera"

The complete mitochondrial genome of Solemya velum (Mollusca: Bivalvia) and its relationships with Conchifera

Plazzi et al.

Bio Med Central

Plazzi et al. BMC Genomics 2013,14:409 http://www.biomedcentral.com/1471-2164/14/409

Genomics

RESEARCH ARTICLE Open Access

The complete mitochondrial genome of Solemya velum (Mollusca: Bivalvia) and its relationships with Conchifera

Federico Plazzi*, Anisa Ribani and Marco Passamonti

Abstract

Background: Bivalve mitochondrial genomes exhibit a wide array of uncommon features, like extensive gene rearrangements, large sizes, and unusual ways of inheritance. Species pertaining to the order Solemyida (subclass Opponobranchia) show many peculiar evolutionary adaptations, f.i. extensive symbiosis with chemoautotrophic bacteria. Despite Opponobranchia are central in bivalve phylogeny, being considered the sister group of all Autobranchia, a complete mitochondrial genome has not been sequenced yet.

Results: In this paper, we characterized the complete mitochondrial genome of the Atlantic awning clam Solemya velum: A-T content, gene arrangement and other features are more similar to putative ancestral mollusks than to other bivalves. Two supranumerary open reading frames are present in a large, otherwise unassigned, region, while the origin of replication could be located in a region upstream to the cox3 gene.

Conclusions: We show that S. velum mitogenome retains most of the ancestral conchiferan features, which is unusual among bivalve mollusks, and we discuss main peculiarities of this first example of an organellar genome coming from the subclass Opponobranchia. Mitochondrial genomes of Solemya (for bivalves) and Haliotis (for gastropods) seem to retain the original condition of mollusks, as most probably exemplified by Katharina.

Keywords: Solemya velum, Mitochondrial genome, Gene arrangement, Origin of replication, Mitogenomics

Background

Bivalves and mitochondrial DNA

In animals, the mitochondrial genome (mtDNA) is typically a small, circular and compact molecule, generally encoding for 37 genes: 13 protein-coding genes (PCGs), 2 rRNAs, and 22 tRNAs [1-3]. Even if striking exceptions to this standard are known [4-10], most differences among animal mtDNAs involve gene content and arrangement.

Mollusks have shown high variability in mitochondrial genome architecture [3,11-13], with respect to many genomic features, i.e. length, gene arrangement, strand assignment, gene duplications and losses, nucleotide composition, and more. Within mollusks, gastropods and bivalves show extensive variations, even with differences within the same family or genus [11,14,15]. Furthermore, a major peculiar trait of mitochondrial genome in some

* Correspondence: federico.plazzi@unibo.it

Department of BiologicalGeologicaland EnvironmentalSciences, University of Bologna, Via Selmi, 3, Bologna 40126, Italy

bivalve species is the presence of an alternative pattern of mitochondrial inheritance known as Doubly Uniparental Inheritance or DUI [16-20], that involves two separate lineages of mtDNAs. One mitochondrial genome (called F) is transmitted from the mother to the complete offspring, whereas the other one (called M) is transmitted from the father to sons, where it localizes in germline and, therefore, in sperm. Still, differences in gene order were also detected between the two DUI-related lineages within the same species [13,21,22].

Mitochondrial gene order has been shown to be a good marker for phylogenetic relationships, because rearrangements are random discrete events, and retro-mutation is very unlikely (see, f.i., [23-26]; and reference therein). This is even more true for arrangements of protein coding or rRNA genes, because they are much rarer than the ones involving tRNAs [1,11,13,27,28].

Tracing the "archaic" mtDNA gene arrangement of Mollusca, we can start from groups sequenced so far seeming to retain most ancestral features. Among them,

O© 2013 Plazzi et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Central Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, anc reproduction in any medium, provided the original work is properly cited.

chitons are aculiferan mollusks that are considered sister group to all conchiferans, which include scaphopods, cephalopods, gastropods, and bivalves (see, f.i., [29-31]; and reference therein). For PCGs and rRNAs, the gene arrangement of the chiton Katharina tunicata [32] is also shared by Haliotis, a primitive gastropod [33]; a single inversion separates it from the gene arrangement of Caenogastropoda (with the exception of D. gregarium; see [15,34]); few steps are required to transform the gene order of Katharina in that of Nautilus, a primitive ceph-alopod [35]. A single translocation also separates it from the aplacophoran C. nitidulum ([GenBank:EF211990]). Moreover, the gene arrangement of Katharina tunicata shows some outstanding similarities to lophophorates and even arthropods [27,32,36]. This clearly points out that Katharina may have the most "archaic" gene order known so far among Mollusca, and maybe the ancestral mollusk gene arrangement [13,28].

Many exceptions to typical gene content are known: f.i., it is well known that the atp8 gene has been reported as missing in several bivalve species, as discussed in ([37]; and reference therein). atp8 is present on the same strand in all Unionoida ([37], and reference therein); as a single exception, it is incomplete in the male mtDNA of Pyganodon grandis [22,38]. Moreover, it has been found in some heterodonts, like Loripes lacteus ([GenBank: EF043341]), Lucinella divaricata ([GenBank:EF043342]), Meretrix lamarckii [39], and Meretrix lusoria [40]; a putative atp8 has also been recently reported from the mytilid Musculista senhousia [18]. Conversely, it was not recovered in the mactrid Coelomactra antiquata [41].

Among the other exceptions to gene features and content, the rrnS gene is duplicated in some species of genus Crassostrea, while the rrnL gene is split in two separate fragments in all ostreids known to date ([14,42-44]; [GenBank:FJ841968]); finally, two versions of cox2 were found in the Musculista senhousia M mtDNA [18].

The taxonomic position of Solemya

Despite sharing a common bivalve shell, two major kind of bivalvian mollusks are known. The first group, the Opponobranchia, including most of the protobranchiate bivalves, is generally deemed to retain many ancestral features, while the second, the Autobranchia, i.e. lamelli-branch bivalves with completely functional filter-feeding gills, is a big assemblage of species showing a more derived morphology, when compared to putative ancestral mollusks. Among bivalves, complete mitochondrial genomes are nowadays available for Autobranchia only, which show many differences with respect to the condition of K. tunicata.

In the subclass Opponobranchia there are basically two groups of living bivalves: Nuculida and Solemyida. The genus Solemya and relatives have edentulous or nearly

edentulous, generally equivalved shells, with an homogeneous aragonitic ostracum, a glossy, thick and brown periostracum, and the mollusk shows a broad fringe, siphonate mantle and burrowing habits [45,46]. Most solemyids are involved in symbiosis with chemoauto-trophic, gill-hosted, bacteria, enabling life in unusual habitats like deep-sea vents [46,47].

In this paper, we present the first complete mitochon-drial genome of a representative of Opponobranchia, the Atlantic awning clam Solemya velum Say, 1822. This organelle genome was completely annotated and compared to other available bivalve and conchiferan complete mitochondrial genomes. Because of the sister-group relationship between Opponobranchia and Autobranchia, the characteristics of the Solemya mitogenome have proved useful to compare bivalve mtDNA features with those of other mollusks. In this paper we discuss our findings with particular reference to: (i) the presence of the atp8 gene; (ii) nucleotide composition; (iii) strand patterns; (iv) origin of replication; (v) supranumerary ORFs; (vi) gene arrangement.

Methods

DNA extraction, Long-PCR reactions and sequencing

Specimens of Solemya velum were collected by and commercially purchased at the Woods Hole Oceanographic Institution (Massachusetts, USA) in Summer 2003. A standard phenol:chloroform protocol was used to extract total genomic DNA from a pool of 5 individuals, because of the small dimensions of the obtained specimens.

The technique of Long-PCR amplification, paired with sequencing with primer-walking or shotgun cloning, was used to amplify the complete mitochondrial genome in several overlapping fragments. The Herculase® II Fusion Enzyme (Stratagene) kit was used to perform Long-PCR reactions up to 10,000 bp. The reaction conditions were set as follows: 10 ^L 5x Herculase® II Fusion reaction buffer, nucleotides 250 ^M each, primers 0.25 ^M each, 0.5 ^L Herculase® II Fusion, 5 ^L template DNA, ddH2O up to 50 ^L. Cycle conditions were set up as an initial denatur-ation step at 92°C for 2', 40 cycles of denaturation at 92°C for 10', annealing at 48-52°C for 30'', and extension at 68°C for 10', and a final extension step of 68°C for 8'. Primers used for Long-PCR were used to sequence long amplicons and new internal specific primers to complete primer-walking were designed with Primer3 online tool [48].

Routine PCR amplification was performed for amplicons <2,000 bp with GoTaq® Flexi DNA Polymerase (Promega) as in [49]. Amplicons were purified through PEG precipitation [50], or with Wizard® SV Gel and PCR Clean-Up System (Promega); when necessary, they were ligated into a pGEM® T-Easy Vector (Promega) and transformed into MAX Efficiency® DH5a™ Chemically Competent Cells (Invitrogen) as

in [51]. Additional files 1 and 2 list all primers used for this study. All sequencing reactions were carried out through the Macrogen Europe (Amsterdam, The Netherland) facility. Only in the case of long-PCR with primers COI2F and Solemya_3a1894R (see Additional file 1), the 10,000 bp long amplicon was purified with 5% PEG precipitation [50] and sequenced via shotgun cloning. This was carried out by Macrogen Korea (Seoul, South Korea).

Sequence annotation

Protein-coding genes were annotated using the online ORF Finder tool [52]; the software Glimmer 3.02 [53] under iterated pipeline for assessing ORF features was used to confirm results; homology search was carried out with BLAST ([54,55; and reference therein]). We investigated structures and putative functions of unknown ORFs through the @TOME 2.0 ([56]; and reference therein) and InterProScan ([57]; and reference therein) online tools: signal peptides were sought with SignalP

[58], while similarities were detected using HHsearch

[59], SP3 [60] and Fugue [61].

Start codons of PCGs were set at the first start codon found by ORF Finder that did not overlap with an upstream gene; whenever a stop codon was overlapping with a following gene, it was moved backwards to the first suitable codon starting with T/TA (thus annotating a hypothetical truncated T--/TA- stop codon). In cases of neighboring PCGs, these in silico predictions were tested looking for a secondary structure with a possible cleavage signaling function in the connecting region, using the Mfold server [62] and a folding temperature of 14°C. tRNA genes were predicted with tRNAscan-SE 1.21 [63,64] and ARWEN 1.2 [65]. The Mfold server was used to predict the secondary structure of unassigned regions; all secondary structures were graphically edited with VARNA 3.7 [66].

Codon usage and nucleotide composition statistics were computed using MEGA 5.03 [67] and Microsoft Excel® 2007; repeated sequences were found with Spectral Repeat Finder v 1.1 [68]. The mitochondrial genome map was prepared using GenomeVx [69], setting cox1 as the starting point of the mtDNA and labeling its coding strand as " + ".

Phylogenetic analysis

Complete mitochondrial genomes of bivalves and other mollusks were downloaded from GenBank in November 2011 (Additional file 3). Summarizing, we included in our dataset 30 bivalves, 23 gastropods, 6 cephalopods, 1 scaphopod, 1 polyplacophoran, 1 chaetodermomorph, and the polychaete outgroup Platynereis dumerilii [70]. We assessed phylogenetic representativeness of this sample through the AvTD method as in [49]. We used

the software PhyRe [71] and set the number of splits, merges, and moves to 2, shuffling at the family level. Sequences were managed through CLC Sequence Viewer 6.6.2 (CLC bio A/S), Microsoft Excel® 2007, and MEGA 5.03.

Each gene, with the exception of atp8, was separately translated into amminoacids and aligned with MAFFT 6 [72] and Muscle 3.8.31 [73,74], using the M-Coffee merging algorithm [75,76]. Gblocks [77,78] was used to select blocks of conserved positions suitable for phylogenetic analysis under default (stringent) conditions.

PartitionFinderProtein 1.0.1 [79], using the greedy option and Bayesian Information Criterion (BIC), tested the best partitioning scheme of our dataset, which was chosen for subsequent analysis, as well as the concatenated alignment and the completely partitioned model. Best-fitting amminoacid substitutions models were selected with ProtTest 3.2 ([80]; and reference therein), through Phyml [81] and BIC for model selection.

The software RAxML 7.2.8 [82,83] was used for maximum likelihood analyses, using both the fast (-x) and the standard (-b) bootstrap algorithm with 200 replicates. The PROTCAT model [84] was implemented for optimization of individual per-site substitution rates, using models suggested by ProtTest 3.2. Trees were graphically edited by PhyloWidget [85], Dendroscope [86], and Inkscape softwares.

Results

Genomic features

The complete mitochondrial genome of Solemya velum was found to be 15,660 bp long. It was deposited into GenBank database under Accession Number [GenBank: NC_017612]. All genes of the standard metazoan mito-chondrial genome were found, including the atp8 gene (Figure 1). With the only exception of trnT, genes are organized in a large cluster on the " + " strand (from trnG to trnF) and in a slightly shorter cluster on the "-" strand (from trnE to atp6). 22 tRNAs are present: as usual for animal mtDNA, two serine-encoding tRNAs, trnSl (AGN) and trnS2(UCN), and two leucine-encoding tRNAs, trnL1(CUN) and trnL2(UUR) were found. The secondary structure of tRNA genes was predicted and is shown in Additional file 4; as expected [1,18,87-90], trnS1(AGN) presents a reduced DHU arm.

Nucleotide composition and A-T/G-C proportions were computed for each single gene and for PCGs, third codon positions, ribosomal genes, tRNAs, and URs taken as a whole (Additional file 5): the total A-T content of S. velum mitochondrial genome is 68.11%. A chi-square test with 1 d. f. demonstrated that the A-T composition of S. velum mtDNA is significantly different from that of other bivalves, gastropods, scaphopods (p < 0.005), and K. tunicata (Polyplacophora; p < 0.010); however, no significant

Qo/omi/o i/all im

Name Type Start End Length (bp) UNs Strand An ti codon Start codon Stop«

cox 7 PCG 1 1536 1536 17 + ATG TAA

cox2 tmS2 PCG tRNA 1554 2272 2240 2340 687 69 31 + -1 + TGA ATG TAA

tmT tRNA 2340 2405 66 5 - TGT

ned4L PCG 2411 2707 297 11 + ATG TAA

nmM PCG 2719 4062 1344 4 + ATA TAA

tmH tRNA 4067 4132 66 1 + GTG

nsd5 PCG 4134 5860 1727 0 + ATG TA-

tmF tRNA 5861 5927 67 31 + GAA

atp6 PCG 5959 6654 696 5 ATG TAA

tmD tRNA 6660 6728 69 7 - GTC

aip8 PCG 6736 6900 165 14 ATG TAG

cytb PCG 6915 8037 1123 21 ATT T-

nad6 PCG 8059 8559 501 -2 ATA TAG

nad1 PCG 8629 9570 942 0 ATG TAA

tmL2 tRNA 9571 9638 68 1 - TAA

tmL1 tRNA 9640 9710 71 0 - TAG

rmL rRNA 9711 11088 1376 0 -

tmV tRNA 11087 11159 73 0 TAC

rmS tmM tmC tmY rRNA tRNA tRNA tRNA 11160 12045 12117 12188 12044 12113 12186 12252 885 0 - CAT GCA GTA

70 67 -1 0

tmW tmQ tmE tRNA tRNA tRNA 12253 12325 12395 12322 12395 12462 70 71 68 2 -1 105 TCA TTG TTC

tm G cox3 tRNA PCG 12568 12636 12635 13415 68 780 0 + 1 + TCC ATG TAA

tmK tRNA 13417 13486 70 372 + TTT TGC TCG

tmA tmR tRNA tRNA 13859 13937 13928 14004 70 68 8 + 1 +

tmN tRNA 14006 14074 69 0 + GTT

tml tRNA 14075 14144 70 0 + GAT

nad3 PCG 14145 14497 353 0 + ATG TA-

tmS1 tRNA 14498 14566 69 0 + GCT

nad2 PCG 14587 15655 1089 5 + ATG TAA

Figure 1 Genomic map and annotation of S. velum complete mitochondrial genome. Genes located on the externalside of the map are

on " +" strand (i.e. that encoding cox/), whereas genes on the internalside are on "-" strand. URs > 100 bp are shown and they were arbitrarily set on " +" strand. UNs, number of unassigned nucleotides after the gene: if negative, overlapping nucleotides with the following gene; PCG, Proteir Coding Gene.

difference was found with mtDNA A-T composition of C. nitidulum (Caudofoveata) and Cephalopoda (see raw data in Additional file 6).

A-T content and A-T/G-C skew are shown in Figure 2 for S. velum and three other mollusks for comparison: Katharina tunicata (Polyplacophora), Unio pictorum (Bivalvia: Palaeoheterodonta), and Meretrix petechialis (Bivalvia: Heterodonta). A-T content is often similar to that of K. tunicata (and M. petechialis), while skews, at the genomic level, are more similar to those of U. pictorum. We also plotted the A-C and the G-T content along the mitochondrial genome (using the " + " strand) with a sliding window of 151 bp (Figure 3). Overall, there is bias towards neither pair, but we evidenced a region of high A-C content spanning approximately from the atp6 to the rrnL gene, i.e. part of the mitochondrial genome where all genes are on the "-" strand.

The most common start codon (Figure 1) is ATG (10 PCGs), but also alternative codons were detected, in accordance with previous findings in different invertebrates [1,91]. Most probably, truncated stop codon are used in three genes, namely nad3/nad5 (TA-) and cytb (T-). As already shown (f.i., [18,91,92]), these are common in meta-zoan mitochondrial genomes, with TAA stop codon subsequently restored by post-transcriptional polyadenilation. In five cases (Figure 1), two PCGs are not separated by any tRNA and are neighboring: in all cases, a stem-loop

structure with a putative cleavage function of the polycis-tronic primary transcript has been found (Additional file 7). S. velum mtDNA contains 3,735 codifying codons, whose usage is shown in Additional file 8. Most used codon is UUA (Leu), while less used codon is CGC (Arg). The commonest amminoacid is leucine, while the rarest is cysteine.

The third base of the codon is most often an A (38.93%) or a T (38.32%), whereas C (12.39%) and G (10.36%) are less represented in this position. This may simply relate to the degeneracy of the mitochondrial code, but it is possible that natural selection is even looser in these positions because of a wobble effect: as already reported for other metazoans ([18]; and reference therein), only some codons in the PCGs do have their relative tRNA/anticodon on the mtDNA (see Additional file 8).

We found precise patterns of nucleotide content in fourfold degenerate third codon position along the mtDNA molecule (Figure 4). We used all the PCGs as a possible starting point for the analysis and best results were obtained when cox3 was used as the first gene in the analysis. All the correlations were found to be significant with the exception of C content (see caption to Figure 4 for details).

Unassigned regions

Relatively few unassigned regions (URs) are present in the mitochondrial genome of S. velum (4.12% of the

-0.10 -0.20

0.40 ■ 0.20 0.00 - H ik ll IM 1 ■ 1 ,0 1 1.. -1 ■ fr+l-l

-0.20 -0.40 1 ■ 1 p 1 I 1

CD CO 00

F Q. Q.

C\J Co -Q C\| Co o FV CO (0

o ° c c

S OS2.

CD O cl

Ezzd ^ a: on

■g-s

Figure 2 Compositional patterns of S. velum and other mollusks mitochondrial genomes. A-T content, A-T skew, and G-C skew are computed for each single gene and for many genomic regions following the legend below the chart.

genome length; Figure 1; Additional files 5 and 9). Most of them are between 11 and 31 bp, but the largest ones are UR7 (105 bp) and UR8 (372 bp), between the trnE/ trnG and trnK/trnA gene pairs, respectively.

The putative secondary structures of UR7 and UR8 are shown in Figure 5. UR7 folds as a double hairpin; UR8 folds in a more complex pattern, with several stem-and -loop substructures. A repeated 17-bp long motif was found in this region (5'-ACCAGCCGGTTTTTCTA-3'), starting at bases 220 and 337 of UR8 sequence. Both UR7 and UR8 have a high A-T content (84.76% and 70.43%, respectively), making of UR7 the A-T-richest region in the genome.

Two small ORFs of 117 and 195 nucleotides were found within UR8 and were called ORF117 and ORF195, respectively. ORF117 starts at nucleotide 13,524 (start codon: ATA) and ends at 13,640 (stop codon: TAA) on the " + " strand; according to Glimmer results, ORF117 has a "raw" scoring value of 3.05, which is comparable with that of other PCGs, like nad4L and nad6 (2.58 and 3.19, respectively). A BLAST search could find a possible

homolog within the putative control region of Haliotis rubra [33], which was also confirmed by Glimmer. The A-T content of ORF117 is 59.83% and the most used codon is UUU (Phe), with 7 hits.

ORF195 starts at nucleotide 13,846 (start codon: ATT) and ends at 13,652 (stop codon: TAG) on the "-" strand. ORF195 was not confirmed by Glimmer, but SignalP could retrieve a weak similarity with a signal peptide in the first 29 amminoacids of the putative translated protein; the A-T content of ORF195 is 73.33% and the most used codon is AAA (Lys), with 12 hits.

InterProScan with TMHMM 2.0 online tool could not identify any domain within ORF117, while a transmembrane domain was found within ORF195 (amminoacids 15-32). Using the @TOME 2.0 server, HHsearch, SP3 and Fugue could find some similarities between ORF117 and the DNA-binding domains of some proteins ([PDB:1TNS], [PDB:1C20], [PDB:3GNA], [PDB:1G4D]); HHsearch and Fugue also showed homology of ORF195 with two membrane-linked proteins ([PDB:1FME], [PDB:1MEQ]).

Phylogenetic analysis

The Phylogenetic Representativeness of our sample of mollusk mitogenomes is shown in Additional file 10. Using the original mollusk master list as published by [93], the AvTD of our sample is lying exactly on the mean AvTD of 100 random subsamples of equal size (Additional file 10a, diamond) and it is above the highest random result (Additional file 10b, diamond) if a change in underlying taxonomy is simulated.

The overall length of concatenated alignment, after Gblocks masking, was of 1,782 amminoacids and the nad4L gene was completely excluded from the analysis, lacking suitable blocks. The software PartitionFinderProtein selected a 3-blocks model: the first cluster was atp6-cytb-nad2-nad3-nad4-nad5; the second one was cox1-cox2-cox3-nad1; the nad6 gene was given its own partition. All models selected by ProtTest and piped to RAxML are listed in Additional file 11.

The six ML searches converged on similar trees: following the partitioning scheme selected by Par-titionFinderProtein and using the complete bootstrap procedure we obtained the tree shown in Figure 6. Katharina tunicata is the sister taxon of all other mollusks; a node with low bootstrap support (BS = 20.5) separates Solemya + (Haliotis + Caenogastropoda) from (Graptacme + Cephalopoda) + (Heterobranchia + Auto-branchia). In this scenario, both Bivalvia and Gastropoda seem polyphyletic, but deep nodes are weakly supported (13.0 < BS <44.5). Conversely, Cephalopoda, Heterobranchia, Amarsipobranchia sensu [49], Palae-oheterodonta, and Caenogastropoda were recovered as monophyletic with high BS.

Gene order

As expected from the high plasticity of molluscan (and particularly bivalvian) mitochondrial genomes, many differences were found between S. velum and other mollusks' gene arrangement. However, we found that Katharina tunicata and Haliotis rubra (Gastropoda: Vetigastropoda) share the same gene order (recall that we excluded tRNAs for this analysis) and, more interestingly, that this gene order is highly similar to that of S. velum, the only difference being an event of inversion of the atp8-atp6-nad5-nad4-nad4L cluster (Figure 7). If tRNAs were taken into account, this inversion would hold true (even if the cluster is slightly different due to trnT, trnH, trnF, and trnD). The same inversion event can be partially recovered in the unpublished partial mitochondrial genome of the opponobranchiate Nucula nucleus (Bivalvia: Nuculida), which is available in GenBank under the Accession Number [GenBank:EF211991].

Discussion

Gene content

The Solemya mtDNA contains all genes of the standard metazoan mitochondrial genome (Figure 1). It is tempting to conclude that the loss/degeneracy of atp8 is restricted to Amarsipobranchia, given the presence of this gene in palaeoheterodonts and in S. velum (and, following the GenBank partial mitochondrial genome, also in Nucula nucleus). Whether the absence of atp8 gene is real or just an outcome of incorrect annotations (see, f.i., [18,37]), its presence in a bivalve like Solemya further supports, if necessary, that the ancestral bivalve condition is the retention of a fully functional atp8 gene.

mtDNA (bp)

Figure 3 A-C and G-T content along the mitochondrial genome of S. velum. A-C (pale blue) and G-T (dark red) contents are computed on a sliding window of size 151 bp. A linear sketch of the complete genome as annotated in Figure 1 is depicted above the plot: blue, PCGs; green, tRNAs; brown, rRNAs; gray, URs >100 bp; genes on the " +" strand are above the black line and genes on the "-" strand are below it.

Moreover, it is common in metazoans to find neighboring atp6 and atp8 on the same strand [1] and it has been suggested that uncleaved transcripts may be co-translated [94,95]. Tough this arrangement is not found in many phyla, like Plathyhelminthes, Nematoda, Annelida, Sipunculida, Brachiopoda, and Mollusca (and atp8 itself is also lacking in some of them; [13]), nevertheless atp6 and atp8 are neighboring in S. velum (Tab 3). The same association can be found in basal mollusks, like C. nitidulum and K. tunicata, and in other conchiferans, like cephalopods (with the exception of N. macromphalus), Caenogastropoda, and Heterobranchia (albeit on the opposite strand).

Genome features

Mean A-T content in main molluscan classes ranges between 63.51% (bivalves) and 74.12% (scaphopod G. eborea): S. velum has a high A-T content (68.11%), being significantly more similar to aculiferans and cephalopods than to other bivalves (Additional file 6): actually, the A-T content of Autobranchia is between 55.20% (M. yessoensis; [96]) and 69.70% (V. philippinarum, [GenBank:NC_003354]). Irrespective of the functional constraints and gene features, we found an unbiased, when not low, G-T content in the " + " strand (Figure 3). Most commonly, the leading strand is G-T rich ([35,97-103]; but see [104-106]). The G-T content is not particularly high in any region of the S. velum " + " strand, and it even drops to very low values where the molecule encodes PCGs on the "-" strand (Figure 3).

All genes are located on the same strand in Amarsipobranchia, and most of them in Palaeoheterodonta. Contrastingly, in Solemya velum, genes are evenly

distributed among " + " and "-" strands, with 18 and 19 genes, respectively. Even if a H-biased distribution of genes is found in other lophotrochozoans, like annelids, brachio-pods, bryozoans and platyhelminths (see, f.i., [13,27]), an even distribution is the commonest situation among Mollusca (Additional file 6; but see [15]) and, notably, as for A-T content, S. velum is quite similar to Caudofoveata, Cephalopoda, Polyplacophora, and Scaphopoda. This gene distribution on both strands rises a stimulating question on strand assignment: which is the leading (heavy; antisense) strand in S. velum? Patterns evidenced in S. velum resemble those of N. macromphalus [35]. Contrarily, a strand with a sharper G-T predominance has been signaled, f.i., in some gastropods [15] and in Katharina [35]. It seems that mtDNAs with most genes on the same strand (e.g., Caenogastropoda, Amarsipobranchia) tend to have higher G-T values than mtDNAs with genes evenly distributed on either strand (e.g., Cephalopoda, Palaeoheterodonta).

Control region and origins of replication

The animal mtDNA control region (CR) should contain or neighbor the origins of replication (ORs). [107] and, specifically for bivalves, Breton and colleagues ([22]; and reference therein) proposed several parameters to annotate the CR, like (i) UR length, (ii) evidence for secondary structure with T-rich loops, (iii) high A-T content and (iv) repetitive elements and palindromes. The best candidates for S. velum control region are the unassigned regions UR7 and UR8 (see Additional file 9), but, as expected, they give no BLAST hits with putative CRs of other mollusks. Among the two largest URs, UR8 is the longest un-assigned region. Both URs evidenced complex putative

Figure 4 Location of the origin of replication of the H strand. A (green), C (blue), G (black), and T (red) content at four-fold degenerate codons of PGCs are shown. Percent contents of each PCG are plotted at the midpoint of the ORF using the first nucleotide of cox3 ORF as the starting point. We also included ORF117 in this analysis (see text for further details). Equations are as follows. %A, y = 0.0017 x + 39.88, r2 = 0.64, p < 0.005;%C, y = 0.0001 x 15.07, r2 = 0.01, p = 0.71;%G, y = - 0.0009x14.09, r2 = 0.30, p < 0.05;%oT, y = - 0.0009x30.96, r2 = 0.39, p < 0.05.

secondary structures (Figure 5); again, both URs have a high A-T content, but while the A-T content of UR8 is 70.43% (somewhat near the overall genome score of 68.11%), it is up to 84.76% for UR7, much more than other putative CRs of mollusks [108]. On the other side, the only 17 bp-long repeated motif found in these URs was found in UR8. So, based on the above mentioned characteristics, it is not possible to unambiguously assign the CR function to either UR.

[102] suggested that mutations at four-fold degenerate sites should be completely neutral, being positions under no or limited selection. Therefore, in absence of selective constraints, the heavy (antisense) strand would accumulate G and T at these sites, while the light (sense) strand would accumulate A and C. Consequently, A-T and G-C skews at the four-fold degenerate codon sites are known to be significantly correlated with the single-strand duration during duplication, and therefore with the position of each PCG with respect to the OR of that strand [22,102]. Precise linear patterns of the percentages of

each of the 4 nucleotides were found when cox3 was used as the starting point (Figure 4), being the correlation significant for 3 nucleotides out of 4. The slope was positive for A and C, and negative for G and T. This is very similar to the findings of [102] for mammals and [15] for vermetid gastropods. These data finally point to three conclusions: (i) the strand we call " +" (i.e., the one encoding coxl) is the heavy (antisense) strand, while the strand we call "-" is the light (sense) strand; (ii) the CR of S. velum mtDNA is located immediately before cox3 in the UR7 region; (iii) as UR7 is located at the H/L switch, we suggest this to be the OR of both strands, working in either direction.

The region of the putative OR of the H strand encompasses UR7 and a cassette of tRNAs on the "-" strand, namely trnM, trnC, trnY, trnW, trnQ, and trnE, a situation already signaled in the family Vermetidae [15] and in the unionid Inversidens japanensis [22]. As shown by [109], tRNAs on one strand can sometimes work as OR in the opposite one by forming alternative secondary

dG = -23.40 1<C3 l/mo

Figure 5 UR secondary structures. Putative secondary structures of largest URs of S. velum mitochondrialgenome were inferred by software Mfold. The Gibbs energy (dG) is shown at the bottom of each structure. a, UR7; b, UR8.

Figure 6 Maximum likelihood (ML) tree. The tree shown is the consensus tree of 200 standard bootstrap replicates and the bootstrap proportion (BP) is shown for each node. Bivalves are shown in red; the asterisk indicates Solemya velum.

structures other than conventional cloverleaves. Further comparison with other bivalves is tricky because all genes are on the same strand within Amarsipobranchia, but, similarly, the CR of Mytilus spp. is located after a group of 7 tRNAs and the rrnL gene [21]; the major non-coding region (MNR) of Mimachamys nobilis is located after a cassette of 8 tRNAs [96]; the CR of Meretrix spp. is located after 7 consecutive tRNAs [39,40,110,111].

The supranumerary ORFs within UR8

If UR7 is the putative CR on Solemya mtDNA, the function of UR8 remains unknown. Its length and secondary structure may have some kind of signaling function, but

it is quite noteworthy that two ORFs were found here: ORF117 and ORF195. Are they functional or not? Remarkably, they span over the almost complete UR8, leaving only small unassigned nucleotide stretches of 37, 11, and 12 bp, similar to other intergenic spacers in S. velum mtDNA (see Additional file 9).

ORF117 has a Glimmer "raw" score comparable to other PCGs of S. velum mtDNA (i.e., nad4L and nad6). Although it is nested in the A-T rich UR8 (70.43%), it has a lower A-T content (59.83%), so that its composition is actually different from the rest of the UR8. Moreover, notwithstanding the low (for S. velum) A-T content, the most used codon is UUU (Phe), which is also among the most used ones in all PCGs of this mtDNA, and therefore its

codon composition is similar to that of other PCGs. The nucleotide pattern at four-fold degenerate sites is also consistent with other PCGs (Figure 4). Finally, ORF117 is almost completely located in a region of UR8 with few secondary structures (Figure 5). The presence of an homolog of ORF117 in the putative CR of Haliotis rubra, a species that, as above mentioned, seems to retain most ancestral features of molluscan mtDNA [33], may be related to a common origin of this ORF.

Conversely, ORF195 is not found by Glimmer and has a higher A-T content than ORF117 (73.33%); consistently, the most used codon is AAA (Lys). However, possible homology of ORF195 with membrane proteins is confirmed to some extent by the finding of a large transmembrane domain; the presence of a signal peptide constitutes a further in silico evidence favoring the functionality of this ORF.

It is not easy to assign to a protein a functional role only relying on bioinformatics data: expectedly, given the low homology scores and the short length of both ORFs, many different kinds of proteins and ligands were suggested by tools hosted on the @TOME 2.0 server. The presence of supranumerary ORFs in mitochondrial genomes has been reported elsewhere (f.i., [11,22,112,113]; and references therein) and they mostly are of obscure function, but they generally share either a DNA-binding motif or a transmembrane region.

The commonest hit of ORF117 was with DNA-binding domains of other polypeptides, and many of them were top-ranked using the alignments scores as a sorting criterion. The putative transmembrane region of ORF195 is 19

amminoacids long and it is found in the N-terminal part of the peptide; it is followed by 12 positively charged amminoacids (either K or R) out of 32 in the C-terminal half of the protein. Interestingly, this architecture is the same described by [22] for supranumerary sex-linked ORFs in unionid mitochondrial genomes. Breton and colleagues suggest a possible role for these ORFs, which must be involved in the complex machinery of the DUI mechanism. Present findings may confirm their claim that natural selection is working on maintaining the structure, rather than the sequence, of transmembrane sup-ranumerary mitochondrial ORFs [22].

The presence of a putative transmembrane signaling peptide in ORF195 and the DNA-binding signal in ORF117 may suggest a regulatory role for both these proteins; moreover, their presence in the S. velum mtDNA might constitute an evidence of the ancestral presence of such supranumerary ORFs in all bivalves. However, it has to be noted that this remains an in silico analysis and that some features of ORF195 could be randomly due to the high A-T content (f.i., the signal peptide): therefore, detailed analyses of mRNA gene expression are required to shed light on these issues.

Phylogenetic analysis and the usefulness of mitochondrial markers

The position of Solemya is unexpected, being nested within Gastropoda in our tree. However, node support is evidently very low (44.5), so our analysis does not point to a diphyly of Bivalvia, but rather to a polytomy at the base of the tree, including Solemya, Katharina and

Nautilus macromphalus

Sepia officinalis

Chaetoderma nitidulum

1 tandem duplication event

Solemya velum

Caenogastropoda*

Figure 7 Gene rearrangements. Reconstruction of relationships among gene arrangements of Katharina tunicata/Haliotis rubra/Haliotis tuberculata, Solemya velum, C. nitidulum, Caenogastropoda, N. macromphalus, and S. officinalis is shown, after the exclusion of tRNAs. Genes involved in the rearrangements are shaded in gray. If a minus sign ("-") is present, the gene is encoded on the "-" strand, otherwise it is encoded on the " +" strand. The asterisk is to signalthat D. gregarium is an exception to the displayed common gene arrangement of Caenogastropoda; square brackets refer to a possible gene arrangement ancestralto N. macromphalus (subclass Nautiloida) and S. officinalis.

Haliotis. We may suggest that the lineage leading to Opponobranchia arose so early in the molluscan radiation that little or no phylogenetic signal of this event may be retrievable in Solemya mtDNA. The structural similarities of Solemya mtDNA to Haliotis and Katharina, and the huge differences to other bivalvian mitochondrial genomes, may have affected the clustering, hence the incorrect weak relationship to Gastropoda, which has rather to be considered an artifact.

The diphyly of Bivalvia is not a new find however (see, f.i., [114-116]), even if most recent phylogenomic studies could strongly retrieve bivalves as monophyletic [29-31]. On this aspect, [117] misinterpreted our previous results [49,51], as this is the first time we obtain bivalves as diphyletic with mitochondrial DNA; this is simply because in previous works, focusing on internal relationships of the Class, we invariably forced bivalves to be monophyletic [49,51]. It is remarkable that consistency with bivalves' lower-level taxonomy was always maintained by our previous mtDNA analyses, a consistency which is actually lacking in [117]. On the other hand, mtDNA fails to retrieve strong phylogenetic signal for the most basal molluscan phylogenetic events, thus retrieving controversial results. Other molecular markers are needed on the issue.

Mitochondrial gene arrangement may better help in tracing basal phylogenetic relationships [23-26]. Remarkably, Solemya gene order connects to the Katarina one by a single gene inversion event (Figure 7), once again pointing out that Solemya belongs to a group of mol-lusks maintaining archaic mtDNA features. The same inversion can be traced in N. nucleus: the presence of the inversion can be extrapolated by the sequenced regions and therefore can be considered very likely. This may be taken as evidence for a sister group relationship between Nuculida and Solemyida, thus supporting again the monophyly of Opponobranchia. Given the questioned status of Nuculanoidea (f.i., [30,49,51,117,118]; and reference therein), it would be very interesting to obtain the complete mitochondrial genome of a species belonging to this super-family and to compare it with the one of S. velum. On the other hand, gene orders of autobranchiate bivalves known so far are so highly derived and hardly connectible (if not at all) to this archaic condition that the gene order of S. velum is useless in tracing phylogenetic relationships between Opponobranchia and other bivalves. Only the invention of a slow-evolving autobranch bivalve mtDNA (if it exists) would help to trace Bivalvia deep phylogenetic relationships based on mtDNA gene arrangements.

Conclusions

In previous paragraphs we extensively discussed many features of the mitochondrial genome of S. velum, in terms of gene/nucleotide content, strand identification, putative control region, and gene arrangement. All evidences

gathered from different (and partially independent) sources point towards the same conclusion: S. velum retains most of the ancestral mtDNA features of conchiferans, like H. rubra does within gastropods. The large similarities found with K. tunicata, an outgroup of conchiferans, on one side, and the great differences found with other known bivalves, on the other side, lead us to polarize genomic characters and conclude that the mtDNA of S. velum has been probably "freezed" in a condition very similar to that of the most recent common ancestor of Opponobranchia and Autobranchia.

If this is true, the wide discontinuity between S. velum and Autobranchia is intriguing: in facts, no mtDNA representing an intermediate state is known to date, the only exceptions being some genomic features of Palaeoheterodonta (f.i., nucleotide skews and the proportion of unassigned nucleotides; unpublished observation). In the branch leading to Autobranchia, mitochondrial genome evolved on its own, like for the translocation of most - if not all - genes on a single strand; the decrease in A-T content; a possible tendency towards heavy changes in atp8 gene; multiple and lineage-specific events of gene rearrangement. The evolution of Autobranchia seems to be tightly coupled with a dramatic increase of gene rearrangement events. Which factors triggered this boost of genomic evolution, while the main cladogenetic event leading to the Opponobranchia-Autobranchia split was taking place in the lower Cambrian [51]? An exhaustive answer is probably beyond the scope of this paper, but it is tempting to investigate whether the appearance of DUI played a main role in this burst [16,17]: further research on DUI evolution and the characterization of mitochondrial inheritance in Opponobranchia can surely shed more light on this issue. The recent discovery of DUI in the nuculanid Ledella ultima [118] is very interesting in this regard and has still to be evaluated in the light of the controversial phylogenetic position of the group.

Additional files

Additional file 1: Primers used in this study for Long-PCR reactions.

Primers that were used for the same experiment as a forward/reverse couple were marked with the same letter. The position of the primer annealing site on the complete molecule is reported in the "Target" column [119-121].

Additional file 2: Primers used in this study for routine PCR reactions. Primers that were used for the same experiment as a forward/ reverse couple were marked with the same letter. The position of the primer annealing site on the complete molecule is reported in the "Target" column [120-123].

Additional file 3: The dataset used for this study. Additional file 4: Secondary structures of tRNAs. Additional file 5: Nucleotide composition of Solemya velum mitochondrial genome.

Additional file 6: Mitogenomic features of taxa used in this study.

%UNs, percentage of unassigned nucleotides over the totallength of the

genome; H, number of genes on the putative H strand; L, number of genes on the putative L strand; aa, number of amminoacids encoded by the totality of protein coding genes (excluding stop codons). The rrnS gene is duplicated in C. gigas; length of either copy is reported.

Additional file 7: Secondary structures of regions between two consecutive PCGs.

Additional file 8: Codon usage in Solemya velum mitochondrial genome. The totalfrequency of each amminoacid is reported under the three-letter/one-letter name; underlined codons correspond to anticodons of mitochondrialtRNAs. Alltruncated (TA-/T-) stop codons were attributed to TAA. RSCU, Relative Synonymous Codon Usage. Additional file 9: Unassigned regions longer than 10 bp. Additional file 10: Phylogenetic Representativeness. Test is reported as in [49] for (a) the originalmaster list of mollusks taken from [93] and (b) a set of 100 shuffled master lists simulating taxonomicalrevisions. In both cases, AvTD is plotted on the left axis in the upper part of the chart and VarTD on the right one in the lower part. Sample size is plotted on x-axis; the greatest AvTD value (upper thick continue line), the AvTD mean (thin continue line), the AvTD 95% lower confidence limit (lower thick continue line), the VarTD 95% upper confidence limit (upper thick dashed line), the VarTD mean (thin dashed line), and the lowest VarTD value (lower thick dashed line) are shown. All these lines (with the exception of greatest AvTD and lowest VarTD) are shown as two-tailed 95% confidence limits for shuffling test (b). Sample used for this work is shown as a black diamond (AvTD)/circle (VarTD).

Additional file 11: Amminoacid substitution models selected by ProtTest 3.2 [80] and piped to RAxML 7.2.8 [82,83].

Competing interests

The authors declare that they have no competing interests. Authors' contributions

FP participated in the design of the study and in molecular lab work, analyzed the data and drafted the manuscript; AR carried out the molecular lab work and analyzed the data; MP conceived the study and helped to draft the manuscript. Allauthors read and approved the finalmanuscript.

Aknowledgements

We would like to thank Jeffrey L. Boore for providing specimens of S. velum used for this study and for the stimulating discussions on mitochondrial genomics with MP. This work was supported by the University and Research Italian Ministry (MIUR PRIN07, grant number 2007NSHJL8_002, and PRIN09, grant number 2009NWXMXX_002) and the "Canziani Bequest" fund (University of Bologna, grant number A.31.CANZELSEW). FP was financially supported by Fondazione delMonte di Bologna e Ravenna.

Received: 6 August 2012 Accepted: 11 June 2013 Published: 18 June 2013

References

1. Boore JL: Animal mitochondrial genome. Nucleic Acids Res 1999,27:1767-1780.

2. Burger G, Gray MW, Lang BF: Mitochondrial genomes: anything goes. Trends Genet 2003, 19:709-716.

3. Simison WB, Boore JL: Molluscan evolutionary genomics. In Phylogeny and evolution of the mollusca. Edited by Ponder W, Lindberg DR. Berkeley: University of California Press; 2008:447-461.

4. Armstrong MR, Blok VC, Phillips MS: A multipartite mitochondrial genome in the potato cyst nematode Globodera pallida. Genetics 2000,154:181-192.

5. Cameron SL, Yoshizawa K, Mizukoshi A, Whiting MF, Johnson KP: Mitochondrial genome deletions and minicircles are common in lice (Insecta: Phthiraptera). BMC Genomics 2011, 12:394.

6. Gibson T, Blok VC, Dowton M: Sequence and characterization of six mitochondrial subgenomes from Globodera rostochiensis: multipartite structure is conserved among close nematode relatives. J Mol Evol 2007, 65:308-315.

7. Shao R, Kirkness EF, Barker SC: The single mitochondrial chromosome typical of animals has evolved into 18 minichromosomes in the human body louse, Pediculus humanus. Genome Res 2009, 19:904-912.

8. Suga K, Welch DBM, Tanaka Y, Sakakura Y, Haglwarak A: Two circular chromosomes of unequal copy number make up the mitochondrial genome of the rotifer Brachionus plicatilis. Mol Biol Evol 2008, 25:1129-1137.

9. Watanabe KI, Bessho Y, Kawasaki M, Horl H: Mitochondrial genes are found on minicircle DNA molecules in the mesozoan animal Dicyema. J Mol Biol 1999, 286:645-650.

10. Wei D-D, Shao R, Yuan M-L, Dou W, Barker SC, Wang J-J: The multipartite mitochondrial genome of liposcelis bostrychophila: insights into the evolution of mitochondrial genomes in bilateral animals. PLoS One 2012, 7:e33973.

11. Glssl C, lannelli F, Pesole G: Evolution of the mitochondrial genome of Metazoa as exemplified by comparison of congeneric species. Heredity 2008, 101:301-320.

12. Grande C, Templado J, Zardoya R: Evolution of gastropod mitochondrial genome arrangements. BMC Evol Biol 2008, 8:61.

13. Vallès Y, Boore JL: Lophotrochozoan mitochondrial genomes. Integr Comp Biol 2006, 46:544-557.

14. Mllbury CA, Gaffney PM: Complete mitochondrial DNA sequence of the eastern oyster crassostrea virginica. Mar Biotechnol 2005, 7:697-712.

15. Rawllngs TA, Maclnnls MJ, Bleler R, Boore JL, Colllns TM: Sessile snails, dynamic genomes: gene rearrangements within the mitochondrial genome of a family of caenogastropod molluscs. BMC Genomics 2010,11:440.

16. Breton S, Doucet-Beaupré H, Stewart DT, Hoeh WR, Bller PU: The unusual system of doubly uniparental inheritance of mtDNA: isn't one enough? Trends Genet 2007, 23:465-474.

17. Passamontl M, Ghlselll F: Doubly uniparental inheritance: two mitochondrial genomes, one precious model for organelle DNA inheritance and evolution. DNA Cell Biol 2009, 28:1 -10.

18. Passamontl M, Rlccl A, Mllanl L, Ghlselll F: Mitochondrial genomes and doubly uniparental inheritance: new insights from musculista senhousia sex-linked mitochondrial DNAs (bivalvia mytilidae). BMC Genomics 2011, 12:442.

19. Zouros E: An unusual type of mitochondrial-DNA inheritance in the blue mussel Mytilus. Proc Natl Acad Sci USA 1994, 91:7463-7467.

20. Zouros E: Mitochondrial-DNA inheritance - Reply. Nature 1994, 368:818.

21. Breton S, Burger G, Stewart DT, Bller PU: Comparative analysis of gender associated complete mitochondrial genomes in marine mussels (Mytilus spp.). Genetics 2006, 172:1107-1119.

22. Breton S, Doucet-Beaupré H, Stewart DT, Plontklvska H, Karmakar M, Bogan AE, Bller PU, Hoeh WR: Comparative mitochondrial genomics of freshwater mussels (bivalvia: unionoida) with doubly uniparental inheritance of mtDNA: gender-specific open reading frames and putative origins of replication. Genetics 2009, 183:1575-1589.

23. Boore JL, Brown WM: Big trees from little genomes: mitochondrial gene order as a phylogenetic tool. Curr Opin Genet Dev 1998, 8:668-674.

24. Dreyer H, Stelner G: The complete sequence and gene organization of the mitochondrial genome of the gadilid scaphopod Siphonondentalium labatum (Mollusca). Mol Phylogenet Evol 2004, 31:605-617.

25. Rokas A, Holland PWH: Rare genomic changes as a tool for phylogenetics. TREE 2000, 15:454-459.

26. Stechmann A, Schlegel M: Analysis of the complete mitochondrial DNA sequence of the brachiopod Terebratulina retusa places Brachiopoda within the protostomes. Proc R Soc Lond B 1999, 266:2043-2052.

27. Jang KH, Hwang UW: Complete mitochondrial genome of Bugula neritina (Bryozoa, Gymnolaemata, Cheilostomata): phylogenetic position of Bryozoa and phylogeny of lophophorates within the Lophotrochozoa. BMC Genomics 2009, 10:167.

28. Xu W, Jameson D, Tang B, Hlggs PG: The relationship between the rate of molecular evolution and the rate of genome rearrangement in animal mitochondrial genomes. J Mol Evol 2006, 63:375-392.

29. Kocot KM, Cannon JT, Todt C, Cltarella MR, Kohn AB, Meyer A, Santos SR, Schander C, Moroz LL, Lleb B, Halanych KM: Phylogenomics reveals deep molluscan relationships. Nature 2011, 477:452-456.

30. Smlth SA, Wllson NG, Goetz FE, Feehery C, Andrade SCS, Rouse GW, Glrlbet G, Dunn CW: Resolving the evolutionary relationships of molluscs with phylogenomic tools. Nature 2011, 480:364-367.

31. Wllson NG, Rouse GW, Glrlbet G: Assessing the molluscan hypothesis Serialia (Monoplacophora + Polyplacophora) using novel molecular data. Mol Phylogenet Evol 2010, 54:187-193.

32. Boore JL, Brown WM: Complete DNA sequence of the mitochondrial genome of the black chiton, Katharina tunicata. Genetics 1994,138:423-443.

33. Maynard BT, Kerr LJ, McKiernan JM, Jansen ES, Hanna PJ: Mitochondrial DNA sequence and gene organization in the Australian blacklip abalone Haliotis rubra (Leach). Mar Biotechnol 2005, 7:645-658.

34. Cunha RL, Grande C, Zardoya R: Neogastropod phylogenetic relationships based on entire mitochondrial genomes. BMC Evol Biol 2009, 9:210.

35. Boore JL: The complete sequence of the mitochondrial genome of Nautilus macromphalus (Mollusca: Cephalopoda). BMC Genomics 2006, 7:182.

36. Helfenbein KG, Boore JL: The mitochondrial genome of phoronis architecta—comparisons demonstrate that phoronids Are lophotrochozoan protostomes. Mol Biol Evol 2004, 21:153-157.

37. Breton S, Stewart DT, Hoeh WR: Characterization of a mitochondrial ORF from the gender-associated mtDNAs of mytilus spp. (Bivalvia: mytilidae): identification of the "missing" ATPase 8 gene. Mar Genom 2010, 3:11-18.

38. Doucet-Beaupre H, Breton S, Chapman EG, Blier PU, Bogan AE, Stewart DT, Hoeh WR: Mitochondrial phylogenomics of the bivalvia (mollusca): searching for the origin and mitogenomics correlates of doubly uniparental inheritance of mtDNA. BMC Evol Biol 2010, 10:50.

39. Wang H, Zhang S, Xiao G, Liu B: Complete mtDNA of the Meretrix lamarckii (Bivalvia: Veneridae) and molecular identification of suspected M. lamarckii based on the whole mitochondrial genome.

Marine Genomics 2011,4:263-271.

40. Wang H, Zhang S, Li Y, Liu B: Complete mtDNA of Meretrix lusoria (Bivalvia: Veneridae) reveals the presence of an atp8 gene, length variation and heteroplasmy in the control region. Comp Biochem Physiol Part D Genomics Proteomics 2010, 5:256-264.

41. Meng X, Zhao N, Shen X, Hao J, Liang M, Zhu X, Cheng H, Yan B, Liu Z: Complete mitochondrial genome of coelomactra antiquata (mollusca: bivalvia): the first representative from the family mactridae with novel gene order and unusual tandem repeats. Comp Biochem Phys D 2012, 7:175-179.

42. Danic-Tchaleu G, Heurtebise S, Morga B, Lapegue S: Complete mitochondrial DNA sequence of the European flat oyster Ostrea edulis confirms Ostreidae classification. BMC Res Notes 2011, 4:400.

43. Wu X, Xu X, Yu Z, Wei Z, Xia J: Comparison of seven Crassostrea mitogenomes and phylogenetic analyses. Mol Phylogenet Evol 2010, 57:448-454.

44. Yu H, Li Q: Mutation and selection on the wobble nucleotide in tRNA anticodons in marine bivalve mitochondrial genomes. PLoS One 2011, 6:e16147.

45. Newell ND: Classification of the Bivalvia. Am Mus Novit 1965, 2206:1-25.

46. Taylor JD, Glover EA, Williams ST: Ancient chemosynthetic bivalves: systematics of Solemyidae from eastern and southern Australia (Mollusca: Bivalvia). Memoirs Queensland Museum - Nature 2008, 54:75-104.

47. Taviani M, Angeletti L, Ceregato A: Chemosynthetic Bivalves of the Family Solemyidae (Bivalvia, Protobranchia) in the Neogene of the Mediterranean Basin. J Paleontol 2011, 85:1067-1076.

48. Rozen S, Skaletsky HJ: Primer3 on the WWW for general users and for biologist programmers. Methods Mol Biol 2000, 132:365-386.

49. Plazzi F, Ceregato A, Taviani M, Passamonti M: A molecular phylogeny of bivalve mollusks: ancient radiations and divergences as revealed by mitochondrial genes. PLoS One 2011, 6:e27147.

50. Lis JT, Schleif R: Size fractionation of double-stranded DNA by precipitation with polyethylene glycol. Nucleic Acid Res 1975, 2:383-389.

51. Plazzi F, Passamonti M: Towards a molecular phylogeny of Mollusks: Bivalves' early evolution as revealed by mitochondrial genes. Mol Phylogenet Evol 2010, 57:641 -657.

52. Tatusov T, Tatusov R: ORF Finder. http://www.ncbi.nlm.nih.gov/projects/gorf/.

53. Delcher AL, Bratke KA, Powers EC, Salzberg SL: Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 2007, 23:673-679.

54. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol 1990, 215:403-410.

55. Ye J, McGinnis S, Madden TL: BLAST: improvements for better sequence analysis. Nucleic Acids Res 2006, 34:W6-W9.

56. Pons J-L, Labesse G: @TOME-2: a new pipeline for comparative modeling of protein-ligand complexes. Nucleic Acids Res 2009, 37:W485-W491.

57. Zdobnov EM, Apweiler R: InterProScan - an integration platform for the signature-recognition methods in InterPro. Bioinformatics 2001,17:847-848.

58. Petersen TN, Brunak S, von Heijne G, Nielsen H: SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods 2011,8:785-786.

59. Söding J: Protein homology detection by HMM-HMM comparison. Bioinformatics 2005, 21:951 -960.

60. Zhou H, Zhou Y: SPARKS 2 and SP3 servers in CASP6. Proteins 2005, 58:321 -328.

61. Shi J, Blundell TL, Mizuguchi K: FUGUE: sequence-structure homology recognition using environment-specific substitution tables and structure-dependent Gap penalties. J Mol Biol 2001, 310:243-257.

62. Zuker M: Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res 2003, 31:3406-3415.

63. Lowe TM, Eddy SR: TRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 1997,25:955-964.

64. Schattner P, Brooks A, Lowe TM: The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res

2005, 33(suppl 2):W686-W689.

65. Laslett D, Canback B: ARWEN, a program to detect tRNA genes in metazoan mitochondrial nucleotide sequences. Bioinformatics 2008,24:172-175.

66. Darty K, Denise A, Ponty J: VARNA: interactive drawing and editing of the RNA secondary structure. Bioinformatics 2009, 25:1974-1975.

67. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol 2011, 28:2731-2739.

68. Sharma D, Issac B, Raghava GP, Ramaswamy R: Spectral repeat finder (SRF): identification of repetitive sequences using Fourier transformation. Bioinformatics 2004, 20:1405-1412.

69. Conant GC, Wolfe KH: GenomeVx: simple web-based creation of editable circular chromosome maps. Bioinformatics 2008, 24:861 -862.

70. Boore JL, Brown WM: Mitochondrial genomes of Galathealinum, Helobdella, and Platynereis: sequence and gene arrangement comparisons indicate that Pogonophora is not a phylum and Annelida and Arthropoda are not sister taxa. Mol Biol Evol 2000,17:87-106.

71. Plazzi F, Ferrucci RR, Passamonti M: Phylogenetic representativeness: a new method for evaluating taxon sampling in evolutionary studies. BMC Bioinformatics 2010, 11:209.

72. Katoh K, Misawa K, Kuma K, Miyata T: MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res 2002, 30:3059-3066.

73. Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 2004, 32:1792-1797.

74. Edgar RC: MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 2004, 5:113.

75. Moretti S, Armougom F, Wallace IM, Higgins DG, Jongeneel CV, Notredame C: The M-Coffee web server: a meta-method for computing multiple sequence alignments by combining alternative alignment methods. Nucleic Acids Res 2007, 35:W645-W648.

76. Wallace IM, O'Sullivan O, Higgins DG, Notredame C: M-Coffee: combining multiple sequence alignment methods with T-Coffee. Nucleic Acids Res

2006, 34:1692-1699.

77. Castresana J: Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol 2000, 17:540-552.

78. Talavera G, Castresana J: Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol 2007, 56:564-577.

79. Lanfear R, Calcott B, Ho SYW, Guindon S: PartitionFinder: combined selection of partitioning schemes and substitution models for phylogenetic analyses. Mol Biol Evol 2012, 29:1695-1701.

80. Darriba D, Taboada GL, Doallo R, Posada D: ProtTest 3: fast selection of best-fit models of protein evolution. Bioinformatics 2011, 27:1164-1165.

81. Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol 2003, 52:696-704.

82. Stamatakis A: RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 2006, 22:2688-2690.

83. Stamatakis A, Hoover P, Rougemont J: A rapid bootstrap algorithm for the RAxML Web servers. Syst Biol 2008, 57:758-771.

84. Stamatakis A: Phylogenetic models of rate heterogeneity: a high performance computing perspective. In Proceedings of 20th international parallel and distributed processing symposium. Rhodes; 2006. http:// ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1639535&tag=1.

85. Jordan GE, Piel WH: PhyloWidget: Web-based visualizations for the tree of life. Bioinformatics 2008, 15:1641-1642.

86. Huson DH, Richter DC, Rausch C, Dezulian T, Franz M, Rupp R: Dendroscope - An interactive viewer for large phylogenetic trees. BMC Bioinformatics 2007, 8:460.

Feng X, Llu D-F, Wang N-X, Zhu C-D, Jlang G-F: The mitochondrial genome of the butterfly Papilio xuthus (Lepidoptera: Papilionidae) and related phylogenetic analyses. Mol Biol Rep 2010, 37:3877-3888. Garey GR, Wolstenholme DR: Platyhelminth mitochondrial DNA: evidence for early evolutionary origin of a tRNA that contains a dihydrouridine-arm replacement loop, and of serine-specifying AGA and AGG codons. J Mol Evol 1989, 28:374-387.

Klm I, Cha SY, Yoon MH, Hwang JS, Lee SM, Sohn HD, Jln BR: The complete nucleotide sequence and gene organization of the mitochondrial genome of the oriental mole cricket, Gryllotalpa orientalis (Orthoptera: Gryllotalpidae). Gene 2005, 353:155-168.

Sheffleld NC, Song H, Cameron SL, Whltlng MF: A comparative analysis of mitochondrial genomes in coleoptera (arthropoda: insecta) and genome descriptions of Six New beetles. Mol Biol Evol 2008, 25:2499-2509. Serb JM, Lydeard C: Complete mtDNA sequence of the North American freshwater mussel, Lampsilis ornata (Unionidae): an examination of the evolution and phylogenetic utility of mitochondrial genome organization in Bivalvia (Mollusca). Mol Biol Evol 2003, 20:1854-1886. Fernandez-Sllva P, Enrlquez JA, Montoya J: Replication and transcription of mammalian mitochondrial DNA. Exp Physiol 2003, 88:4156. Mlllard V: Classification of Mollusca: A Classification of World Wide Mollusca. Volume 3. 2nd edltlon. South Afrlca; 2001.

Boore JL, Staton JL: The mitochondrial genome of the sipunculid Phascolopsis gouldii supports its association with Annelida rather than Mollusca. Mol Biol Evol 2002, 17:540-552.

Watklns RF, Beckenbach AT: Partial sequence of a sponge mitochondrial genome reveals sequence similarity to cnidaria in cytochrome oxidase subunit II and the large ribosomal RNA subunit. J Mol Evol 1999,48:542-554. Wu X, Xu X, Yu Z, Kong X: Comparative mitogenomic analyses of three scallops (Bivalvia: Pectinidae) reveal high level variation of genomic organization and a diversity of transfer RNA gene sets. BMC Research Notes 2009, 2:69.

Beletskll A, Bhagwat AS: Transcription-induced mutations: increase in C to T mutations in the nontranscribed strand during transcription in Escherichia coli. Proc Natl Acad Sci USA 1996, 93:13919-13924. Franclno MP, Ochman H: Strand asymmetries in DNA evolution.

Trends Genet 1997, 13:240-245.

Frederlco LA, Kunkel TA, Shaw BA: A sensitive genetic assay for the detection of cytosine deamination: Determination of rate constant and the activation energy. Biochemistry 1990, 29:2532-2537. Hassanln A, Léger N, Deutsch J: Evidence for multiple reversals of asymmetric mutational constraints during the evolution of the mitochondrial genome of metazoa, and consequences for phylogenetic inferences. Syst Biol 2005, 54:277-298.

Perna NT, Kocher TD: Patterns of nucleotide composition at fourfold degenerate sites of animal mitochondrial genomes. J Mol Evol 1995, 41:353-358.

Reyes A, Glssl C, Pesole G, Saccone C: Asymmetrical directional mutation pressure in the mitochondrial genome of mammals. Mol Biol Evol 1998, 15:957-966.

Sancar A, Sancar GB: DNA repair enzymes. Annu Rev Biochem 1998, 57:29-67. Falth JJ, Pollock DD: Likelihood analysis of asymmetrical mutation bias gradients in vertebrate mitochondrial genomes. Genetics 2003,165:735-745. Rodakls GC, Cao L, Mlzl A, Kenchlngton EL, Zouros E: Nucleotide content gradients in maternally and paternally inherited mitochondrial genomes of the mussel Mytilus. J Mol Evol 2007, 65:124-136. Saccone C, Glssl C, Reyes A, Larlzza A, Sblsà E, Pesole G: Mitochondrial DNA in metazoa: degree of freedom in a frozen event. Gene 2002, 286:3-12. Brugler MR, France SC: The mitochondrial genome of a deep-sea bamboo coral (Cnidaria, Anthozoa, Octocorallia, Isidiidae): genome structure and putative origins of replication are not conserved among octocorals. J Mol Evol 2008, 67:125-136.

Arunkumar KP, Nagaraju J: Unusually long palindromes are abundant in mitochondrial control regions of insects and nematodes. PLoS One 2006, 1:e110.

Sellgmann H, Krlshnan NM, Rao BJ: Possible multiple origins of replication in primate mitochondria: alternative role of tRNA sequences. J Theor Biol 2006, 241:321-332.

He C, Wang J, Gao X, Song W, Ll H, Ll Y, Llu W, Su H: The complete mitochondrial genome of the hard clam Meretrix meretrix. Mol Biol Rep 2011, 38:3401 -3409.

Ren J, Shen X, Sun M, Jiang F, Yu Y, Chi Z, Liu B: The complete mitochondrial genome of the clam Meretrix petechialis (Mollusca: Bivalvia: Veneridae). Mitochondr DNA 2009, 20:78-87. Pont-Kingdon G, Okada NA, Macfarlane JL, Beagley CT, Watkins-Sims CD, Cavalier-Smith T, Clark-Walker GD, Wolstenholme DR: Mitochondrial DNA of the coral Sarcophyton glaucum contains a gene for a homologue bacterial MutS: a possible case of gene transfer from the nucleus to the mitochondrion. J Mol Evol 1998, 46:419-431.

Shao Z, Shannon G, Chaga OY, Lavrov DV: Mitochondrial genome of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa): a linear DNA molecule encoding a putative DNA-dependant DNA polymerase. Gene 2006, 381:92-101.

Giribet G, Distel DL: Bivalve phylogeny and molecular data. In Molecular systematics and phylogeography of mollusks. Edited by Lydeard C, Lindberg DR. Washington: Smithsonian Books; 2003:45-90. Giribet G, Okusu A, Lindgren AR, Huff SW, Schrödl M, Nishiguchi MK: Evidence for a clade composed of molluscs with serially repeated structures: Monoplacophorans are related to chitons. Proc Natl Acad Sci USA 2006, 103:7723-7728.

Giribet G, Wheeler W: On bivalve phylogeny: A high-level analysis of the Bivalvia (Mollusca) based on combined morphology and DNA sequence data. invert Biol 2002, 121:271-324.

Sharma PP, González VL, Kawauchi GY, Andrade SCS, Guzmán A, Collins TM, Glover EA, Harper EM, Healy JM, Mikkelsen PM, Taylor JD, Bieler R, Giribet G: Phylogenetic analysis of four nuclear protein-encoding genes largely corroborates the traditional classification of Bivalvia (Mollusca).

Mol Phylogenet Evol 2012, 65:64-74.

Boyle EE, Etter RJ: Heteroplasmy in a deep-sea protobranch bivalve suggests an ancient origin of doubly uniparental inheritance of mitochondria in Bivalvia. Mar Biol 2013, 160:413-422.

Simon C, Buckley TR, Frati F, Stewart JB, Beckenbach AT: Incorporating molecular evolution into phylogenetic analysis, and a New compilation of conserved polymerase chain reaction primers for animal mitochondrial DNA. Annu Rev Ecol Evol Syst 2006, 37:545-579. Palumbi SR, Martin A, Romano S, McMillan WO, Stice L, Grabowski G: The simple fool's guide to PCR. Hawaii: Kewalo Marine Laboratory and University of Hawaii; 1996.

Stewart FJ, Baik AHY, Cavanaugh CM: Genetic subdivision of chemosynthetic endosymbionts of Solemya velum along the Southern New England Coast. Appl Environ Microb 2009, 75:6005-6007. Matsumoto M: Phylogenetic analysis of the subclass Pteriomorpha (Bivalvia) from mtDNA COI sequences. Mol Phylogenet Evol 2003,27:429-440. Burger G, Lavrov DV, Forget L, Lang BF: Sequencing complete mitochondrial and plastid genomes. Nat Protoc 2007, 2:603-614.

doi:10.1186/1471-2164-14-409

Cite this article as: Plazzl et al.: The complete mitochondrial genome of Solemya velum (Mollusca: Bivalvia) and its relationships with Conchifera.

BMC Genomics 2013 14:409.

Submit your next manuscript to BioMed Central and take full advantage of:

• Convenient online submission

• Thorough peer review

• No space constraints or color figure charges

• Immediate publication on acceptance

• Inclusion in PubMed, CAS, Scopus and Google Scholar

• Research which is freely available for redistribution

Submit your manuscript at www.biomedcentral.com/submit

Central