The Peanut Genome

This page describes the high-quality genome assembly for peanut (Arachis hypogaea), cultivar Tifrunner, which was released in December, 2017 (Bertioli et al., 2019), and the improved version 2.0. An explanation of the reason for the corrected version and a list of differences is available here.

IMPORTANT NOTE: the version 2.0 assembly (gnm2) for Arachis hypogaea var. Tifrunner has not been submitted to GenBank.

Tifrunner is an important U.S. variety, with good market and growth characteristics and resistance to several peanut diseases (early and late leaf spot and TSWV/spotted wilt).

This is a project of the International Peanut Genome Initiative, in order to accelerate breeding progress and get more productive, disease-resistant, stress-tolerant varieties to farmers. The IPGI project has sequenced the genomes of the two diploid progenitors of cultivated peanut, as well as the genome of cultivated peanut itself.


Arachis hypogaea: Peanut

Download folders for Arachis hypogaea at data store (includes genomes and other available data types)

Arachis hypogaea cv. Tifrunner

Assemblies:
Genome assembly 1 for Arachis hypogaea, cultivar Tifrunner
Genome assembly 2 for Arachis hypogaea, cultivar Tifrunner. This version has structural changes relative to assembly 1.

Annotations:
Gene models for genome assembly 2 for Arachis hypogaea, cultivar Tifrunner.
Arachis hypogaea accession Tifrunner annotation files from genome assembly 1.0
Annotation 2, using the BIND pipeline, for Arachis hypogaea, cultivar Tifrunner

Genome browsers:
JBrowse2 arahy.Tifrunner.gnm1: JBrowse2 Linear Genome View
JBrowse2 arahy.Tifrunner.gnm2: JBrowse2 Linear Genome View
JBrowse2 arahy.Tifrunner.gnm2.x.aradu.V14167.gnm2.DWM1.paf.gz: JBrowse2 Dotplot View
JBrowse2 arahy.Tifrunner.gnm2.x.aradu.K30065.gnm1.DWM1.paf.gz: JBrowse2 Dotplot View
JBrowse2 arahy.Tifrunner.gnm2.x.arahy.Shitouqi.gnm1.DWM1.paf.gz: JBrowse2 Dotplot View
JBrowse2 arahy.Tifrunner.gnm2.x.aradu.K30060.gnm1.DWM1.paf.gz: JBrowse2 Dotplot View
JBrowse2 arahy.Tifrunner.gnm2.x.araip.K30076.gnm2.DWM1.paf.gz: JBrowse2 Dotplot View
JBrowse2 arahy.Tifrunner.gnm2.x.arahy.Tifrunner.gnm1.DWM1.paf.gz: JBrowse2 Dotplot View
JBrowse2 arahy.Tifrunner.gnm2.x.arahy.BaileyII.gnm1.DWM1.paf.gz: JBrowse2 Dotplot View
JBrowse2 arahy.Tifrunner.gnm2.x.arahy.Fuhuasheng.gnm1.DWM1.paf.gz: JBrowse2 Dotplot View

Other resources:
NCBI Tifrunner assembly 2.0: Tifrunner gnm2.ann2 (assembly 2.0 annotation 2.0) at NCBI
LIS SequenceServer: SequenceServer BLAST against the Tifrunner v2 assembly

Changes between the genome assemblies version 1 and version 2

Note: The gene models on gnm2.ann1 are the result of liftover, not a new annotation procedure; ie the gene model structures and basic names stay the same, though some coordinates changed and a few gene models got duplicates.

Additional details about the A. hypogaea assembly

The assembly size is 2,556 Mbp, which we estimate to span more than 99% of the actual genome. The scaffold N50 (a measure of the assembly contiguity) is 135.2 MB (the scale of the complete peanut chromosomes). A total of 48.25x of PACBIO sequence (avg. read length of 11,525) was used to generate the initial assembly, which was subsequently polished using Illumina sequences and ARROW. Homozygous SNPs and INDELs were corrected in the release sequence using ~40x of Illumina reads (2x250, 800bp insert, library ID ICIH and ICID). Synteny with the diploid A. duranensis and A. ipaensis, along with 1 genetic map and 2 synthetic maps (provided by David Bertioli) were used to identify misjoins in the raw assembly. The resulting assembly was then scaffolded using HiC data. Post scaffolding, 6 additional breaks were made to resolve misjoins introduced during the scaffolding procedure.

The original sequences were combined with the duplicated tetrasomic regions and joined together using 26 joins to create the 20 A. hypogaea chromosomes. During the construction of the chromosomes, all 500bp scaffolded gaps were converted to 1,000 bp gaps, and the map joins that were added consisted of 10,000 bp gaps. Chromosomes were numbered as Arahy.01-Arahy.20, where the A genome is represented as Arahy.01-Arahy.10 and the B genome is represented as Arahy.11-Arahy.20. 99.3% of the assembled sequence is contained in the chromosomes.

Arachis hypogaea cv. Fuhuasheng

Assemblies:
Genome assembly for tetraploid peanut cultivar Fuhuasheng, from Chen et al., 2019

Annotations:
No annotations yet.

Genome browsers:
JBrowse2 arahy.Fuhuasheng.gnm1: JBrowse2 Linear Genome View

Arachis hypogaea cv. Shitouqi

Assemblies:
Genome assembly for peanut cultivar Shitouqui, from Zhuang et al., 2019.

Annotations:
No annotations yet.

Genome browsers:
JBrowse2 arahy.Shitouqi.gnm1: JBrowse2 Linear Genome View

Arachis hypogaea cv. Bailey II

Assemblies:
Arachis hypogaea accession Bailey II, genome assembly 1

Annotations:
Annotation 1 for Arachis hypogaea accession Bailey II, genome assembly 1

Genome browsers:
JBrowse2 arahy.BaileyII.gnm1: JBrowse2 Linear Genome View
JBrowse2 arahy.BaileyII.gnm1.x.araca.K10017.gnm1.Z93X.paf.gz: JBrowse2 Dotplot View
JBrowse2 arahy.BaileyII.gnm1.x.aradu.V14167.gnm2.Z93X.paf.gz: JBrowse2 Dotplot View
JBrowse2 arahy.BaileyII.gnm1.x.araip.K30076.gnm2.Z93X.paf.gz: JBrowse2 Dotplot View
JBrowse2 arahy.BaileyII.gnm1.x.arast.V10309.gnm1.Z93X.paf.gz: JBrowse2 Dotplot View


Arachis duranensis: Diploid wild peanut - A-genome

Download folders for Arachis duranensis at data store (includes genomes and other available data types)

Arachis duranensis cv. V14167

Assemblies:
Genome assembly 2 for Arachis duranensis, accession V14167
Genome assembly 1 for Arachis duranensis, accession V14167

Annotations:
genome annotations

Genome browsers:
JBrowse2 aradu.V14167.gnm2: JBrowse2 Linear Genome View
JBrowse2 aradu.V14167.gnm1: JBrowse2 Linear Genome View
JBrowse2 aradu.V14167.gnm2.x.aradu.K30065.gnm1.96TT.paf.gz: JBrowse2 Dotplot View
JBrowse2 aradu.V14167.gnm2.x.araip.K30076.gnm2.96TT.paf.gz: JBrowse2 Dotplot View
JBrowse2 aradu.V14167.gnm2.x.aradu.K30060.gnm1.96TT.paf.gz: JBrowse2 Dotplot View
JBrowse2 aradu.V14167.gnm2.x.arast.V10309.gnm1.96TT.paf.gz: JBrowse2 Dotplot View
JBrowse2 aradu.V14167.gnm2.x.araca.K10017.gnm1.96TT.paf.gz: JBrowse2 Dotplot View

Other resources:
Download at Data Store: DataStore directory for Arachis duranensis.
Genome assembly at NCBI: Genome assembly at NCBI, V14167

Arachis duranensis cv. K30060

Assemblies:
Genome assembly 1 for Arachis duranensis, accession K30060

Annotations:
No annotations yet.

Genome browsers:
JBrowse2 aradu.K30060.gnm1: JBrowse2 Linear Genome View
JBrowse2 aradu.K30060.gnm1.x.aradu.K30065.gnm1.LXVF.paf.gz: JBrowse2 Dotplot View

Other resources:
Download at Data Store: DataStore directory for Arachis duranensis.
Genome assembly at NCBI: Genome assembly at NCBI, K30060

Arachis duranensis cv. K30065

Assemblies:
Genome assembly 1 for Arachis duranensis, accession K30065, collected from rio seco Bolivia

Annotations:
No annotations yet.

Genome browsers:
JBrowse2 aradu.K30065.gnm1: JBrowse2 Linear Genome View

Other resources:
Download at Data Store: DataStore directory for Arachis duranensis.
Genome assembly at NCBI: Genome assembly at NCBI, K30065


Arachis ipaensis: Diploid wild peanut - B genome

Download folders for Arachis ipaensis at data store (includes genomes and other available data types)

Arachis ipaensis cv. K30076

Assemblies:
Genome assembly 1 for Arachis ipaensis, accession K30076.
Genome Assembly 2 for A. ipaensis K30076. A high-quality chromosome-scale assembly, based on PacBio RSII and Sequel reads, ordered into pseudomolecules by Dovetail using Hi-C Chicago libraries and the HiRise and SNAP.

Annotations:
genome annotations for Arachis ipaensis K30076.gnm1.ann1

Genome browsers:
JBrowse2 araip.K30076.gnm1: JBrowse2 Linear Genome View
JBrowse2 araip.K30076.gnm2: JBrowse2 Linear Genome View
JBrowse2 araip.K30076.gnm2.x.aradu.K30060.gnm1.08ZF.paf.gz: JBrowse2 Dotplot View
JBrowse2 araip.K30076.gnm2.x.aradu.K30065.gnm1.08ZF.paf.gz: JBrowse2 Dotplot View
JBrowse2 araip.K30076.gnm2.x.araca.K10017.gnm1.08ZF.paf.gz: JBrowse2 Dotplot View

Other resources:
Genome assembly at NCBI: Genome assembly at NCBI, K30076


Arachis cardenasii: Mani silvestre (wild peanut)

Download folders for Arachis cardenasii at data store (includes genomes and other available data types)

Arachis cardenasii cv. GKP10017

Assemblies:
Genome assembly 1 for Arachis cardenasii, accession K10017

Annotations:
No annotations yet.

Genome browsers:
JBrowse2 araca.K10017.gnm1: JBrowse2 Linear Genome View


Arachis stenosperma: Amendoim silvestre (wild peanut)

Download folders for Arachis stenosperma at data store (includes genomes and other available data types)

Arachis stenosperma cv. V10309

Assemblies:
Genome assembly 1 for Arachis stenosperma, genotype V10309

Annotations:
GenBank RefSeq annotation for Genome assembly 1 for Arachis stenosperma, genotype V10309

Genome browsers:
JBrowse2 arast.V10309.gnm1: JBrowse2 Linear Genome View