Genomic Selection for Resilient Crop Breeding in South Punjab

Dataset Description

For this project, entitled "Genomic Research for Wheat Genes Prediction in South Punjab", data was supplied by the Supervisor and is comprised of two main aspects: genotypic data (SNPs) and phenotypic data (agronomic traits). Such datasets serve as the foundation on which to search for the genetic relationships with important wheat traits across the South Punjab area.

Data Sources: Two datasets were used in this research:

Genotypic Dataset Column Description

Column Name Description
rs SNP identifier (e.g., 1A_1208114)
alleles Possible allelic variation at the SNP site (e.g., A/G)
chrom Chromosome location (e.g., 1A, 3B)
pos Physical position on the chromosome
strand DNA strand direction (+ or -)
assembly Reference genome assembly version
center Data generating center
protLSID Protocol identifier (usually NA)
assayLSID Assay identifier (technical ID)
panelLSID Panel ID used in SNP genotyping
QCcode Quality control code (optional)
Genotype Graph

Phenotypic Dataset Column Description

Column Name Description
Genotypes Wheat variety/cultivar name
DTH_2022-23 Days to heading in 2022-23
SL_2022-23 Spike length (cm)
PH_2022-23 Plant height (cm)
NDVI_2022-23 Normalized Difference Vegetation Index
GPS_2022-23 Grains per spike
TGW_2022-23 Thousand grain weight
Plant height General plant height
Spikes per meter square Number of spikes/m²
FLA Average Flag leaf area
NDVI General NDVI value
Spike length Spike length (cm)
Grain per spike Avg. grains per spike
Spikes Length Spikes/m² (repeat)
TGW Thousand grain weight (repeat)
Yield per meter square Grain yield per m²
Days to heading Sowing to heading duration
Phenotype Graph