Multivariate Principal Component Analysis and Clustering Methods for Assessing Genetic Diversity in Bread Wheat (Triticum aestivum L.) Genotypes

Authors

  • Aavula Naveen Dept. of Genetics and Plant Breeding, Banaras Hindu University, Varanasi, Uttar Pradesh (221 005), India
  • Dharavath Hathiram Genetics Division, Indian Agricultural Research Institute, New Delhi (110 012), India
  • Patel Supriya Dept. of Genetics and Plant Breeding, Acharya N.G. Ranga Agricultural University, Tirupati (517 502), India
  • T. Danakumara Genetics Division, Indian Agricultural Research Institute, New Delhi (110 012), India
  • V. K. Mishra Dept. of Genetics and Plant Breeding, Banaras Hindu University, Varanasi, Uttar Pradesh (221 005), India
  • B. Sinha Dept. of Genetics and Plant Breeding, Banaras Hindu University, Varanasi, Uttar Pradesh (221 005), India
  • A. Harika Dept. of Genetics and Plant Breeding, Banaras Hindu University, Varanasi, Uttar Pradesh (221 005), India https://orcid.org/0000-0001-8225-751X

DOI:

https://doi.org/10.23910/1.2024.5580

Keywords:

Wheat, genetic diversity, phenotypic variation, PCA, clustering

Abstract

The present experiment was conducted during rabi season (November, 2019–May, 2020) at Agricultural Research Farm of Banaras Hindu University, Varanasi, Uttar Pradesh, India to assess the genetic diversity and phenotypic variation among 50 bread wheat accessions. The genotypes were grown in a randomized complete block design with three replications, and data was collected on 14 quantitative traits. The multivariate techniques, including Principal Component Analysis (PCA) and K-means clustering were employed to identify traits contributing the most to phenotypic variation and to classify the genotypes into distinct groups based on their characteristics. PCA identified 13 principal components, with the first five explaining 65.2% of the total variation. The first two principal components accounted for 37% of the total phenotypic variation, with grain yield plant-1, thousand seed weight, harvest index, and canopy temperature as major contributors to PC1, while total biomass and biomass contributed primarily to PC2. K-means clustering grouped the genotypes into five distinct clusters, with clusters 2 and 5 having the highest number of genotypes (14 each). Cluster 1 exhibited the highest mean values for germination percentage, days to maturity, and vegetative index. Hierarchical clustering further confirmed the genetic diversity, delineating five distinct clusters using Ward’s method with Euclidean distance. This study provides valuable insights for breeders aiming to enhance traits such as yield and other morphological characteristics through heterosis and transgressive breeding.

Downloads

Download data is not yet available.

Downloads

Published

2024-11-25

How to Cite

1.
Naveen A, Hathiram D, Supriya P, Danakumara T, Mishra VK, Sinha B, et al. Multivariate Principal Component Analysis and Clustering Methods for Assessing Genetic Diversity in Bread Wheat (Triticum aestivum L.) Genotypes. IJBSM [Internet]. 2024 Nov. 25 [cited 2025 Sep. 17];15(Nov, 11):01-8. Available from: https://ojs.pphouse.org/index.php/IJBSM/article/view/5580

Issue

Section

Articles