Statistical Learning for High-Dimensional Data: A Comprehensive Approach to Dimensionality Reduction in Machine Learning

Authors

  • Irsa Sajjad Department of Mathematics, National University of Modern Languages, Islamabad, Pakistan
  • Sumaira Sharif Department of Mathematics, University of Central Punjab, Lahore, Pakistan
  • Maria Malik Department of Statistics, COMSATS University, Lahore, Pakistan
  • Aysha Qayyum University of Management and Technology Lahore, Pakistan
  • Sharqa Hashmi Govt Graduate College for Women Lahore, Pakistan

DOI:

https://doi.org/10.63278/mme.vi.1698

Keywords:

Dimensionality Reduction, PCA, LDA, Statistical Analysis, t-SNE

Abstract

Dimensionality reduction is a crucial process in machine learning, particularly when dealing with high-dimensional data. As the number of features increases, models often suffer from overfitting, computational complexity, and a lack of interpretability. This paper explores statistical methods for dimensionality reduction, focusing on techniques such as Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), and t-SNE. These methods aim to preserve the underlying structure of data while reducing its dimensions for better model performance. By analyzing the mathematical foundations of these techniques, we evaluate their application across various machine learning models, demonstrating their utility in improving model efficiency and interpretability. Experimental results validate the effectiveness of these statistical methods in practical machine learning tasks.

Downloads

How to Cite

Sajjad, Irsa, Sumaira Sharif, Maria Malik, Aysha Qayyum, and Sharqa Hashmi. 2025. “Statistical Learning for High-Dimensional Data: A Comprehensive Approach to Dimensionality Reduction in Machine Learning”. Metallurgical and Materials Engineering, May, 1228-36. https://doi.org/10.63278/mme.vi.1698.

Issue

Section

Research