Content area

Abstract

Spectroscopic data allows for the obtaining of relevant information about the composition of samples and has been used for research in scientific disciplines such as chemistry, geology, archaeology, Mars research, pharmacy, and medicine, as well as important industrial use. In archaeology, it allows the characterization and classification of artifacts and ecofacts, the analysis of patterns, the characterization and study of the exchange of materials, etc. Spectrometers provide a large amount of data, the so-called “big data” type, which requires the use of multivariate statistical techniques, mainly principal component analysis, cluster analysis, and discriminant analysis. This work is focused on reducing the dimensionality of the data by selecting a small subset of variables to characterize the samples and presents a mathematical methodology for the selection of the most efficient variables. The objective is to identify a subset of variables based on spectral features that allow characterization of the samples under study with the least possible errors when performing quantitative analyses or discriminations between different samples. The subset is not predetermined and, in each case, is obtained for each set of samples based on the most important features of the samples under study, which allows for a good fit to the data. The reduction of the number of variables to an important performance based on the previously chosen difference between features, with a great fit to the raw data. Thus, instead of 2151 variables, a minimum optimal subset of 32 valleys and 31 peaks is obtained for a minimum difference between peaks or between valleys of 20 nm. This methodology has been applied to a sample of minerals and rocks extracted from the ECOSTRESS 1.0 spectral library.

Details

1009240
Company / organization
Title
Variables Selection from the Patterns of the Features Applied to Spectroscopic Data—An Application Case
Author
Romero-Béjar, José L 1   VIAFID ORCID Logo  ; Esquivel, Francisco Javier 2   VIAFID ORCID Logo  ; Esquivel, José Antonio 3 

 Department of Statistics and Operations Research, University of Granada, 18011 Granada, Spain; [email protected] (J.L.R.-B.); [email protected] (F.J.E.); Instituto de Investigación Biosanitaria (ibs.GRANADA), 18014 Granada, Spain; Institute of Mathematics, University of Granada (IMAG), Ventanilla 11, 18001 Granada, Spain 
 Department of Statistics and Operations Research, University of Granada, 18011 Granada, Spain; [email protected] (J.L.R.-B.); [email protected] (F.J.E.); Laboratory of 3D Archaeological Modelling, University of Granada, 18011 Granada, Spain 
 Laboratory of 3D Archaeological Modelling, University of Granada, 18011 Granada, Spain; Department of Prehistory and Archaeology, University of Granada, 18011 Granada, Spain 
Publication title
Volume
13
Issue
1
First page
99
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
22277390
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2024-12-29
Milestone dates
2024-11-04 (Received); 2024-12-25 (Accepted)
Publication history
 
 
   First posting date
29 Dec 2024
ProQuest document ID
3153862563
Document URL
https://www.proquest.com/scholarly-journals/variables-selection-patterns-features-applied/docview/3153862563/se-2?accountid=208611
Copyright
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-01-10
Database
ProQuest One Academic