TAN Chao,TAN Cheng,CHENG Bin,ZOU Qin,CHEN Hui,WU Tong,LIN Zan.Improving the Accuracy of Spectral Recognition of Expired Drug by an Ensemble Model and Virtual Sample Generation[J].Journal of Instrumental Analysis,2025,44(06):1131-1138.
TAN Chao,TAN Cheng,CHENG Bin,ZOU Qin,CHEN Hui,WU Tong,LIN Zan.Improving the Accuracy of Spectral Recognition of Expired Drug by an Ensemble Model and Virtual Sample Generation[J].Journal of Instrumental Analysis,2025,44(06):1131-1138. DOI: 10.12452/j.fxcsxb.25020465.
Improving the Accuracy of Spectral Recognition of Expired Drug by an Ensemble Model and Virtual Sample Generation
The qualitative identification of fake drugs based on near-infrared(NIR) spectroscopy needs to extract characteristic information and establish prediction models from complex,overlapped and unstable spectra by using computers and chemometrics. In this kind of task,there may also be an imbalanced classification problem where there are relatively few samples of a certain class. Based on the generation of virtual samples and ensemble modeling,this approach has the potential to improve the recognition accuracy for imbalanced training set. In this paper,azithromycin was taken as the research object,a group of experimental samples were designed,and an ensemble algorithm of partial least squares discriminant analysis(PLS-DA) based on virtual samples was proposed to construct a classifier for identifying whether a drug sample had expired. The performance of single and ensemble models was compared in ten different spectral ranges,and the influence of different imbalance ratios,the composition of minority class samples and ensemble size were also discussed. The sensitivity of ensemble models was improved by about 9% on average. Finally,the overall effectiveness of the ensemble learning strategy was confirmed. The proposed ensemble algorithm shows more advantages when there are too few minority class samples,and the method can also be used for other types of systems.
关键词
Keywords
references
Shang H , Shang L W , Wu J J , Xu Z B , Zhou S W , Wang Z H , Wang H J , Yin J H . Spectrochim. Acta A , 2023 , 287 : 121990 .
Neto A J S , Lopes D de C . J. Food Compos. Anal. , 2024 , 135 : 106637 .
Tan C , Chen H , Chen M X , Lin Z . Infrared Phys. Technol. , 2024 , 139 : 105334 .
Liu L , Wang B , Xu X X , Xu J . Infrared Phys. Technol. , 2025 , 145 : 105713 .
Suarin N A S S , Chia K S , Fuzi S F Z M . Knowl.-Based Syst. , 2024 , 295 : 111817 .
Leng H Y , Zhang Z Y , Chen C , Chen C . Spectrochim. Acta A , 2024 , 320 : 124581 .
Qi C C , Zhou N N , Hu T , Wu M T , Chen Q S , Wang H , Zhang K J , Lin Z . Smart Agric. Technol. , 2025 , 10 : 100728 .
Daud S N S S , Sudirman R , Shing T W . Biomed. Signal Proces. Control , 2023 , 83 : 104649 .