Identification and Prediction of Fresh Gasoline Locations and Branding Using Newly Targeted Compound Chromatograms with Chemometrics and Machine Learning

Document Type : Research Article

Authors

1 Jabatan Kimia Malaysia, 46661 Petaling Jaya, Selangor, Malaysia

2 East Coast Environmental Research Institute (ESERI), Universiti Sultan Zainal Abidin, Gong Badak, 21300 Kuala Nerus, Terengganu, Malaysia

3 Faculty of Bioresources and Food Industry, Universiti Sultan Zainal Abidin, Besut Campus, 22200 Besut, Terengganu, Malaysia

Abstract
Forensic investigations place significant importance on the identification and utilisation of gasoline in crime scenes, particularly in cases involving arson. This study employed gas chromatography-mass spectrometry (GC-MS) to analyse gasoline samples. Additionally, chemometrics techniques, specifically principal component analysis (PCA), discriminant analysis (DA), and classification and regression tree (CART) machine learning, were utilised to identify and differentiate the gasoline brands and geographical origins. This study encompasses three widely recognized gasoline brands obtained from stations located across eight distinct Malaysian states, which also contains an oil refinery. A novel chromatogram, known as the targeted compounds chromatogram (TCC), was developed. It consists of toluene, p-xylene, propyl benzene, 1-ethyl-2-methylbenzene, mesitylene, and indane, which were identified as markers using factor analysis. The TCC was then applied to fifty-three training samples, resulting in a 94% accurate classification of the brands and locations of origin. A unique machine-learning model called Classification and Regression Tree (CART) was constructed and effectively used to analyse 100 unidentified real gasoline samples. The model achieved a mean absolute error (MAE) of 1.1 for location and 0.4 for brand. Furthermore, the accuracy of the estimator remained consistent even when changes were made to the training data set. The results collected clearly illustrate the capacity of this methodology to assist in solving of criminal investigations.

Keywords

Subjects


  1. A. Sinkov, P.M.L. Sandercock and J.J. Harynuk, “Chemometric classification of casework arson samples based on gasoline content”, in Forensic Sci Int., 235 (2014) 24–31.
  2. D. Pert, M.G. Baron and J.W. Birkett, “Review of analytical techniques for arson residues”, in J Forensic Sci., 51(5) (2006) 33–49.
  3. M. Hupp, L.J. Marshall and D.I. Campbell, “Chemometric analysis of diesel fuel for forensic and environmental applications”, in Anal Chim Acta, 606(2) (2008) 159–71.
  4. Stauffer, Eric and Lentini, “ASTM standards for fire debris analysis: A review”, in Forensic science international, 132 (2003) 63–7.
  5. L. Flumignan, N. Boralle and J.E. de Oliveira, “Screening Brazilian commercial gasoline quality by hydrogen nuclear magnetic resonance spectroscopic fingerprintings and pattern-recognition multivariate chemometric analysis”, in Talanta, 82(1) (2010) 99–105.
  6.  
  1. Suresh and K.P. Jagath, “An experimental study of the corrosion process of metals in virtue of crude oils and the characteristics”, in Chemical Review and Letters, 2 1 (2019) 21-32.
  2. González, J. Ayuso and J.A. Álvarez, “Application of an HS-MS for the detection of ignitable liquids from fire debris”, in Talanta, (2015) 142:150–6.
  3. J.C. Peschier, M.M.P. Grutters and J.N. Hendrikse, “Using alkylate components for classifying gasoline in fire debris samples”, in J Forensic Sci, 63 2 (2018) 20–30.
  4. Boczkaj, M. Jaszczołt and A. Przyjazny, “Application of normal-phase high-performance liquid chromatography followed by gas chromatography for analytics of diesel fuel additives”, in Anal Bioanal Chem, 405 18 (2013) 95–103.
  5. M. Balabin and R.Z. Safieva, “Gasoline classification by source and type based on near infrared (NIR) spectroscopy data”, in Fuel, 87 7 (2008) 96–101.
  6. M. Balabin, R.Z. Safieva and E.I. Lomakina, “Gasoline classification using near infrared (NIR) spectroscopy data: comparison of multivariate techniques”, in Anal Chim Acta, 671 (2010) 27–35.
  7. Jais, Daud and Sharifah, “Forensic Analysis of Accelerant on Different Fabrics Using Attenuated Total Reflectance-Fourier Transform Infrared Spectroscopy (ATR-FTIR) and Chemometrics Techniques”, in Malaysian Journal of Medicine and Health Sciences, (2020).
  8. C.C. Pereira, V.L. Skrobot and E.V.R. Castro, “Determination of gasoline adulteration by principal components analysis-linear discriminant analysis applied to FTIR spectra”, in Energy Fuels, 20 3 (2006) 97–102.
  9. S.G. Teixeira, F.S. Oliveira and H.C. Dossantos, “Multivariate calibration in Fourier transform infrared spectrometry as a tool to detect adulterations in Brazilian gasoline”, in Fuel, 87 3 (2008) 346–352.
  10. M.V. Malmquist, R.R. Olsen and A.B. Hansen, “Assessment of oil weathering by gas chromatography-mass spectrometry, time warping and principal component analysis”, in J Chromatogr A, 1164 (2007) 262–270.
  11. González, M.J. Ferreiro and M. Barbero, “Novel method based on ion mobility spectrometry sum spectrum for the characterization of ignitable liquids in fire debris”, in Talanta, 199 (2019) 189–194.
  12. Q. Lee, P.M.L. Sandercock and J.J. Harynuk, “The influence of temperature on the pyrolysis of household materials”, in J Anal Appl Pyrol, (2016);118:75–85.
  13. Figueiredo, B.Y. Christophe and Delphine, “Evaluation of an untargeted chemometric approach for the source inference of ignitable liquids in forensic science”, in Forensic Science International, (2018).
  14. Kumar, and V. Sharma, “Chemometrics in Forensic Science”, in Trends in Analytical Chemistry, 105 (2018) 191–201.
  15. G.M. Ghazi, L.C Lee and A.S. Samsudin, “Evaluation of ensemble data preprocessing strategy on forensic gasoline classification using untargeted GC–MS data and classification and regression tree (CART) algorithm”, in Microchemical Journal, (2022) 182.
  16. ASTM E1618-19, “Standard test method for ignitable liquid residues in extracts from fire debris samples by gas chromatography-mass spectrometry”, in ASTM International, (2019) www.astm.org.
  17. Hasan, A. Majhoo, M. Sami and A. Aldulaimi, “Predicting Hydration Enthalpy of Low Molecular Weight Organic Molecules using COSMO-SAC Modeling”, in Chemical Review and Letters6(1) (2022) 86-94. doi: 10.22034/crl.2023.399189.1225
  18. Steinberg, “Chapter 10 CART: Classification and regression trees”, (2009) https://www.semanticscholar.org.
  19. Gonzalez, Y. Kroyan and T. Sarjovaara, “Prediction of Gasoline Blend Ignition Characteristics Using Machine Learning Models”, in Energy and Fuels, 35 (2021) 9332–9340.
Volume 7, Issue 1 - Serial Number 1
January and February 2024
Pages 134-141

  • Receive Date 19 July 2023
  • Revise Date 02 February 2024
  • Accept Date 07 February 2024