Normalizing Gas‐Chromatography–Mass Spectrometry Data: Method Choice can Alter Biological Inference

Bioessays 40 (6):1700210 (2018)
  Copy   BIBTEX

Abstract

We demonstrate how different normalization techniques in GC‐MS analysis impart unique properties to the data, influencing any biological inference. Using simulations, and empirical data, we compare the most commonly used techniques (Total Sum Normalization ‘TSN’; Median Normalization ‘MN’; Probabilistic Quotient Normalization ‘PQN’; Internal Standard Normalization ‘ISN’; External Standard Normalization ‘ESN’; and a compositional data approach ‘CODA’). When differences between biological classes are pronounced, ESN and ISN provides good results, but are less reliable for more subtly differentiated groups. MN, TSN, and CODA approaches produced variable results dependent on the structure of the data, and are prone to false positive biomarker identification. In contrast, PQN exhibits the lowest false positive rate, though with occasionally poor model performance. Because ESN requires extensive pre‐planning, and offers only mixed reliability, and ISN, TSN, MN, and CODA approaches are prone to introducing artefactual differences, we recommend the use of PQN in GC–MS research.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 93,590

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Field Normalization of Scientometric Indicators.Ludo Waltman & Nees Jan van Eck - 2019 - In Wolfgang Glänzel, Henk F. Moed, Ulrich Schmoch & Mike Thelwall (eds.), Springer Handbook of Science and Technology Indicators. Springer Verlag. pp. 281-300.

Analytics

Added to PP
2018-05-03

Downloads
9 (#449,242)

6 months
3 (#1,723,834)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations