Non-targeted metabolomics
Non-targeted metabolomics (i.e. discovery metabolomics) is a non-hypothetical approach that aims to obtain as many metabolites as possible from a single analysis. The main purpose is to compare the metabolomes (all metabolites of a given organism) of the control and experimental groups to find out the differences in their metabolites and to explore the metabolic pathways between the differential metabolites. The analysis generally includes:
a. Metabolic profiling (also called differential expression analysis): looking for metabolites of interest with statistically significant abundance changes in a set of experimental and control samples.
b. Identification: After performing metabolic profiling, the chemical structure of these metabolites is determined.
c. Interpretation: The final step in the research process, explaining the association between the metabolites found and the biological process or biological state.
Research Process
The non-targeted metabolomics research process can be mainly divided into experimental design, sample collection and processing, metabolite extraction and concentration, sample detection, data analysis, metabolite identification, and finally biological interpretation. Because the metabolome changes extremely fast, the metabolite species are various, the concentration varies greatly, the chemical properties are different, and the data information is huge, each step may have a large impact on the final results, so it is crucial to realize the standardized operation of metabolomics
Technology platform
The main platforms for non-targeted metabolomics are nuclear magnetic resonance (NMR), gas chromatography-mass spectrometry (GC-MS) and liquid chromatography-mass spectrometry (LC-MS), each of which has a certain bias, so researchers need to choose the appropriate platform according to the purpose of the study and the type of samples. The purpose is to achieve a more complete and comprehensive study.
Table Comparison of major assay platforms for non-targeted
Major Assay Platforms |
Advantages |
Disadvantages |
NMR |
1. High throughput 2. Universality 3. Good objectivity and reproducibility 4. Fast (2~3 min/sample) 5. Simple pretreatment, no derivatization and separation required 6. Non-invasive 7. Can detect most organic compounds in the sample 8. Long instrument life |
1. Detection dynamic range is narrow 2. Lower sensitivity and resolution than MS 3. The sample amount is relatively large (0.1 ~ 0.5 mL) 4. Expensive instrumentation and maintenance costs |
GC-MS |
1. High throughput 2. High precision, sensitivity and reproducibility 3. Has a reference standard spectral database, easy to characterize 4. The sample amount is moderate (0.1~0.2 mL) 5. Can detect most organic and some inorganic molecules in the sample |
1. Requires derivatization 2. Requires separation 3. Slow analysis speed (20~40 min/sample) 4. It is difficult to identify new compounds 5. Not suitable for the analysis of difficult volatile, thermally unstable substances |
LC-MS |
1. High throughput 2. High separation rate and sensitivity 3. Wide dynamic range of detection 4. Simple sample processing, no need for derivatization 5. Small sample size (10~100 μL) 6. Applicable to thermally unstable, non-volatile, not easily derivatized and large molecular weight substances |
1. Slow analysis speed (15~40 min/sample) 2. Lack of standard spectral database for reference 3. Difficult to identify new compounds 4. Higher cost 5. Short instrument life |
Data analysis methods
The data obtained from non-targeted metabolomics are complex and multidimensional, and the chemometric methods need to be fully applied to deeply explore the information in them. The data are generally pre-processed first, and the specific process includes normalization, data transformation, centralization, and normalization steps. Only after pre-processing, simplification and dimensionality reduction of the data are realized, and reliable data models are established. The pattern recognition methods applied to metabolomics data analysis mainly include unsupervised learning methods and supervised learning methods.
The unsupervised learning methods mainly include: principal component analysis (PCA), nonlinear mapping, cluster analysis, etc. Supervised learning methods are mainly based on partial least squares (PLS), neural network improvement methods, least squares-discriminant analysis (PLS-DA), orthogonal least squares (OPLS), etc., among which PCA and PLS-DA are commonly used pattern recognition methods in metabolomics.
Non-targeted metabolomics data processing process
Sample types
Serum - Plasma - Urine - Cerebrospinal fluid - Saliva - Feces - Microorganisms - Cells - Plants - Tissues – Other
Lead time
The non-targeted metabolomics services and period offered by Wuhan Biobank are listed in the following table.:
Platform | Instruments | Database |
Number of compounds detected |
Cycle time | Detection price |
GC-MS |
Aglient 7890A/5975C |
1、NIST |
Tens to hundreds | About 15 working days |
Please inquire |
UPLC-TOF |
SCIEX TripleTOF 5600 plus |
1、Self-built database |
thousands |
About 20 working days | Please inquire |