Unveiling the role of protein palmitoylation in gastric cancer diagnosis via machine learning.
Shen. Zhong Hua ZH; Wu. Jingyu J
Key Findings
- Palmitoylation-related genes are linked to gastric cancer development.
- Four core genes (ASPA, RBM20, COL4A1, MAL) were identified as strong diagnostic markers.
- A Gradient Boosting Machine model using these genes achieved an AUC of 0.963, indicating high diagnostic accuracy.
- SHAP analysis highlighted the contribution of each gene to the model’s predictions.
- The study lacks animal validation and clear mechanisms linking the genes to immune cells.
Practical Outcomes
- For biohackers, the study doesn’t provide a new supplement, dosage, or protocol to try. It mainly offers early‑stage scientific insight that could eventually lead to better gastric cancer tests, but there’s no immediate actionable benefit for longevity, metabolic health, or performance optimization.
Summary
Scientists used AI to look at a type of protein modification called palmitoylation and found four genes that could help detect stomach cancer early. The computer model was very accurate, but the work is still at the research stage and doesn’t give any direct advice for everyday health or longevity practices.
Abstract
Gastric cancer (GC) is a highly morbid and mortal gastrointestinal malignancy, urgently requiring sensitive and specific biomarkers for detection. Protein palmitoylation, a reversible lipid modification process, has been connected to tumor formation, yet its function in gastric cancer (GC) is still insufficiently explored. This research creatively combined palmitoylation-associated characteristics with machine learning methods, utilizing the SHapley Additive exPlanations (SHAP) framework to boost the interpretability of the model. Gene expression profiling datasets from public repositories were collected, with batch effects corrected. Genes with differential expression (DEGs) were pinpointed, and an analysis of functional enrichment was carried out. Through intersection analysis of DEGs and a palmitoylation gene set, and integration of LASSO regression, SVM-RFE, and random forest algorithms, four core genes (ASPA, RBM20, COL4A1, and MAL) were selected. Ten machine learning models were built, among which the Gradient Boosting Machine (GBM) model achieved the optimal performance (AUC = 0.963). SHAP analysis uncovered the notable contributions of the four core genes to model classification. The study also explored gene expression characteristics, immune cell correlations, and spatial heterogeneity. However, it has limitations such as lack of in-vivo animal model validation, unclear core gene-immune cell interaction mechanisms, and insufficient sample diversity. Overall, this research provides new insights into GC pathogenesis and directions for future studies on diagnosis and treatment.
Study Information
pubmed
2025
2025-10-28T00:00:00.000Z
10.1007/s12672-025-03804-5
34