Mode
Text Size
Log in / Sign up

INLA methods identified hypervariable CpG sites faster than MCMC in healthy individualsScientists find hidden switches in DNA that may control disease risk

AI-generated summary of the cited source, checked by automated accuracy review. How we work

Key Takeaway
Note that INLA methods offered faster runtime and concordant results compared to MCMC for CpG site identification.

This cohort study included 158 healthy individuals within the MAMELI cohort setting. The primary objective was the identification of hypervariable CpG sites. The comparator involved MCMC-based methods. The study also assessed computational runtime comparison and concordance with MCMC-based methods as secondary outcomes.

Hypervariable CpG sites were identified using the INLA approach. Computational runtime was shorter by orders of magnitude when using INLA versus MCMC. Concordance between the two methods was reported as concordant. Specific absolute numbers, p-values, or confidence intervals were not reported for these outcomes.

Safety data, adverse events, and tolerability were not reported. Follow-up duration was not reported. The study phase was not reported. Funding or conflicts of interest were not reported. Practice relevance was not reported. Limitations were not explicitly listed in the provided data. Given the observational nature of the cohort and missing details, clinical application should be restrained.

The tags that change the most

Most research focuses on average methylation levels across large groups. But what if the real story isn’t the average — but the variation? What if the spots that change the most from person to person, or even day to day, are the ones that matter most?

That’s what this study explores. Instead of just measuring how much methylation is present, researchers looked for “hot spots” — CpG sites in DNA where methylation levels vary widely between healthy people.

These hypervariable sites aren’t random. They cluster near genes tied to inflammation and stress response. That’s a big clue. It suggests they may help the body adapt — but could also make some people more vulnerable to disease later on.

Think of it like a thermostat. A stable thermostat keeps your house comfortable. But one that swings wildly? It could signal a deeper problem. These DNA hot spots may be the body’s biological thermostats for stress and immune response.

A faster way to find hidden patterns

Finding these sites used to take weeks. The standard method, called MCMC, is accurate but slow. It’s like solving a maze by trying every path one by one.

The new method uses something called INLA — integrated nested Laplace approximation. It’s like using a map and a drone to find the exit fast. It’s not trial and error. It’s smart math that skips the long wait.

The researchers tested it on 158 healthy people from the MAMELI cohort. They used nanopore sequencing to read methylation across 13 key genes. The INLA method found the same hot spots as the old method — but in minutes instead of days.

This speed opens doors. Scientists can now test many models, tweak variables, and explore data in real time. Large studies that once took months can now be done in days.

The team found a set of CpG sites with unusually high variability. These sites were not in random parts of the genome. They were near regulatory regions — the control panels of genes.

Many were close to genes like NFKB1 and IL6, which play major roles in inflammation. Others were near stress-response genes like HSP90. That makes sense. These systems need to adapt quickly to threats. But too much flexibility could lead to dysfunction.

One site, near the SOD2 gene, showed extreme variation. SOD2 helps protect cells from damage. If its control switch is unstable, it might not respond properly when needed. That could raise the risk for chronic diseases like diabetes or heart disease later in life.

The findings don’t mean these people are sick. They’re healthy. But their DNA already shows differences in how they handle stress. That could help explain why some people get sick while others stay well — even with similar lifestyles.

But there's a catch.

This doesn’t mean this treatment is available yet.

The study only looked at healthy people. It didn’t track whether high variability led to disease. So we can’t say for sure that these hot spots cause illness. They might just be markers — signs of something else going on.

Also, the panel only covered 13 genes. The human genome has about 20,000. This is just a small window. Future studies need to scan the whole genome to find more hot spots.

And while INLA is fast and accurate, it’s still a statistical model. It works best when assumptions about the data hold true. In more complex cases, it might miss subtle patterns.

What this means for the future

Experts say this approach could change how we study disease risk. Instead of waiting for symptoms, we might one day use methylation variability as an early warning system.

Imagine a blood test that doesn’t just check cholesterol or glucose — but also scans for unstable gene controls. Doctors could spot rising risk long before disease takes hold. Then, with lifestyle changes or early interventions, they might prevent illness before it starts.

But we’re not there yet. This tool is for research only. It’s not approved for clinics. And even when it is, it will be one piece of a much bigger puzzle.

What happens next

The next step is to test this method in larger, more diverse groups. Researchers need to follow people over time to see if high methylation variability predicts future illness.

They’ll also expand beyond inflammation genes to cover the whole genome. And they’ll test whether lifestyle changes — like diet, sleep, or stress reduction — can stabilize these hot spots.

For now, the message is clear: health isn’t just about what genes you have. It’s about how they’re controlled. And some of the most important clues may be hiding in the parts of our DNA that change the most.

Study Details

Study typeCohort
EvidenceLevel 3
PublishedApr 2026
View Original Abstract ↓
DNA methylation is an epigenetic regulator of gene expression and cell identity, which can be shaped by both physiological and pathological factors, including environmental exposure. The identification of sites with high methylation variability can be computationally challenging, especially in large-scale studies. To address this, we propose a framework based on the integrated nested Laplace approximation (INLA) to model methylation with Bayesian generalized linear mixed models (GLMMs), accounting for subject covariates, genomic annotations, and cell composition. To validate the methodology, we sequenced 158 healthy subjects with nanopore and analyzed a panel of 13 genes related to inflammation and stress response. We identified a set of hypervariable CpG sites whose genomic context and methylation levels were consistent with a regulatory role, making them potential candidates for epigenomic association studies. In our comparison, INLA results were concordant with those obtained with MCMC-based methods, with runtimes shorter by orders of magnitude. The computational efficiency of the framework allows for fast exploratory data analysis, model testing, and iterative prototyping, making it viable for large-scale studies that otherwise would be computationally prohibitive.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.