Mode
Text Size
Log in / Sign up

Review of WGS integration in Emirati genome program reveals high breast cancer variant prevalence and risk stratification utilityYour DNA May Predict Breast Cancer Risk Decades Before Diagnosis

AI-generated summary of the cited source, checked by automated accuracy review. How we work

Key Takeaway
Note that Emirati breast cancer variant prevalence is high, with allele frequencies up to tenfold higher than global references.

This publication is an observational review examining the integration of whole-genome sequencing with electronic health records within the Emirati Genome Program. The study scope encompasses 436,780 individuals, including 229,309 women, to evaluate the prevalence and penetrance of pathogenic and likely pathogenic variants alongside the performance of polygenic risk scores. The setting is the United Arab Emirates, utilizing global reference datasets as a comparator to highlight population-specific genetic differences.

The analysis reports that the prevalence of pathogenic and likely pathogenic variants in women is 0.84%, contributing to 5.2% of breast cancer cases. The mean age of women with these variants was 45.9 +/- 11.1 years. Age-specific cumulative risk for BRCA1 c.4065_4068del reached 37.6% by age 60, while the corresponding risk for BRCA2 c.2808_2811del was 31%.

Allele frequency comparisons showed values up to tenfold higher in the Emirati population than in global reference datasets. Furthermore, the European-derived polygenic risk score model advanced the prediction of 10-year breast cancer risk onset for women in the top decile by a decade. Even within sister pairs, higher polygenic risk was observed, suggesting significant familial aggregation.

The authors acknowledge that the genetic architecture of breast cancer in Arab populations remains largely understudied. Consequently, the integration of monogenic, polygenic, and familial data defines a national framework for risk stratification, identifying disease-free women potentially eligible for targeted prevention. Clinicians should interpret these findings as descriptive of this specific cohort rather than causal evidence for broader populations.

A warning hidden in plain sight

Imagine being told your risk of breast cancer is average — when your DNA quietly tells a different story. For millions of women across the Middle East, that mismatch may have been hiding for decades.

Most breast cancer risk tools were designed using data from European and North American populations. For women whose ancestry traces to the Arab world, those tools may simply not apply.

Why this gap matters now

Breast cancer is the most common cancer in women worldwide. It is also highly treatable when found early. But effective screening depends on knowing who is at high risk — and that knowledge has not been equally distributed.

Until now, almost no large-scale genomic data existed for Arab populations. That meant doctors in countries like the United Arab Emirates were using risk calculators built from genetic data that largely excluded their patients.

What changed with this project

Previously, genetic risk for breast cancer was assessed using European-derived scores and a handful of known gene mutations — mainly BRCA1 and BRCA2. Women in the UAE were screened using those same tools, even though the underlying data did not reflect their genetic background.

But here's where things get more complex: researchers analyzed 436,780 individuals through the Emirati Genome Program, one of the largest national sequencing efforts ever launched. By pairing whole-genome sequencing with electronic health records and family trees reconstructed for more than 48,000 families, they could see breast cancer risk in a way no prior study of this population had attempted.

Risk scores that were built for someone else

Think of a polygenic risk score (PRS) as a weather forecast built for a different city. It captures general patterns, but local conditions — geography, genetics, history — can make that forecast unreliable. For a woman in Dubai, a risk model trained in Denmark is an imperfect fit.

The study found that two specific gene mutations — one in BRCA1 and one in BRCA2 — appear in Emirati women at rates up to ten times higher than in global reference databases. These variants were linked to cumulative breast cancer risks of 37.6% and 31% by age 60, respectively. That means roughly one in three women carrying these variants may develop breast cancer before their seventh decade of life.

Among 229,309 Emirati women in the dataset, about 0.84% carried a high-risk gene variant in one of thirteen cancer-associated genes recommended for testing. Those variants accounted for 5.2% of all breast cancer cases in the study — and those cases tended to occur nearly a decade earlier than average, with a mean diagnosis age of about 46 years.

Women in the top tenth of a polygenic risk score — meaning their combination of many small genetic variants added up to high risk — had a 10-year risk onset that was roughly a decade earlier than women at average risk.

This research does not mean that all Arab women need genetic testing immediately — but it does mean the tools used to decide who gets tested may need to be updated.

A broader view of inherited risk

What made this framework unusual is that it combined three types of risk: individual high-impact gene mutations, polygenic scores (the combined effect of thousands of smaller variants), and family history. When all three were considered together, the picture of who is truly at risk became much sharper — even distinguishing risk levels between sisters within the same family.

Researchers in this field generally agree that risk stratification works best when it is population-specific. This study provides the first data set large enough to begin building that specificity for Arab women.

If you are a woman of Arab or Middle Eastern descent, your current breast cancer risk assessment may be based on tools that were not designed with your genetics in mind. It is worth asking your doctor whether population-specific genetic counseling or expanded genetic testing might be appropriate, particularly if you have a family history of breast cancer or were diagnosed at a younger age.

Limits worth knowing

This study focused on Emirati nationals and may not fully represent other Arab populations, which vary genetically across the region. The European-derived polygenic risk score used in the analysis performed well but was not specifically built for this population. Results still need validation in other Arab cohorts before the framework can be widely adopted.

The research team envisions this framework becoming the foundation for a national precision prevention program in the UAE — one where women are categorized by their actual genomic risk rather than average population statistics. If validated and expanded to other countries in the region, this approach could reshape breast cancer screening across the Arab world, ensuring that the tools used to protect women's health actually reflect who those women are.

Study Details

EvidenceLevel 5
PublishedApr 2026
View Original Abstract ↓
The genetic architecture of Breast Cancer (BC) in Arab populations remains largely understudied, limiting the precision of current prevention and screening programs. The Emirati Genome Program (EGP), one of the world's first nation-wide sequencing initiatives, offers an unprecedented opportunity to delineate inherited BC risk across an entire population. We analyzed 436,780 EGP individuals, including 229,309 women, integrating whole-genome sequencing (WGS) with electronic health records (EHRs). We quantified the prevalence and penetrance of pathogenic and likely pathogenic (P/LP) variants across 13 NCCN-recommended BC genes, evaluated the performance of established polygenic risk scores (PRS), and reconstructed >48,000 pedigrees to measure familial aggregation. P/LP variants were identified in 0.84% of women, accounting for 5.2% of BC cases (mean age of 45.9+/- 11.1 years). Highly penetrant BRCA1 c.4065_4068del (p.Asn1355fs) and BRCA2 c.2808_2811del (p.Ala938Profs) variants showed age-specific cumulative risks of 37.6% and 31% by age 60, respectively, and allele frequencies up to tenfold higher in the Emirati population than in global reference datasets. The European-derived PRS model (PGS000004) demonstrated strong performance, advancing 10-year BC risk onset by a decade for women in the top decile. Family-based PRS discriminated affected from unaffected individuals, revealing higher polygenic risk even within sister pairs. Integration of monogenic, polygenic, and familial data defined a national framework for risk stratification, identifying disease-free women potentially eligible for targeted prevention. Nation-scale genome sequencing reveals, for the first time, the comprehensive landscape of inherited BC susceptibility within a Middle Eastern population. The integration of monogenic, polygenic, and familial data establishes a national framework for genomic risk stratification- transforming population genomics into a foundation for precision prevention and early detection in the UAE and beyond.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.