In Silico Platform Shows Promise for Creating Personality-Stable Elderly Digital Twin Agents

Photo by Cht Gsml / Unsplash

medRxiv Published April 3, 2026 Medically reviewed April 26, 2026 Study authors: Wang, J.; Yang, Z.; Zhu, Z.; Zhu, X.; Huang, Z.; Wang, H.; Tian, L.; Cao, Y.; Qu, X.; Qi, X.; Wu, B. DOI ↗ By Dr. Sofia Müller, MD · Lifespan & Whole-Person Care

Key Takeaway

Note: This is an in silico platform validation; no clinical patient data or outcomes are reported.

This study describes the development and psychometric validation of ELDER-SIM, an in silico platform for building personality-stable digital twin agents representing elderly individuals. The platform was tested under four conditions: a Baseline prompt-only model, and models enhanced with memory components (+Memory), a Cognitive Conceptualization Diagram (+CCD), and domain-specific fine-tuning (+LoRA). The primary outcome was personality consistency, measured by Cronbach's α for internal consistency, intraclass correlation coefficient (ICC) for test-retest reliability, and role discrimination accuracy.

Results showed acceptable to excellent internal consistency across all conditions, with Cronbach's α ranging from 0.70 to 0.94. Test-retest stability was consistently high, with ICC values ranging from 0.85 to 0.96. Role discrimination accuracy improved stepwise from Baseline (83.3%) to +Memory (88.9%), +CCD (94.4%), and +LoRA (97.2%). The +CCD condition showed the largest gain in internal consistency (mean α increased from 0.702 to 0.892), while the +LoRA condition achieved the highest overall metrics (α 0.940, ICC 0.958).

No safety, tolerability, or adverse event data were reported, as this was an in silico technical validation. Key limitations include the absence of reported sample size, population details, follow-up duration, and statistical significance measures (p-values or confidence intervals). The study did not involve human participants or clinical outcomes.

The authors suggest this framework could support more reliable longitudinal digital twins for elderly mental health and psychosocial care research, enabling reproducible in silico evaluation of interaction trajectories and intervention strategies prior to clinical deployment. However, this remains a proof-of-concept technical platform; its direct clinical practice relevance for patient care is not established.

Study Details

EvidenceLevel 5

PublishedMar 2026

View Original Abstract ↓

BackgroundLarge language models (LLMs) enable patient-facing conversational agents, creating a plausible pathway toward patient digital twins that can capture older adults lived experiences, beliefs, and behavioral responses across time. A central barrier to clinical-grade digital twins is personality drift--inconsistent trait expression across repeated, longitudinal interactions--which can undermine the reliability of generated trajectories and intervention-response simulation in geriatric care contexts. ObjectiveTo develop ELDER-SIM as a multi-role elderly-care conversational platform oriented toward building personality-stable elderly digital twin agents, and to propose a psychometric validation framework for quantifying and improving personality consistency in LLM-based agents across repeated interactions. MethodsELDER-SIM was implemented using n8n workflow orchestration with local LLM inference (Ollama/vLLM), integrating (1) Big Five (OCEAN) trait specifications, (2) a three-layer Cognitive Conceptualization Diagram (CCD) grounded in Becks cognitive behavioral therapy framework, and (3) a MySQL-based long-term memory module for persistent states. We conducted a systematic ablation study across four conditions--Baseline (prompt-only), +Memory, +CCD, and +LoRA (domain-specific fine-tuning using 19,717 instruction pairs derived from CHARLS)--and evaluated personality consistency using Cronbachs (internal consistency), intraclass correlation coefficient (ICC) (test-retest reliability), and role discrimination accuracy . ResultsPersonality measurement reliability was acceptable to excellent across conditions (Cronbachs : 0.70-0.94), with consistently high test-retest stability (ICC: 0.85- 0.96). Role discrimination improved stepwise from 83.3% (Baseline) to 88.9% (+Memory), 94.4% (+CCD), and 97.2% (+LoRA). CCD produced the largest gain in internal consistency (mean 0.702[->]0.892), while LoRA achieved the highest overall internal consistency ( 0.940) and ICC (0.958). ConclusionsELDER-SIM provides a psychometrically validated approach for constructing personality-consistent elderly digital twin agents. By demonstrating that structured cognitive modeling and domain adaptation materially reduce personality drift, the framework supports more reliable longitudinal digital twins for elderly mental health and psychosocial care, enabling reproducible in silico evaluation of interaction trajectories and intervention strategies before clinical deployment.

In Silico Platform Shows Promise for Creating Personality-Stable Elderly Digital Twin Agents

Study Details

Group dance interventions improved physical performance and cognition in community-dwelling older adults with sarcopenia.

Why Group Dance Is Becoming Medicine for Aging Muscles

Clinical research that matters. Delivered to your inbox.

In Silico Platform Shows Promise for Creating Personality-Stable Elderly Digital Twin Agents

Study Details

Group dance interventions improved physical performance and cognition in community-dwelling older adults with sarcopenia.

Why Group Dance Is Becoming Medicine for Aging Muscles

Clinical research that matters. Delivered to your inbox.

Related in Geriatrics & Aging

From Other Specialties