Mode
Text Size
Log in / Sign up

In Silico Platform Shows Promise for Creating Personality-Stable Elderly Digital Twin Agents

In Silico Platform Shows Promise for Creating Personality-Stable Elderly Digital Twin Agents
Photo by Cht Gsml / Unsplash
Key Takeaway
Note: This is an in silico platform validation; no clinical patient data or outcomes are reported.

This study describes the development and psychometric validation of ELDER-SIM, an in silico platform for building personality-stable digital twin agents representing elderly individuals. The platform was tested under four conditions: a Baseline prompt-only model, and models enhanced with memory components (+Memory), a Cognitive Conceptualization Diagram (+CCD), and domain-specific fine-tuning (+LoRA). The primary outcome was personality consistency, measured by Cronbach's α for internal consistency, intraclass correlation coefficient (ICC) for test-retest reliability, and role discrimination accuracy.

Results showed acceptable to excellent internal consistency across all conditions, with Cronbach's α ranging from 0.70 to 0.94. Test-retest stability was consistently high, with ICC values ranging from 0.85 to 0.96. Role discrimination accuracy improved stepwise from Baseline (83.3%) to +Memory (88.9%), +CCD (94.4%), and +LoRA (97.2%). The +CCD condition showed the largest gain in internal consistency (mean α increased from 0.702 to 0.892), while the +LoRA condition achieved the highest overall metrics (α 0.940, ICC 0.958).

No safety, tolerability, or adverse event data were reported, as this was an in silico technical validation. Key limitations include the absence of reported sample size, population details, follow-up duration, and statistical significance measures (p-values or confidence intervals). The study did not involve human participants or clinical outcomes.

The authors suggest this framework could support more reliable longitudinal digital twins for elderly mental health and psychosocial care research, enabling reproducible in silico evaluation of interaction trajectories and intervention strategies prior to clinical deployment. However, this remains a proof-of-concept technical platform; its direct clinical practice relevance for patient care is not established.

Study Details

EvidenceLevel 5
PublishedMar 2026
View Original Abstract ↓
BackgroundLarge language models (LLMs) enable patient-facing conversational agents, creating a plausible pathway toward patient digital twins that can capture older adults lived experiences, beliefs, and behavioral responses across time. A central barrier to clinical-grade digital twins is personality drift--inconsistent trait expression across repeated, longitudinal interactions--which can undermine the reliability of generated trajectories and intervention-response simulation in geriatric care contexts. ObjectiveTo develop ELDER-SIM as a multi-role elderly-care conversational platform oriented toward building personality-stable elderly digital twin agents, and to propose a psychometric validation framework for quantifying and improving personality consistency in LLM-based agents across repeated interactions. MethodsELDER-SIM was implemented using n8n workflow orchestration with local LLM inference (Ollama/vLLM), integrating (1) Big Five (OCEAN) trait specifications, (2) a three-layer Cognitive Conceptualization Diagram (CCD) grounded in Becks cognitive behavioral therapy framework, and (3) a MySQL-based long-term memory module for persistent states. We conducted a systematic ablation study across four conditions--Baseline (prompt-only), +Memory, +CCD, and +LoRA (domain-specific fine-tuning using 19,717 instruction pairs derived from CHARLS)--and evaluated personality consistency using Cronbachs (internal consistency), intraclass correlation coefficient (ICC) (test-retest reliability), and role discrimination accuracy . ResultsPersonality measurement reliability was acceptable to excellent across conditions (Cronbachs : 0.70-0.94), with consistently high test-retest stability (ICC: 0.85- 0.96). Role discrimination improved stepwise from 83.3% (Baseline) to 88.9% (+Memory), 94.4% (+CCD), and 97.2% (+LoRA). CCD produced the largest gain in internal consistency (mean 0.702[->]0.892), while LoRA achieved the highest overall internal consistency ( 0.940) and ICC (0.958). ConclusionsELDER-SIM provides a psychometrically validated approach for constructing personality-consistent elderly digital twin agents. By demonstrating that structured cognitive modeling and domain adaptation materially reduce personality drift, the framework supports more reliable longitudinal digital twins for elderly mental health and psychosocial care, enabling reproducible in silico evaluation of interaction trajectories and intervention strategies before clinical deployment.
Free Newsletter

Clinical research that matters. Delivered to your inbox.

Join thousands of clinicians and researchers. No spam, unsubscribe anytime.