Agent skill

multi-ancestry-prs-agent

Stars 2,009
Forks 275

Install this agent skill to your Project

npx add-skill https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/multi-ancestry-prs-agent

SKILL.md


name: 'multi-ancestry-prs-agent' description: 'AI-powered multi-ancestry polygenic risk score calculation and optimization for equitable disease risk prediction across diverse global populations.' measurable_outcome: Execute skill workflow successfully with valid output within 15 minutes. allowed-tools:

  • read_file
  • run_shell_command

Multi-Ancestry PRS Agent

The Multi-Ancestry PRS Agent provides AI-optimized polygenic risk score calculation designed to work across diverse ancestral populations. It addresses the critical limitation of European-biased GWAS by integrating trans-ancestry methods, improving risk prediction for underrepresented populations and enabling equitable precision medicine.

When to Use This Skill

  • When calculating PRS for non-European ancestry individuals.
  • For developing trans-ancestry risk prediction models.
  • To reduce PRS bias across ancestral populations.
  • When integrating multi-ancestry GWAS summary statistics.
  • For research on PRS portability and equity.

Core Capabilities

  1. Multi-Ancestry PRS: Calculate ancestry-aware polygenic scores.

  2. Trans-Ancestry Optimization: Optimize weights across populations.

  3. Local Ancestry Integration: Account for admixed genomes.

  4. Ensemble Methods: Combine multiple PRS approaches.

  5. Ancestry Calibration: Population-specific score calibration.

  6. Equity Assessment: Evaluate prediction fairness across groups.

Supported Ancestries

Ancestry Training Data Availability PRS Performance
European (EUR) Excellent High
East Asian (EAS) Good Good
South Asian (SAS) Moderate Moderate
African (AFR) Limited Lower
Hispanic/Latino (AMR) Limited Variable
Middle Eastern (MID) Very Limited Lower

Multi-Ancestry Methods

Method Approach Best For
PRS-CSx Cross-population shrinkage Multi-ancestry
PRS-Multi Multi-population training Large cohorts
Ensemble Combined methods Best overall
Local Ancestry Ancestry-specific weights Admixed populations
GPSMult Integrated multi-trait Correlated traits

Workflow

  1. Input: Individual genotypes, target ancestry, disease/trait.

  2. Ancestry Inference: Determine genetic ancestry.

  3. Method Selection: Choose optimal PRS approach.

  4. Score Calculation: Compute ancestry-aware PRS.

  5. Calibration: Apply population-specific calibration.

  6. Risk Stratification: Categorize into risk groups.

  7. Output: PRS, percentile, clinical interpretation.

Example Usage

User: "Calculate multi-ancestry coronary artery disease PRS for this admixed individual with African and European ancestry."

Agent Action:

bash
python3 Skills/Precision_Medicine/Multi_Ancestry_PRS_Agent/calc_prs.py \
    --genotypes patient_genotypes.vcf.gz \
    --ancestry admixed_AFR_EUR \
    --local_ancestry lai_segments.bed \
    --trait coronary_artery_disease \
    --method prs_csx \
    --gwas_summary_stats eur_gwas.txt,afr_gwas.txt \
    --calibration_cohort 1kg_admixed \
    --output prs_results/

Input Requirements

Input Format Purpose
Genotypes VCF/PLINK Individual variants
Ancestry Estimated or self-reported Method selection
GWAS Summary Stats Multiple ancestries Score weights
Local Ancestry LAI segments Admixture handling
Reference Panel Multi-ancestry LD calculation

Output Components

Output Description Format
PRS Score Raw polygenic score .csv
Percentile Population-specific ranking .csv
Risk Category High/Intermediate/Low .csv
Ancestry Breakdown Component scores .json
Confidence Interval Score uncertainty .json
Clinical Interpretation Risk explanation .md

Disease-Specific Performance

Disease Multi-Ancestry AUC EUR Only AUC Improvement
CAD 0.75-0.80 0.70-0.85 5-10% in non-EUR
Type 2 Diabetes 0.70-0.75 0.65-0.72 8-12% in AFR
Breast Cancer 0.65-0.72 0.60-0.70 5-8% globally
Alzheimer's 0.70-0.78 0.65-0.75 5-10% in diverse

AI/ML Components

PRS Optimization:

  • Bayesian shrinkage (PRS-CS)
  • Cross-population learning
  • Neural network weight optimization

Ancestry Inference:

  • Supervised classification
  • Unsupervised clustering (PCA, ADMIXTURE)
  • Local ancestry inference (RFMix)

Ensemble Learning:

  • Stacking multiple PRS methods
  • Ancestry-stratified weighting
  • Uncertainty quantification

Clinical Integration

Application PRS Role Clinical Action
Primary Prevention Risk stratification Screening intensity
Risk Communication Personalized risk Lifestyle modification
Treatment Selection Predicted response Drug choice
Family Screening Cascade testing Genetic counseling

Prerequisites

  • Python 3.10+
  • PLINK 2.0
  • PRSice-2, LDpred2, PRS-CSx
  • Multi-ancestry reference panels
  • GWAS summary statistics

Related Skills

  • PRS_Net_Deep_Learning_Agent - Deep learning PRS
  • Pharmacogenomics_Agent - Drug-gene interactions
  • PopEVE_Variant_Predictor_Agent - Variant interpretation
  • DiagAI_Agent - Clinical integration

Bias and Fairness

Bias Type Cause Mitigation
Discovery Bias EUR-dominated GWAS Multi-ancestry GWAS
LD Variation Population-specific LD Local ancestry adjustment
Allele Frequency Differing frequencies Population-specific weights
Effect Size Heterogeneous effects Trans-ancestry meta-analysis

Large-Scale Initiatives

Initiative Focus Contribution
All of Us US diversity 1M diverse participants
PAGE Multi-ethnic GWAS Discovery in diverse
H3Africa African genomics Continental diversity
Mexican Biobank Latin American Admixed populations
GBMI Global Biobank Multi-ancestry meta-analysis

Special Considerations

  1. Self-Reported Ancestry: May not match genetic ancestry
  2. Admixture: Require local ancestry methods
  3. Population Stratification: Careful covariate adjustment
  4. Clinical Validity: Validate in target population
  5. Health Equity: Consider access disparities

ESC Guidelines Integration (2025)

Recommendation PRS Role Evidence Level
CV Risk Assessment Risk modifier IIa, B
Statin Decisions Borderline risk reclassification IIa, B
Family History Enhancement Quantify genetic burden IIa, C

Limitations

Limitation Impact Research Needed
AFR Performance Lower accuracy More GWAS
Rare Variants Not captured WGS integration
Gene-Environment Not modeled Interaction studies
Clinical Utility Limited evidence Randomized trials

Author

AI Group - Biomedical AI Platform

Expand your agent's capabilities with these related and highly-rated skills.

FreedomIntelligence/OpenClaw-Medical-Skills

vcf-annotator

Annotate VCF variants with VEP, ClinVar, gnomAD frequencies, and ancestry-aware context. Generates prioritised variant reports.

2,009 275
Explore
FreedomIntelligence/OpenClaw-Medical-Skills

chemist-analyst

Analyzes events through chemistry lens using molecular structure, reaction mechanisms, thermodynamics, kinetics, and analytical techniques (spectroscopy, chromatography, mass spectrometry). Provides insights on chemical processes, material properties, reaction pathways, synthesis, and analytical methods. Use when: Chemical reactions, material analysis, synthesis planning, process optimization, environmental chemistry. Evaluates: Molecular structure, reaction mechanisms, yield, selectivity, safety, environmental impact.

2,009 275
Explore
FreedomIntelligence/OpenClaw-Medical-Skills

bio-alignment-io

Read, write, and convert multiple sequence alignment files using Biopython Bio.AlignIO. Supports Clustal, PHYLIP, Stockholm, FASTA, Nexus, and other alignment formats for phylogenetics and conservation analysis. Use when reading, writing, or converting alignment file formats.

2,009 275
Explore
FreedomIntelligence/OpenClaw-Medical-Skills

sleep-analyzer

分析睡眠数据、识别睡眠模式、评估睡眠质量,并提供个性化睡眠改善建议。支持与其他健康数据的关联分析。

2,009 275
Explore
FreedomIntelligence/OpenClaw-Medical-Skills

metabolomics-workbench-database

Access NIH Metabolomics Workbench via REST API (4,200+ studies). Query metabolites, RefMet nomenclature, MS/NMR data, m/z searches, study metadata, for metabolomics and biomarker discovery.

2,009 275
Explore
FreedomIntelligence/OpenClaw-Medical-Skills

bio-hi-c-analysis-matrix-operations

Balance, normalize, and transform Hi-C contact matrices using cooler and cooltools. Apply iterative correction (ICE), compute expected values, and generate observed/expected matrices. Use when normalizing or transforming Hi-C matrices.

2,009 275
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results