Agent skill
prs-net-deep-learning-agent
Install this agent skill to your Project
npx add-skill https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/prs-net-deep-learning-agent
SKILL.md
name: 'prs-net-deep-learning-agent' description: 'Geometric deep learning-based polygenic risk score prediction using PRS-Net for modeling gene interactions, enhanced disease prediction, and cross-ancestry portability.' measurable_outcome: Execute skill workflow successfully with valid output within 15 minutes. allowed-tools:
- read_file
- run_shell_command
PRS-Net Deep Learning Agent
The PRS-Net Deep Learning Agent implements interpretable geometric deep learning for polygenic risk score prediction. PRS-Net models non-linear gene-gene interactions and biological network relationships to enhance disease prediction accuracy and improve cross-ancestry portability compared to traditional linear PRS methods.
When to Use This Skill
- When linear PRS methods show limited performance.
- For modeling complex gene-gene interactions.
- To improve PRS portability across ancestries.
- When biological interpretability of PRS is needed.
- For integrating pathway and network information.
Core Capabilities
-
Non-Linear PRS: Capture gene-gene interactions via deep learning.
-
Network Integration: Incorporate protein-protein interaction networks.
-
Interpretability: Identify important pathways and gene modules.
-
Cross-Ancestry Transfer: Improved portability via learned biology.
-
Multi-Task Learning: Joint modeling of related traits.
-
Uncertainty Quantification: Provide prediction confidence.
PRS-Net Architecture
| Component | Function | Innovation |
|---|---|---|
| Input Layer | Gene-level summaries | Aggregated variant effects |
| Network Encoder | PPI graph convolution | Biological structure |
| Attention Layer | Gene importance | Interpretability |
| Predictor | Disease/trait prediction | Non-linear mapping |
| Explanation | Pathway enrichment | Biological insights |
Comparison to Traditional PRS
| Aspect | Linear PRS | PRS-Net |
|---|---|---|
| Gene Interactions | Not modeled | GNN captures |
| Network Biology | Ignored | Integrated |
| Interpretability | Limited (SNP weights) | Pathway-level |
| Cross-Ancestry | Often poor | Improved |
| Computational Cost | Low | Moderate |
| Training Data Needed | Low | Moderate |
Workflow
-
Input: Individual genotypes, PPI network, training phenotypes.
-
Gene Summarization: Aggregate SNPs to gene-level scores.
-
Network Encoding: Learn representations on PPI graph.
-
Prediction: Non-linear disease risk prediction.
-
Interpretation: Extract important genes and pathways.
-
Cross-Ancestry: Apply to diverse populations.
-
Output: Risk scores, uncertainty, biological explanations.
Example Usage
User: "Calculate PRS-Net scores for Type 2 Diabetes with pathway-level interpretation."
Agent Action:
python3 Skills/Precision_Medicine/PRS_Net_Deep_Learning_Agent/prs_net_predict.py \
--genotypes cohort_genotypes.vcf.gz \
--ppi_network string_ppi.graphml \
--trait type2_diabetes \
--model_weights prs_net_t2d_v1.pt \
--interpret_pathways true \
--ancestry_calibration multi \
--output prs_net_results/
Input Requirements
| Input | Format | Purpose |
|---|---|---|
| Genotypes | VCF/PLINK | SNP data |
| PPI Network | GraphML, edge list | Gene relationships |
| Gene Mapping | BED | SNP-to-gene |
| Training Labels | Phenotype file | Model training |
| GWAS Summary | Optional | Initialization |
Output Components
| Output | Description | Format |
|---|---|---|
| PRS-Net Score | Non-linear polygenic score | .csv |
| Risk Percentile | Population ranking | .csv |
| Gene Importance | Attention weights | .csv |
| Pathway Enrichment | Top pathways | .csv |
| Module Visualization | Network subgraphs | .png |
| Uncertainty | Prediction confidence | .json |
Network Biology Integration
| Network | Source | Genes | Edges |
|---|---|---|---|
| STRING PPI | String-db | 19,000 | 5.5M |
| BioGRID | BioGRID | 18,000 | 1.2M |
| Reactome | Reactome | 10,000 | 250K |
| GO Biological Process | Gene Ontology | 18,000 | Hierarchical |
Performance Benchmarks
| Disease | Linear PRS AUC | PRS-Net AUC | Improvement |
|---|---|---|---|
| Type 2 Diabetes | 0.65 | 0.72 | +7% |
| Coronary Artery Disease | 0.70 | 0.76 | +6% |
| Schizophrenia | 0.62 | 0.68 | +6% |
| Alzheimer's Disease | 0.68 | 0.74 | +6% |
Cross-Ancestry Portability
| Ancestry | Linear PRS Drop | PRS-Net Drop |
|---|---|---|
| EUR → EAS | -15% | -8% |
| EUR → AFR | -30% | -18% |
| EUR → SAS | -20% | -12% |
| EUR → AMR | -18% | -10% |
AI/ML Components
Graph Neural Networks:
- Graph convolutional networks (GCN)
- Graph attention networks (GAT)
- Message passing neural networks
Interpretability:
- Attention visualization
- Integrated gradients
- Pathway enrichment analysis
Transfer Learning:
- Pre-training on EUR
- Fine-tuning on diverse
- Domain adaptation
Prerequisites
- Python 3.10+
- PyTorch, PyTorch Geometric
- NetworkX, igraph
- Scanpy (optional for visualization)
- GPU recommended
Related Skills
- Multi_Ancestry_PRS_Agent - Traditional multi-ancestry PRS
- PopEVE_Variant_Predictor_Agent - Variant interpretation
- Pharmacogenomics_Agent - Drug-gene interactions
- Pathway_Analysis - Pathway enrichment
Biological Interpretation
| Interpretation Level | Output | Clinical Use |
|---|---|---|
| Gene | Top contributing genes | Target identification |
| Pathway | Enriched pathways | Mechanism understanding |
| Module | Network subgraphs | Biological insight |
| Hub Genes | Central genes | Druggable targets |
Training Considerations
| Factor | Recommendation | Rationale |
|---|---|---|
| Sample Size | >10,000 | Deep learning needs data |
| Class Balance | Oversample or weight | Avoid bias |
| Validation | Cross-validation | Avoid overfitting |
| Regularization | Dropout, L2 | Generalization |
Special Considerations
- Interpretability Trade-offs: More complex = less interpretable
- Computational Requirements: GPU accelerates training
- Network Quality: PPI accuracy affects results
- Gene Mapping: SNP-to-gene assignment matters
- Overfitting: Regularization essential
Clinical Applications
| Application | PRS-Net Advantage | Benefit |
|---|---|---|
| Risk Stratification | Higher accuracy | Better prediction |
| Biological Insight | Pathway interpretation | Mechanism |
| Drug Targets | Hub gene identification | Therapeutic targets |
| Ancestry Equity | Better portability | Fairer prediction |
Limitations
| Limitation | Impact | Future Direction |
|---|---|---|
| Training Data | EUR-dominated | Diverse cohorts |
| Network Completeness | Missing edges | Multi-network integration |
| Rare Variants | Not well captured | WGS + rare variant methods |
| Clinical Validation | Limited trials | Prospective studies |
Author
AI Group - Biomedical AI Platform
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
vcf-annotator
Annotate VCF variants with VEP, ClinVar, gnomAD frequencies, and ancestry-aware context. Generates prioritised variant reports.
chemist-analyst
Analyzes events through chemistry lens using molecular structure, reaction mechanisms, thermodynamics, kinetics, and analytical techniques (spectroscopy, chromatography, mass spectrometry). Provides insights on chemical processes, material properties, reaction pathways, synthesis, and analytical methods. Use when: Chemical reactions, material analysis, synthesis planning, process optimization, environmental chemistry. Evaluates: Molecular structure, reaction mechanisms, yield, selectivity, safety, environmental impact.
bio-alignment-io
Read, write, and convert multiple sequence alignment files using Biopython Bio.AlignIO. Supports Clustal, PHYLIP, Stockholm, FASTA, Nexus, and other alignment formats for phylogenetics and conservation analysis. Use when reading, writing, or converting alignment file formats.
sleep-analyzer
分析睡眠数据、识别睡眠模式、评估睡眠质量,并提供个性化睡眠改善建议。支持与其他健康数据的关联分析。
metabolomics-workbench-database
Access NIH Metabolomics Workbench via REST API (4,200+ studies). Query metabolites, RefMet nomenclature, MS/NMR data, m/z searches, study metadata, for metabolomics and biomarker discovery.
bio-hi-c-analysis-matrix-operations
Balance, normalize, and transform Hi-C contact matrices using cooler and cooltools. Apply iterative correction (ICE), compute expected values, and generate observed/expected matrices. Use when normalizing or transforming Hi-C matrices.
Didn't find tool you were looking for?