Instructions to use demegire/personaplex-finetune-pharma with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Moshi
How to use demegire/personaplex-finetune-pharma with Moshi:
# pip install moshi # Run the interactive web server python -m moshi.server --hf-repo "demegire/personaplex-finetune-pharma" # Then open https://localhost:8998 in your browser
# pip install moshi import torch from moshi.models import loaders # Load checkpoint info from HuggingFace checkpoint = loaders.CheckpointInfo.from_hf_repo("demegire/personaplex-finetune-pharma") # Load the Mimi audio codec mimi = checkpoint.get_mimi(device="cuda") mimi.set_num_codebooks(8) # Encode audio (24kHz, mono) wav = torch.randn(1, 1, 24000 * 10) # [batch, channels, samples] with torch.no_grad(): codes = mimi.encode(wav.cuda()) decoded = mimi.decode(codes) - Notebooks
- Google Colab
- Kaggle
PersonaPlex Finetune — Pharma Adherence
Experimental research artifact. Not intended for production or clinical use. See Limitations below.
This repository contains the artifacts from an example finetuning run:
a LoRA finetune of nvidia/personaplex-7b-v1
on synthetic pharma adherence (patient-support) dialogues.
- Recipe and code: https://github.com/emotion-machine-org/personaplex-finetune
- Base model:
nvidia/personaplex-7b-v1(7B params,dep_q=16, voice + role conditioning over Moshi) - Run name:
adhery-v2-21 - Trained: April 2026, 1024 steps, batch 8, duration 80 s/sample
- Adapter: LoRA rank 64, scaling 2.0 (skip depformer, ~5.6% params trainable)
The full model card describing the recipe lives in the GitHub repo at
MODEL_CARD.md;
this README adapts that card to the specifics of this run.
What this is
A voice-native streaming language model finetuned to follow
patient-support / medication-adherence call structure: greeting,
medication check-in, side-effect probe, adherence nudge, close. Trained
with mid-conversation context injection (the puppeteer mechanism) so an
external system can inject talking points during the call.
Run artifacts included:
merged_step448/model.safetensors— merged checkpoint at step 448 (best so far on script-adherence eval)checkpoints/checkpoint_000064..000768/— LoRA adapter snapshots every 64 stepsargs.yaml— full training configmetrics.train.jsonl,metrics.eval.jsonl— train/eval loss curveswandb/,tb/— Weights & Biases and TensorBoard run logsgen_eval/step_000064..000768/— generation eval results per checkpointgemini_eval_step448/— Gemini judge transcripts + scoring at step 448
Training data
Synthetic patient-support dialogues (adhery-v2 dataset, ~2k samples,
mean duration 296 s) generated with Claude and rendered to speech with
VibeVoice 7B, aligned with
WhisperX. Each sample carries a text_prompt, voice_prompt, and
context_injections (frame-offset talking points).
The data is not redistributed here. The pipeline in
pipeline/
regenerates equivalent data from public sources.
Intended use
- Research on voice-native patient-support agents and mid-call context injection.
- Reproducing the pharma branch of the experiments documented in
docs/history/notes/combined_experiment_report.md.
Out of scope / Limitations
This model is experimental and was trained for research. Specifically:
- Not for production deployment of any kind, including any patient-facing context.
- Not medical advice. The model is not aligned for clinical correctness; it can and will hallucinate medication names, dosages, schedules, and side effects.
- English-only, biased toward American English (VibeVoice voice library).
- No red-team / jailbreak evaluation.
- No HIPAA / regulatory review.
- No human-subject testing. All eval is synthetic-judge based (Claude / Gemini).
- Synthetic training data is fluent but stylized; out-of-distribution prompts will surface that distribution shift.
- Voice cloning capability inherits from PersonaPlex. Follow consent norms.
Evaluation
- Generation eval (every 64 steps, in
gen_eval/): held-out prompts, 30 s generations, Claude-judged 1–5 on naturalness, accuracy, and script adherence. - Gemini eval at step 448 (
gemini_eval_step448/): larger held-out set with Gemini as judge. Seereviews.jsonfor per-prompt scoring.
Eval is judge-based and not a substitute for human review.
License
- Adapter weights: NVIDIA Open Model License (inherited from
nvidia/personaplex-7b-v1). - Code in the linked GitHub repo: MIT.
Citation
@software{personaplex_finetune_pharma,
title = {PersonaPlex Finetune — Pharma Adherence},
author = {emotion-machine-org},
year = {2026},
url = {https://github.com/emotion-machine-org/personaplex-finetune}
}
Underlying base models:
- Moshi: Défossez et al., arXiv:2410.00037
- PersonaPlex: NVIDIA, model card
- Downloads last month
- -