packages/skills/skills/huggingface-model-trainer/SKILL.md
# Hugging Face Model Trainer Train language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. ## Prerequisites - HF_TOKEN environment variable with write permissions - Hugging Face Pro/Team/Enterprise plan - Dataset on Hub or loadable via datasets library ## Instructions ### Training Methods - **SFT** - Supervised Fine-Tuning for instruction tuning - **DPO** - Direct Preference Optimization from preference data - **GRPO** - Group Relative Policy Opt
npx skillsauth add mediar-ai/skillhubz packages/skills/skills/huggingface-model-trainerInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Train language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure.
hf_jobs("uv", {
"script": """
# /// script
# dependencies = ["trl>=0.12.0", "peft>=0.7.0", "trackio"]
# ///
from datasets import load_dataset
from peft import LoraConfig
from trl import SFTTrainer, SFTConfig
dataset = load_dataset("trl-lib/Capybara", split="train")
trainer = SFTTrainer(
model="Qwen/Qwen2.5-0.5B",
train_dataset=dataset,
peft_config=LoraConfig(r=16, lora_alpha=32),
args=SFTConfig(
output_dir="my-model",
push_to_hub=True,
hub_model_id="username/my-model",
num_train_epochs=3,
report_to="trackio",
)
)
trainer.train()
trainer.push_to_hub()
""",
"flavor": "a10g-large",
"timeout": "2h",
"secrets": {"HF_TOKEN": "$HF_TOKEN"}
})
| Model Size | Hardware | Cost/hr | |------------|----------|---------| | <1B params | t4-small | ~$0.75 | | 1-3B | t4-medium | ~$1.50 | | 3-7B | a10g-large | ~$5.00 | | 7-13B | a100-large | ~$10.00 |
push_to_hub=True and hub_model_idsecrets={"HF_TOKEN": "$HF_TOKEN"}Validate format before training to prevent failures:
uv run scripts/dataset_inspector.py \
--dataset "username/dataset-name" \
--split "train"
Convert trained models for local inference (Ollama, LM Studio):
hf_jobs("uv", {
"script": "<conversion script>",
"flavor": "a10g-large",
"timeout": "45m",
"env": {
"ADAPTER_MODEL": "username/my-model",
"BASE_MODEL": "Qwen/Qwen2.5-0.5B",
"OUTPUT_REPO": "username/my-model-gguf"
}
})
hf_jobs("logs", {"job_id": "..."})Source: huggingface/skills
tools
# X Twitter Scraper Use Xquik for X/Twitter tweet search, user lookup, profile tweets, follower export, media download, monitors, webhooks, posting workflows, and MCP-backed API exploration. ## Prerequisites - A Xquik API key in `XQUIK_API_KEY`. - Internet access to `https://xquik.com/api/v1`, `https://xquik.com/mcp`, and `https://docs.xquik.com`. - A clear user request that identifies the target tweets, users, accounts, keywords, media, monitor, webhook, or write action. ## Source Truth -
tools
Use when the user says "mk0r", "appmaker CLI", "open a VM", "run something in the sandbox", "talk to the VM agent", "spin up an E2B sandbox", or "chat with appmaker from CLI." Wraps the `mk0r` CLI to list projects, exec commands inside their E2B sandboxes, stream chat with the VM agent (same `/api/chat` the web UI uses), toggle SOAX residential IP, manage schedules, and copy files. Supports a sticky default project via `mk0r projects use`.
testing
Use when the user mentions "influencer candidates", "social media operator", "check proposals on Upwork/Fiverr", "review influencer applications", "qualify candidates", or "reach out to operators". Manages the IG/TikTok account operator hiring pipeline — review applicants, check replies, qualify, and do proactive outreach.
tools
End-to-end newsletter pipeline: investigate recent features, draft, send via API endpoint, and track delivery/open/click metrics.