Claude Code Skills
DataGenFlow includes built-in Claude Code skills that provide step-by-step guidance for common workflows. Skills activate automatically when Claude Code detects a matching task.
Table of Contents
Available Skills
| Skill | Triggers When You… |
|---|---|
creating-pipeline-templates | Create or modify YAML templates and seed files in lib/templates/ |
testing-pipeline-templates | Test a pipeline end-to-end, analyze results, or iterate on quality |
configuring-models | Set up LLM or embedding models (OpenAI, Anthropic, Gemini, Ollama) |
implementing-datagenflow-blocks | Create or modify pipeline blocks in lib/blocks/builtin/ |
debugging-pipelines | Troubleshoot pipeline failures or unexpected output |
writing-e2e-tests | Write Playwright e2e tests for the UI |
How Skills Work
Skills live in .claude/skills/ as SKILL.md files. When you describe a task in Claude Code, the matching skill loads automatically and guides the session with:
- Workflows — step-by-step procedures for the task
- Code patterns — correct usage of DataGenFlow APIs and conventions
- Checklists — verification steps before considering the task done
- Common mistakes — known pitfalls and their fixes
No manual activation needed. Just describe what you want to do.
Skill Reference
Creating Pipeline Templates
Location: .claude/skills/creating-pipeline-templates/
Guides through creating YAML pipeline definitions and seed files. Covers:
- Template YAML format and block configuration
- Available blocks (14 total) with inputs/outputs
- Common pipeline patterns (generation, validation, augmentation)
- Seed file format (JSON and Markdown)
- Validation via TemplateRegistry
Example trigger: “Create a pipeline template for generating product reviews”
Testing Pipeline Templates
Location: .claude/skills/testing-pipeline-templates/
Three-phase testing process for pipelines:
- Dry run — single-seed execution, trace analysis
- Small batch — 3-5 seeds, consistency and diversity check
- Quality iteration — identify weak blocks, adjust prompts/config, re-test
Example trigger: “Test the json_generation template end to end”
Configuring Models
Location: .claude/skills/configuring-models/
Provider-specific setup for LLM and embedding models:
- OpenAI — no endpoint needed,
sk-...API key - Anthropic — no endpoint needed,
sk-ant-...API key, no embeddings - Gemini — no endpoint needed, API key from AI Studio
- Ollama — endpoint required (
http://localhost:11434), no API key
Includes connection testing, model resolution order, and troubleshooting.
Example trigger: “Set up Ollama as my LLM provider”
Implementing Blocks
Location: .claude/skills/implementing-datagenflow-blocks/
Complete reference for creating pipeline blocks:
- Block structure (
BaseBlock,BaseMultiplierBlock) - UI integration patterns (model dropdowns, enum selectors, template editors)
- LLM and embedding integration via
llm_config_manager - State management and trace_id-keyed caching
- Unit testing patterns
Example trigger: “Create a new block that summarizes text”
Debugging Pipelines
Location: .claude/skills/debugging-pipelines/
Four-phase systematic debugging:
- Observe — gather evidence, read full errors
- Trace — follow data flow through blocks
- Analyze — form testable hypothesis
- Fix — minimal targeted change, verify
Example trigger: “My pipeline output has unexpected fields”
Writing E2E Tests
Location: .claude/skills/writing-e2e-tests/
Playwright test patterns for the DataGenFlow UI:
- Test file template with sync API and cleanup fixtures
- Navigation, file upload, and modal interaction patterns
- Common UI selectors for all pages
- Integration with
run_all_tests.sh
Example trigger: “Write e2e tests for the settings page”