acestep

⚠Review·Scanned 2/17/2026

The skill is a CLI wrapper and documentation for the ACE-Step music API using scripts/acestep.sh to generate and download music from http://127.0.0.1:8001 (or a user-supplied API URL). It executes shell commands (installers), performs network requests to ${api_url} endpoints, and reads/writes scripts/config.json including an api_key.

from clawhub.ai·v4ec6c1a·50.2 KB·0 installs

Scanned from 1.0.0 at 4ec6c1a · Transparency log ↗

$ vett add clawhub.ai/dumoedss/acestepReview findings below

ACE-Step Music Generation Skill

Use ACE-Step V1.5 API for music generation. Script: scripts/acestep.sh (requires curl + jq).

Prerequisites - ACE-Step API Service

IMPORTANT: This skill requires the ACE-Step API server to be running.

Required Dependencies

The scripts/acestep.sh script requires the following tools:

1. curl - For making HTTP requests to the API 2. jq - For parsing JSON responses

Check Dependencies

Before using this skill, verify that the required tools are installed:

# Check curl
curl --version

# Check jq
jq --version

Installing jq

If jq is not installed, the script will attempt to install it automatically. If automatic installation fails, install manually:

Windows:

# Using Chocolatey
choco install jq

# Or download from: https://jqlang.github.io/jq/download/
# Extract jq.exe and add to PATH

macOS:

# Using Homebrew
brew install jq

# Using MacPorts
port install jq

Linux:

# Debian/Ubuntu
sudo apt-get install jq

# Fedora/RHEL/CentOS
sudo yum install jq
# or
sudo dnf install jq

# Arch Linux
sudo pacman -S jq

Verification:

jq --version
# Should output: jq-1.x

If user reports jq installation issues, guide them through manual installation for their platform.

Before First Use

Ask the user about their setup:

"Do you have ACE-Step API service configured and running?"

If YES:
- Verify the API endpoint: curl -s http://127.0.0.1:8001/health
- If using remote service, ask for the API URL and update scripts/config.json
- Proceed with music generation
If NO or NOT SURE:
- Ask: "Do you have ACE-Step installed?"
  
  If installed but not running:
  - Use the acestep-docs skill to help them start the service
  - Guide them through startup process
  If not installed:
  - Offer to help download and install ACE-Step
  - Ask: "Would you like to use the Windows portable package or install from source?"
  - Use acestep-docs skill to guide through installation

Service Configuration

Local Service (Default):

{
  "api_url": "http://127.0.0.1:8001",
  "api_key": ""
}

Remote Service:

{
  "api_url": "http://your-server-ip:8001",
  "api_key": "your-api-key-if-needed"
}

To configure remote service, update scripts/config.json or use:

cd {skill_directory}/scripts/
./acestep.sh config --set api_url "http://remote-server:8001"
./acestep.sh config --set api_key "your-key"

Using acestep-docs Skill for Setup Help

IMPORTANT: For installation and startup, always use the acestep-docs skill to get complete and accurate guidance.

When user needs help with installation or startup, invoke the acestep-docs skill:

Use the Skill tool to invoke: acestep-docs

DO NOT provide simplified startup commands - each user's environment may be different. Always guide them to use acestep-docs for proper setup.

Health Check

To verify if service is running:

curl http://127.0.0.1:8001/health
# Should return: {"status":"ok",...}

If health check fails, use acestep-docs skill to help user start the service properly.

WORKFLOW: For user requests requiring vocals, you should:

Consult Music Creation Guide for lyrics writing, caption creation, duration/BPM/key selection
Write complete, well-structured lyrics yourself based on the guide
Generate using Caption mode with -c and -l parameters

Only use Simple/Random mode (-d or random) for quick inspiration or instrumental exploration.

Output Files

After generation, the script automatically saves results to the acestep_output folder in the project root (same level as .claude):

project_root/
├── .claude/
│   └── skills/acestep/...
├── acestep_output/          # Output directory
│   ├── <job_id>.json         # Complete task result (JSON)
│   ├── <job_id>_1.mp3        # First audio file
│   ├── <job_id>_2.mp3        # Second audio file (if batch_size > 1)
│   └── ...
└── ...

JSON Result Structure

Important: When LM enhancement is enabled (use_format=true), the final synthesized content may differ from your input. Check the JSON file for actual values:

Field	Description
`prompt`	Actual caption used for synthesis (may be LM-enhanced)
`lyrics`	Actual lyrics used for synthesis (may be LM-enhanced)
`metas.prompt`	Original input caption
`metas.lyrics`	Original input lyrics
`metas.bpm`	BPM used
`metas.keyscale`	Key scale used
`metas.duration`	Duration in seconds
`generation_info`	Detailed timing and model info
`seed_value`	Seeds used (for reproducibility)
`lm_model`	LM model name
`dit_model`	DiT model name

To get the actual synthesized lyrics, parse the JSON and read the top-level lyrics field, not metas.lyrics.

Script Commands

CRITICAL - Complete Lyrics Input: When providing lyrics via the -l parameter, you MUST pass ALL lyrics content WITHOUT any omission:

If user provides lyrics, pass the ENTIRE text they give you
If you generate lyrics yourself, pass the COMPLETE lyrics you created
NEVER truncate, shorten, or pass only partial lyrics
Missing lyrics will result in incomplete or incoherent songs

Music Parameters: Refer to Music Creation Guide for how to calculate duration, choose BPM, key scale, and time signature.

# need to cd skills path
cd {project_root}/{.claude or .codex}/skills/acestep/

# Caption mode - RECOMMENDED: Write lyrics first, then generate
./scripts/acestep.sh generate -c "Electronic pop, energetic synths" -l "[Verse] Your complete lyrics
[Chorus] Full chorus here..." --duration 120 --bpm 128

# Instrumental only
./scripts/acestep.sh generate "Jazz with saxophone"

# Quick exploration (Simple/Random mode)
./scripts/acestep.sh generate -d "A cheerful song about spring"
./scripts/acestep.sh random

# Options
./scripts/acestep.sh generate "Rock" --duration 60 --batch 2
./scripts/acestep.sh generate "EDM" --no-thinking    # Faster

# Other commands
./scripts/acestep.sh status <job_id>
./scripts/acestep.sh health
./scripts/acestep.sh models

Configuration

Important: Configuration follows this priority (high to low):

Command line arguments > config.json defaults
User-specified parameters temporarily override defaults but do not modify config.json
Only config --set command permanently modifies config.json

Default Config File (`scripts/config.json`)

{
  "api_url": "http://127.0.0.1:8001",
  "api_key": "",
  "generation": {
    "thinking": true,
    "use_format": false,
    "use_cot_caption": true,
    "use_cot_language": false,
    "batch_size": 1,
    "audio_format": "mp3",
    "vocal_language": "en"
  }
}

Option	Default	Description
`api_url`	`http://127.0.0.1:8001`	API server address
`api_key`	`""`	API authentication key (optional)
`generation.thinking`	`true`	Enable 5Hz LM (higher quality, slower)
`generation.audio_format`	`mp3`	Output format (mp3/wav/flac)
`generation.vocal_language`	`en`	Vocal language

API Reference

All responses wrapped: {"data": <payload>, "code": 200, "error": null, "timestamp": ...}

Endpoint	Method	Description
`/health`	GET	Health check
`/release_task`	POST	Create generation task
`/query_result`	POST	Query task status, body: `{"task_id_list": ["id"]}`
`/v1/models`	GET	List available models
`/v1/audio?path={path}`	GET	Download audio file

Query Result Response

{
  "data": [{
    "task_id": "xxx",
    "status": 1,
    "result": "[{\"file\":\"/v1/audio?path=...\",\"metas\":{\"bpm\":120,\"duration\":60,\"keyscale\":\"C Major\"}}]"
  }]
}

Status codes: 0 = processing, 1 = success, 2 = failed

Request Parameters (`/release_task`)

Parameters can be placed in param_obj object.

Generation Modes

Mode	Usage	When to Use
Caption (Recommended)	`generate -c "style" -l "lyrics"`	For vocal songs - write lyrics yourself first
Simple	`generate -d "description"`	Quick exploration, LM generates everything
Random	`random`	Random generation for inspiration

Core Parameters

Parameter	Type	Default	Description
`prompt`	string	""	Music style description (Caption mode)
`lyrics`	string	""	Full lyrics content - Pass ALL lyrics without omission. Use `[inst]` for instrumental. Partial/truncated lyrics = incomplete songs
`sample_mode`	bool	false	Enable Simple/Random mode
`sample_query`	string	""	Description for Simple mode
`thinking`	bool	false	Enable 5Hz LM for audio code generation
`use_format`	bool	false	Use LM to enhance caption/lyrics
`model`	string	-	DiT model name
`batch_size`	int	1	Number of audio files to generate

Music Attributes

Parameter	Type	Default	Description
`audio_duration`	float	-	Duration in seconds
`bpm`	int	-	Tempo (beats per minute)
`key_scale`	string	""	Key (e.g. "C Major")
`time_signature`	string	""	Time signature (e.g. "4/4")
`vocal_language`	string	"en"	Language code (en, zh, ja, etc.)
`audio_format`	string	"mp3"	Output format (mp3/wav/flac)

Generation Parameters

Parameter	Type	Default	Description
`inference_steps`	int	8	Diffusion steps
`guidance_scale`	float	7.0	CFG scale
`seed`	int	-1	Random seed (-1 for random)
`infer_method`	string	"ode"	Diffusion method (ode/sde)

Audio Task Parameters

Parameter	Type	Default	Description
`task_type`	string	"text2music"	text2music / continuation / repainting
`src_audio_path`	string	-	Source audio for continuation
`repainting_start`	float	0.0	Repainting start position (seconds)
`repainting_end`	float	-	Repainting end position (seconds)

Example Request (Simple Mode)

{
  "sample_mode": true,
  "sample_query": "A cheerful pop song about spring",
  "thinking": true,
  "param_obj": {
    "duration": 60,
    "bpm": 120,
    "language": "en"
  },
  "batch_size": 2
}