Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

scitix/volcano-scheduler-logs

Name: volcano-scheduler-logs
Author: scitix

skills/core/volcano-scheduler-logs/SKILL.md

npx skillsauth add scitix/siclaw volcano-scheduler-logs

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Volcano Scheduler Logs

Retrieve and analyze Volcano scheduler logs to understand scheduling decisions, failures, and performance issues.

Scope: This skill is for diagnosis only. It retrieves logs for analysis but does not modify any cluster state.

Usage

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh [options]

Parameters

| Parameter | Required | Description | |-----------|----------|-------------| | --keyword KEYWORD | no | Filter logs by keyword (case-insensitive) | | --pod POD | no | Filter logs related to specific pod name | | --since TIME | no | Show logs newer than relative time (e.g., 10m, 1h) | | --lines N | no | Number of lines to show (default: 100) | | --follow | no | Stream logs in real-time (Ctrl+C to stop) | | --previous | no | Show logs from previous container instance (after restart) |

Examples

Get recent scheduler logs:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh

Search for error messages:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --keyword error

Get logs for a specific pod:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --pod my-job-0

Get last 500 lines from the past hour:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --since 1h --lines 500

Stream logs for gang scheduling issues:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --keyword gang --follow

Check logs from previous scheduler instance (after crash/restart):

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --previous --lines 200

Common Keywords for Filtering

| Keyword | Use Case | |---------|----------| | error | Find error messages and failures | | FailedScheduling | Scheduling failures | | allocate | Resource allocation attempts | | gang | Gang scheduling decisions | | minMember | MinMember constraint issues | | preempt | Preemption events | | reclaim | Resource reclamation | | enqueue | Queue admission decisions | | bind | Pod binding attempts | | queue | Queue-related decisions | | proportion | Proportion plugin decisions | | priority | Priority-related decisions |

Understanding Scheduler Logs

Log Format

Volcano scheduler logs typically follow this format:

I0102 15:04:05.123456       1 scheduler.go:123] Starting scheduling session
I0102 15:04:05.234567       1 allocate.go:456] Try to allocate resources for Job <namespace>/<job-name>
E0102 15:04:05.345678       1 gang.go:789] Failed to schedule pod <pod-name>: minMember not satisfied

Log levels:

I - Info: Normal operation information
W - Warning: Unusual but non-fatal conditions
E - Error: Failures and errors
F - Fatal: Critical errors causing shutdown

Common Log Patterns

Session Start

Starting scheduling session
Starting scheduling loop

Indicates scheduler is processing a new batch of pending pods

Enqueue Decisions

Try to enqueue pod group
PodGroup <name> is enqueued
PodGroup <name> is pending

Shows whether pod groups are admitted to the queue

Allocation Attempts

Try to allocate resources for Job
Try to allocate for task

Shows scheduling attempts for specific jobs/pods

Gang Scheduling

minMember not satisfied
gang member not ready
Waiting for gang members

Indicates Gang constraint preventing scheduling

Resource Shortage

Insufficient cpu
Insufficient memory
0 nodes are available

Indicates resource constraint preventing scheduling

Preemption

Preempting pods
Found victim pods

Shows preemption decisions for high-priority workloads

Reclaim

Try to reclaim resources
Reclaiming resources from queue

Shows resource reclamation between queues

Diagnostic Use Cases

Case 1: Pod Stuck in Pending

Find relevant scheduler decisions:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --pod <pod-name> --since 30m

Look for:

FailedScheduling events
minMember not satisfied
Insufficient resource messages
enqueue decisions (is the PodGroup being admitted?)

Case 2: Gang Scheduling Issues

Check Gang plugin behavior:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --keyword gang --since 1h

Look for:

minMember related messages
Gang constraint validation
Comparison of running vs required members

Case 3: Queue Resource Issues

Check proportion and reclaim decisions:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --keyword "reclaim\|proportion" --since 30m

Look for:

Queue resource calculations
Reclaim triggers
Over-commit handling

Case 4: Scheduler Performance

Check for scheduling delays:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --lines 500 | grep -E "(Starting|Finished) scheduling"

Look for:

Long gaps between "Starting" and "Finished"
High frequency of scheduling loops
Errors causing retries

Case 5: Preemption Analysis

Check preemption decisions:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --keyword preempt --since 1h

Look for:

Which pods are being preempted
Priority comparisons
Preemption success/failure

Environment Variables

| Variable | Default | Description | |----------|---------|-------------| | VOLCANO_SCHEDULER_NS | volcano-system | Scheduler namespace | | VOLCANO_SCHEDULER_LABEL | app=volcano-scheduler | Label selector for scheduler pods |

Limitations

Log retention: Logs may be rotated based on cluster configuration
Multi-scheduler: If running multiple schedulers, logs will be interleaved
Log level: Default log level may not show all debug information
Previous logs: --previous only works if the container has restarted

Tips for Effective Log Analysis

Use time ranges: Narrow down with --since to focus on recent issues
Combine keywords: Search for error\|Failed\|failed to catch all failures
Check pod context: Always include --pod when investigating specific pods
Look for patterns: Repeating errors may indicate systemic issues
Correlate with events: Compare with kubectl get events timestamps

scitix/volcano-scheduler-logs

skills/core/volcano-scheduler-logs/SKILL.md

Retrieve and analyze Volcano scheduler logs. Filter by keyword, time range, or pod name to debug scheduling decisions.

88 stars

development

Updated Apr 12, 2026

$ install --global

skillsauth

npx skillsauth add scitix/siclaw volcano-scheduler-logs

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 12, 2026, 11:21 PM69.9s2 files scanned

SKILL.md

name:: volcano-scheduler-logs
description:: >-

Volcano Scheduler Logs

Retrieve and analyze Volcano scheduler logs to understand scheduling decisions, failures, and performance issues.

Scope: This skill is for diagnosis only. It retrieves logs for analysis but does not modify any cluster state.

Usage

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh [options]

Parameters

Examples

Get recent scheduler logs:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh

Search for error messages:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --keyword error

Get logs for a specific pod:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --pod my-job-0

Get last 500 lines from the past hour:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --since 1h --lines 500

Stream logs for gang scheduling issues:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --keyword gang --follow

Check logs from previous scheduler instance (after crash/restart):

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --previous --lines 200

Common Keywords for Filtering

Understanding Scheduler Logs

Log Format

Volcano scheduler logs typically follow this format:

I0102 15:04:05.123456       1 scheduler.go:123] Starting scheduling session
I0102 15:04:05.234567       1 allocate.go:456] Try to allocate resources for Job <namespace>/<job-name>
E0102 15:04:05.345678       1 gang.go:789] Failed to schedule pod <pod-name>: minMember not satisfied

Log levels:

I - Info: Normal operation information
W - Warning: Unusual but non-fatal conditions
E - Error: Failures and errors
F - Fatal: Critical errors causing shutdown

Common Log Patterns

Session Start

Starting scheduling session
Starting scheduling loop

Indicates scheduler is processing a new batch of pending pods

Enqueue Decisions

Try to enqueue pod group
PodGroup <name> is enqueued
PodGroup <name> is pending

Shows whether pod groups are admitted to the queue

Allocation Attempts

Try to allocate resources for Job
Try to allocate for task

Shows scheduling attempts for specific jobs/pods

Gang Scheduling

minMember not satisfied
gang member not ready
Waiting for gang members

Indicates Gang constraint preventing scheduling

Resource Shortage

Insufficient cpu
Insufficient memory
0 nodes are available

Indicates resource constraint preventing scheduling

Preemption

Preempting pods
Found victim pods

Shows preemption decisions for high-priority workloads

Reclaim

Try to reclaim resources
Reclaiming resources from queue

Shows resource reclamation between queues

Diagnostic Use Cases

Case 1: Pod Stuck in Pending

Find relevant scheduler decisions:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --pod <pod-name> --since 30m

Look for:

FailedScheduling events
minMember not satisfied
Insufficient resource messages
enqueue decisions (is the PodGroup being admitted?)

Case 2: Gang Scheduling Issues

Check Gang plugin behavior:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --keyword gang --since 1h

Look for:

minMember related messages
Gang constraint validation
Comparison of running vs required members

Case 3: Queue Resource Issues

Check proportion and reclaim decisions:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --keyword "reclaim\|proportion" --since 30m

Look for:

Queue resource calculations
Reclaim triggers
Over-commit handling

Case 4: Scheduler Performance

Check for scheduling delays:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --lines 500 | grep -E "(Starting|Finished) scheduling"

Look for:

Long gaps between "Starting" and "Finished"
High frequency of scheduling loops
Errors causing retries

Case 5: Preemption Analysis

Check preemption decisions:

bash skills/core/volcano-scheduler-logs/scripts/get-scheduler-logs.sh --keyword preempt --since 1h

Look for:

Which pods are being preempted
Priority comparisons
Preemption success/failure

Environment Variables

Limitations

Log retention: Logs may be rotated based on cluster configuration
Multi-scheduler: If running multiple schedulers, logs will be interleaved
Log level: Default log level may not show all debug information
Previous logs: --previous only works if the container has restarted

Tips for Effective Log Analysis

Use time ranges: Narrow down with --since to focus on recent issues
Combine keywords: Search for error\|Failed\|failed to catch all failures
Check pod context: Always include --pod when investigating specific pods
Look for patterns: Repeating errors may indicate systemic issues
Correlate with events: Compare with kubectl get events timestamps

Related Skills

scitix/gateway-diagnostics

testing

VerifiedTrustedCommunity

Show and ping the gateway of a network interface, on a Kubernetes node or inside a pod's network namespace. Auto-detects the gateway from the routing table (ip -j route), reports interface type (RoCE / Ethernet / IB), and tests reachability with ping. Use for default-route / gateway questions, network reachability checks, RoCE/RDMA data-path validation, and "can this node/pod reach its gateway" investigations.

209SKILL.mdUpdated Jun 7, 2026

scitix/gateway-diagnostics

scitix/skill-authoring

development

VerifiedTrustedCommunity

Guide for writing and improving Siclaw skills. Read this when creating or modifying a skill. Covers skill directory layout, SKILL.md format, script execution modes, and best practices.

209SKILL.mdUpdated Apr 23, 2026

scitix/skill-authoring

scitix/node-logs

devops

VerifiedTrustedCommunity

Retrieve logs from a Kubernetes node. Supports journalctl (systemd units) and file-based logs. Use when you need to inspect node-level logs (containerd, kubelet, etc.). Run via host_script (preferred) or node_script.

209SKILL.mdUpdated Apr 12, 2026

scitix/manage-skill

development

VerifiedTrustedCommunity

Guides the user to the Siclaw Web page to manage Skills. Use this guide when the user requests to create, edit, or view a Skill in a Channel conversation.

88SKILL.mdUpdated Apr 12, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/scitix/siclaw.git

# Copy into Claude Code skills folder (global)
cp -r siclaw/skills/core/volcano-scheduler-logs ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

scitix/siclaw

88 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

Adoption

scitix/volcano-scheduler-logs

$ install --global

Security Scan Results

SKILL.md

Volcano Scheduler Logs

Usage

Parameters

Examples

Common Keywords for Filtering

Understanding Scheduler Logs

Log Format

Common Log Patterns

Session Start

Enqueue Decisions

Allocation Attempts

Gang Scheduling

Resource Shortage

Preemption

Reclaim

Diagnostic Use Cases

Case 1: Pod Stuck in Pending

Case 2: Gang Scheduling Issues

Case 3: Queue Resource Issues

Case 4: Scheduler Performance

Case 5: Preemption Analysis

Environment Variables

Limitations

Tips for Effective Log Analysis

See Also

Related Skills

scitix/gateway-diagnostics

scitix/skill-authoring

scitix/node-logs

scitix/manage-skill

scitix/volcano-scheduler-logs

$ install --global

Security Scan Results

SKILL.md

Volcano Scheduler Logs

Usage

Parameters

Examples

Common Keywords for Filtering

Understanding Scheduler Logs

Log Format

Common Log Patterns

Session Start

Enqueue Decisions

Allocation Attempts

Gang Scheduling

Resource Shortage

Preemption

Reclaim

Diagnostic Use Cases

Case 1: Pod Stuck in Pending

Case 2: Gang Scheduling Issues

Case 3: Queue Resource Issues

Case 4: Scheduler Performance

Case 5: Preemption Analysis

Environment Variables

Limitations

Tips for Effective Log Analysis

See Also

Related Skills

scitix/gateway-diagnostics

scitix/skill-authoring

scitix/node-logs

scitix/manage-skill