SKILL.md

name:: deepcode-agentic-coding
title:: DeepCode: Open Agentic Coding
version:: 0.0.2
engine:: skillxiv-v0.0.2-claude-opus-4.6
license:: MIT
url:: https://arxiv.org/abs/2512.07921
keywords:: [code generation, agent systems, paper-to-code, repository synthesis, agentic reasoning]
description:: Transform research specifications into production-grade codebases through strategic information management and autonomous agent orchestration. DeepCode surpasses PhD experts and commercial tools—critical when you need scientific code reproducibility at scale.

Overview

DeepCode treats repository synthesis as an optimization problem, strategically managing information flow to maximize relevant signals within finite context windows. The fully autonomous system orchestrates four complementary operations for code generation, transforming detailed specifications (like scientific papers) into functional, production-quality implementations.

When to Use

Converting academic papers into working code implementations
Scientific reproducibility and code generation from research
Large codebases requiring complex specification management
Scenarios needing to navigate context length limitations strategically
Applications where code quality rivals human expert work
Multi-file, complex project generation

When NOT to Use

Simple utility functions or scripts
Tasks with minimal context requirements
Projects requiring domain expertise beyond code structure
Real-time code generation with strict latency requirements
Specifications that are vague or incomplete

Core Technique

Strategic information orchestration through four complementary operations:

# Agentic code generation system
class DeepCodeAgent:
    def __init__(self, llm_model, context_window=8000):
        self.model = llm_model
        self.context_limit = context_window
        self.operations = {
            'blueprint_distillation': BlueprintDistillation(),
            'code_memory': StatefulCodeMemory(),
            'retrieval_augmented': RetrievalAugmentedGeneration(),
            'error_correction': ClosedLoopErrorCorrection()
        }

    def synthesize_repository(self, specification):
        """
        Transform specification into complete codebase.
        Manages information flow within context constraints.
        """
        # Stage 1: Blueprint distillation
        blueprint = self.distill_blueprint_from_specification(specification)

        # Stage 2: Initialize code memory
        code_memory = self.initialize_code_memory(blueprint)

        # Stage 3: Orchestrate generation with RAG
        generated_files = self.generate_with_rag(blueprint, code_memory)

        # Stage 4: Iterative error correction
        corrected_files = self.correct_and_refine(
            generated_files,
            blueprint,
            code_memory
        )

        return corrected_files

    def distill_blueprint_from_specification(self, spec):
        """
        Compress specification to essential architecture information.
        Source compression maximizes relevant signals in context.
        """
        # Extract key components from specification
        components = self.extract_components(spec)
        dependencies = self.extract_dependencies(spec)
        interfaces = self.extract_interfaces(spec)
        algorithms = self.extract_algorithms(spec)

        # Create compressed blueprint
        blueprint = {
            'components': components,
            'dependencies': dependencies,
            'interfaces': interfaces,
            'algorithms': algorithms,
            'estimated_context': self.estimate_total_context(spec)
        }

        # Truncate specification to fit context
        compressed_spec = self.compress_specification(
            spec,
            blueprint,
            self.context_limit
        )

        blueprint['compressed_spec'] = compressed_spec
        return blueprint

    def extract_components(self, specification):
        """
        Identify major software components from specification.
        """
        # Parse specification to find modules, classes, functions
        components = []
        prompt = f"""
        Identify major software components in:
        {specification[:2000]}

        Return: component names, responsibilities, dependencies
        """

        components_text = self.model.generate(prompt)
        return self.parse_components(components_text)

    def initialize_code_memory(self, blueprint):
        """
        Create structured memory for generated code.
        Enables efficient retrieval and cross-file references.
        """
        code_memory = StatefulCodeMemory()

        for component in blueprint['components']:
            # Initialize memory entry for each component
            code_memory.add_component(
                name=component['name'],
                purpose=component['purpose'],
                dependencies=component['dependencies']
            )

        # Register interfaces
        for interface in blueprint['interfaces']:
            code_memory.add_interface(
                name=interface['name'],
                signature=interface['signature']
            )

        return code_memory

    def generate_with_rag(self, blueprint, code_memory):
        """
        Retrieval-augmented generation strategically injects knowledge.
        Retrieves relevant code context when generating new sections.
        """
        generated_files = {}

        for component in blueprint['components']:
            # Retrieve relevant code context
            relevant_context = code_memory.retrieve(
                query=component['purpose'],
                top_k=3
            )

            # Construct generation prompt
            prompt = self.construct_generation_prompt(
                component=component,
                blueprint=blueprint,
                relevant_context=relevant_context,
                existing_code=generated_files
            )

            # Generate code for component
            generated_code = self.model.generate(prompt)

            # Validate and store
            generated_files[component['name']] = generated_code

            # Update memory with generated code
            code_memory.add_code(
                component['name'],
                generated_code
            )

        return generated_files

    def construct_generation_prompt(self, component, blueprint, relevant_context, existing_code):
        """
        Carefully craft prompt to maximize context usage.
        """
        prompt_parts = []

        # Part 1: Component specification
        prompt_parts.append(f"""
        Generate code for component: {component['name']}
        Purpose: {component['purpose']}
        """)

        # Part 2: Relevant context from code memory
        if relevant_context:
            prompt_parts.append(f"""
        Reference existing implementations:
        {relevant_context}
        """)

        # Part 3: Interface requirements
        prompt_parts.append(f"""
        Required interfaces:
        {self.format_interfaces(component['dependencies'])}
        """)

        # Part 4: Existing code to maintain consistency
        if existing_code:
            prompt_parts.append(f"""
        Maintain compatibility with existing code:
        {self.summarize_existing_code(existing_code)}
        """)

        prompt = "\n".join(prompt_parts)

        # Ensure prompt fits within context
        if len(prompt) > self.context_limit:
            prompt = prompt[:self.context_limit]

        return prompt

    def correct_and_refine(self, generated_files, blueprint, code_memory):
        """
        Closed-loop error correction refines outputs iteratively.
        """
        corrected_files = generated_files.copy()
        max_iterations = 3

        for iteration in range(max_iterations):
            # Test generated code
            errors = self.test_files(corrected_files)

            if not errors:
                break  # No errors, done

            # Generate corrections
            for file_name, error_list in errors.items():
                correction_prompt = f"""
                Fix errors in {file_name}:
                Current code:
                {corrected_files[file_name][:1000]}

                Errors:
                {self._format_errors(error_list)}

                Provide corrected code.
                """

                corrected_code = self.model.generate(correction_prompt)
                corrected_files[file_name] = corrected_code

                # Update code memory
                code_memory.update_code(file_name, corrected_code)

        return corrected_files

    def test_files(self, generated_files):
        """
        Execute and validate generated code.
        Identify errors for correction.
        """
        errors = {}

        for file_name, code in generated_files.items():
            # Attempt to execute and check for syntax/runtime errors
            exec_errors = self.execute_and_validate(code)

            if exec_errors:
                errors[file_name] = exec_errors

        return errors

    def compress_specification(self, spec, blueprint, context_limit):
        """
        Compress specification to fit within context window.
        Keep essential information, remove redundancy.
        """
        # Priority: algorithms > interfaces > examples
        essential_spec = ""

        # Add algorithms (highest priority)
        for algo in blueprint['algorithms']:
            essential_spec += f"\n{algo['description']}"

        # Add interfaces
        for interface in blueprint['interfaces']:
            essential_spec += f"\n{interface['signature']}"

        # Truncate if still too large
        if len(essential_spec) > context_limit:
            essential_spec = essential_spec[:context_limit]

        return essential_spec

The framework strategically optimizes information flow, achieving production-grade code quality comparable to PhD-level human experts.

Key Results

Surpasses commercial tools (Cursor, Claude Code) on code quality metrics
PhD-level expert performance on key reproduction metrics
Handles complex, multi-file repository synthesis
Outperforms prior research baselines significantly
PaperBench benchmark demonstrates effectiveness

Implementation Notes

Blueprint distillation compresses specification efficiently
Code memory enables cross-file consistency and references
RAG retrieves relevant context for each component
Closed-loop correction iteratively refines outputs
Context management crucial for large specifications

References

Original paper: https://arxiv.org/abs/2512.07921
Focus: Autonomous code generation from specifications
Domain: Software engineering, agent systems

SKILL.md

name:: deepcode-agentic-coding
title:: DeepCode: Open Agentic Coding
version:: 0.0.2
engine:: skillxiv-v0.0.2-claude-opus-4.6
license:: MIT
url:: https://arxiv.org/abs/2512.07921
keywords:: [code generation, agent systems, paper-to-code, repository synthesis, agentic reasoning]
description:: Transform research specifications into production-grade codebases through strategic information management and autonomous agent orchestration. DeepCode surpasses PhD experts and commercial tools—critical when you need scientific code reproducibility at scale.

Overview

When to Use

Converting academic papers into working code implementations
Scientific reproducibility and code generation from research
Large codebases requiring complex specification management
Scenarios needing to navigate context length limitations strategically
Applications where code quality rivals human expert work
Multi-file, complex project generation

When NOT to Use

Simple utility functions or scripts
Tasks with minimal context requirements
Projects requiring domain expertise beyond code structure
Real-time code generation with strict latency requirements
Specifications that are vague or incomplete

Core Technique

Strategic information orchestration through four complementary operations:

# Agentic code generation system
class DeepCodeAgent:
    def __init__(self, llm_model, context_window=8000):
        self.model = llm_model
        self.context_limit = context_window
        self.operations = {
            'blueprint_distillation': BlueprintDistillation(),
            'code_memory': StatefulCodeMemory(),
            'retrieval_augmented': RetrievalAugmentedGeneration(),
            'error_correction': ClosedLoopErrorCorrection()
        }

    def synthesize_repository(self, specification):
        """
        Transform specification into complete codebase.
        Manages information flow within context constraints.
        """
        # Stage 1: Blueprint distillation
        blueprint = self.distill_blueprint_from_specification(specification)

        # Stage 2: Initialize code memory
        code_memory = self.initialize_code_memory(blueprint)

        # Stage 3: Orchestrate generation with RAG
        generated_files = self.generate_with_rag(blueprint, code_memory)

        # Stage 4: Iterative error correction
        corrected_files = self.correct_and_refine(
            generated_files,
            blueprint,
            code_memory
        )

        return corrected_files

    def distill_blueprint_from_specification(self, spec):
        """
        Compress specification to essential architecture information.
        Source compression maximizes relevant signals in context.
        """
        # Extract key components from specification
        components = self.extract_components(spec)
        dependencies = self.extract_dependencies(spec)
        interfaces = self.extract_interfaces(spec)
        algorithms = self.extract_algorithms(spec)

        # Create compressed blueprint
        blueprint = {
            'components': components,
            'dependencies': dependencies,
            'interfaces': interfaces,
            'algorithms': algorithms,
            'estimated_context': self.estimate_total_context(spec)
        }

        # Truncate specification to fit context
        compressed_spec = self.compress_specification(
            spec,
            blueprint,
            self.context_limit
        )

        blueprint['compressed_spec'] = compressed_spec
        return blueprint

    def extract_components(self, specification):
        """
        Identify major software components from specification.
        """
        # Parse specification to find modules, classes, functions
        components = []
        prompt = f"""
        Identify major software components in:
        {specification[:2000]}

        Return: component names, responsibilities, dependencies
        """

        components_text = self.model.generate(prompt)
        return self.parse_components(components_text)

    def initialize_code_memory(self, blueprint):
        """
        Create structured memory for generated code.
        Enables efficient retrieval and cross-file references.
        """
        code_memory = StatefulCodeMemory()

        for component in blueprint['components']:
            # Initialize memory entry for each component
            code_memory.add_component(
                name=component['name'],
                purpose=component['purpose'],
                dependencies=component['dependencies']
            )

        # Register interfaces
        for interface in blueprint['interfaces']:
            code_memory.add_interface(
                name=interface['name'],
                signature=interface['signature']
            )

        return code_memory

    def generate_with_rag(self, blueprint, code_memory):
        """
        Retrieval-augmented generation strategically injects knowledge.
        Retrieves relevant code context when generating new sections.
        """
        generated_files = {}

        for component in blueprint['components']:
            # Retrieve relevant code context
            relevant_context = code_memory.retrieve(
                query=component['purpose'],
                top_k=3
            )

            # Construct generation prompt
            prompt = self.construct_generation_prompt(
                component=component,
                blueprint=blueprint,
                relevant_context=relevant_context,
                existing_code=generated_files
            )

            # Generate code for component
            generated_code = self.model.generate(prompt)

            # Validate and store
            generated_files[component['name']] = generated_code

            # Update memory with generated code
            code_memory.add_code(
                component['name'],
                generated_code
            )

        return generated_files

    def construct_generation_prompt(self, component, blueprint, relevant_context, existing_code):
        """
        Carefully craft prompt to maximize context usage.
        """
        prompt_parts = []

        # Part 1: Component specification
        prompt_parts.append(f"""
        Generate code for component: {component['name']}
        Purpose: {component['purpose']}
        """)

        # Part 2: Relevant context from code memory
        if relevant_context:
            prompt_parts.append(f"""
        Reference existing implementations:
        {relevant_context}
        """)

        # Part 3: Interface requirements
        prompt_parts.append(f"""
        Required interfaces:
        {self.format_interfaces(component['dependencies'])}
        """)

        # Part 4: Existing code to maintain consistency
        if existing_code:
            prompt_parts.append(f"""
        Maintain compatibility with existing code:
        {self.summarize_existing_code(existing_code)}
        """)

        prompt = "\n".join(prompt_parts)

        # Ensure prompt fits within context
        if len(prompt) > self.context_limit:
            prompt = prompt[:self.context_limit]

        return prompt

    def correct_and_refine(self, generated_files, blueprint, code_memory):
        """
        Closed-loop error correction refines outputs iteratively.
        """
        corrected_files = generated_files.copy()
        max_iterations = 3

        for iteration in range(max_iterations):
            # Test generated code
            errors = self.test_files(corrected_files)

            if not errors:
                break  # No errors, done

            # Generate corrections
            for file_name, error_list in errors.items():
                correction_prompt = f"""
                Fix errors in {file_name}:
                Current code:
                {corrected_files[file_name][:1000]}

                Errors:
                {self._format_errors(error_list)}

                Provide corrected code.
                """

                corrected_code = self.model.generate(correction_prompt)
                corrected_files[file_name] = corrected_code

                # Update code memory
                code_memory.update_code(file_name, corrected_code)

        return corrected_files

    def test_files(self, generated_files):
        """
        Execute and validate generated code.
        Identify errors for correction.
        """
        errors = {}

        for file_name, code in generated_files.items():
            # Attempt to execute and check for syntax/runtime errors
            exec_errors = self.execute_and_validate(code)

            if exec_errors:
                errors[file_name] = exec_errors

        return errors

    def compress_specification(self, spec, blueprint, context_limit):
        """
        Compress specification to fit within context window.
        Keep essential information, remove redundancy.
        """
        # Priority: algorithms > interfaces > examples
        essential_spec = ""

        # Add algorithms (highest priority)
        for algo in blueprint['algorithms']:
            essential_spec += f"\n{algo['description']}"

        # Add interfaces
        for interface in blueprint['interfaces']:
            essential_spec += f"\n{interface['signature']}"

        # Truncate if still too large
        if len(essential_spec) > context_limit:
            essential_spec = essential_spec[:context_limit]

        return essential_spec

The framework strategically optimizes information flow, achieving production-grade code quality comparable to PhD-level human experts.

Key Results

Surpasses commercial tools (Cursor, Claude Code) on code quality metrics
PhD-level expert performance on key reproduction metrics
Handles complex, multi-file repository synthesis
Outperforms prior research baselines significantly
PaperBench benchmark demonstrates effectiveness

Implementation Notes

Blueprint distillation compresses specification efficiently
Code memory enables cross-file consistency and references
RAG retrieves relevant context for each component
Closed-loop correction iteratively refines outputs
Context management crucial for large specifications

References

Original paper: https://arxiv.org/abs/2512.07921
Focus: Autonomous code generation from specifications
Domain: Software engineering, agent systems

Adoption

ADu2021/deepcode-agentic-coding

$ install --global

Security Scan Results

SKILL.md

Overview

When to Use

When NOT to Use

Core Technique

Key Results

Implementation Notes

References

Related Skills

ADu2021/flow-map-trajectory-tilting

ADu2021/flexible-data-mixture-of-experts

ADu2021/flexibility-trap-diffusion-reasoning

ADu2021/flex-continuous-agent-evolution

ADu2021/deepcode-agentic-coding

$ install --global

Security Scan Results

SKILL.md

Overview

When to Use

When NOT to Use

Core Technique

Key Results

Implementation Notes

References

Related Skills

ADu2021/flow-map-trajectory-tilting

ADu2021/flexible-data-mixture-of-experts

ADu2021/flexibility-trap-diffusion-reasoning

ADu2021/flex-continuous-agent-evolution