SWE-gen automates AI task creation by reversing GitHub pull requests
#AI

SWE-gen automates AI task creation by reversing GitHub pull requests

Startups Reporter
1 min read

Abundant AI's open-source tool converts merged GitHub PRs into verified Harbor tasks by recreating buggy states and validating fixes.

Featured image

Abundant AI has released SWE-gen, an open-source tool that automatically converts merged GitHub pull requests into validated tasks for AI systems. The solution addresses a critical bottleneck in AI training: creating high-quality, real-world coding tasks that accurately reflect developer workflows.

Traditional methods for generating AI programming tasks often rely on synthetic datasets or manual curation. SWE-gen instead mines actual software development history by analyzing merged PRs from public repositories. The tool works across programming languages by using Claude Code to detect a project's language, build system, and test framework during analysis.

SWE-gen llama genie

The core innovation lies in SWE-gen's verification pipeline:

  1. Bug recreation: Reverts merged PRs to reconstruct pre-fix buggy states
  2. Baseline validation: Confirms tests fail against the buggy version
  3. Fix verification: Applies the original PR changes and confirms tests pass

This containerized approach installs all dependencies at build time, ensuring reproducibility. Developers can generate individual tasks using specific PRs (swegen create --repo axios/axios --pr 7150) or process entire repositories continuously (swegen farm fastapi/fastapi).

Validation commands (swegen validate) confirm tasks meet the NOP (no-operation) and Oracle requirements, while swegen analyze runs agent trials to verify task solvability. The team recently demonstrated practical application by generating a dataset of 1,000 JavaScript/TypeScript tasks using this methodology.

Unlike synthetic benchmarks, SWE-gen produces tasks rooted in actual development scenarios, potentially accelerating AI agent training while maintaining real-world relevance. The Apache 2.0-licensed tool requires API keys for GitHub, OpenAI/Anthropic, and optional cloud sandbox integrations.

Comments

Loading comments...