Architecture

Data flow

scanRepo()          → { languages, frameworks, domains, entryPoints, size, health }
     ↓
buildRepoGraph()    → { files, edges, ranked, hubs, clusters, hotspots, stats }
     ↓
runLLM() x2         → discoveryFindings (architecture + domains, in parallel)
     ↓
generateChunked()   → skill files (base → domains in parallel batches of 3 → CLAUDE.md)
     ↓
writeSkillFiles()   → .claude/skills/**/*.md + CLAUDE.md

Import graph

Parses JS/TS (via es-module-lexer) and Python (regex). Resolves relative imports, path aliases from tsconfig.json, and Python multi-root packages.

File priority formula:

priority = fanIn * 3.0 + exportCount * 1.5 + isEntryPoint * 10.0 + gitChurn * 2.0 + 1/(depth+1) * 1.0

Clustering: Connected components via BFS on the undirected import graph. Files that import each other land in the same domain cluster.

Runner

runLLM() dispatches to runClaude() or runCodex() based on configured backend:

Claude: claude -p --verbose --output-format stream-json
Codex: codex exec --json --sandbox read-only --ephemeral

Auto-scales timeout by repo size. Parses <file path="...">content</file> tags. Enforces path safety (writes only to .claude/, .agents/, instruction files).

Target system

Target = where output goes. Backend = which CLI generates it. Config in .aspens.json.

claude target → .claude/skills/, CLAUDE.md
codex target → .agents/skills/, AGENTS.md

You can use Claude as backend to generate Codex output, or vice versa.

Key dependencies

Package	Purpose
`es-module-lexer`	JS/TS import parsing (WASM, no native deps)
`commander`	CLI framework
`@clack/prompts`	Interactive terminal prompts
`picocolors`	Terminal colors

Plus claude CLI or codex CLI at runtime for LLM operations.