Skip to content

Capstone

繁體中文 | 简体中文 | English

After finishing a track, build something yourself — this file is not a tutorial, not a walkthrough, and there is no model answer. Its purpose is to turn "I read the roadmap" into "I have something I can show + a grade I gave myself."

How to use this file:

  1. Pick a problem you actually have (work, research, daily life). Don't pick a toy problem — a capstone's value comes from being real.
  2. Check your track's "Prerequisites" and confirm the required stages have each passed their "Self-check".
  3. When done, self-assess with the matching rubric (4 levels: Not yet / Basic / Good / Excellent). Scoring honestly is more useful than scoring high.
  4. Want feedback? Post the artifact + your self-assessment to Discussion for peer review (optional, not required).

What each stage teaches / what you need before it / how you know you've learned it — all of that stays with the stage file's "Learning objectives / Prerequisites / Self-check". This file only defines the capstone itself.


Track A Capstone — CLI Power User

Prerequisites: Stage 0–2 + A1 + A2 + Stage 5 + A3 have each passed their self-check (Stage 8 is a shared hub across both tracks — recommended, but it does not gate capstone entry; the Track A capstone focuses on the CLI workflow).

Brief: Assemble a CLI-agent workflow you will reuse, automating something you currently do by hand.

Requirements (all mandatory):

  • A CLI agent (Claude Code or equivalent) at the core
  • At least 1 MCP server or a skill / command you wrote yourself
  • A clear input → a usable artifact out (not "chatting with it")
  • Reproducible by someone else: include how to run (install, configure, run, expected output)
  • Handles at least 1 failure case (missing input, API failure, what happens when the result is wrong)

Deliverables: a folder / repo with the artifact + README + evidence of at least one real run (log / screenshot / output file) + a reflection under 150 words (where you got stuck, what you'd change next time).

Time: 3–8 hours (not counting time spent learning the stages).

Track A scoring rubric (self-assessed, 4 levels)

DimensionNot yetBasicGoodExcellent
Problem realismToy problemSomewhat usefulReal, will reuseReal, with a quantifiable metric (time saved / fewer errors etc.)
Tool usePlain chat onlyUsed a CLI agent+ MCP or self-written skill/commandMulti-component, with justified choices
ReproducibilityOnly runs for youSteps incompleteOthers can run it from the READMEOne-click / fully automated with pre-checks
RobustnessCrashes on any errorRisks mentionedHandles 1 failureMultiple failures have fallbacks
Docs & reflectionNoneHas a READMEClear README + reflectionReflection points to a concrete next improvement

Track B Capstone — Agent Builder

Prerequisites: Stage 0–8 (including Stage 3, 4, 5, 6, 7, 7.5, 8) have each passed their self-check.

Brief: Design, build, and evaluate a small system that solves a concrete problem. Pick one:

  • A. Multi-agent: ≥ 2 cooperating agents with orchestration logic; or
  • B. RAG system: a complete retrieval + generation pipeline.

Requirements (all mandatory):

  • Has tool use
  • Has one outward interface (CLI / API / chat — any one, mapping to Stage 8)
  • Has explicit evaluation: define ≥ 5 test cases yourself + measure a pass rate / qualitative assessment (this one is non-negotiable — the thing this curriculum most often skips is "verification")
  • Failure-mode analysis: write down under what conditions it breaks and how you'd know
  • An architecture sketch (a diagram or a paragraph describing components and data flow)

Deliverables: a code repo + an architecture description + evaluation results (even just an N-cases / pass-rate table) + README + a reflection under 200 words (where the architecture call was wrong, what you'd do differently).

Time: 8–20 hours (not counting time spent learning the stages).

Track B scoring rubric (self-assessed, 4 levels)

DimensionNot yetBasicGoodExcellent
Problem definitionVagueHas a goalClear scope, acceptance-checkableClear, and states why it's worth doing
ArchitectureNo designJust runsJustified multi-agent / RAG choiceCan articulate the trade-offs
Implementation correctnessDoesn't runMain path runsHandles edge casesStable and the code is readable
Evaluation rigorNot testedTried it a few times by hand≥5 cases + pass rateHas a baseline comparison / a rerunnable regression
Robustness & failure analysisNoneRisks mentionedConcrete failure modesFailures are detected + mitigated
Interface & docsNoneRunsInterface + clear READMEOthers can use it directly
ReflectionNoneOne sentenceConcrete (names a specific problem in the architecture or component choice)Points to an architecture-level next step

Pick a brief by role (audience flavor)

Same capstone — just swap in your scenario; no need to do a separate one:

  • 🔬 researcher: literature Q&A / experiment-log organization / data-preprocessing agent
  • 💻 developer: a review/triage agent inside CI / repo Q&A / automated release notes
  • 🎓 teacher: question generation + grading support / material rewriting / course Q&A (mind the academic-integrity boundary)
  • 📊 knowledge-worker: meeting notes → action items / cross-document synthesis / first-draft weekly report
  • 👥 everyday-user: personal-data consolidation / scheduling and reminders / repetitive-chore automation

How to show your work

  • Post to the matching Discussion category with a link to the artifact + your rubric self-assessment.
  • Put it in your own portfolio / GitHub; describe it with concrete facts (what you did, what you measured), and avoid hype like "the strongest / the best in the world".
  • Want to review someone else's: give feedback against that person's track rubric — about the work, not the person.

This file only defines the capstone and its scoring. Each stage's learning content and pass conditions stay with that stage's file.