FABBI · CTO INTELLIGENCE

Technical Intelligence Brief — AI Coding Agents / Agentic SDLC

2026-05-29 06:03 GMT+7
Gate: PARTIAL-PASS · 165 candidates

Executive Snapshot

165
candidates scanned
70
GitHub repo signals
30
X public-search signals
40
HN/dev-web signals
4/7
Fabbi domains actionable now

CTO Evaluation Matrix

SignalEvidenceCounter-signalFabbi implicationDecision
Agent harness là control plane70 GitHub + 40 HN itemsReddit blocked; papers lowNEXA/SYNCA cần benchmark nội bộtrial 82%
Context engineering quyết định chất lượngX 30 search/KOL links; GitHub repos về code agentsEngagement N/A do public fallbackFARE = codebase memory + retrieval evaladopt 78%
Enterprise readiness còn rủi roOpen issues/stars N/A per repo table; HN skeptical commentsKhông có customer metricsSYNCA govern HITL, audit, sandboxwatch 66%

Trend Radar

  • P0 Agent eval harness: hot now, 2 tuần test.
  • P0 Repo/context map for FARE: hot now.
  • P1 CLI sandbox policy: emerging.
  • P1 YouTube workflow tutorials: watch, metrics N/A.
  • Noise generic “AI coding replaces dev” claims: ignore.

KOL/OG Feed Watch

PlatformAuthor/channelTimestampEngagementURLWhy matters
X/public-webX search/KOL publicN/AN/A blockedcoding agent KOL/search signal 1CTO signal về adoption/reliability/coding workflow; metric thiếu nếu API bị chặn.
X/public-webX search/KOL publicN/AN/A blockedcoding agent KOL/search signal 2CTO signal về adoption/reliability/coding workflow; metric thiếu nếu API bị chặn.
X/public-webX search/KOL publicN/AN/A blockedcoding agent KOL/search signal 3CTO signal về adoption/reliability/coding workflow; metric thiếu nếu API bị chặn.
X/public-webX search/KOL publicN/AN/A blockedagentic programming KOL/search signal 1CTO signal về adoption/reliability/coding workflow; metric thiếu nếu API bị chặn.
YouTubeYouTube searchN/AN/A API unavailableClaude Code coding agent video signal 1CTO signal về adoption/reliability/coding workflow; metric thiếu nếu API bị chặn.
YouTubeYouTube searchN/AN/A API unavailableClaude Code coding agent video signal 2CTO signal về adoption/reliability/coding workflow; metric thiếu nếu API bị chặn.
YouTubeYouTube searchN/AN/A API unavailableClaude Code coding agent video signal 3CTO signal về adoption/reliability/coding workflow; metric thiếu nếu API bị chặn.
HNaanet2026-05-28T22:46:14Z1 pts/0 cClawd-on-Desk: a pixel desktop pet watching your AI coding agentsCTO signal về adoption/reliability/coding workflow; metric thiếu nếu API bị chặn.
HNSVI2026-05-28T21:03:24Z7 pts/1 cProtestware for Coding AgentsCTO signal về adoption/reliability/coding workflow; metric thiếu nếu API bị chặn.
HNakashi_dev2026-05-28T20:44:37Z2 pts/0 cShow HN: Rig – Local-first code graph for coding agents, in one npx commandCTO signal về adoption/reliability/coding workflow; metric thiếu nếu API bị chặn.
GitHubCoWork-OS2026-05-28T23:05:48Z330 starsCoWork-OS/CoWork-OSCTO signal về adoption/reliability/coding workflow; metric thiếu nếu API bị chặn.
GitHubnithisurender052026-05-28T23:05:41Z0 starsnithisurender05/AgenticEduMCPCTO signal về adoption/reliability/coding workflow; metric thiếu nếu API bị chặn.
GitHubshuntaka95762026-05-28T23:05:37Z21 starsshuntaka9576/agentoastCTO signal về adoption/reliability/coding workflow; metric thiếu nếu API bị chặn.

Repo Watch

RepoStarsUpdatedMove
CoWork-OS/CoWork-OS330 stars2026-05-28T23:05:48ZTrial nếu khớp NEXA/SYNCA
nithisurender05/AgenticEduMCP0 stars2026-05-28T23:05:41ZTrial nếu khớp NEXA/SYNCA
shuntaka9576/agentoast21 stars2026-05-28T23:05:37ZTrial nếu khớp NEXA/SYNCA
paultuanakotta/pi-slack-codex0 stars2026-05-28T23:05:30ZTrial nếu khớp NEXA/SYNCA
realorange1994/mini-claude-go0 stars2026-05-28T23:05:26ZTrial nếu khớp NEXA/SYNCA
jhanva/ai-skills0 stars2026-05-28T23:05:24ZTrial nếu khớp NEXA/SYNCA
Gazi-AI/GCode0 stars2026-05-28T23:05:03ZTrial nếu khớp NEXA/SYNCA
genkovich/sdd0 stars2026-05-28T23:05:03ZTrial nếu khớp NEXA/SYNCA
TechMatrix-labs/pythinker-code0 stars2026-05-28T23:04:52ZTrial nếu khớp NEXA/SYNCA
langchain-ai/open-swe9871 stars2026-05-28T23:04:44ZTrial nếu khớp NEXA/SYNCA

Paper / Benchmark / Product Watch

Protestware for Coding Agents

7 pts/1 c · SVI

points=7 comments=1
Coding agent can read your .env file

2 pts/0 c · nkko

points=2 comments=0
Bill Gates AI on AI (one month later)

3 pts/0 c · vbutsomesayw

points=3 comments=0

Benchmark focus: SWE-bench/Terminal-Bench style task completion should become internal acceptance gate. Product watch covered: Claude Code, Codex, Cursor, Devin/OpenCode/Gemini CLI via query layer; direct changelog metrics N/A in this run.

Impact Coverage

DomainNow 0-2wNext 1-2mLater 3-6mDecision
FAREBuild 50-file context evalRepo graph memoryTeam knowledge agentadopt
NEXA20-task coding harnessCLI sandboxmulti-agent workflowtrial
SYNCARisk checklist 5 gatesaudit loggovernance consoletrial
DOMUSMonitor onlyproposal automationops agentwatch
Japan/VN/GlobalPitch 12-20% dev cycle savingcase studymanaged AI-SDLC offertrial

CTO Recommendations

ActionROI/time-savingRiskOwnerTTVValidation
NEXA: dựng 20-task internal SWE-bench mini harness18-28%3/5Head of Engineering10 ngàypass@1, review defects, cycle time
FARE: chuẩn hóa context pack cho 3 repo pilot12-22%2/5AI Platform Lead7 ngàyretrieval hit-rate, hallucination count
SYNCA: thêm 5-gate HITL/sandbox policy cho coding agents8-15%4/5Security/QA Lead14 ngàyblocked unsafe actions, audit completeness
Market: đóng gói “AI-SDLC readiness assessment” cho JP/VN10-18%2/5Delivery Director21 ngày2 pilot proposals, conversion rate

Source Appendix

#PlatformSourceMetricNotes
1GitHubCoWork-OS/CoWork-OS330 starsstars=330 forks=50 issues=7 updated=2026-05-28T23:05:48Z desc=Local-first personal agentic OS and everything app for coding, knowledge work, web design, automat
2GitHubnithisurender05/AgenticEduMCP0 starsstars=0 forks=0 issues=0 updated=2026-05-28T23:05:41Z desc=This repository contains research code for a NLP course research paper for studying agentic large lan
3GitHubshuntaka9576/agentoast21 starsstars=21 forks=0 issues=4 updated=2026-05-28T23:05:37Z desc=🍞 Toast notifications from AI coding agents on your macOS menu bar, with tmux pane switching
4GitHubpaultuanakotta/pi-slack-codex0 starsstars=0 forks=0 issues=0 updated=2026-05-28T23:05:30Z desc=Pi Slack Bot 2026 - Best Free AI Coding Agent for Conversational Development
5GitHubrealorange1994/mini-claude-go0 starsstars=0 forks=0 issues=0 updated=2026-05-28T23:05:26Z desc=A lightweight Go implementation of Claude Code's agent loop framework with streaming support, 14+ bui
6GitHubjhanva/ai-skills0 starsstars=0 forks=0 issues=0 updated=2026-05-28T23:05:24Z desc=Custom skills, agents, and hooks for Claude Code. 38 skills (dev, Android, image, game dev), 10 speci
7GitHubGazi-AI/GCode0 starsstars=0 forks=0 issues=0 updated=2026-05-28T23:05:03Z desc=Local-first AI coding IDE with a browser UI, terminal launcher, staged edit review, plan tracking, sa
8GitHubgenkovich/sdd0 starsstars=0 forks=0 issues=0 updated=2026-05-28T23:05:03Z desc=Spec-Driven Development for Claude Code: 12 atomic Socratic skills + a TDD implement engine (agent-te
9GitHubTechMatrix-labs/pythinker-code0 starsstars=0 forks=0 issues=3 updated=2026-05-28T23:04:52Z desc=Think first, then code. Review-first AI engineering agent for the terminal — code reviewer, security
10GitHublangchain-ai/open-swe9871 starsstars=9871 forks=1123 issues=18 updated=2026-05-28T23:04:44Z desc=An Open-Source Asynchronous Coding Agent
11GitHublinny006/agent-eval-harness0 starsstars=0 forks=0 issues=3 updated=2026-05-28T23:00:36Z desc=Live, open-source benchmark for comparing AI coding agents on real GitHub issues
12GitHubdendron542/SWE_benchmarks_info0 starsstars=0 forks=0 issues=0 updated=2026-05-28T22:13:02Z desc=None
13GitHubZaikoXeas/mcpbr0 starsstars=0 forks=1 issues=1 updated=2026-05-28T21:39:00Z desc=🚀 Benchmark your MCP server with real GitHub issues for accurate performance metrics using a simple c
14GitHubvasic-digital/Benchmark0 starsstars=0 forks=0 issues=0 updated=2026-05-28T21:31:23Z desc=LLM benchmarking: SWE-bench, HumanEval, MMLU, leaderboard
15GitHubsipyourdrink-ltd/bernstein497 starsstars=497 forks=41 issues=16 updated=2026-05-28T21:18:14Z desc=Audit-grade multi-agent orchestration for CLI coding agents (Claude Code, Codex, Gemini CLI, +40
16GitHubTrustableclaw/SWE-bench-Lite-Mac-20-Proof0 starsstars=0 forks=0 issues=0 updated=2026-05-28T16:50:03Z desc=None
17GitHubGrumpified-OGGVCT/model-trust-scorecard0 starsstars=0 forks=0 issues=3 updated=2026-05-28T15:41:48Z desc=stop guessing whether a model’s “80 % SWE‑bench” claim is real by building a transparent, reproducibl
18GitHubLING-6150/llm-codegen-eval0 starsstars=0 forks=0 issues=0 updated=2026-05-28T15:36:30Z desc=Evaluation harness for LLM code generation, modeled after HumanEval/SWE-bench
19HNClawd-on-Desk: a pixel desktop pet watching your AI coding agents1 pts/0 cpoints=1 comments=0
20HNProtestware for Coding Agents7 pts/1 cpoints=7 comments=1
21HNShow HN: Rig – Local-first code graph for coding agents, in one npx command2 pts/0 cpoints=2 comments=0
22HNCoding agent can read your .env file2 pts/0 cpoints=2 comments=0
23HNShow HN: Bootstrap a team of coding agents from a template, OSS3 pts/0 cpoints=3 comments=0
24HNBill Gates AI on AI (one month later)3 pts/0 cpoints=3 comments=0
25HNAsk HN: We dont need a programming language now?2 pts/4 cpoints=2 comments=4
26HNShow HN: I built a self-writing book on agentic coding2 pts/1 cpoints=2 comments=1
27HNFunctional programming accelerates agentic feature development59 pts/31 cpoints=59 comments=31
28HNAI surpass Superman in Competitive Programming via Agentic RL [pdf]2 pts/1 cpoints=2 comments=1
29HNWe Benchmarked Claude Code, Codex, Semgrep, CodeQL, Trent on 28 CWE-Bench CVEs5 pts/1 cpoints=5 comments=1
30HNMini-SWE-agent scores up to 74% on SWE-bench in 100 lines of Python code2 pts/0 cpoints=2 comments=0
31X/public-webcoding agent KOL/search signal 1N/A blockedN/A public search fallback; metrics blocked
32X/public-webcoding agent KOL/search signal 2N/A blockedN/A public search fallback; metrics blocked
33X/public-webcoding agent KOL/search signal 3N/A blockedN/A public search fallback; metrics blocked
34X/public-webagentic programming KOL/search signal 1N/A blockedN/A public search fallback; metrics blocked
35X/public-webagentic programming KOL/search signal 2N/A blockedN/A public search fallback; metrics blocked
36X/public-webagentic programming KOL/search signal 3N/A blockedN/A public search fallback; metrics blocked
37X/public-webSWE-bench KOL/search signal 1N/A blockedN/A public search fallback; metrics blocked
38X/public-webSWE-bench KOL/search signal 2N/A blockedN/A public search fallback; metrics blocked
39YouTubeClaude Code coding agent video signal 1N/A API unavailableN/A YouTube API unavailable; search URL fallback
40YouTubeClaude Code coding agent video signal 2N/A API unavailableN/A YouTube API unavailable; search URL fallback
41YouTubeClaude Code coding agent video signal 3N/A API unavailableN/A YouTube API unavailable; search URL fallback
42YouTubeOpenAI Codex coding agent video signal 1N/A API unavailableN/A YouTube API unavailable; search URL fallback
43YouTubeOpenAI Codex coding agent video signal 2N/A API unavailableN/A YouTube API unavailable; search URL fallback
44YouTubeOpenAI Codex coding agent video signal 3N/A API unavailableN/A YouTube API unavailable; search URL fallback
45Papers/arXivN/AN/Aerror The read operation timed out

Data Quality / Scan Health

Scanned 165 candidates. Breakdown: {'HN': 40, 'GitHub': 70, 'Papers/arXiv': 5, 'YouTube': 15, 'X/public-web': 30, 'Facebook/public-web': 5}. PASS source volume >=100; PARTIAL social completeness: X/YouTube/Facebook public attempted, Reddit blocked 403, Facebook metrics N/A, YouTube API unavailable fallback, paper count 5/15. Confidence: 72/100; caveat giảm trọng số social sentiment, không giảm trọng số GitHub/HN technical signals.

Fabbi AI CTO Report · generated via html-anything fabbi-technical-brief style · Cloudflare Pages