AI articles

Read the latest AI development workflow material in one place.

A curated, date-sorted reading list for product and engineering teams tracking AI workflows, Claude Code, Windsurf, Kiro, GitHub Copilot, coding agents, and AI model behavior.

Updated Jun 22, 2026 Workflow focused External sources

AIDLC brief

What today's AI development articles mean for teams

The latest signals point to the same operating need: AI-assisted development is becoming a team system, not a solo prompting trick.

Scale is now an engineering concern

AI coding activity is affecting source-control and cloud capacity. Teams should add reliability, CI, review queue, and repository health signals to their AI adoption plan.

Vendor independence matters

Tool strategy is shifting quickly across Claude Code, Codex, Cursor, Copilot, Kiro, Windsurf, and Devin. A model-agnostic workflow protects teams from rewriting process every time tools change.

Instructions need governance

Repo instructions such as AGENTS.md and CLAUDE.md should be short, consistent, and reviewable. Bad instructions create hidden workflow risk just like bad code.

Sorted by published date

Newest articles first

Newest first.

What is GLM-5.2? Another open-source Chinese AI model has Silicon Valley's attention.

Useful if you track model choices for coding work: it highlights a new open model built for long coding tasks and agentic workflows, plus why teams may revisit closed-vs-open tool strategy.

Read external article

Forget prompt engineering: 'Loop engineering' is all the rage now

Useful for teams moving beyond one-off prompts: it explains recurring agent workflows, sub-agent review patterns, and the cost tradeoffs behind always-on AI coding loops.

Read external article

Probe-and-Refine Tuning of Repository Guidance for Coding Agents

Useful if you maintain AGENTS.md or repo policies: it focuses on improving repository guidance so coding agents follow project-specific workflow expectations more reliably.

Read external article

N-Version Programming with Coding Agents

Worth reviewing if one-shot agent output is too risky: it explores generating multiple independent implementations so teams can compare, test, and choose the safer result.

Read external article

StaminaBench: Stress-Testing Coding Agents over 100 Interaction Turns

Strong fit for long-running AI dev workflows: it examines whether coding agents stay reliable across extended sessions instead of only short benchmark tasks.

Read external article

AgentArmor: A Framework, Evaluation, & Mitigation of Coding Agent Failures

Helpful for rollout planning: it centers failure modes and mitigations for coding agents, which is useful when you need stronger review and guardrail decisions.

Read external article

SWE-Future: Forecast-Conditioned Data Synthesis for Future-Oriented Software Engineering Agents

Worth reviewing if your benchmark plan feels stale: it proposes future-oriented coding-agent tasks so teams can evaluate against likely repo evolution, not only replayed historical issues.

Read external article

Runtime Compliance Verification for AI Agents

Useful for teams adding policy checks around tool use: it shows how runtime monitors can block non-compliant agent actions instead of relying only on prompt reviews.

Read external article

A Framework for Evaluating Agentic Skills at Scale

Helpful if you maintain reusable repo instructions or skills: it focuses on testing whether those workflow artifacts actually change agent behavior and improve task outcomes.

Read external article

All Smoke, No Alarm: Oracle Signals in Agent-Authored Test Code

Strong code-review signal for teams shipping AI-generated changes: test files alone do not prove coverage, so reviews should check whether the tests assert real behavior.

Read external article

Microsoft turns to Amazon for help with GitHub's AI-driven capacity issues

Useful signal for engineering leaders: AI coding activity is changing source-control infrastructure demand, reliability planning, and cloud capacity assumptions around GitHub.

Read external article

SpaceX overtakes Amazon to become world's fifth most valuable company

Worth tracking if you compare coding tools strategically: the Cursor deal centers compute, developer access, and enterprise control in the AI coding race.

Read external article

The quiet, galactic ambitions of Cursor CEO Michael Truell

Tracks Cursor's position in the AI coding market, model-dependency concerns, Claude Code competition, and why tool teams are investing in their own model strategy.

Read external article

RHO: Your Coding Agent is Secretly a Roboticist

Shows coding agents optimizing multi-file prompt, tool, and control-code harnesses, which is relevant to teams designing repeatable agent workflows rather than one-off prompts.

Read external article

AIChilles: Automatically Uncovering Hidden Weaknesses in AI-Evolved Systems

Strong fit for AIDLC verification: the paper studies how AI-evolved programs can regress on unseen workloads and how lifecycle checks can expose hidden weaknesses.

Read external article

Configuration Smells in AGENTS.md Files

Directly relevant to repo instructions for coding agents, with examples of problems like context bloat, conflicting instructions, and instruction leakage in agent configuration files.

Read external article

Software Delegation Contracts: Measuring Reviewability in AI Coding-Agent Work

Useful if you review agent-produced changes: it shows how explicit task contracts and evidence bundles can make AI coding work easier to inspect, even when correctness does not improve.

Read external article

LLM Agents Can See Code Repositories

Helpful for teams working in larger repos: it suggests repository maps and visual structure can cut exploration overhead without hurting issue-resolution quality.

Read external article

FastContext: Training Efficient Repository Explorer for Coding Agents

Relevant if agent runs waste tokens on repo search: it separates exploration from solving so teams can pass tighter context instead of flooding the main agent with noisy reads.

Read external article

New in Kiro Web: Build with Spec, GitLab, and more

Useful for teams studying browser-based spec workflows, cross-repository tasks, GitLab support, and how structured specs reduce rework before AI agents implement changes.

Read external article

Do programming languages still matter to your AI coding agent teammate? Evidence at scale from chess engines

Helpful for teams choosing stack boundaries: agents stay broadly capable, but language choice still affects cost, performance, and how much validation work remains.

Read external article

Getting Better at Working With You: Compiling User Corrections into Runtime Enforcement for Coding Agents

Strong workflow signal for teams turning repeated reviewer feedback into enforceable agent rules instead of restating the same corrections every run.

Read external article