agents

mirror of https://github.com/wshobson/agents.git synced 2026-03-18 17:47:16 +00:00

Author	SHA1	Message	Date
Seth Hobson	94d1aba17a	Add modernized Payment Intents pattern with Payment Element - Restore Payment Intents flow removed by PR, updated for modern best practices - Use Payment Element instead of legacy Card Element - Use stripe.confirmPayment() instead of deprecated confirmCardPayment() - Use automatic_payment_methods instead of hardcoded payment_method_types - Split Python/JS into separate fenced code blocks for clarity - Add guidance on when to use Payment Intents vs Checkout Sessions - Renumber subsequent patterns (Subscription → 4, Customer Portal → 5)	2026-02-19 13:45:55 -05:00
Seth Hobson	204e8129aa	Polish Stripe best practices examples for consistency - Remove payment_method_types=['card'] from Quick Start (dynamic payment methods) - Remove unused appearance variable from Pattern 2 JS example - Fix actions access pattern: destructure before use for consistency - Add inline comments clarifying sync/async distinction and amount format - Add ui_mode='embedded' to Embedded checkout bullet for completeness - Replace payment_method_types with automatic_payment_methods in test example	2026-02-19 13:42:36 -05:00
Sawyer	2b8e3166a1	Update to latest Stripe best practices	2026-02-18 20:38:50 -08:00
Seth Hobson	089740f185	chore: bump marketplace to v1.5.1 and sync plugin versions Sync marketplace.json versions with plugin.json for all 14 touched plugins. Fix plugin.json versions for llm-application-dev (2.0.3), startup-business-analyst (1.0.4), and ui-design (1.0.2) to match marketplace lineage. Add dotnet-contribution to marketplace.	2026-02-06 19:36:28 -05:00
Seth Hobson	4d504ed8fa	fix: eliminate cross-plugin dependencies and modernize plugin.json across marketplace Rewrites 14 commands across 11 plugins to remove all cross-plugin subagent_type references (e.g., "unit-testing::test-automator"), which break when plugins are installed standalone. Each command now uses only local bundled agents or general-purpose with role context in the prompt. All rewritten commands follow conductor-style patterns: - CRITICAL BEHAVIORAL RULES with strong directives - State files for session tracking and resume support - Phase checkpoints requiring explicit user approval - File-based context passing between steps Also fixes 4 plugin.json files missing version/license fields and adds plugin.json for dotnet-contribution. Closes #433	2026-02-06 19:34:26 -05:00
Seth Hobson	4820385a31	chore: modernize all plugins to new format with per-plugin plugin.json Add .claude-plugin/plugin.json to all 67 remaining plugins and simplify marketplace.json entries by removing redundant fields (keywords, strict, commands, agents, skills, repository) that are now auto-discovered. Bump marketplace version to 1.5.0.	2026-02-05 22:02:17 -05:00
Seth Hobson	a5ab5d8f31	chore(agent-teams): bump to v1.0.2	2026-02-05 17:42:30 -05:00
Seth Hobson	598ea85e7f	fix(agent-teams): simplify plugin.json and marketplace entry to match conductor patterns Strip plugin.json to minimal fields (name, version, description, author, license). Remove commands/agents/skills arrays, keywords, repository, and strict from marketplace entry.	2026-02-05 17:41:00 -05:00
Seth Hobson	fb9eba62b2	fix(agent-teams): remove Context7 MCP dependency, align frontmatter with conductor patterns, bump to v1.0.1 Remove .mcp.json to eliminate external MCP dependency that likely caused plugin load failure. Add tools: field to all agents, version: field to all skills, matching conductor plugin patterns.	2026-02-05 17:30:35 -05:00
Seth Hobson	b187ce780d	docs(agent-teams): use official /plugin install command instead of --plugin-dir	2026-02-05 17:16:29 -05:00
Seth Hobson	1f46cab1f6	docs(agent-teams): add link to official Anthropic Agent Teams docs	2026-02-05 17:14:55 -05:00
Seth Hobson	0752775afc	feat(agent-teams): add plugin for multi-agent team orchestration New plugin with 7 presets (review, debug, feature, fullstack, research, security, migration), 4 specialized agents, 7 slash commands, 6 skills with reference docs, and Context7 MCP integration for research teams.	2026-02-05 17:10:02 -05:00
Ruyut	918a770990	fix: add missing ')' in winston File transport (#426 )	2026-02-01 21:06:12 -05:00
Song Luar	194a267494	Update npx packages referenced in markdown files (#425 ) * use correct npx package names in md files * fix: update remaining non-existent npm package references - Replace react-codemod with jscodeshift in deps-upgrade.md - Remove non-existent changelog-parser reference --------- Co-authored-by: Seth Hobson <wshobson@gmail.com>	2026-02-01 21:04:21 -05:00
kenzo	3ed95e608a	feat(tailwind-design-system): update skill for Tailwind CSS v4 (#427 ) * feat(tailwind-design-system): update skill for Tailwind CSS v4 Major updates: - CSS-first configuration with @theme blocks - @custom-variant for dark mode (not @variant) - @keyframes must be inside @theme for tree-shaking - React 19 ref-as-prop patterns (no forwardRef) - OKLCH colors for better perceptual uniformity - Native CSS animations (@starting-style, transition-behavior) - New @utility directive for custom utilities - @theme inline/static modifiers - Namespace overrides (--color-: initial) - Semi-transparent variants with color-mix() - Container query tokens Breaking changes from v3: - tailwind.config.ts → CSS @theme - @tailwind directives → @import 'tailwindcss' - darkMode: 'class' → @custom-variant dark fix: address review feedback for tailwind v4 skill - Add missing semicolon to @custom-variant declaration - Add missing Slot import from @radix-ui/react-slot - Add missing DialogPortal declaration - Add --color-ring-offset to theme for focus states - Fix misleading comment about @keyframes tree-shaking - Update comparison table for tailwindcss-animate replacement - Use standard zod import path (not transitional zod/v4) - Update upgrade guide link to stable URL - Format with Prettier --------- Co-authored-by: Seth Hobson <wshobson@gmail.com>	2026-02-01 20:40:22 -05:00
M. A.	cbb60494b1	Add Comprehensive Python Development Skills (#419 ) * Add extra python skills covering code style, design patterns, resilience, resource management, testing patterns, and type safety ...etc * fix: correct code examples in Python skills - Clarify Python version requirements for type statement (3.10+ vs 3.12+) - Add missing ValidationError import in configuration example - Add missing httpx import and url parameter in async example --------- Co-authored-by: Seth Hobson <wshobson@gmail.com>	2026-01-30 11:52:14 -05:00
Daniel	f9e9598241	Revise event sourcing architect metadata and description (#417 ) Add header with the event sourcing architect's description and name format.	2026-01-30 11:34:59 -05:00
Seth Hobson	1135ac6062	docs: update installation commands for llm-application-dev and conductor	2026-01-19 17:08:27 -05:00
Seth Hobson	56848874a2	style: format all files with prettier	2026-01-19 17:07:03 -05:00
Seth Hobson	027ed046a3	feat(ui-design): modernize to auto-discovery pattern v1.0.1	2026-01-19 16:59:34 -05:00
Seth Hobson	1b9d881d11	fix(llm-application-dev): use auto-discovery pattern like conductor v2.0.2	2026-01-19 16:55:01 -05:00
Seth Hobson	16f8e8c66e	fix(llm-application-dev): add command frontmatter for slash command registration v2.0.1	2026-01-19 16:26:41 -05:00
Seth Hobson	1e54d186fe	feat(ui-design): add comprehensive UI/UX design plugin v1.0.0 New plugin covering mobile (iOS, Android, React Native) and web applications with modern design patterns, accessibility, and design systems. Components: - 9 skills: design-system-patterns, accessibility-compliance, responsive-design, mobile-ios-design, mobile-android-design, react-native-design, web-component-design, interaction-design, visual-design-foundations - 4 commands: design-review, create-component, accessibility-audit, design-system-setup - 3 agents: ui-designer, accessibility-expert, design-system-architect Marketplace updated: - Version bumped to 1.3.4 - 102 agents (+3), 116 skills (+9)	2026-01-19 16:22:13 -05:00
Seth Hobson	8be0e8ac7a	feat(llm-application-dev): modernize to LangGraph and latest models v2.0.0 - Migrate from LangChain 0.x to LangChain 1.x/LangGraph patterns - Update model references to Claude 4.5 and GPT-5.2 - Add Voyage AI as primary embedding recommendation - Add structured outputs with Pydantic - Replace deprecated initialize_agent() with StateGraph - Fix security: use AST-based safe math instead of unsafe execution - Add plugin.json and README.md for consistency - Bump marketplace version to 1.3.3	2026-01-19 15:43:25 -05:00
Seth Hobson	e827cc713a	feat(conductor): remove tool restrictions from slash commands Remove allowed-tools field from all conductor commands to enable full tool access for implementation-focused workflows. - implement.md: removed tool restrictions - manage.md: removed tool restrictions - new-track.md: removed tool restrictions - revert.md: removed tool restrictions - setup.md: removed tool restrictions - status.md: removed tool restrictions Bump conductor version to 1.2.0	2026-01-17 11:19:12 -05:00
Seth Hobson	e5255782cd	feat(conductor): add track manager command for lifecycle management Add /conductor:manage command with comprehensive track lifecycle operations: - Archive completed tracks with reason tracking - Restore archived tracks to active state - Delete tracks permanently with safety confirmations - Rename track IDs with full reference updates - Cleanup orphaned artifacts and stale tracks - Interactive menu when invoked without arguments Also includes: - Add Archived Tracks section to tracks.md template - Update README with manage command documentation - Bump version to 1.1.0 - Format files with prettier	2026-01-16 12:02:24 -05:00
Seth Hobson	d750cf0e44	fix(conductor): remove invalid attribution field from plugin metadata The attribution field is not part of the Claude Code plugin schema. Attribution is properly documented in README.md instead.	2026-01-15 22:36:24 -05:00
Seth Hobson	627dc5cdd7	chore(conductor): add license and attribution to plugin metadata - Add Apache-2.0 license to marketplace.json and plugin.json - Add attribution field referencing original Google project - Bump version to 1.0.7	2026-01-15 22:35:17 -05:00
Seth Hobson	3fc313393d	chore(conductor): add attribution to original Google project - Add acknowledgments section crediting gemini-cli-extensions/conductor - Credit @wshobson for Claude Code adaptation - Update license from MIT to Apache-2.0 to match original	2026-01-15 22:33:09 -05:00
Seth Hobson	58f4038326	fix(startup-business-analyst): add allowed-tools to commands and simplify marketplace entry Add allowed-tools field to command frontmatter and simplify marketplace.json entry to rely on auto-discovery instead of explicit arrays.	2026-01-15 21:33:44 -05:00
Seth Hobson	3e673da18e	fix(conductor): add allowed-tools to commands and simplify marketplace entry Based on research of official plugins: - Add allowed-tools array to all 5 commands (required field in working plugins) - Simplify marketplace.json entry to match official format (minimal fields, auto-discovery) - Remove explicit commands/agents/skills arrays (rely on auto-discovery like official plugins) Version: 1.0.6	2026-01-15 21:27:59 -05:00
Seth Hobson	3d0a40dd58	fix(plugins): align with official Claude Code plugin conventions - Add minimal .claude-plugin/plugin.json to conductor and startup-business-analyst (matches official format: name, description, author only) - Remove .gitignore from startup-business-analyst (not in official plugins) - Bump versions: conductor 1.0.5, startup-business-analyst 1.0.2 Plugin structure now matches official examples (feature-dev, ralph-loop, etc.)	2026-01-15 21:01:45 -05:00
Seth Hobson	29d59eb5cf	fix(plugins): add proper frontmatter to commands and agents Commands now have correct frontmatter format: - description (required) - argument-hint (optional) Agents now have correct frontmatter format: - name, description, tools, model, color Updated plugins: - conductor v1.0.4: fixed all 5 commands and 1 agent - startup-business-analyst v1.0.1: fixed all 3 commands	2026-01-15 20:58:35 -05:00
Seth Hobson	87ed65d2b5	fix(plugins): remove redundant plugin.json and fix command formats - Remove .claude-plugin/plugin.json from conductor and startup-business-analyst (marketplace.json provides all config, standalone plugin.json breaks installation) - Remove YAML frontmatter from startup-business-analyst commands - Bump versions: conductor 1.0.3, startup-business-analyst 1.0.1	2026-01-15 20:55:21 -05:00
Seth Hobson	3a6a24e1a6	chore(conductor): bump version to 1.0.2	2026-01-15 20:44:10 -05:00
Seth Hobson	fe3baec076	fix(conductor): align file formats with Claude Code plugin conventions - Remove YAML frontmatter from command files (commands don't use frontmatter) - Remove non-standard fields (color, tools) from conductor-validator agent - Simplify agent description to single line format	2026-01-15 20:42:46 -05:00
Seth Hobson	1408671cb7	fix(conductor): move plugin to plugins/ directory for proper discovery Conductor plugin was at root level instead of plugins/ directory, causing slash commands to not be recognized by Claude Code.	2026-01-15 20:34:57 -05:00
Seth Hobson	f662524f9a	feat: add Conductor plugin for Context-Driven Development Add comprehensive Conductor plugin implementing Context-Driven Development methodology with tracks, specs, and phased implementation plans. Components: - 5 commands: setup, new-track, implement, status, revert - 1 agent: conductor-validator - 3 skills: context-driven-development, track-management, workflow-patterns - 18 templates for project artifacts Documentation updates: - README.md: Updated counts (68 plugins, 100 agents, 110 skills, 76 tools) - docs/plugins.md: Added Conductor to Workflows section - docs/agents.md: Added conductor-validator agent - docs/agent-skills.md: Added Conductor skills section Also includes Prettier formatting across all project files.	2026-01-15 17:38:21 -05:00
Seth Hobson	87231b828d	feat: add startup-business-analyst plugin Comprehensive startup analysis plugin with market sizing, financial modeling, team planning, and strategic research for early-stage companies. - 5 skills: market sizing, financial modeling, team planning, competitive analysis, metrics - 3 commands: market-opportunity, financial-projections, business-case - 1 agent: startup-analyst - Covers TAM/SAM/SOM, unit economics, competitive landscape, hiring plans	2026-01-13 20:25:25 -05:00
Dávid Balatoni	2d769d4f84	feat: add reverse-engineering plugin (#409 ) * feat(reverse-engineering): add firmware-analyst agent * feat(reverse-engineering): add binary-analysis-patterns skill * feat(reverse-engineering): add memory-forensics skill * feat(reverse-engineering): add protocol-reverse-engineering skill * feat(reverse-engineering): add anti-reversing-techniques skill * feat(reverse-engineering): register plugin in marketplace * docs(reverse-engineering): update to binwalk v3 syntax and references * fix(reverse-engineering): correct author URL to balcsida * docs(reverse-engineering): add authorization warning to anti-reversing skill * fix(reverse-engineering): correct author name	2026-01-09 10:41:06 -05:00
Rafael Martínez – Dev & IA	c81daa055d	feat: add .NET backend development plugin (#157 ) Co-authored-by: Martineto21 <ramac21@gmail.com>	2025-12-30 16:40:12 -05:00
google-labs-jules[bot]	12f3ff4555	🛡️ Sentinel: [Security Enhancement] Add security middleware to API template (#154 ) * feat: add security middleware to REST API template Adds `TrustedHostMiddleware` and `CORSMiddleware` to the FastAPI template to ensure basic security protections are in place. Includes comments guiding users on how to configure these for production. - Added TrustedHostMiddleware for Host header validation - Added CORSMiddleware for Cross-Origin Resource Sharing - Added TODOs for production configuration * feat: add security middleware to REST API template Adds `TrustedHostMiddleware` and `CORSMiddleware` to the FastAPI template to ensure basic security protections are in place. Includes comments guiding users on how to configure these for production. - Added TrustedHostMiddleware for Host header validation - Added CORSMiddleware for Cross-Origin Resource Sharing - Configured safe defaults (allow_credentials=False) for the template - Added TODOs for production configuration * feat: secure API template and fix Pydantic deprecations Enhances `rest-api-template.py` with standard security middleware and updates Pydantic usage to V2 standards. - Added `TrustedHostMiddleware` and `CORSMiddleware` with safe defaults - Updated Pydantic models to use `ConfigDict` and `model_dump()` to resolve deprecation warnings - Documented security learnings in sentinel journal --------- Co-authored-by: google-labs-jules[bot] <161369871+google-labs-jules[bot]@users.noreply.github.com>	2025-12-22 09:51:51 -05:00
google-labs-jules[bot]	a86384334b	⚡ Bolt: optimize prompt evaluation loop to skip redundant calls (#152 ) - Avoid re-evaluating the current prompt if metrics are already available from the previous iteration. - Pass metrics from the best variation to the next iteration. - Reduces N-1 expensive LLM calls in an N-iteration optimization loop. Co-authored-by: google-labs-jules[bot] <161369871+google-labs-jules[bot]@users.noreply.github.com>	2025-12-21 19:02:37 -05:00
google-labs-jules[bot]	fda45604b7	⚡ Bolt: Optimize PromptOptimizer thread pool usage (#147 ) * ⚡ Bolt: Reuse ThreadPoolExecutor in PromptOptimizer 💡 What: Initialized `ThreadPoolExecutor` in `PromptOptimizer.__init__` and reused it in `evaluate_prompt`. 🎯 Why: The previous implementation created a new `ThreadPoolExecutor` for every call to `evaluate_prompt`. Since `evaluate_prompt` is called repeatedly inside the `optimize` loop (and for every variation), this caused significant overhead from repeatedly creating and destroying thread pools. 📊 Impact: Benchmark showed a reduction in execution time from ~5.36s to ~3.76s (~30% improvement) for 500 iterations with a mocked LLM. 🔬 Measurement: Ran a benchmark script executing `evaluate_prompt` 500 times. Before: 5.36s After: 3.76s * ⚡ Bolt: Reuse ThreadPoolExecutor in PromptOptimizer 💡 What: Initialized `ThreadPoolExecutor` in `PromptOptimizer.__init__` and reused it in `evaluate_prompt`. Added a `shutdown` method for proper cleanup. 🎯 Why: The previous implementation created a new `ThreadPoolExecutor` for every call to `evaluate_prompt`. Since `evaluate_prompt` is called repeatedly inside the `optimize` loop (and for every variation), this caused significant overhead from repeatedly creating and destroying thread pools. 📊 Impact: Benchmark showed a reduction in execution time from ~5.36s to ~3.76s (~30% improvement) for 500 iterations with a mocked LLM. 🔬 Measurement: Ran a benchmark script executing `evaluate_prompt` 500 times. Before: 5.36s After: 3.76s * ⚡ Bolt: Reuse ThreadPoolExecutor in PromptOptimizer 💡 What: Initialized `ThreadPoolExecutor` in `PromptOptimizer.__init__` and reused it in `evaluate_prompt`. Added a `shutdown` method and wrapped execution in `try...finally` for proper resource management. 🎯 Why: The previous implementation created a new `ThreadPoolExecutor` for every call to `evaluate_prompt`. Since `evaluate_prompt` is called repeatedly inside the `optimize` loop (and for every variation), this caused significant overhead from repeatedly creating and destroying thread pools. 📊 Impact: Benchmark showed a reduction in execution time from ~5.36s to ~3.76s (~30% improvement) for 500 iterations with a mocked LLM. 🔬 Measurement: Ran a benchmark script executing `evaluate_prompt` 500 times. Before: 5.36s After: 3.76s --------- Co-authored-by: google-labs-jules[bot] <161369871+google-labs-jules[bot]@users.noreply.github.com>	2025-12-20 21:28:39 -05:00
google-labs-jules[bot]	70cf3f3682	⚡ Bolt: Parallelize Prompt Evaluation in optimize-prompt.py (#145 ) * feat: Parallelize prompt evaluation in optimize-prompt.py - Update `PromptOptimizer.evaluate_prompt` to use `ThreadPoolExecutor` for concurrent test case processing - Significantly reduces total execution time when using high-latency LLM clients (network IO bound) - Maintain accurate metric aggregation (latency, accuracy, token count) from parallel results - This prepares the script for real-world usage where sequential execution is a major bottleneck ⚡ Bolt: Reduces total evaluation time from O(n) to O(1) latency-wise (bounded by max_workers) for concurrent requests. * feat: Parallelize prompt evaluation in optimize-prompt.py - Update `PromptOptimizer.evaluate_prompt` to use `ThreadPoolExecutor` for concurrent test case processing - Significantly reduces total execution time when using high-latency LLM clients (network IO bound) - Maintain accurate metric aggregation (latency, accuracy, token count) from parallel results - Ensure no generated artifacts (`optimization_results.json`) are committed ⚡ Bolt: Reduces total evaluation time from O(n) to O(1) latency-wise (bounded by max_workers) for concurrent requests. --------- Co-authored-by: google-labs-jules[bot] <161369871+google-labs-jules[bot]@users.noreply.github.com>	2025-12-19 09:12:15 -05:00
Seth Hobson	01d93fc227	feat: add 5 new specialized agents with 20 skills Add domain expert agents with comprehensive skill sets: - service-mesh-expert (cloud-infrastructure): Istio/Linkerd patterns, mTLS, observability - event-sourcing-architect (backend-development): CQRS, event stores, projections, sagas - vector-database-engineer (llm-application-dev): embeddings, similarity search, hybrid search - monorepo-architect (developer-essentials): Nx, Turborepo, Bazel, pnpm workspaces - threat-modeling-expert (security-scanning): STRIDE, attack trees, security requirements Update all documentation to reflect correct counts: - 67 plugins, 99 agents, 107 skills, 71 commands	2025-12-16 16:00:58 -05:00
Seth Hobson	c7ad381360	feat: implement three-tier model strategy with Opus 4.5 (#139 ) * feat: implement three-tier model strategy with Opus 4.5 This implements a strategic model selection approach based on agent complexity and use case, addressing Issue #136. Three-Tier Strategy: - Tier 1 (opus): 17 critical agents for architecture, security, code review - Tier 2 (inherit): 21 complex agents where users choose their model - Tier 3 (sonnet): 63 routine development agents (unchanged) - Tier 4 (haiku): 47 fast operational agents (unchanged) Why Opus 4.5 for Tier 1: - 80.9% on SWE-bench (industry-leading for code) - 65% fewer tokens for long-horizon tasks - Superior reasoning for architectural decisions Changes: - Update architect-review, cloud-architect, kubernetes-architect, database-architect, security-auditor, code-reviewer to opus - Update backend-architect, performance-engineer, ai-engineer, prompt-engineer, ml-engineer, mlops-engineer, data-scientist, blockchain-developer, quant-analyst, risk-manager, sql-pro, database-optimizer to inherit - Update README with three-tier model documentation Relates to #136 * feat: comprehensive model tier redistribution for Opus 4.5 This commit implements a strategic rebalancing of agent model assignments, significantly increasing the use of Opus 4.5 for critical coding tasks while ensuring Sonnet is used more than Haiku for support tasks. Final Distribution (153 total agent files): - Tier 1 Opus: 42 agents (27.5%) - All production coding + critical architecture - Tier 2 Inherit: 42 agents (27.5%) - Complex tasks, user-choosable - Tier 3 Sonnet: 38 agents (24.8%) - Support tasks needing intelligence - Tier 4 Haiku: 31 agents (20.3%) - Simple operational tasks Key Changes: Tier 1 (Opus) - Production Coding + Critical Review: - ALL code-reviewers (6 total): Ensures highest quality code review across all contexts (comprehensive, git PR, code docs, codebase cleanup, refactoring, TDD) - All major language pros (7): python, golang, rust, typescript, cpp, java, c - Framework specialists (6): django (2), fastapi (2), graphql-architect (2) - Complex specialists (6): terraform-specialist (3), tdd-orchestrator (2), data-engineer - Blockchain: blockchain-developer (smart contracts are critical) - Game dev (2): unity-developer, minecraft-bukkit-pro - Architecture (existing): architect-review, cloud-architect, kubernetes-architect, hybrid-cloud-architect, database-architect, security-auditor Tier 2 (Inherit) - User Flexibility: - Secondary languages (6): javascript, scala, csharp, ruby, php, elixir - All frontend/mobile (8): frontend-developer (4), mobile-developer (2), flutter-expert, ios-developer - Specialized (6): observability-engineer (2), temporal-python-pro, arm-cortex-expert, context-manager (2), database-optimizer (2) - AI/ML, backend-architect, performance-engineer, quant/risk (existing) Tier 3 (Sonnet) - Intelligent Support: - Documentation (4): docs-architect (2), tutorial-engineer (2) - Testing (2): test-automator (2) - Developer experience (3): dx-optimizer (2), business-analyst - Modernization (4): legacy-modernizer (3), database-admin - Other support agents (existing) Tier 4 (Haiku) - Simple Operations: - SEO/Marketing (10): All SEO agents, content, search - Deployment (4): deployment-engineer (4 instances) - Debugging (5): debugger (2), error-detective (3) - DevOps (3): devops-troubleshooter (3) - Other simple operational tasks Rationale: - Opus 4.5 achieves 80.9% on SWE-bench with 65% fewer tokens on complex tasks - Production code deserves the best model: all language pros now on Opus - All code review uses Opus for maximum quality and security - Sonnet > Haiku (38 vs 31) ensures better intelligence for support tasks - Inherit tier gives users cost control for frontend, mobile, and specialized tasks Related: #136, #132 * feat: upgrade final 13 agents from Haiku to Sonnet Based on research into Haiku 4.5 vs Sonnet 4.5 capabilities, upgraded agents requiring deep analytical intelligence from Haiku to Sonnet. Research Findings: - Haiku 4.5: 73.3% SWE-bench, 3-5x faster, 1/3 cost, sub-200ms responses - Best for Haiku: Real-time apps, data extraction, templates, high-volume ops - Best for Sonnet: Complex reasoning, root cause analysis, strategic planning Agents Upgraded (13 total): - Debugging (5): debugger (2), error-detective (3) - Complex root cause analysis - DevOps (3): devops-troubleshooter (3) - System diagnostics & troubleshooting - Network (2): network-engineer (2) - Complex network analysis & optimization - API Documentation (2): api-documenter (2) - Deep API understanding required - Payments (1): payment-integration - Critical financial integration Final Distribution (153 total): - Tier 1 Opus: 42 agents (27.5%) - Production coding + critical architecture - Tier 2 Inherit: 42 agents (27.5%) - Complex tasks, user-choosable - Tier 3 Sonnet: 51 agents (33.3%) - Support tasks needing intelligence - Tier 4 Haiku: 18 agents (11.8%) - Fast operational tasks only Haiku Now Reserved For: - SEO/Marketing (8): Pattern matching, data extraction, content templates - Deployment (4): Operational execution tasks - Simple Docs (3): reference-builder, mermaid-expert, c4-code - Sales/Support (2): High-volume, template-based interactions - Search (1): Knowledge retrieval Sonnet > Haiku as requested (51 vs 18) Sources: - https://www.creolestudios.com/claude-haiku-4-5-vs-sonnet-4-5-comparison/ - https://www.anthropic.com/news/claude-haiku-4-5 - https://caylent.com/blog/claude-haiku-4-5-deep-dive-cost-capabilities-and-the-multi-agent-opportunity Related: #136 * docs: add cost considerations and clarify inherit behavior Addresses PR feedback: - Added comprehensive cost comparison for all model tiers - Documented how 'inherit' model works (uses session default, falls back to Sonnet) - Explained cost optimization strategies - Clarified when Opus token efficiency offsets higher rate This helps users make informed decisions about model selection and cost control.	2025-12-10 15:52:06 -05:00
Mike Kazmier	16cddabb75	add c4 documentation workflow and agents (#129 ) * add c4 documentation workflow and agents * update the c4-code agent to use proper mermaid diagram types	2025-12-10 14:53:11 -05:00
Joe Previte	c660e2454c	docs(agents): add haskell-pro (#128 ) * docs(agents): add haskell-pro * fixup * Move haskell-pro agent to functional-programming plugin - Moved plugins/haskell-development/agents/haskell-pro.md to plugins/functional-programming/agents/haskell-pro.md - Updated path reference in docs/agents.md This addresses review feedback to place the Haskell agent in the existing functional-programming plugin alongside elixir-pro, rather than creating a new haskell-development plugin. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-12-10 14:51:03 -05:00
Kiri	ddbd034ca3	feat: add Temporal workflow orchestration to backend-development plugin (#125 ) * docs: enhance payment-integration agent with critical security guidance Add evidence-based security requirements from Stripe, PayPal, OWASP: - Webhook security (signature verification, idempotency, quick response, server validation) - PCI compliance essentials (tokenization, server-side validation, environment separation) - Real-world failure examples (processor collapse, Lambda failures, malicious price manipulation) Minimal expansion: 32 to 57 lines (25 lines added) * feat: add Temporal workflow orchestration to backend-development plugin Add comprehensive Temporal workflow orchestration support with 1 agent and 2 skills: Agent: - temporal-python-pro: Python SDK expert for durable workflows, saga patterns, async/await patterns, error handling, and production deployment Skills: - workflow-orchestration-patterns: Language-agnostic patterns for workflows vs activities, saga compensation, entity workflows, and determinism constraints - temporal-python-testing: Progressive disclosure testing guide with unit testing, integration testing, replay testing, and local development setup Changes: - Add agent: plugins/backend-development/agents/temporal-python-pro.md (311 lines) - Add skill: plugins/backend-development/skills/workflow-orchestration-patterns/ (286 lines) - Add skill: plugins/backend-development/skills/temporal-python-testing/ (SKILL.md + 4 resource files) - Update marketplace.json: backend-development plugin v1.2.2 → v1.2.3 - Update docs/agents.md: 85 → 86 agents - Update docs/agent-skills.md: 55 → 57 skills Content Sources: - Official Temporal documentation (docs.temporal.io) - Temporal Python SDK guide (python.temporal.io) - Temporal architecture docs (github.com/temporalio/temporal) - OWASP best practices for distributed systems Addresses #124 --------- Co-authored-by: Kiran Eshwarappa <kiran.eshwarapa@gmail.com>	2025-11-16 20:45:36 -05:00

1 2

62 Commits