Commit Graph

62 Commits

Author SHA1 Message Date
Seth Hobson
94d1aba17a Add modernized Payment Intents pattern with Payment Element
- Restore Payment Intents flow removed by PR, updated for modern best practices
- Use Payment Element instead of legacy Card Element
- Use stripe.confirmPayment() instead of deprecated confirmCardPayment()
- Use automatic_payment_methods instead of hardcoded payment_method_types
- Split Python/JS into separate fenced code blocks for clarity
- Add guidance on when to use Payment Intents vs Checkout Sessions
- Renumber subsequent patterns (Subscription → 4, Customer Portal → 5)
2026-02-19 13:45:55 -05:00
Seth Hobson
204e8129aa Polish Stripe best practices examples for consistency
- Remove payment_method_types=['card'] from Quick Start (dynamic payment methods)
- Remove unused appearance variable from Pattern 2 JS example
- Fix actions access pattern: destructure before use for consistency
- Add inline comments clarifying sync/async distinction and amount format
- Add ui_mode='embedded' to Embedded checkout bullet for completeness
- Replace payment_method_types with automatic_payment_methods in test example
2026-02-19 13:42:36 -05:00
Sawyer
2b8e3166a1 Update to latest Stripe best practices 2026-02-18 20:38:50 -08:00
Seth Hobson
089740f185 chore: bump marketplace to v1.5.1 and sync plugin versions
Sync marketplace.json versions with plugin.json for all 14 touched
plugins. Fix plugin.json versions for llm-application-dev (2.0.3),
startup-business-analyst (1.0.4), and ui-design (1.0.2) to match
marketplace lineage. Add dotnet-contribution to marketplace.
2026-02-06 19:36:28 -05:00
Seth Hobson
4d504ed8fa fix: eliminate cross-plugin dependencies and modernize plugin.json across marketplace
Rewrites 14 commands across 11 plugins to remove all cross-plugin
subagent_type references (e.g., "unit-testing::test-automator"), which
break when plugins are installed standalone. Each command now uses only
local bundled agents or general-purpose with role context in the prompt.

All rewritten commands follow conductor-style patterns:
- CRITICAL BEHAVIORAL RULES with strong directives
- State files for session tracking and resume support
- Phase checkpoints requiring explicit user approval
- File-based context passing between steps

Also fixes 4 plugin.json files missing version/license fields and adds
plugin.json for dotnet-contribution.

Closes #433
2026-02-06 19:34:26 -05:00
Seth Hobson
4820385a31 chore: modernize all plugins to new format with per-plugin plugin.json
Add .claude-plugin/plugin.json to all 67 remaining plugins and simplify
marketplace.json entries by removing redundant fields (keywords, strict,
commands, agents, skills, repository) that are now auto-discovered.
Bump marketplace version to 1.5.0.
2026-02-05 22:02:17 -05:00
Seth Hobson
a5ab5d8f31 chore(agent-teams): bump to v1.0.2 2026-02-05 17:42:30 -05:00
Seth Hobson
598ea85e7f fix(agent-teams): simplify plugin.json and marketplace entry to match conductor patterns
Strip plugin.json to minimal fields (name, version, description, author, license).
Remove commands/agents/skills arrays, keywords, repository, and strict from marketplace entry.
2026-02-05 17:41:00 -05:00
Seth Hobson
fb9eba62b2 fix(agent-teams): remove Context7 MCP dependency, align frontmatter with conductor patterns, bump to v1.0.1
Remove .mcp.json to eliminate external MCP dependency that likely caused plugin load failure.
Add tools: field to all agents, version: field to all skills, matching conductor plugin patterns.
2026-02-05 17:30:35 -05:00
Seth Hobson
b187ce780d docs(agent-teams): use official /plugin install command instead of --plugin-dir 2026-02-05 17:16:29 -05:00
Seth Hobson
1f46cab1f6 docs(agent-teams): add link to official Anthropic Agent Teams docs 2026-02-05 17:14:55 -05:00
Seth Hobson
0752775afc feat(agent-teams): add plugin for multi-agent team orchestration
New plugin with 7 presets (review, debug, feature, fullstack, research,
security, migration), 4 specialized agents, 7 slash commands, 6 skills
with reference docs, and Context7 MCP integration for research teams.
2026-02-05 17:10:02 -05:00
Ruyut
918a770990 fix: add missing ')' in winston File transport (#426) 2026-02-01 21:06:12 -05:00
Song Luar
194a267494 Update npx packages referenced in markdown files (#425)
* use correct npx package names in md files

* fix: update remaining non-existent npm package references

- Replace react-codemod with jscodeshift in deps-upgrade.md
- Remove non-existent changelog-parser reference

---------

Co-authored-by: Seth Hobson <wshobson@gmail.com>
2026-02-01 21:04:21 -05:00
kenzo
3ed95e608a feat(tailwind-design-system): update skill for Tailwind CSS v4 (#427)
* feat(tailwind-design-system): update skill for Tailwind CSS v4

Major updates:
- CSS-first configuration with @theme blocks
- @custom-variant for dark mode (not @variant)
- @keyframes must be inside @theme for tree-shaking
- React 19 ref-as-prop patterns (no forwardRef)
- OKLCH colors for better perceptual uniformity
- Native CSS animations (@starting-style, transition-behavior)
- New @utility directive for custom utilities
- @theme inline/static modifiers
- Namespace overrides (--color-*: initial)
- Semi-transparent variants with color-mix()
- Container query tokens

Breaking changes from v3:
- tailwind.config.ts → CSS @theme
- @tailwind directives → @import 'tailwindcss'
- darkMode: 'class' → @custom-variant dark

* fix: address review feedback for tailwind v4 skill

- Add missing semicolon to @custom-variant declaration
- Add missing Slot import from @radix-ui/react-slot
- Add missing DialogPortal declaration
- Add --color-ring-offset to theme for focus states
- Fix misleading comment about @keyframes tree-shaking
- Update comparison table for tailwindcss-animate replacement
- Use standard zod import path (not transitional zod/v4)
- Update upgrade guide link to stable URL
- Format with Prettier

---------

Co-authored-by: Seth Hobson <wshobson@gmail.com>
2026-02-01 20:40:22 -05:00
M. A.
cbb60494b1 Add Comprehensive Python Development Skills (#419)
* Add extra python skills covering code style, design patterns, resilience, resource management, testing patterns, and type safety ...etc

* fix: correct code examples in Python skills

- Clarify Python version requirements for type statement (3.10+ vs 3.12+)
- Add missing ValidationError import in configuration example
- Add missing httpx import and url parameter in async example

---------

Co-authored-by: Seth Hobson <wshobson@gmail.com>
2026-01-30 11:52:14 -05:00
Daniel
f9e9598241 Revise event sourcing architect metadata and description (#417)
Add header with the event sourcing architect's description and name format.
2026-01-30 11:34:59 -05:00
Seth Hobson
1135ac6062 docs: update installation commands for llm-application-dev and conductor 2026-01-19 17:08:27 -05:00
Seth Hobson
56848874a2 style: format all files with prettier 2026-01-19 17:07:03 -05:00
Seth Hobson
027ed046a3 feat(ui-design): modernize to auto-discovery pattern v1.0.1 2026-01-19 16:59:34 -05:00
Seth Hobson
1b9d881d11 fix(llm-application-dev): use auto-discovery pattern like conductor v2.0.2 2026-01-19 16:55:01 -05:00
Seth Hobson
16f8e8c66e fix(llm-application-dev): add command frontmatter for slash command registration v2.0.1 2026-01-19 16:26:41 -05:00
Seth Hobson
1e54d186fe feat(ui-design): add comprehensive UI/UX design plugin v1.0.0
New plugin covering mobile (iOS, Android, React Native) and web
applications with modern design patterns, accessibility, and design systems.

Components:
- 9 skills: design-system-patterns, accessibility-compliance, responsive-design,
  mobile-ios-design, mobile-android-design, react-native-design,
  web-component-design, interaction-design, visual-design-foundations
- 4 commands: design-review, create-component, accessibility-audit, design-system-setup
- 3 agents: ui-designer, accessibility-expert, design-system-architect

Marketplace updated:
- Version bumped to 1.3.4
- 102 agents (+3), 116 skills (+9)
2026-01-19 16:22:13 -05:00
Seth Hobson
8be0e8ac7a feat(llm-application-dev): modernize to LangGraph and latest models v2.0.0
- Migrate from LangChain 0.x to LangChain 1.x/LangGraph patterns
- Update model references to Claude 4.5 and GPT-5.2
- Add Voyage AI as primary embedding recommendation
- Add structured outputs with Pydantic
- Replace deprecated initialize_agent() with StateGraph
- Fix security: use AST-based safe math instead of unsafe execution
- Add plugin.json and README.md for consistency
- Bump marketplace version to 1.3.3
2026-01-19 15:43:25 -05:00
Seth Hobson
e827cc713a feat(conductor): remove tool restrictions from slash commands
Remove allowed-tools field from all conductor commands to enable
full tool access for implementation-focused workflows.

- implement.md: removed tool restrictions
- manage.md: removed tool restrictions
- new-track.md: removed tool restrictions
- revert.md: removed tool restrictions
- setup.md: removed tool restrictions
- status.md: removed tool restrictions

Bump conductor version to 1.2.0
2026-01-17 11:19:12 -05:00
Seth Hobson
e5255782cd feat(conductor): add track manager command for lifecycle management
Add /conductor:manage command with comprehensive track lifecycle operations:
- Archive completed tracks with reason tracking
- Restore archived tracks to active state
- Delete tracks permanently with safety confirmations
- Rename track IDs with full reference updates
- Cleanup orphaned artifacts and stale tracks
- Interactive menu when invoked without arguments

Also includes:
- Add Archived Tracks section to tracks.md template
- Update README with manage command documentation
- Bump version to 1.1.0
- Format files with prettier
2026-01-16 12:02:24 -05:00
Seth Hobson
d750cf0e44 fix(conductor): remove invalid attribution field from plugin metadata
The attribution field is not part of the Claude Code plugin schema.
Attribution is properly documented in README.md instead.
2026-01-15 22:36:24 -05:00
Seth Hobson
627dc5cdd7 chore(conductor): add license and attribution to plugin metadata
- Add Apache-2.0 license to marketplace.json and plugin.json
- Add attribution field referencing original Google project
- Bump version to 1.0.7
2026-01-15 22:35:17 -05:00
Seth Hobson
3fc313393d chore(conductor): add attribution to original Google project
- Add acknowledgments section crediting gemini-cli-extensions/conductor
- Credit @wshobson for Claude Code adaptation
- Update license from MIT to Apache-2.0 to match original
2026-01-15 22:33:09 -05:00
Seth Hobson
58f4038326 fix(startup-business-analyst): add allowed-tools to commands and simplify marketplace entry
Add allowed-tools field to command frontmatter and simplify marketplace.json
entry to rely on auto-discovery instead of explicit arrays.
2026-01-15 21:33:44 -05:00
Seth Hobson
3e673da18e fix(conductor): add allowed-tools to commands and simplify marketplace entry
Based on research of official plugins:
- Add allowed-tools array to all 5 commands (required field in working plugins)
- Simplify marketplace.json entry to match official format (minimal fields, auto-discovery)
- Remove explicit commands/agents/skills arrays (rely on auto-discovery like official plugins)

Version: 1.0.6
2026-01-15 21:27:59 -05:00
Seth Hobson
3d0a40dd58 fix(plugins): align with official Claude Code plugin conventions
- Add minimal .claude-plugin/plugin.json to conductor and startup-business-analyst
  (matches official format: name, description, author only)
- Remove .gitignore from startup-business-analyst (not in official plugins)
- Bump versions: conductor 1.0.5, startup-business-analyst 1.0.2

Plugin structure now matches official examples (feature-dev, ralph-loop, etc.)
2026-01-15 21:01:45 -05:00
Seth Hobson
29d59eb5cf fix(plugins): add proper frontmatter to commands and agents
Commands now have correct frontmatter format:
- description (required)
- argument-hint (optional)

Agents now have correct frontmatter format:
- name, description, tools, model, color

Updated plugins:
- conductor v1.0.4: fixed all 5 commands and 1 agent
- startup-business-analyst v1.0.1: fixed all 3 commands
2026-01-15 20:58:35 -05:00
Seth Hobson
87ed65d2b5 fix(plugins): remove redundant plugin.json and fix command formats
- Remove .claude-plugin/plugin.json from conductor and startup-business-analyst
  (marketplace.json provides all config, standalone plugin.json breaks installation)
- Remove YAML frontmatter from startup-business-analyst commands
- Bump versions: conductor 1.0.3, startup-business-analyst 1.0.1
2026-01-15 20:55:21 -05:00
Seth Hobson
3a6a24e1a6 chore(conductor): bump version to 1.0.2 2026-01-15 20:44:10 -05:00
Seth Hobson
fe3baec076 fix(conductor): align file formats with Claude Code plugin conventions
- Remove YAML frontmatter from command files (commands don't use frontmatter)
- Remove non-standard fields (color, tools) from conductor-validator agent
- Simplify agent description to single line format
2026-01-15 20:42:46 -05:00
Seth Hobson
1408671cb7 fix(conductor): move plugin to plugins/ directory for proper discovery
Conductor plugin was at root level instead of plugins/ directory,
causing slash commands to not be recognized by Claude Code.
2026-01-15 20:34:57 -05:00
Seth Hobson
f662524f9a feat: add Conductor plugin for Context-Driven Development
Add comprehensive Conductor plugin implementing Context-Driven Development
methodology with tracks, specs, and phased implementation plans.

Components:
- 5 commands: setup, new-track, implement, status, revert
- 1 agent: conductor-validator
- 3 skills: context-driven-development, track-management, workflow-patterns
- 18 templates for project artifacts

Documentation updates:
- README.md: Updated counts (68 plugins, 100 agents, 110 skills, 76 tools)
- docs/plugins.md: Added Conductor to Workflows section
- docs/agents.md: Added conductor-validator agent
- docs/agent-skills.md: Added Conductor skills section

Also includes Prettier formatting across all project files.
2026-01-15 17:38:21 -05:00
Seth Hobson
87231b828d feat: add startup-business-analyst plugin
Comprehensive startup analysis plugin with market sizing, financial modeling, team planning, and strategic research for early-stage companies.

- 5 skills: market sizing, financial modeling, team planning, competitive analysis, metrics
- 3 commands: market-opportunity, financial-projections, business-case
- 1 agent: startup-analyst
- Covers TAM/SAM/SOM, unit economics, competitive landscape, hiring plans
2026-01-13 20:25:25 -05:00
Dávid Balatoni
2d769d4f84 feat: add reverse-engineering plugin (#409)
* feat(reverse-engineering): add firmware-analyst agent

* feat(reverse-engineering): add binary-analysis-patterns skill

* feat(reverse-engineering): add memory-forensics skill

* feat(reverse-engineering): add protocol-reverse-engineering skill

* feat(reverse-engineering): add anti-reversing-techniques skill

* feat(reverse-engineering): register plugin in marketplace

* docs(reverse-engineering): update to binwalk v3 syntax and references

* fix(reverse-engineering): correct author URL to balcsida

* docs(reverse-engineering): add authorization warning to anti-reversing skill

* fix(reverse-engineering): correct author name
2026-01-09 10:41:06 -05:00
Rafael Martínez – Dev & IA
c81daa055d feat: add .NET backend development plugin (#157)
Co-authored-by: Martineto21 <ramac21@gmail.com>
2025-12-30 16:40:12 -05:00
google-labs-jules[bot]
12f3ff4555 🛡️ Sentinel: [Security Enhancement] Add security middleware to API template (#154)
* feat: add security middleware to REST API template

Adds `TrustedHostMiddleware` and `CORSMiddleware` to the FastAPI template to ensure basic security protections are in place. Includes comments guiding users on how to configure these for production.

- Added TrustedHostMiddleware for Host header validation
- Added CORSMiddleware for Cross-Origin Resource Sharing
- Added TODOs for production configuration

* feat: add security middleware to REST API template

Adds `TrustedHostMiddleware` and `CORSMiddleware` to the FastAPI template to ensure basic security protections are in place. Includes comments guiding users on how to configure these for production.

- Added TrustedHostMiddleware for Host header validation
- Added CORSMiddleware for Cross-Origin Resource Sharing
- Configured safe defaults (allow_credentials=False) for the template
- Added TODOs for production configuration

* feat: secure API template and fix Pydantic deprecations

Enhances `rest-api-template.py` with standard security middleware and updates Pydantic usage to V2 standards.

- Added `TrustedHostMiddleware` and `CORSMiddleware` with safe defaults
- Updated Pydantic models to use `ConfigDict` and `model_dump()` to resolve deprecation warnings
- Documented security learnings in sentinel journal

---------

Co-authored-by: google-labs-jules[bot] <161369871+google-labs-jules[bot]@users.noreply.github.com>
2025-12-22 09:51:51 -05:00
google-labs-jules[bot]
a86384334b Bolt: optimize prompt evaluation loop to skip redundant calls (#152)
- Avoid re-evaluating the current prompt if metrics are already available from the previous iteration.
- Pass metrics from the best variation to the next iteration.
- Reduces N-1 expensive LLM calls in an N-iteration optimization loop.

Co-authored-by: google-labs-jules[bot] <161369871+google-labs-jules[bot]@users.noreply.github.com>
2025-12-21 19:02:37 -05:00
google-labs-jules[bot]
fda45604b7 Bolt: Optimize PromptOptimizer thread pool usage (#147)
*  Bolt: Reuse ThreadPoolExecutor in PromptOptimizer

💡 What:
Initialized `ThreadPoolExecutor` in `PromptOptimizer.__init__` and reused it in `evaluate_prompt`.

🎯 Why:
The previous implementation created a new `ThreadPoolExecutor` for every call to `evaluate_prompt`. Since `evaluate_prompt` is called repeatedly inside the `optimize` loop (and for every variation), this caused significant overhead from repeatedly creating and destroying thread pools.

📊 Impact:
Benchmark showed a reduction in execution time from ~5.36s to ~3.76s (~30% improvement) for 500 iterations with a mocked LLM.

🔬 Measurement:
Ran a benchmark script executing `evaluate_prompt` 500 times.
Before: 5.36s
After: 3.76s

*  Bolt: Reuse ThreadPoolExecutor in PromptOptimizer

💡 What:
Initialized `ThreadPoolExecutor` in `PromptOptimizer.__init__` and reused it in `evaluate_prompt`. Added a `shutdown` method for proper cleanup.

🎯 Why:
The previous implementation created a new `ThreadPoolExecutor` for every call to `evaluate_prompt`. Since `evaluate_prompt` is called repeatedly inside the `optimize` loop (and for every variation), this caused significant overhead from repeatedly creating and destroying thread pools.

📊 Impact:
Benchmark showed a reduction in execution time from ~5.36s to ~3.76s (~30% improvement) for 500 iterations with a mocked LLM.

🔬 Measurement:
Ran a benchmark script executing `evaluate_prompt` 500 times.
Before: 5.36s
After: 3.76s

*  Bolt: Reuse ThreadPoolExecutor in PromptOptimizer

💡 What:
Initialized `ThreadPoolExecutor` in `PromptOptimizer.__init__` and reused it in `evaluate_prompt`. Added a `shutdown` method and wrapped execution in `try...finally` for proper resource management.

🎯 Why:
The previous implementation created a new `ThreadPoolExecutor` for every call to `evaluate_prompt`. Since `evaluate_prompt` is called repeatedly inside the `optimize` loop (and for every variation), this caused significant overhead from repeatedly creating and destroying thread pools.

📊 Impact:
Benchmark showed a reduction in execution time from ~5.36s to ~3.76s (~30% improvement) for 500 iterations with a mocked LLM.

🔬 Measurement:
Ran a benchmark script executing `evaluate_prompt` 500 times.
Before: 5.36s
After: 3.76s

---------

Co-authored-by: google-labs-jules[bot] <161369871+google-labs-jules[bot]@users.noreply.github.com>
2025-12-20 21:28:39 -05:00
google-labs-jules[bot]
70cf3f3682 Bolt: Parallelize Prompt Evaluation in optimize-prompt.py (#145)
* feat: Parallelize prompt evaluation in optimize-prompt.py

- Update `PromptOptimizer.evaluate_prompt` to use `ThreadPoolExecutor` for concurrent test case processing
- Significantly reduces total execution time when using high-latency LLM clients (network IO bound)
- Maintain accurate metric aggregation (latency, accuracy, token count) from parallel results
- This prepares the script for real-world usage where sequential execution is a major bottleneck

 Bolt: Reduces total evaluation time from O(n) to O(1) latency-wise (bounded by max_workers) for concurrent requests.

* feat: Parallelize prompt evaluation in optimize-prompt.py

- Update `PromptOptimizer.evaluate_prompt` to use `ThreadPoolExecutor` for concurrent test case processing
- Significantly reduces total execution time when using high-latency LLM clients (network IO bound)
- Maintain accurate metric aggregation (latency, accuracy, token count) from parallel results
- Ensure no generated artifacts (`optimization_results.json`) are committed

 Bolt: Reduces total evaluation time from O(n) to O(1) latency-wise (bounded by max_workers) for concurrent requests.

---------

Co-authored-by: google-labs-jules[bot] <161369871+google-labs-jules[bot]@users.noreply.github.com>
2025-12-19 09:12:15 -05:00
Seth Hobson
01d93fc227 feat: add 5 new specialized agents with 20 skills
Add domain expert agents with comprehensive skill sets:
- service-mesh-expert (cloud-infrastructure): Istio/Linkerd patterns, mTLS, observability
- event-sourcing-architect (backend-development): CQRS, event stores, projections, sagas
- vector-database-engineer (llm-application-dev): embeddings, similarity search, hybrid search
- monorepo-architect (developer-essentials): Nx, Turborepo, Bazel, pnpm workspaces
- threat-modeling-expert (security-scanning): STRIDE, attack trees, security requirements

Update all documentation to reflect correct counts:
- 67 plugins, 99 agents, 107 skills, 71 commands
2025-12-16 16:00:58 -05:00
Seth Hobson
c7ad381360 feat: implement three-tier model strategy with Opus 4.5 (#139)
* feat: implement three-tier model strategy with Opus 4.5

This implements a strategic model selection approach based on agent
complexity and use case, addressing Issue #136.

Three-Tier Strategy:
- Tier 1 (opus): 17 critical agents for architecture, security, code review
- Tier 2 (inherit): 21 complex agents where users choose their model
- Tier 3 (sonnet): 63 routine development agents (unchanged)
- Tier 4 (haiku): 47 fast operational agents (unchanged)

Why Opus 4.5 for Tier 1:
- 80.9% on SWE-bench (industry-leading for code)
- 65% fewer tokens for long-horizon tasks
- Superior reasoning for architectural decisions

Changes:
- Update architect-review, cloud-architect, kubernetes-architect,
  database-architect, security-auditor, code-reviewer to opus
- Update backend-architect, performance-engineer, ai-engineer,
  prompt-engineer, ml-engineer, mlops-engineer, data-scientist,
  blockchain-developer, quant-analyst, risk-manager, sql-pro,
  database-optimizer to inherit
- Update README with three-tier model documentation

Relates to #136

* feat: comprehensive model tier redistribution for Opus 4.5

This commit implements a strategic rebalancing of agent model assignments,
significantly increasing the use of Opus 4.5 for critical coding tasks while
ensuring Sonnet is used more than Haiku for support tasks.

Final Distribution (153 total agent files):
- Tier 1 Opus: 42 agents (27.5%) - All production coding + critical architecture
- Tier 2 Inherit: 42 agents (27.5%) - Complex tasks, user-choosable
- Tier 3 Sonnet: 38 agents (24.8%) - Support tasks needing intelligence
- Tier 4 Haiku: 31 agents (20.3%) - Simple operational tasks

Key Changes:

Tier 1 (Opus) - Production Coding + Critical Review:
- ALL code-reviewers (6 total): Ensures highest quality code review across
  all contexts (comprehensive, git PR, code docs, codebase cleanup, refactoring, TDD)
- All major language pros (7): python, golang, rust, typescript, cpp, java, c
- Framework specialists (6): django (2), fastapi (2), graphql-architect (2)
- Complex specialists (6): terraform-specialist (3), tdd-orchestrator (2), data-engineer
- Blockchain: blockchain-developer (smart contracts are critical)
- Game dev (2): unity-developer, minecraft-bukkit-pro
- Architecture (existing): architect-review, cloud-architect, kubernetes-architect,
  hybrid-cloud-architect, database-architect, security-auditor

Tier 2 (Inherit) - User Flexibility:
- Secondary languages (6): javascript, scala, csharp, ruby, php, elixir
- All frontend/mobile (8): frontend-developer (4), mobile-developer (2),
  flutter-expert, ios-developer
- Specialized (6): observability-engineer (2), temporal-python-pro,
  arm-cortex-expert, context-manager (2), database-optimizer (2)
- AI/ML, backend-architect, performance-engineer, quant/risk (existing)

Tier 3 (Sonnet) - Intelligent Support:
- Documentation (4): docs-architect (2), tutorial-engineer (2)
- Testing (2): test-automator (2)
- Developer experience (3): dx-optimizer (2), business-analyst
- Modernization (4): legacy-modernizer (3), database-admin
- Other support agents (existing)

Tier 4 (Haiku) - Simple Operations:
- SEO/Marketing (10): All SEO agents, content, search
- Deployment (4): deployment-engineer (4 instances)
- Debugging (5): debugger (2), error-detective (3)
- DevOps (3): devops-troubleshooter (3)
- Other simple operational tasks

Rationale:
- Opus 4.5 achieves 80.9% on SWE-bench with 65% fewer tokens on complex tasks
- Production code deserves the best model: all language pros now on Opus
- All code review uses Opus for maximum quality and security
- Sonnet > Haiku (38 vs 31) ensures better intelligence for support tasks
- Inherit tier gives users cost control for frontend, mobile, and specialized tasks

Related: #136, #132

* feat: upgrade final 13 agents from Haiku to Sonnet

Based on research into Haiku 4.5 vs Sonnet 4.5 capabilities, upgraded
agents requiring deep analytical intelligence from Haiku to Sonnet.

Research Findings:
- Haiku 4.5: 73.3% SWE-bench, 3-5x faster, 1/3 cost, sub-200ms responses
- Best for Haiku: Real-time apps, data extraction, templates, high-volume ops
- Best for Sonnet: Complex reasoning, root cause analysis, strategic planning

Agents Upgraded (13 total):
- Debugging (5): debugger (2), error-detective (3) - Complex root cause analysis
- DevOps (3): devops-troubleshooter (3) - System diagnostics & troubleshooting
- Network (2): network-engineer (2) - Complex network analysis & optimization
- API Documentation (2): api-documenter (2) - Deep API understanding required
- Payments (1): payment-integration - Critical financial integration

Final Distribution (153 total):
- Tier 1 Opus: 42 agents (27.5%) - Production coding + critical architecture
- Tier 2 Inherit: 42 agents (27.5%) - Complex tasks, user-choosable
- Tier 3 Sonnet: 51 agents (33.3%) - Support tasks needing intelligence
- Tier 4 Haiku: 18 agents (11.8%) - Fast operational tasks only

Haiku Now Reserved For:
- SEO/Marketing (8): Pattern matching, data extraction, content templates
- Deployment (4): Operational execution tasks
- Simple Docs (3): reference-builder, mermaid-expert, c4-code
- Sales/Support (2): High-volume, template-based interactions
- Search (1): Knowledge retrieval

Sonnet > Haiku as requested (51 vs 18)

Sources:
- https://www.creolestudios.com/claude-haiku-4-5-vs-sonnet-4-5-comparison/
- https://www.anthropic.com/news/claude-haiku-4-5
- https://caylent.com/blog/claude-haiku-4-5-deep-dive-cost-capabilities-and-the-multi-agent-opportunity

Related: #136

* docs: add cost considerations and clarify inherit behavior

Addresses PR feedback:
- Added comprehensive cost comparison for all model tiers
- Documented how 'inherit' model works (uses session default, falls back to Sonnet)
- Explained cost optimization strategies
- Clarified when Opus token efficiency offsets higher rate

This helps users make informed decisions about model selection and cost control.
2025-12-10 15:52:06 -05:00
Mike Kazmier
16cddabb75 add c4 documentation workflow and agents (#129)
* add c4 documentation workflow and agents

* update the c4-code agent to use proper mermaid diagram types
2025-12-10 14:53:11 -05:00
Joe Previte
c660e2454c docs(agents): add haskell-pro (#128)
* docs(agents): add haskell-pro

* fixup

* Move haskell-pro agent to functional-programming plugin

- Moved plugins/haskell-development/agents/haskell-pro.md to plugins/functional-programming/agents/haskell-pro.md
- Updated path reference in docs/agents.md

This addresses review feedback to place the Haskell agent in the existing functional-programming plugin alongside elixir-pro, rather than creating a new haskell-development plugin.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

---------

Co-authored-by: Claude <noreply@anthropic.com>
2025-12-10 14:51:03 -05:00
Kiri
ddbd034ca3 feat: add Temporal workflow orchestration to backend-development plugin (#125)
* docs: enhance payment-integration agent with critical security guidance

Add evidence-based security requirements from Stripe, PayPal, OWASP:
- Webhook security (signature verification, idempotency, quick response, server validation)
- PCI compliance essentials (tokenization, server-side validation, environment separation)
- Real-world failure examples (processor collapse, Lambda failures, malicious price manipulation)

Minimal expansion: 32 to 57 lines (25 lines added)

* feat: add Temporal workflow orchestration to backend-development plugin

Add comprehensive Temporal workflow orchestration support with 1 agent and 2 skills:

**Agent:**
- temporal-python-pro: Python SDK expert for durable workflows, saga patterns,
  async/await patterns, error handling, and production deployment

**Skills:**
- workflow-orchestration-patterns: Language-agnostic patterns for workflows vs
  activities, saga compensation, entity workflows, and determinism constraints
- temporal-python-testing: Progressive disclosure testing guide with unit testing,
  integration testing, replay testing, and local development setup

**Changes:**
- Add agent: plugins/backend-development/agents/temporal-python-pro.md (311 lines)
- Add skill: plugins/backend-development/skills/workflow-orchestration-patterns/ (286 lines)
- Add skill: plugins/backend-development/skills/temporal-python-testing/ (SKILL.md + 4 resource files)
- Update marketplace.json: backend-development plugin v1.2.2 → v1.2.3
- Update docs/agents.md: 85 → 86 agents
- Update docs/agent-skills.md: 55 → 57 skills

**Content Sources:**
- Official Temporal documentation (docs.temporal.io)
- Temporal Python SDK guide (python.temporal.io)
- Temporal architecture docs (github.com/temporalio/temporal)
- OWASP best practices for distributed systems

Addresses #124

---------

Co-authored-by: Kiran Eshwarappa <kiran.eshwarapa@gmail.com>
2025-11-16 20:45:36 -05:00