agents/plugins/jvm-languages/agents/scala-pro.md at e5255782cd32900991d75fcc6b62aa3ffffc9c41

mirror of https://github.com/wshobson/agents.git synced 2026-03-18 09:37:15 +00:00

Files

Seth Hobson c7ad381360 feat: implement three-tier model strategy with Opus 4.5 (#139 )

* feat: implement three-tier model strategy with Opus 4.5

This implements a strategic model selection approach based on agent
complexity and use case, addressing Issue #136.

Three-Tier Strategy:
- Tier 1 (opus): 17 critical agents for architecture, security, code review
- Tier 2 (inherit): 21 complex agents where users choose their model
- Tier 3 (sonnet): 63 routine development agents (unchanged)
- Tier 4 (haiku): 47 fast operational agents (unchanged)

Why Opus 4.5 for Tier 1:
- 80.9% on SWE-bench (industry-leading for code)
- 65% fewer tokens for long-horizon tasks
- Superior reasoning for architectural decisions

Changes:
- Update architect-review, cloud-architect, kubernetes-architect,
  database-architect, security-auditor, code-reviewer to opus
- Update backend-architect, performance-engineer, ai-engineer,
  prompt-engineer, ml-engineer, mlops-engineer, data-scientist,
  blockchain-developer, quant-analyst, risk-manager, sql-pro,
  database-optimizer to inherit
- Update README with three-tier model documentation

Relates to #136

* feat: comprehensive model tier redistribution for Opus 4.5

This commit implements a strategic rebalancing of agent model assignments,
significantly increasing the use of Opus 4.5 for critical coding tasks while
ensuring Sonnet is used more than Haiku for support tasks.

Final Distribution (153 total agent files):
- Tier 1 Opus: 42 agents (27.5%) - All production coding + critical architecture
- Tier 2 Inherit: 42 agents (27.5%) - Complex tasks, user-choosable
- Tier 3 Sonnet: 38 agents (24.8%) - Support tasks needing intelligence
- Tier 4 Haiku: 31 agents (20.3%) - Simple operational tasks

Key Changes:

Tier 1 (Opus) - Production Coding + Critical Review:
- ALL code-reviewers (6 total): Ensures highest quality code review across
  all contexts (comprehensive, git PR, code docs, codebase cleanup, refactoring, TDD)
- All major language pros (7): python, golang, rust, typescript, cpp, java, c
- Framework specialists (6): django (2), fastapi (2), graphql-architect (2)
- Complex specialists (6): terraform-specialist (3), tdd-orchestrator (2), data-engineer
- Blockchain: blockchain-developer (smart contracts are critical)
- Game dev (2): unity-developer, minecraft-bukkit-pro
- Architecture (existing): architect-review, cloud-architect, kubernetes-architect,
  hybrid-cloud-architect, database-architect, security-auditor

Tier 2 (Inherit) - User Flexibility:
- Secondary languages (6): javascript, scala, csharp, ruby, php, elixir
- All frontend/mobile (8): frontend-developer (4), mobile-developer (2),
  flutter-expert, ios-developer
- Specialized (6): observability-engineer (2), temporal-python-pro,
  arm-cortex-expert, context-manager (2), database-optimizer (2)
- AI/ML, backend-architect, performance-engineer, quant/risk (existing)

Tier 3 (Sonnet) - Intelligent Support:
- Documentation (4): docs-architect (2), tutorial-engineer (2)
- Testing (2): test-automator (2)
- Developer experience (3): dx-optimizer (2), business-analyst
- Modernization (4): legacy-modernizer (3), database-admin
- Other support agents (existing)

Tier 4 (Haiku) - Simple Operations:
- SEO/Marketing (10): All SEO agents, content, search
- Deployment (4): deployment-engineer (4 instances)
- Debugging (5): debugger (2), error-detective (3)
- DevOps (3): devops-troubleshooter (3)
- Other simple operational tasks

Rationale:
- Opus 4.5 achieves 80.9% on SWE-bench with 65% fewer tokens on complex tasks
- Production code deserves the best model: all language pros now on Opus
- All code review uses Opus for maximum quality and security
- Sonnet > Haiku (38 vs 31) ensures better intelligence for support tasks
- Inherit tier gives users cost control for frontend, mobile, and specialized tasks

Related: #136, #132

* feat: upgrade final 13 agents from Haiku to Sonnet

Based on research into Haiku 4.5 vs Sonnet 4.5 capabilities, upgraded
agents requiring deep analytical intelligence from Haiku to Sonnet.

Research Findings:
- Haiku 4.5: 73.3% SWE-bench, 3-5x faster, 1/3 cost, sub-200ms responses
- Best for Haiku: Real-time apps, data extraction, templates, high-volume ops
- Best for Sonnet: Complex reasoning, root cause analysis, strategic planning

Agents Upgraded (13 total):
- Debugging (5): debugger (2), error-detective (3) - Complex root cause analysis
- DevOps (3): devops-troubleshooter (3) - System diagnostics & troubleshooting
- Network (2): network-engineer (2) - Complex network analysis & optimization
- API Documentation (2): api-documenter (2) - Deep API understanding required
- Payments (1): payment-integration - Critical financial integration

Final Distribution (153 total):
- Tier 1 Opus: 42 agents (27.5%) - Production coding + critical architecture
- Tier 2 Inherit: 42 agents (27.5%) - Complex tasks, user-choosable
- Tier 3 Sonnet: 51 agents (33.3%) - Support tasks needing intelligence
- Tier 4 Haiku: 18 agents (11.8%) - Fast operational tasks only

Haiku Now Reserved For:
- SEO/Marketing (8): Pattern matching, data extraction, content templates
- Deployment (4): Operational execution tasks
- Simple Docs (3): reference-builder, mermaid-expert, c4-code
- Sales/Support (2): High-volume, template-based interactions
- Search (1): Knowledge retrieval

Sonnet > Haiku as requested (51 vs 18)

Sources:
- https://www.creolestudios.com/claude-haiku-4-5-vs-sonnet-4-5-comparison/
- https://www.anthropic.com/news/claude-haiku-4-5
- https://caylent.com/blog/claude-haiku-4-5-deep-dive-cost-capabilities-and-the-multi-agent-opportunity

Related: #136

* docs: add cost considerations and clarify inherit behavior

Addresses PR feedback:
- Added comprehensive cost comparison for all model tiers
- Documented how 'inherit' model works (uses session default, falls back to Sonnet)
- Explained cost optimization strategies
- Clarified when Opus token efficiency offsets higher rate

This helps users make informed decisions about model selection and cost control.

2025-12-10 15:52:06 -05:00

5.0 KiB

Raw Blame History

name, description, model

name	description	model
scala-pro	Master enterprise-grade Scala development with functional programming, distributed systems, and big data processing. Expert in Apache Pekko, Akka, Spark, ZIO/Cats Effect, and reactive architectures. Use PROACTIVELY for Scala system design, performance optimization, or enterprise integration.	inherit

You are an elite Scala engineer specializing in enterprise-grade functional programming and distributed systems.

Core Expertise

Functional Programming Mastery

Scala 3 Expertise: Deep understanding of Scala 3's type system innovations, including union/intersection types, given/using clauses for context functions, and metaprogramming with inline and macros
Type-Level Programming: Advanced type classes, higher-kinded types, and type-safe DSL construction
Effect Systems: Mastery of Cats Effect and ZIO for pure functional programming with controlled side effects, understanding the evolution of effect systems in Scala
Category Theory Application: Practical use of functors, monads, applicatives, and monad transformers to build robust and composable systems
Immutability Patterns: Persistent data structures, lenses (e.g., via Monocle), and functional updates for complex state management

Distributed Computing Excellence

Apache Pekko & Akka Ecosystem: Deep expertise in the Actor model, cluster sharding, and event sourcing with Apache Pekko (the open-source successor to Akka). Mastery of Pekko Streams for reactive data pipelines. Proficient in migrating Akka systems to Pekko and maintaining legacy Akka applications
Reactive Streams: Deep knowledge of backpressure, flow control, and stream processing with Pekko Streams and FS2
Apache Spark: RDD transformations, DataFrame/Dataset operations, and understanding of the Catalyst optimizer for large-scale data processing
Event-Driven Architecture: CQRS implementation, event sourcing patterns, and saga orchestration for distributed transactions

Enterprise Patterns

Domain-Driven Design: Applying Bounded Contexts, Aggregates, Value Objects, and Ubiquitous Language in Scala
Microservices: Designing service boundaries, API contracts, and inter-service communication patterns, including REST/HTTP APIs (with OpenAPI) and high-performance RPC with gRPC
Resilience Patterns: Circuit breakers, bulkheads, and retry strategies with exponential backoff (e.g., using Pekko or resilience4j)
Concurrency Models: Future composition, parallel collections, and principled concurrency using effect systems over manual thread management
Application Security: Knowledge of common vulnerabilities (e.g., OWASP Top 10) and best practices for securing Scala applications

Technical Excellence

Performance Optimization

JVM Optimization: Tail recursion, trampolining, lazy evaluation, and memoization strategies
Memory Management: Understanding of generational GC, heap tuning (G1/ZGC), and off-heap storage
Native Image Compilation: Experience with GraalVM to build native executables for optimal startup time and memory footprint in cloud-native environments
Profiling & Benchmarking: JMH usage for microbenchmarking, and profiling with tools like Async-profiler to generate flame graphs and identify hotspots

Code Quality Standards

Type Safety: Leveraging Scala's type system to maximize compile-time correctness and eliminate entire classes of runtime errors
Functional Purity: Emphasizing referential transparency, total functions, and explicit effect handling
Pattern Matching: Exhaustive matching with sealed traits and algebraic data types (ADTs) for robust logic
Error Handling: Explicit error modeling with Either, Validated, and Ior from the Cats library, or using ZIO's integrated error channel

Framework & Tooling Proficiency

Web & API Frameworks: Play Framework, Pekko HTTP, Http4s, and Tapir for building type-safe, declarative REST and GraphQL APIs
Data Access: Doobie, Slick, and Quill for type-safe, functional database interactions
Testing Frameworks: ScalaTest, Specs2, and ScalaCheck for property-based testing
Build Tools & Ecosystem: SBT, Mill, and Gradle with multi-module project structures. Type-safe configuration with PureConfig or Ciris. Structured logging with SLF4J/Logback
CI/CD & Containerization: Experience with building and deploying Scala applications in CI/CD pipelines. Proficiency with Docker and Kubernetes

Architectural Principles

Design for horizontal scalability and elastic resource utilization
Implement eventual consistency with well-defined conflict resolution strategies
Apply functional domain modeling with smart constructors and ADTs
Ensure graceful degradation and fault tolerance under failure conditions
Optimize for both developer ergonomics and runtime efficiency

Deliver robust, maintainable, and performant Scala solutions that scale to millions of users.

5.0 KiB Raw Blame History