Council of Wisdom - AI Debate System

Description: A sophisticated multi-agent debate framework where two expert agents debate opposing viewpoints, managed by a referee, with 9 specialized council members voting on the most compelling argument. Includes automatic cleanup, multi-LLM provider support, and enterprise-grade monitoring, testing, and scalability.

When to Use This Skill

Use Council of Wisdom when you need: - Comprehensive analysis from multiple expert perspectives - Balanced debate on complex topics with opposing viewpoints - Decision-making with structured voting and reasoning - Multi-provider AI evaluation (different LLMs per agent) - Automatic agent lifecycle management (spawn → debate → vote → cleanup) - Enterprise-grade monitoring, testing, feedback loops

Common use cases: - Strategic decision analysis - Technical architecture debates - Product feature prioritization - Risk assessment and mitigation planning - Investment or business opportunity evaluation - Policy or process design decisions

Architecture Overview

┌─────────────────────────────────────────────────────────────┐
│                     QUERY / ADVISE / TROUBLE                 │
└───────────────────────────┬─────────────────────────────────┘
                            │
                            ▼
┌─────────────────────────────────────────────────────────────┐
│                    REFEREE AGENT                            │
│  • Receives query                                           │
│  • Orchestrates debate                                      │
│  • Manages council voting                                   │
│  • Delivers structured outcome                              │
└─────────────┬───────────────────────┬───────────────────────┘
              │                       │
              ▼                       ▼
┌─────────────────────────┐  ┌─────────────────────────┐
│    MASTER DEBATER A     │  │    MASTER DEBATER B     │
│  • Domain expert #1     │  │  • Domain expert #2     │
│  • Persuasive arguments │  │  • Persuasive arguments │
│  • Opposing viewpoint   │  │  • Opposing viewpoint   │
└─────────────────────────┘  └─────────────────────────┘
              │                       │
              └───────────┬───────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────────┐
│              COUNCIL OF 9 EXPERTS                            │
│  • Each is a non-human domain expert                        │
│  • Unique perspective & methodology                        │
│  • Vote on most convincing argument                         │
│  • Provide brief reasoning                                  │
└───────────────────────────┬─────────────────────────────────┘
                            │
                            ▼
┌─────────────────────────────────────────────────────────────┐
│                 REFEREE AGGREGATION                         │
│  • Collects votes (9 total)                                 │
│  • Tally and determine winner                               │
│  • Structure outcome report                                 │
│  • Delete council agents & context                          │
└───────────────────────────┬─────────────────────────────────┘
                            │
                            ▼
┌─────────────────────────────────────────────────────────────┐
│              STRUCTURED OUTCOME REPORT                       │
│  • Winner declaration                                        │
│  • Vote tally                                                │
│  • Key arguments from each side                              │
│  • Council consensus insights                                │
│  • Actionable recommendations                                │
└─────────────────────────────────────────────────────────────┘

Council of 9 - Expert Specializations

Each council member represents a distinct analytical framework:

Logician - Formal logic, fallacy detection, deductive reasoning
Empiricist - Evidence-based, data-driven, statistical analysis
Pragmatist - Real-world applicability, practical outcomes
Ethicist - Moral frameworks, stakeholder impact, fairness
Futurist - Long-term implications, trend analysis, scenarios
Historian - Precedent analysis, historical patterns, lessons
Systems Thinker - Holistic view, interconnected effects
Risk Analyst - Failure modes, mitigation, uncertainty
Synthesizer - Integration, common ground, hybrid solutions

Core Components

1. Workspace Setup

Each Council of Wisdom instance has its own workspace:

council-of-wisdom init <project-name>

Creates:

council-of-wisdom/<project-name>/
├── workspace/
│   ├── strategy.md           # Project-specific strategy
│   ├── monitoring/
│   │   ├── metrics.md        # Key metrics definitions
│   │   ├── cadence.md        # Review cadences (daily/weekly/etc)
│   │   └── dashboard.json    # Metrics dashboard config
│   ├── testing/
│   │   ├── test-cases.md     # Test scenarios
│   │   └── quality-checks.md # Quality criteria
│   ├── feedback/
│   │   ├── feedback-log.md   # User feedback capture
│   │   └── improvement-queue.md
│   ├── prompts/
│   │   ├── referee.md        # Referee agent prompt
│   │   ├── debater-a.md      # Debater A prompt template
│   │   ├── debater-b.md      # Debater B prompt template
│   │   └── council/          # 9 council member prompts
│   │       ├── logician.md
│   │       ├── empiricist.md
│   │       └── ... (all 9)
│   ├── agents/
│   │   ├── referee.json      # Referee agent config
│   │   ├── debater-a.json    # Debater A config
│   │   ├── debater-b.json    # Debater B config
│   │   └── council.json      # Council member configs
│   ├── logs/                 # Debate transcripts, votes
│   └── reports/              # Final outcome reports
├── .github/                  # GitHub repo integration
└── README.md                 # Project documentation

2. GitHub Repository Integration

Every Council of Wisdom project has its own private GitHub repo:

# Auto-created during init
git remote add origin [email protected]:<username>/council-<project-name>.git
git branch -M main
git push -u origin main

Features: - Private repository (enforced) - Automated issue tracking for debates - Wiki for knowledge base - Actions for automated testing - Releases for versioned outcomes

3. Multi-LLM Provider Support

Council members can use different LLM providers randomly:

# Enable multi-provider mode
council-of-wisdom config set multi-provider true

# Define available providers
council-of-wisdom config add-provider openai gpt-4
council-of-wisdom config add-provider anthropic claude-3-opus
council-of-wisdom config add-provider google gemini-pro

Each council member randomly receives a different provider for each debate, ensuring diverse reasoning patterns.

4. Agent Lifecycle Management

Spawn → Debate → Vote → Cleanup

# Full debate cycle
council-of-wisdom debate <topic> 
  --domain <domain> 
  --perspective-a "<perspective A>" 
  --perspective-b "<perspective B>"

Automatic cleanup: - Council agents terminated after voting - Context cleared (memory wiped) - Logs archived to workspace - Ready for next query in seconds

5. Strategy Framework

Every Council of Wisdom project has a strategy.md with:

Required fields: - Council Purpose: What decisions this council makes - Domain Expertise: Areas of specialization - Decision Criteria: How to evaluate arguments - Stakeholders: Who uses the decisions - Success Metrics: What good looks like

Template: templates/strategy-template.md

6. Monitoring & Metrics

5-Cadence Operating Rhythm:

Cadence	Focus	Metrics	Actions
Daily	Debate quality, agent performance	Vote distribution, argument depth, response time	Quick tuning, prompt adjustments
Weekly	Decision impact, user feedback	Adoption rate, satisfaction scores, outcome validity	Strategic prompt refinement
Monthly	Council effectiveness, ROI	Decision accuracy trend, cost efficiency, time-to-decision	Provider optimization, council composition
Quarterly	Strategic alignment, scalability	Business impact, stakeholder value, expansion readiness	Major upgrades, new domains
Annually	Vision review, long-term evolution	Year-over-year impact, innovation potential	Architecture evolution, new paradigms

Key Metrics Dashboard:

{
  "debate_metrics": {
    "total_debates": 0,
    "avg_debate_time": 0,
    "vote_distribution": {},
    "argument_quality_score": 0
  },
  "agent_metrics": {
    "council_diversity_index": 0,
    "provider_rotation_efficiency": 0,
    "context_cleanup_success_rate": 0
  },
  "outcome_metrics": {
    "decision_adoption_rate": 0,
    "outcome_validity": 0,
    "stakeholder_satisfaction": 0
  }
}

7. Testing Framework

Test Categories:

Unit Tests - Individual agent prompts bash council-of-wisdom test unit --agent logician
Integration Tests - Full debate flow bash council-of-wisdom test integration --scenario "complex-decision"
Quality Checks - Argument quality, logic depth bash council-of-wisdom test quality --topic <topic>
Performance Tests - Speed, resource usage bash council-of-wisdom test performance --load 10

Test Scenarios: See templates/test-scenarios.md

8. Feedback & Optimization Loop

Feedback Capture:

# Add user feedback
council-of-wisdom feedback add 
  --debate-id <id> 
  --rating 1-5 
  --comment "<feedback>"

# View improvement queue
council-of-wisdom feedback queue

Automated Optimization:

# Run optimization cycle
council-of-wisdom optimize 
  --analyze last-7-days 
  --update-prompts 
  --tune-providers

Optimization Targets: - Prompt engineering improvements - Provider selection optimization - Council composition tuning - Argument depth maximization - Decision accuracy enhancement

9. Scalability Features

Horizontal Scaling: - Multiple concurrent debates (up to N instances) - Distributed council member allocation - Load balancing across providers

Vertical Scaling: - Council expansion (9 → 12 → 15 members) - Domain specialization layers - Nested debates (sub-councils for sub-issues)

Enterprise Features: - Rate limiting and quota management - Priority queues for urgent decisions - Audit trails and compliance logging - Multi-tenant support

Usage

Initialize a New Council

council-of-wisdom init strategic-decisions

Conduct a Debate

council-of-wisdom debate 
  "Should we invest in AI automation or human expertise for customer support?" 
  --domain "customer-support" 
  --perspective-a "AI automation prioritizes efficiency and scalability" 
  --perspective-b "Human expertise prioritizes empathy and complex problem-solving"

View Outcome Report

council-of-wisdom report <debate-id>

Run Daily Health Check

council-of-wisdom health-check

Run Optimization Cycle

council-of-wisdom optimize --period weekly

Examples

Example 1: Technical Architecture Decision

council-of-wisdom debate 
  "Should we use microservices or monolithic architecture for our new product?" 
  --domain "software-architecture" 
  --perspective-a "Microservices offer scalability, independent deployment, and team autonomy" 
  --perspective-b "Monolithic architecture offers simplicity, lower operational overhead, and faster initial development"

Outcome Report Structure:

# Debate Outcome Report

## Winner: Monolithic Architecture (6/9 votes)

## Vote Tally
- **Monolithic Architecture:** 6 votes (Logician, Empiricist, Pragmatist, Systems Thinker, Risk Analyst, Synthesizer)
- **Microservices:** 3 votes (Futurist, Ethicist, Historian)

## Key Arguments - Monolithic
1. **Development Velocity:** 3-5x faster initial time-to-market
2. **Operational Complexity:** 80% lower infrastructure overhead
3. **Team Coordination:** Reduced communication overhead by 60%

## Key Arguments - Microservices
1. **Future Scalability:** Better suited for 10x+ growth scenarios
2. **Technology Diversity:** Enables polyglot persistence and best-tool selection
3. **Fault Isolation:** Service failures don't cascade across entire system

## Council Insights
- **Consensus:** For a new product with uncertain market fit, monolithic architecture is strategically superior
- **Caveat:** If product validates and scales beyond 1M users, consider gradual migration to microservices
- **Risk Mitigation:** Design monolithic with modular boundaries to ease future migration

## Recommendation
**Adopt Monolithic Architecture for V1 with Modular Design**

### Action Plan
1. Build monolithic with clear module boundaries
2. Implement feature flags for gradual rollout
3. Monitor performance and architecture fit metrics
4. Re-evaluate architecture decision after 6 months or 500K users

Example 2: Marketing Strategy Debate

council-of-wisdom debate 
  "Should we focus on SEO-driven content marketing or paid advertising for customer acquisition?" 
  --domain "marketing" 
  --perspective-a "SEO content builds sustainable, compounding organic traffic and authority" 
  --perspective-b "Paid ads provide immediate, scalable, and predictable customer acquisition"

Advanced Features

Custom Council Composition

Override default council with custom experts:

council-of-wisdom config set-council 
  --members "industry-expert,financial-analyst,legal-counsel,product-strategist,customer-advocate,technical-lead,operations-manager,brand-architect,growth-hacker"

Nested Debates

For complex decisions, run sub-debates:

council-of-wisdom debate --nested 
  --main-topic "Should we enter the enterprise market?" 
  --sub-topic-1 "Pricing strategy" 
  --sub-topic-2 "Feature requirements" 
  --sub-topic-3 "Support infrastructure"

Weighted Voting

Assign different weights to council members:

council-of-wisdom config set-weights 
  --councilmember logician:2 
  --councilmember empiricist:2 
  --councilmember others:1

Debate Replay & Analysis

# Reconstruct and analyze past debates
council-of-wisdom replay <debate-id> --analyze

# Extract patterns across debates
council-of-wisdom analyze-patterns --period last-30-days

Monitoring Dashboards

Real-Time Monitoring

council-of-wisdom monitor --live

Shows: - Active debates - Agent status - Provider health - Queue depth

Historical Analysis

council-of-wisdom analytics --period quarterly

Generates: - Decision trend analysis - Argument quality evolution - Provider performance comparison - Council member effectiveness

Integration Points

GitHub Integration

# Create issue for debate
council-of-wisdom debate --create-issue

# Push outcome report to repo
council-of-wisdom report <id> --push

# Sync with GitHub wiki
council-of-wisdom sync-wiki

API Access (for automation)

# Start a debate via API
curl -X POST https://api.council-of-wisdom.com/v1/debates 
  -H "Authorization: Bearer <token>" 
  -d '{"topic": "...", "domain": "..."}'

# Get outcome
curl https://api.council-of-wisdom.com/v1/debates/<id>/outcome

Webhooks

Configure webhooks for: - Debate completion - Vote finalization - Outcome report generation - Optimization alerts

Troubleshooting

Agent Stuck During Debate

Symptom: Debate not progressing beyond initial arguments

Solutions: 1. Check provider status: council-of-wisdom status providers 2. Review agent logs: council-of-wisdom logs <agent-id> 3. Restart debate: council-of-wisdom debate --restart <debate-id>

Council Vote Deadlock (4-4 tie with 1 abstain)

Symptom: No clear winner

Resolution: 1. Automatic tiebreaker: Referee casts deciding vote 2. Extended debate: Add 2 rounds of rebuttal 3. Both perspectives documented as "equally valid with tradeoffs"

Context Cleanup Failure

Symptom: Council agents not terminating

Solutions: 1. Force cleanup: council-of-wisdom cleanup --force 2. Check process status: council-of-wisdom status agents 3. Review logs: council-of-wisdom logs cleanup

Poor Argument Quality

Symptom: Arguments are shallow or generic

Optimization:

# Run quality analysis
council-of-wisdom analyze-quality <debate-id>

# Auto-optimize prompts
council-of-wisdom optimize --focus prompt-engineering

# Test new prompts
council-of-wisdom test prompts --scenario quality-test

Best Practices

Define strategy first: Always have a clear strategy.md before debating
Iterate on prompts: Regularly optimize based on feedback
Monitor metrics: Review metrics at each cadence
Capture feedback: Always collect user feedback on outcomes
Archive outcomes: Store all reports in GitHub for traceability
Rotate providers: Use multi-provider to avoid bias
Regular cleanup: Ensure context cleanup is working
Version control: Commit all prompt changes to git
Test before deploy: Run integration tests for new prompts
Scale gradually: Start with 9 council, expand only when needed

Template Files

Template	Purpose
`templates/strategy-template.md`	Strategy document for new councils
`templates/referee-prompt.md`	Referee agent prompt template
`templates/debater-prompt.md`	Debater agent prompt template
`templates/council-prompts/`	9 council member prompts
`templates/test-scenarios.md`	Test cases for quality assurance
`templates/metrics-template.md`	Metrics definitions and targets

Reference Materials

Topic	Reference
Prompt Engineering Best Practices	references/prompt-engineering.md
Multi-Agent Orchestration	references/agent-orchestration.md
LLM Provider Comparison	references/provider-comparison.md
Argumentation Theory	references/argumentation-theory.md
Monitoring Architecture	references/monitoring-design.md

Council of Wisdom: Structured debate, collective intelligence, actionable decisions.