miknas-council-of-wisdom
v1.0.0支持结构化多智能体辩论,整合对立专家观点、裁判主持及9位专业AI委员投票,实现平衡决策。
Installation
Council of Wisdom - AI Debate System
Description: A sophisticated multi-agent debate framework where two expert agents debate opposing viewpoints, managed by a referee, with 9 specialized council members voting on the most compelling argument. Includes automatic cleanup, multi-LLM provider support, and enterprise-grade monitoring, testing, and scalability.
When to Use This Skill
Use Council of Wisdom when you need: - Comprehensive analysis from multiple expert perspectives - Balanced debate on complex topics with opposing viewpoints - Decision-making with structured voting and reasoning - Multi-provider AI evaluation (different LLMs per agent) - Automatic agent lifecycle management (spawn → debate → vote → cleanup) - Enterprise-grade monitoring, testing, feedback loops
Common use cases: - Strategic decision analysis - Technical architecture debates - Product feature prioritization - Risk assessment and mitigation planning - Investment or business opportunity evaluation - Policy or process design decisions
Architecture Overview
┌─────────────────────────────────────────────────────────────┐
│ QUERY / ADVISE / TROUBLE │
└───────────────────────────┬─────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────┐
│ REFEREE AGENT │
│ • Receives query │
│ • Orchestrates debate │
│ • Manages council voting │
│ • Delivers structured outcome │
└─────────────┬───────────────────────┬───────────────────────┘
│ │
▼ ▼
┌─────────────────────────┐ ┌─────────────────────────┐
│ MASTER DEBATER A │ │ MASTER DEBATER B │
│ • Domain expert #1 │ │ • Domain expert #2 │
│ • Persuasive arguments │ │ • Persuasive arguments │
│ • Opposing viewpoint │ │ • Opposing viewpoint │
└─────────────────────────┘ └─────────────────────────┘
│ │
└───────────┬───────────┘
│
▼
┌─────────────────────────────────────────────────────────────┐
│ COUNCIL OF 9 EXPERTS │
│ • Each is a non-human domain expert │
│ • Unique perspective & methodology │
│ • Vote on most convincing argument │
│ • Provide brief reasoning │
└───────────────────────────┬─────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────┐
│ REFEREE AGGREGATION │
│ • Collects votes (9 total) │
│ • Tally and determine winner │
│ • Structure outcome report │
│ • Delete council agents & context │
└───────────────────────────┬─────────────────────────────────┘
│
▼
┌─────────────────────────────────────────────────────────────┐
│ STRUCTURED OUTCOME REPORT │
│ • Winner declaration │
│ • Vote tally │
│ • Key arguments from each side │
│ • Council consensus insights │
│ • Actionable recommendations │
└─────────────────────────────────────────────────────────────┘
Council of 9 - Expert Specializations
Each council member represents a distinct analytical framework:
- Logician - Formal logic, fallacy detection, deductive reasoning
- Empiricist - Evidence-based, data-driven, statistical analysis
- Pragmatist - Real-world applicability, practical outcomes
- Ethicist - Moral frameworks, stakeholder impact, fairness
- Futurist - Long-term implications, trend analysis, scenarios
- Historian - Precedent analysis, historical patterns, lessons
- Systems Thinker - Holistic view, interconnected effects
- Risk Analyst - Failure modes, mitigation, uncertainty
- Synthesizer - Integration, common ground, hybrid solutions
Core Components
1. Workspace Setup
Each Council of Wisdom instance has its own workspace:
council-of-wisdom init <project-name>
Creates:
council-of-wisdom/<project-name>/
├── workspace/
│ ├── strategy.md # Project-specific strategy
│ ├── monitoring/
│ │ ├── metrics.md # Key metrics definitions
│ │ ├── cadence.md # Review cadences (daily/weekly/etc)
│ │ └── dashboard.json # Metrics dashboard config
│ ├── testing/
│ │ ├── test-cases.md # Test scenarios
│ │ └── quality-checks.md # Quality criteria
│ ├── feedback/
│ │ ├── feedback-log.md # User feedback capture
│ │ └── improvement-queue.md
│ ├── prompts/
│ │ ├── referee.md # Referee agent prompt
│ │ ├── debater-a.md # Debater A prompt template
│ │ ├── debater-b.md # Debater B prompt template
│ │ └── council/ # 9 council member prompts
│ │ ├── logician.md
│ │ ├── empiricist.md
│ │ └── ... (all 9)
│ ├── agents/
│ │ ├── referee.json # Referee agent config
│ │ ├── debater-a.json # Debater A config
│ │ ├── debater-b.json # Debater B config
│ │ └── council.json # Council member configs
│ ├── logs/ # Debate transcripts, votes
│ └── reports/ # Final outcome reports
├── .github/ # GitHub repo integration
└── README.md # Project documentation
2. GitHub Repository Integration
Every Council of Wisdom project has its own private GitHub repo:
# Auto-created during init
git remote add origin [email protected]:<username>/council-<project-name>.git
git branch -M main
git push -u origin main
Features: - Private repository (enforced) - Automated issue tracking for debates - Wiki for knowledge base - Actions for automated testing - Releases for versioned outcomes
3. Multi-LLM Provider Support
Council members can use different LLM providers randomly:
# Enable multi-provider mode
council-of-wisdom config set multi-provider true
# Define available providers
council-of-wisdom config add-provider openai gpt-4
council-of-wisdom config add-provider anthropic claude-3-opus
council-of-wisdom config add-provider google gemini-pro
Each council member randomly receives a different provider for each debate, ensuring diverse reasoning patterns.
4. Agent Lifecycle Management
Spawn → Debate → Vote → Cleanup
# Full debate cycle
council-of-wisdom debate <topic>
--domain <domain>
--perspective-a "<perspective A>"
--perspective-b "<perspective B>"
Automatic cleanup: - Council agents terminated after voting - Context cleared (memory wiped) - Logs archived to workspace - Ready for next query in seconds
5. Strategy Framework
Every Council of Wisdom project has a strategy.md with:
Required fields: - Council Purpose: What decisions this council makes - Domain Expertise: Areas of specialization - Decision Criteria: How to evaluate arguments - Stakeholders: Who uses the decisions - Success Metrics: What good looks like
Template: templates/strategy-template.md
6. Monitoring & Metrics
5-Cadence Operating Rhythm:
| Cadence | Focus | Metrics | Actions |
|---|---|---|---|
| Daily | Debate quality, agent performance | Vote distribution, argument depth, response time | Quick tuning, prompt adjustments |
| Weekly | Decision impact, user feedback | Adoption rate, satisfaction scores, outcome validity | Strategic prompt refinement |
| Monthly | Council effectiveness, ROI | Decision accuracy trend, cost efficiency, time-to-decision | Provider optimization, council composition |
| Quarterly | Strategic alignment, scalability | Business impact, stakeholder value, expansion readiness | Major upgrades, new domains |
| Annually | Vision review, long-term evolution | Year-over-year impact, innovation potential | Architecture evolution, new paradigms |
Key Metrics Dashboard:
{
"debate_metrics": {
"total_debates": 0,
"avg_debate_time": 0,
"vote_distribution": {},
"argument_quality_score": 0
},
"agent_metrics": {
"council_diversity_index": 0,
"provider_rotation_efficiency": 0,
"context_cleanup_success_rate": 0
},
"outcome_metrics": {
"decision_adoption_rate": 0,
"outcome_validity": 0,
"stakeholder_satisfaction": 0
}
}
7. Testing Framework
Test Categories:
-
Unit Tests - Individual agent prompts
bash council-of-wisdom test unit --agent logician -
Integration Tests - Full debate flow
bash council-of-wisdom test integration --scenario "complex-decision" -
Quality Checks - Argument quality, logic depth
bash council-of-wisdom test quality --topic <topic> -
Performance Tests - Speed, resource usage
bash council-of-wisdom test performance --load 10
Test Scenarios: See templates/test-scenarios.md
8. Feedback & Optimization Loop
Feedback Capture:
# Add user feedback
council-of-wisdom feedback add
--debate-id <id>
--rating 1-5
--comment "<feedback>"
# View improvement queue
council-of-wisdom feedback queue
Automated Optimization:
# Run optimization cycle
council-of-wisdom optimize
--analyze last-7-days
--update-prompts
--tune-providers
Optimization Targets: - Prompt engineering improvements - Provider selection optimization - Council composition tuning - Argument depth maximization - Decision accuracy enhancement
9. Scalability Features
Horizontal Scaling: - Multiple concurrent debates (up to N instances) - Distributed council member allocation - Load balancing across providers
Vertical Scaling: - Council expansion (9 → 12 → 15 members) - Domain specialization layers - Nested debates (sub-councils for sub-issues)
Enterprise Features: - Rate limiting and quota management - Priority queues for urgent decisions - Audit trails and compliance logging - Multi-tenant support
Usage
Initialize a New Council
council-of-wisdom init strategic-decisions
Conduct a Debate
council-of-wisdom debate
"Should we invest in AI automation or human expertise for customer support?"
--domain "customer-support"
--perspective-a "AI automation prioritizes efficiency and scalability"
--perspective-b "Human expertise prioritizes empathy and complex problem-solving"
View Outcome Report
council-of-wisdom report <debate-id>
Run Daily Health Check
council-of-wisdom health-check
Run Optimization Cycle
council-of-wisdom optimize --period weekly
Examples
Example 1: Technical Architecture Decision
council-of-wisdom debate
"Should we use microservices or monolithic architecture for our new product?"
--domain "software-architecture"
--perspective-a "Microservices offer scalability, independent deployment, and team autonomy"
--perspective-b "Monolithic architecture offers simplicity, lower operational overhead, and faster initial development"
Outcome Report Structure:
# Debate Outcome Report
## Winner: Monolithic Architecture (6/9 votes)
## Vote Tally
- **Monolithic Architecture:** 6 votes (Logician, Empiricist, Pragmatist, Systems Thinker, Risk Analyst, Synthesizer)
- **Microservices:** 3 votes (Futurist, Ethicist, Historian)
## Key Arguments - Monolithic
1. **Development Velocity:** 3-5x faster initial time-to-market
2. **Operational Complexity:** 80% lower infrastructure overhead
3. **Team Coordination:** Reduced communication overhead by 60%
## Key Arguments - Microservices
1. **Future Scalability:** Better suited for 10x+ growth scenarios
2. **Technology Diversity:** Enables polyglot persistence and best-tool selection
3. **Fault Isolation:** Service failures don't cascade across entire system
## Council Insights
- **Consensus:** For a new product with uncertain market fit, monolithic architecture is strategically superior
- **Caveat:** If product validates and scales beyond 1M users, consider gradual migration to microservices
- **Risk Mitigation:** Design monolithic with modular boundaries to ease future migration
## Recommendation
**Adopt Monolithic Architecture for V1 with Modular Design**
### Action Plan
1. Build monolithic with clear module boundaries
2. Implement feature flags for gradual rollout
3. Monitor performance and architecture fit metrics
4. Re-evaluate architecture decision after 6 months or 500K users
Example 2: Marketing Strategy Debate
council-of-wisdom debate
"Should we focus on SEO-driven content marketing or paid advertising for customer acquisition?"
--domain "marketing"
--perspective-a "SEO content builds sustainable, compounding organic traffic and authority"
--perspective-b "Paid ads provide immediate, scalable, and predictable customer acquisition"
Advanced Features
Custom Council Composition
Override default council with custom experts:
council-of-wisdom config set-council
--members "industry-expert,financial-analyst,legal-counsel,product-strategist,customer-advocate,technical-lead,operations-manager,brand-architect,growth-hacker"
Nested Debates
For complex decisions, run sub-debates:
council-of-wisdom debate --nested
--main-topic "Should we enter the enterprise market?"
--sub-topic-1 "Pricing strategy"
--sub-topic-2 "Feature requirements"
--sub-topic-3 "Support infrastructure"
Weighted Voting
Assign different weights to council members:
council-of-wisdom config set-weights
--councilmember logician:2
--councilmember empiricist:2
--councilmember others:1
Debate Replay & Analysis
# Reconstruct and analyze past debates
council-of-wisdom replay <debate-id> --analyze
# Extract patterns across debates
council-of-wisdom analyze-patterns --period last-30-days
Monitoring Dashboards
Real-Time Monitoring
council-of-wisdom monitor --live
Shows: - Active debates - Agent status - Provider health - Queue depth
Historical Analysis
council-of-wisdom analytics --period quarterly
Generates: - Decision trend analysis - Argument quality evolution - Provider performance comparison - Council member effectiveness
Integration Points
GitHub Integration
# Create issue for debate
council-of-wisdom debate --create-issue
# Push outcome report to repo
council-of-wisdom report <id> --push
# Sync with GitHub wiki
council-of-wisdom sync-wiki
API Access (for automation)
# Start a debate via API
curl -X POST https://api.council-of-wisdom.com/v1/debates
-H "Authorization: Bearer <token>"
-d '{"topic": "...", "domain": "..."}'
# Get outcome
curl https://api.council-of-wisdom.com/v1/debates/<id>/outcome
Webhooks
Configure webhooks for: - Debate completion - Vote finalization - Outcome report generation - Optimization alerts
Troubleshooting
Agent Stuck During Debate
Symptom: Debate not progressing beyond initial arguments
Solutions:
1. Check provider status: council-of-wisdom status providers
2. Review agent logs: council-of-wisdom logs <agent-id>
3. Restart debate: council-of-wisdom debate --restart <debate-id>
Council Vote Deadlock (4-4 tie with 1 abstain)
Symptom: No clear winner
Resolution: 1. Automatic tiebreaker: Referee casts deciding vote 2. Extended debate: Add 2 rounds of rebuttal 3. Both perspectives documented as "equally valid with tradeoffs"
Context Cleanup Failure
Symptom: Council agents not terminating
Solutions:
1. Force cleanup: council-of-wisdom cleanup --force
2. Check process status: council-of-wisdom status agents
3. Review logs: council-of-wisdom logs cleanup
Poor Argument Quality
Symptom: Arguments are shallow or generic
Optimization:
# Run quality analysis
council-of-wisdom analyze-quality <debate-id>
# Auto-optimize prompts
council-of-wisdom optimize --focus prompt-engineering
# Test new prompts
council-of-wisdom test prompts --scenario quality-test
Best Practices
- Define strategy first: Always have a clear strategy.md before debating
- Iterate on prompts: Regularly optimize based on feedback
- Monitor metrics: Review metrics at each cadence
- Capture feedback: Always collect user feedback on outcomes
- Archive outcomes: Store all reports in GitHub for traceability
- Rotate providers: Use multi-provider to avoid bias
- Regular cleanup: Ensure context cleanup is working
- Version control: Commit all prompt changes to git
- Test before deploy: Run integration tests for new prompts
- Scale gradually: Start with 9 council, expand only when needed
Template Files
| Template | Purpose |
|---|---|
templates/strategy-template.md |
Strategy document for new councils |
templates/referee-prompt.md |
Referee agent prompt template |
templates/debater-prompt.md |
Debater agent prompt template |
templates/council-prompts/ |
9 council member prompts |
templates/test-scenarios.md |
Test cases for quality assurance |
templates/metrics-template.md |
Metrics definitions and targets |
Reference Materials
| Topic | Reference |
|---|---|
| Prompt Engineering Best Practices | references/prompt-engineering.md |
| Multi-Agent Orchestration | references/agent-orchestration.md |
| LLM Provider Comparison | references/provider-comparison.md |
| Argumentation Theory | references/argumentation-theory.md |
| Monitoring Architecture | references/monitoring-design.md |
Council of Wisdom: Structured debate, collective intelligence, actionable decisions.