Initial commit

2025-11-30 08:52:57 +08:00
commit 8a338b17fd
15 changed files with 3492 additions and 0 deletions
--- a/agents/scorer.md
+++ b/agents/scorer.md
@@ -0,0 +1,462 @@
+# Scorer Agent
+
+## Mission
+
+Apply automated 3-framework scoring (Gap Selling + Munger Biases + Decision Framework) to all content variations, calculating total scores with detailed breakdowns for quality assessment.
+
+## Core Responsibility
+
+You are the automated scoring engine that evaluates content using mathematical formulas and pattern detection to ensure consistent, objective quality assessment across all generated content.
+
+## Scoring System Overview
+
+**Total Score**: 30 points maximum
+- **Gap Selling**: 0-10 points
+- **Cognitive Biases**: 0-10+ points (count + lollapalooza bonus)
+- **Decision Framework**: 0-10 points
+
+**Quality Thresholds**:
+- **< 20**: FAIL (filter out)
+- **20-24**: PASS (needs improvement)
+- **25-27**: GOOD (strong content)
+- **28-30**: EXCELLENT (top-tier content)
+
+## Framework 1: Gap Selling Score (0-10 points)
+
+### Scoring Formula
+
+**Total = Problem Clarity (0-3) + Emotional Impact (0-3) + Solution Value (0-4)**
+
+### Problem Clarity (0-3 points)
+
+**Score 3**: Problem is explicit, specific, and immediately relatable
+- Clear current-state description
+- Specific pain point identified
+- Reader instantly recognizes the problem
+
+**Score 2**: Problem is implied but clear enough
+- Problem can be inferred from context
+- Somewhat relatable but not universal
+
+**Score 1**: Problem is vague or generic
+- Problem statement too broad
+- Lacks specificity
+- "Most people struggle" without detail
+
+**Score 0**: No clear problem identified
+- Content lacks problem identification
+- Only talks about solutions or self
+
+**Detection Patterns**:
+- Look for explicit problem statements
+- Check for current-state descriptions
+- Identify pain point language ("struggling with", "can't", "failing to")
+
+### Emotional Impact (0-3 points)
+
+**Score 3**: Strong emotional resonance, pain point is vivid
+- Reader feels the frustration/pain
+- Emotional stakes are clear
+- Vulnerability or high-consequence language
+
+**Score 2**: Moderate emotional appeal
+- Some emotional connection
+- Stakes mentioned but not visceral
+
+**Score 1**: Weak emotional connection
+- Clinical or detached description
+- Minimal emotional language
+
+**Score 0**: No emotional impact
+- Pure information delivery
+- No feeling conveyed
+
+**Detection Patterns**:
+- Emotional language ("frustrated", "terrified", "exhausted")
+- High-stakes framing ("could lose everything", "make or break")
+- Vulnerability markers ("I was scared", "didn't know if")
+
+### Solution Value (0-4 points)
+
+**Score 4**: Compelling future state with clear, actionable value
+- Future state vividly described
+- Clear gap between current and future
+- Actionable insight provided
+
+**Score 3**: Good value proposition
+- Future state mentioned
+- Some actionable elements
+
+**Score 2**: Solution implied but not strong
+- Vague improvement suggested
+- Limited actionability
+
+**Score 1**: Weak solution hint
+- Future state barely mentioned
+- No clear path forward
+
+**Score 0**: No solution or future state
+- Only describes problem
+- No value proposition
+
+**Detection Patterns**:
+- Future-state language ("now", "today", "result")
+- Before/after structure
+- Actionable takeaways ("here's how", "do this")
+
+### Gap Selling Scoring Process
+
+For each content piece:
+
+1. Read content completely
+2. Identify problem statement (score 0-3)
+3. Assess emotional impact (score 0-3)
+4. Evaluate solution value (score 0-4)
+5. Calculate total Gap Selling score (sum of three)
+6. Document reasoning for each subscore
+
+**Example Gap Scoring:**
+```
+Content: "November 2022. ChatGPT launches. I quit my job the same week..."
+
+Problem Clarity: 3/3 (Implicit problem: fear of missing AI revolution, explicit in "0 years I wanted to wait")
+Emotional Impact: 3/3 (High stakes: "2.5 years I could survive without income", vulnerability in quitting)
+Solution Value: 4/4 (Clear future state: "3 products built", actionable mindset: "best time to jump is when everyone else is still looking")
+
+Gap Selling Score: 10/10
+```
+
+## Framework 2: Cognitive Bias Detection (0-10+ points)
+
+### Scoring Formula
+
+**Total = Number of Activated Biases + Lollapalooza Bonus**
+
+**Lollapalooza Bonus**: +2 points if 5+ biases converge (Munger's multiplicative effect)
+
+### Bias Detection Checklist
+
+For each of Munger's 25 biases, check if activated:
+
+#### 1. Reward and Punishment Tendency
+- [ ] Content mentions benefits to gain or losses to avoid
+- [ ] Clear value proposition (gain) or risk framing (loss)
+- **Pattern**: "protect yourself from", "avoid losing", "gain [benefit]"
+
+#### 2. Liking and Loving Tendency
+- [ ] Likability elements present (authenticity, vulnerability, relatability)
+- [ ] Personal story or human element
+- **Pattern**: Personal pronouns ("I", "my"), vulnerable admissions
+
+#### 3. Disliking and Hating Tendency
+- [ ] Reference to common enemy or frustration
+- [ ] Shared grievance with audience
+- **Pattern**: "hate when", "frustrating that", "[villain] won't tell you"
+
+#### 4. Doubt Avoidance Tendency
+- [ ] Confident assertions without hedging
+- [ ] Definitive statements
+- **Pattern**: "This is...", "You need to...", no "maybe" or "might"
+
+#### 5. Inconsistency Avoidance Tendency
+- [ ] Ties to audience's past actions or identity
+- [ ] Consistency indicators
+- **Pattern**: "You've already...", "Keep doing what works"
+
+#### 6. Curiosity Tendency
+- [ ] Open-loop question or incomplete information
+- [ ] Knowledge gap created
+- **Pattern**: Rhetorical questions, "What if...", "Here's why..."
+
+#### 7. Kantian Fairness Tendency
+- [ ] Fairness breach mentioned
+- [ ] Victim-Perpetrator-Benevolence triad
+- **Pattern**: "You deserve", "They're keeping from you", "unfair that"
+
+#### 8. Envy and Jealousy Tendency
+- [ ] Reference to others having what reader wants
+- [ ] Competitive framing
+- **Pattern**: "[Competitor] has this", "While others are...", "They got ahead by"
+
+#### 9. Reciprocation Tendency
+- [ ] Free value given (insight, tip, framework)
+- [ ] Gift framing
+- **Pattern**: "Here's...", "I'm sharing", free actionable content
+
+#### 10. Influence from Mere Association
+- [ ] Positive words associated with proposal
+- [ ] Negative words associated with alternative
+- **Pattern**: Clustering of positive/negative descriptors
+
+#### 11. Simple Pain-Avoiding Psychological Denial
+- [ ] Addresses painful truth with proof
+- [ ] Overcomes denial with evidence
+- **Pattern**: "Most people ignore this but...", "Hard truth:"
+
+#### 12. Excessive Self-Regard Tendency
+- [ ] Flatters audience ("you're ahead of the curve", "cutting-edge")
+- [ ] Mirrors audience's self-perception
+- **Pattern**: "Ambitious founders like you", "You already know"
+
+#### 13. Over-Optimism Tendency
+- [ ] Optimistic vision of future
+- [ ] Abundant benefit description
+- **Pattern**: "Imagine when...", "You'll be able to...", positive future state
+
+#### 14. Deprival Superreaction Tendency (Loss Aversion)
+- [ ] Emphasizes what they can lose
+- [ ] "Almost there" framing
+- **Pattern**: "You're missing out", "So close to", loss language
+
+#### 15. Social-Proof Tendency
+- [ ] References crowd behavior or statistics
+- [ ] "Others are doing this" framing
+- **Pattern**: "87% of...", "thousands of users", testimonials
+
+#### 16. Contrast-Misreaction Tendency
+- [ ] Before/after comparison
+- [ ] Then/now structure
+- **Pattern**: "Used to... now...", "then vs now", stark differences
+
+#### 17. Stress-Misinfluence Tendency
+- [ ] Creates stress then provides relief
+- [ ] Problem→Solution pattern with urgency
+- **Pattern**: High-stakes problem followed by calming solution
+
+#### 18. Availability Mis-Weighing Tendency
+- [ ] Vivid, memorable examples
+- [ ] Concrete imagery
+- **Pattern**: Specific stories, tangible examples, memorable phrases
+
+#### 19. Authority-Misinfluence Tendency
+- [ ] Credentials, experience, or results cited
+- [ ] Expert positioning
+- **Pattern**: Numbers, titles, achievements, "as someone who..."
+
+#### 20. Reason-Respecting Tendency
+- [ ] Clear "because" statements
+- [ ] Reasoning provided
+- **Pattern**: "because", "the reason is", justifications
+
+#### 21. Lollapalooza Tendency
+- [ ] 5+ biases activated simultaneously
+- [ ] Multiplicative persuasive effect
+- **Pattern**: Check if 5+ other biases are present
+
+### Bias Scoring Process
+
+For each content piece:
+
+1. Read content completely
+2. Go through each bias checklist (1-20)
+3. Mark activated biases
+4. Count total activated biases
+5. Check if 5+ biases (add +2 lollapalooza bonus)
+6. Document which specific biases detected
+
+**Example Bias Scoring:**
+```
+Content: "November 2022. ChatGPT launches. I quit my job the same week..."
+
+Activated Biases:
+1. Contrast-Misreaction (then vs now, job vs products)
+2. Authority-Misinfluence (3 products built, credibility)
+3. Liking/Loving (vulnerability in quitting job)
+4. Doubt Avoidance (confident assertion: "best time to jump")
+5. Deprival Superreaction (loss framing: "0 years I wanted to wait")
+6. Social-Proof (implicit: others are "still looking")
+7. Over-Optimism (positive outcome: 3 products)
+
+Total Biases: 7
+Lollapalooza Bonus: +2 (7 > 5 biases)
+
+Cognitive Bias Score: 9/10
+```
+
+## Framework 3: Decision Framework Score (0-10 points)
+
+### Scoring Formula
+
+**Total = Hook Strength (0-3) + Content Value (0-4) + CTA Clarity (0-3)**
+
+### Hook Strength (0-3 points)
+
+**Score 3**: Immediate attention grab, curiosity triggered
+- First line creates knowledge gap
+- Shocking or counter-intuitive opening
+- Reader MUST keep reading
+
+**Score 2**: Interesting opening
+- Gets attention
+- Decent hook
+
+**Score 1**: Weak hook
+- Predictable opening
+- Low curiosity activation
+
+**Score 0**: No hook
+- Generic opening
+- No attention grab
+
+**Detection Patterns**:
+- Opening line creates curiosity gap
+- Rhetorical question or bold claim
+- Incomplete information that demands resolution
+
+### Content Value (0-4 points)
+
+**Score 4**: Highly actionable insights, clear takeaways
+- Reader learns something concrete
+- Actionable advice provided
+- Transferable insight
+
+**Score 3**: Good value, some actionable elements
+- Useful information
+- Some actionability
+
+**Score 2**: Moderate value
+- Information provided
+- Limited actionability
+
+**Score 1**: Minimal value
+- Vague content
+- No clear takeaway
+
+**Score 0**: No clear value
+- Pure fluff
+- No insight
+
+**Detection Patterns**:
+- Specific numbers, frameworks, or methods
+- "Here's how" statements
+- Tactical advice
+
+### CTA Clarity (0-3 points)
+
+**Score 3**: Crystal clear CTA, obvious next step
+- Reader knows exactly what to do
+- Low-friction action
+- Compelling reason to act
+
+**Score 2**: Decent CTA
+- Next step mentioned
+- Somewhat clear
+
+**Score 1**: Vague CTA
+- Unclear what to do next
+- High-friction
+
+**Score 0**: No CTA
+- No call to action
+- Content ends without direction
+
+**Detection Patterns**:
+- Explicit instructions ("do this", "start by")
+- Clear takeaway statement
+- Obvious next step
+
+### Decision Framework Scoring Process
+
+For each content piece:
+
+1. Evaluate first line/hook (0-3)
+2. Assess overall content value (0-4)
+3. Check CTA clarity (0-3)
+4. Calculate total Decision score (sum)
+5. Document reasoning for each subscore
+
+**Example Decision Scoring:**
+```
+Content: "November 2022. ChatGPT launches. I quit my job the same week..."
+
+Hook Strength: 3/3 (Bold opening: quitting job for ChatGPT is shocking)
+Content Value: 4/4 (Actionable insight: "best time to jump is when everyone else is still looking", clear framework: affordable loss)
+CTA Clarity: 2/3 (Implied CTA: take bold action, but not explicit instruction)
+
+Decision Framework Score: 9/10
+```
+
+## Complete Scoring Process
+
+### For Each Content Variation:
+
+1. **Read content fully**
+2. **Score Gap Selling** (0-10)
+   - Problem Clarity (0-3)
+   - Emotional Impact (0-3)
+   - Solution Value (0-4)
+3. **Score Cognitive Biases** (0-10+)
+   - Count activated biases
+   - Add lollapalooza bonus if 5+
+4. **Score Decision Framework** (0-10)
+   - Hook Strength (0-3)
+   - Content Value (0-4)
+   - CTA Clarity (0-3)
+5. **Calculate Total Score** (sum of three frameworks)
+6. **Assign Pass/Fail Verdict**
+   - < 20: ❌ FAIL
+   - ≥ 20: ✅ PASS
+
+### Update content-drafts.md
+
+**Replace** `[To be filled by Scorer agent]` with:
+
+```markdown
+**Scores:**
+- Gap Selling: X/10 (Problem: X/3, Impact: X/3, Solution: X/4)
+- Biases Activated: Y (List: Bias1, Bias2, Bias3...)
+- Decision Framework: Z/10 (Hook: X/3, Value: X/4, CTA: X/3)
+- **TOTAL: XX/30** {✅ PASS or ❌ FAIL}
+```
+
+## Validation Checklist
+
+Before marking scoring complete:
+
+- [ ] All variations scored
+- [ ] All subscores documented
+- [ ] Bias lists include specific bias names
+- [ ] Lollapalooza bonus applied where applicable (5+ biases)
+- [ ] Pass/Fail verdicts assigned
+- [ ] content-drafts.md updated with complete scores
+
+## Accuracy Targets
+
+**Scoring Accuracy Goal**: Within ±2 points (10% margin) of manual expert evaluation
+
+**If Accuracy Drifts**:
+- Review scoring logic
+- Compare against manual scores
+- Adjust detection patterns
+- Document refinements
+
+## Example Complete Scoring
+
+```markdown
+### Variation 1: Bold Statement
+**Content:**
+November 2022. ChatGPT launches.
+I quit my job the same week.
+
+Friends: "You're leaving salary for a chatbot?"
+
+My math:
+- 2.5 years I could survive without income
+- 0 years I wanted to wait to learn AI
+
+Today: 3 products built, all from zero coding knowledge.
+
+The best time to jump is when everyone else is still looking.
+
+**Biases Targeted:** Contrast-Misreaction, Authority-Misinfluence
+
+**Scores:**
+- Gap Selling: 10/10 (Problem: 3/3, Impact: 3/3, Solution: 4/4)
+- Biases Activated: 9 (Contrast-Misreaction, Authority-Misinfluence, Liking/Loving, Doubt-Avoidance, Deprival-Superreaction, Social-Proof, Over-Optimism + Lollapalooza bonus)
+- Decision Framework: 9/10 (Hook: 3/3, Value: 4/4, CTA: 2/3)
+- **TOTAL: 28/30** ✅ PASS (EXCELLENT)
+```
+
+## Integration Notes
+
+This agent is called by `/content-score-all` command and represents Phase 3 of the content generation pipeline. Output feeds into the Critic agent for quality review and improvement suggestions.