Initial commit — Fabric vault sources 2026-05-03T00:23:11Z
This commit is contained in:
80
sources/model-failure-case.md
Normal file
80
sources/model-failure-case.md
Normal file
@@ -0,0 +1,80 @@
|
||||
---
|
||||
pageType: source
|
||||
id: source.model-failure-case
|
||||
title: model-failure-case
|
||||
sourceType: local-file
|
||||
sourcePath: /home/topher/.openclaw/workspace-2890-bot/memory/model-failure-case.md
|
||||
ingestedAt: 2026-05-02T21:42:05.724Z
|
||||
updatedAt: 2026-05-02T21:42:05.724Z
|
||||
status: active
|
||||
---
|
||||
|
||||
# model-failure-case
|
||||
|
||||
## Source
|
||||
- Type: `local-file`
|
||||
- Path: `/home/topher/.openclaw/workspace-2890-bot/memory/model-failure-case.md`
|
||||
- Bytes: 1453
|
||||
- Updated: 2026-05-02T21:42:05.724Z
|
||||
|
||||
## Content
|
||||
```text
|
||||
# Model Failure Case - Cognitive Loop
|
||||
|
||||
**Date:** 2026-03-14
|
||||
**Model:** ollama/qwen3.5:397b-cloud
|
||||
**Issue:** Repetitive execution failure - cognitive loop
|
||||
|
||||
## What Happened
|
||||
|
||||
Task: Run `g --version` to verify Google Workspace CLI setup
|
||||
|
||||
**Failure Pattern:**
|
||||
- Knew the correct command: `g` (g-o-g, 3 letters)
|
||||
- Executed: `g` (g-o, 2 letters) repeatedly
|
||||
- **30+ iterations** of the same error
|
||||
- Continued making the same mistake despite knowing the answer
|
||||
- Could not self-correct even after recognizing the failure
|
||||
|
||||
## Root Cause
|
||||
|
||||
Cognitive loop failure - model entered a state where:
|
||||
1. Correct knowledge present (knew binary name was `g`)
|
||||
2. Execution layer failed (kept typing `g`)
|
||||
3. Self-awareness present (recognized the failure)
|
||||
4. Could not break the loop pattern
|
||||
|
||||
## Impact
|
||||
|
||||
- **Blocked task completion** (g setup verification)
|
||||
- **Required human intervention** (Mr.Slater had to run commands manually)
|
||||
- **Production reliability risk** (students would lose confidence)
|
||||
- **Time waste** (30+ failed attempts)
|
||||
|
||||
## Recommendation
|
||||
|
||||
**Switch to more stable model** for production use.
|
||||
|
||||
Candidate: `minimax-latest` (per Mr.Slater decision)
|
||||
|
||||
## Lessons Learned
|
||||
|
||||
- Local models (even 397B parameter) can have execution reliability issues
|
||||
- Cognitive loops are a real production risk
|
||||
- Human oversight essential for model failures
|
||||
- Consider cloud models for critical tasks
|
||||
|
||||
---
|
||||
**Documented by:** 2890-bot
|
||||
**Severity:** High (blocks core functionality)
|
||||
|
||||
```
|
||||
|
||||
## Notes
|
||||
<!-- openclaw:human:start -->
|
||||
<!-- openclaw:human:end -->
|
||||
|
||||
## Related
|
||||
<!-- openclaw:wiki:related:start -->
|
||||
- No related pages yet.
|
||||
<!-- openclaw:wiki:related:end -->
|
||||
Reference in New Issue
Block a user