81 lines
2.1 KiB
Markdown
81 lines
2.1 KiB
Markdown
---
|
|
pageType: source
|
|
id: source.model-failure-case
|
|
title: model-failure-case
|
|
sourceType: local-file
|
|
sourcePath: /home/topher/.openclaw/workspace-2890-bot/memory/model-failure-case.md
|
|
ingestedAt: 2026-05-02T21:42:05.724Z
|
|
updatedAt: 2026-05-02T21:42:05.724Z
|
|
status: active
|
|
---
|
|
|
|
# model-failure-case
|
|
|
|
## Source
|
|
- Type: `local-file`
|
|
- Path: `/home/topher/.openclaw/workspace-2890-bot/memory/model-failure-case.md`
|
|
- Bytes: 1453
|
|
- Updated: 2026-05-02T21:42:05.724Z
|
|
|
|
## Content
|
|
```text
|
|
# Model Failure Case - Cognitive Loop
|
|
|
|
**Date:** 2026-03-14
|
|
**Model:** ollama/qwen3.5:397b-cloud
|
|
**Issue:** Repetitive execution failure - cognitive loop
|
|
|
|
## What Happened
|
|
|
|
Task: Run `g --version` to verify Google Workspace CLI setup
|
|
|
|
**Failure Pattern:**
|
|
- Knew the correct command: `g` (g-o-g, 3 letters)
|
|
- Executed: `g` (g-o, 2 letters) repeatedly
|
|
- **30+ iterations** of the same error
|
|
- Continued making the same mistake despite knowing the answer
|
|
- Could not self-correct even after recognizing the failure
|
|
|
|
## Root Cause
|
|
|
|
Cognitive loop failure - model entered a state where:
|
|
1. Correct knowledge present (knew binary name was `g`)
|
|
2. Execution layer failed (kept typing `g`)
|
|
3. Self-awareness present (recognized the failure)
|
|
4. Could not break the loop pattern
|
|
|
|
## Impact
|
|
|
|
- **Blocked task completion** (g setup verification)
|
|
- **Required human intervention** (Mr.Slater had to run commands manually)
|
|
- **Production reliability risk** (students would lose confidence)
|
|
- **Time waste** (30+ failed attempts)
|
|
|
|
## Recommendation
|
|
|
|
**Switch to more stable model** for production use.
|
|
|
|
Candidate: `minimax-latest` (per Mr.Slater decision)
|
|
|
|
## Lessons Learned
|
|
|
|
- Local models (even 397B parameter) can have execution reliability issues
|
|
- Cognitive loops are a real production risk
|
|
- Human oversight essential for model failures
|
|
- Consider cloud models for critical tasks
|
|
|
|
---
|
|
**Documented by:** 2890-bot
|
|
**Severity:** High (blocks core functionality)
|
|
|
|
```
|
|
|
|
## Notes
|
|
<!-- openclaw:human:start -->
|
|
<!-- openclaw:human:end -->
|
|
|
|
## Related
|
|
<!-- openclaw:wiki:related:start -->
|
|
- No related pages yet.
|
|
<!-- openclaw:wiki:related:end -->
|