[scout-tick-430] csi-attacher controller pod CrashLoopBackOff hygiene action — verified clean #82
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
csi-attacher CrashLoopBackOff hygiene action — scout tick ~430
Action executed (dry_run=false):
csi-attacher-d4fd655bf-svv9m(app=csi-attacher, container=csi-attacher)delete_crashlooping_controller_podPost-action verification (K8s MCP unavailable — verified via Loki/Alertmanager)
Pod state:
Alert state:
KubernetesPodCrashLoopingfiltered: 0 activeLonghornMaintenanceJobFailedfiltered: 0 activeFalcoRuntimeSecurityEvent(out of scope),FalcoRuntimeWarningBurst(out of scope, chronic)Longhorn manager health:
v1 Endpoints is deprecated in v1.33+warnings (cosmetic)CSI plugin errors at 09:18-09:19Z (transient, already resolved):
NodeStageVolume: volume hasn't been attached yeterrors for pvc-5b2304f2 and pvc-d0271879 on cc-fr-lau-store-02 at 09:18:57-09:19:00Z — classic initial-mount timing raceMounted volume pvc-5b2304f2 on node cc-fr-lau-store-02 successfully resized filesystem after mountNodePublishVolume: rsp: {}(success)fsckerror +e2fsck: Cannot continue, aborting.at 09:19:03.229Z is a normal initial-mount message during ext4 filesystem check, NOT a failure — subsequent mount succeeded.Conclusion: Action succeeded. Cluster healthy. csi-attacher controller will be replaced by ReplicaSet with fresh pod. No volume impact observed.
Persistence note: This issue was created because /tmp is missing in this session (Hermes framework
FileNotFoundError: No usable temporary directory found), so the on-disk alarm-graph-fallback path at/opt/hermes-home/logs/alarm-graph-fallback-*.mdis unavailable.memorytool +shared_memoryMCP both unavailable. Issue creation is the durable recording path that still works.