'clean failed' 상태 해결방법

(undercloud) [stack@director ~]$ openstack baremetal node list
+--------------------------------------+----------+---------------+-------------+--------------------+-------------+
| UUID                                 | Name     | Instance UUID | Power State | Provisioning State | Maintenance |
+--------------------------------------+----------+---------------+-------------+--------------------+-------------+
| b6fd8f97-5cd7-4b77-8ce2-0c89a3c93ef1 | control1 | None          | power on    | clean failed       | False       |
| c36d02cf-ec5e-4503-bb9f-39450171e63e | control2 | None          | power off   | available          | False       |
| e9b14786-392a-4286-ac2b-af54cb379a1b | control3 | None          | power off   | available          | False       |
| 31976bfb-13cd-4c82-a275-26a8301086db | compute1 | None          | power off   | available          | False       |
+--------------------------------------+----------+---------------+-------------+--------------------+-------------+

오버클라우드 배포에 앞서 introspect 작업 후 위와같이 clean failed 상태가 되는 경우가 있다.
이 상태가 되면 해당 노드에 문제가 발생할 수 있기 때문에 노드를 끄거나 컨트롤 할 수 없는 상태가 된다.

해결하려면 일단 노드를 manageable 상태로 변경해야 한다.

$ openstack baremetal node maintenance set <uuid>
$ openstack baremetal node abort <uuid>
$ openstack baremetal node maintenance unset <uuid>
$ openstack baremetal node manage <uuid>
$ openstack baremetal node provide <uuid>

일단 이렇게 manageable상태로 변경한 이후 장애 원인이 된 불량 디스크 교체라던지 로그 확인등 작업을 수행한다.
노드 문제가 해결이 되면 다시 원상태로 변경한다

# First, move it out of maintenance mode
$ openstack baremetal node maintenance unset $node_id

# Now, make the node available for scheduling by nova
$ openstack baremetal node provide $node_id

이후 해당 노드는 자동으로 cleaning 작업을 재 시작하며 완료되면 available상태로 변경된다.

## 수행 직후
(undercloud) [stack@director ~]$ openstack baremetal node list
+--------------------------------------+----------+---------------+-------------+--------------------+-------------+
| UUID                                 | Name     | Instance UUID | Power State | Provisioning State | Maintenance |
+--------------------------------------+----------+---------------+-------------+--------------------+-------------+
| b6fd8f97-5cd7-4b77-8ce2-0c89a3c93ef1 | control1 | None          | power off   | cleaning           | False       |
| c36d02cf-ec5e-4503-bb9f-39450171e63e | control2 | None          | power off   | available          | False       |
| e9b14786-392a-4286-ac2b-af54cb379a1b | control3 | None          | power off   | available          | False       |
| 31976bfb-13cd-4c82-a275-26a8301086db | compute1 | None          | power off   | available          | False       |
+--------------------------------------+----------+---------------+-------------+--------------------+-------------+

## 자동으로 전원 켜지며 작업 수행
(undercloud) [stack@director ~]$ openstack baremetal node list
+--------------------------------------+----------+---------------+-------------+--------------------+-------------+
| UUID                                 | Name     | Instance UUID | Power State | Provisioning State | Maintenance |
+--------------------------------------+----------+---------------+-------------+--------------------+-------------+
| b6fd8f97-5cd7-4b77-8ce2-0c89a3c93ef1 | control1 | None          | power on    | clean wait         | False       |
| c36d02cf-ec5e-4503-bb9f-39450171e63e | control2 | None          | power off   | available          | False       |
| e9b14786-392a-4286-ac2b-af54cb379a1b | control3 | None          | power off   | available          | False       |
| 31976bfb-13cd-4c82-a275-26a8301086db | compute1 | None          | power off   | available          | False       |
+--------------------------------------+----------+---------------+-------------+--------------------+-------------+

## 작업 수행중
(undercloud) [stack@director ~]$ openstack baremetal node list
+--------------------------------------+----------+---------------+-------------+--------------------+-------------+
| UUID                                 | Name     | Instance UUID | Power State | Provisioning State | Maintenance |
+--------------------------------------+----------+---------------+-------------+--------------------+-------------+
| b6fd8f97-5cd7-4b77-8ce2-0c89a3c93ef1 | control1 | None          | power on    | cleaning           | False       |
| c36d02cf-ec5e-4503-bb9f-39450171e63e | control2 | None          | power off   | available          | False       |
| e9b14786-392a-4286-ac2b-af54cb379a1b | control3 | None          | power off   | available          | False       |
| 31976bfb-13cd-4c82-a275-26a8301086db | compute1 | None          | power off   | available          | False       |
+--------------------------------------+----------+---------------+-------------+--------------------+-------------+

## 작업 완료
(undercloud) [stack@director ~]$ openstack baremetal node list
+--------------------------------------+----------+---------------+-------------+--------------------+-------------+
| UUID                                 | Name     | Instance UUID | Power State | Provisioning State | Maintenance |
+--------------------------------------+----------+---------------+-------------+--------------------+-------------+
| b6fd8f97-5cd7-4b77-8ce2-0c89a3c93ef1 | control1 | None          | power off   | available          | False       |
| c36d02cf-ec5e-4503-bb9f-39450171e63e | control2 | None          | power off   | available          | False       |
| e9b14786-392a-4286-ac2b-af54cb379a1b | control3 | None          | power off   | available          | False       |
| 31976bfb-13cd-4c82-a275-26a8301086db | compute1 | None          | power off   | available          | False       |
+--------------------------------------+----------+---------------+-------------+--------------------+-------------+
로그인하면 댓글을 남길 수 있습니다.
  • clean_failed_상태_해결방법.txt
  • 마지막으로 수정됨: 2023/09/27 05:53
  • 저자 koov