adking-2024-07-23_15:55:03-orch:cephadm-wip-adk-testing-2024-07-19-0851-distro-default-smithi

User	Scheduled	Started	Updated	Runtime	Suite	Branch	Machine Type	Revision	Pass	Fail	Dead
adking	2024-07-23 15:55:03	2024-07-23 15:56:56	2024-07-24 04:10:25	12:13:29	orch:cephadm	wip-adk-testing-2024-07-19-0851	smithi	4fb9f75	1	7	1

Average wait time: 0:09:49

See other runs of suite 'orch:cephadm' on branch 'wip-adk-testing-2024-07-19-0851'?

See other runs on branch 'wip-adk-testing-2024-07-19-0851'?

See other runs scheduled on 2024-07-23?

See detail view?

Status	Job ID	Links	Posted	Started	Updated	Runtime	Duration	In Waiting	Machine	Teuthology Branch	OS Type	OS Version	Description	Nodes
fail	7814080		2024-07-23 15:55:07	2024-07-23 15:56:56	2024-07-23 16:45:56	0:49:00	0:38:37	0:10:23	smithi	main	centos	9.stream	orch:cephadm/mds_upgrade_sequence/{bluestore-bitmap centos_9.stream conf/{client mds mgr mon osd} fail_fs/no overrides/{ignorelist_health ignorelist_upgrade ignorelist_wrongly_marked_down pg-warn pg_health syntax} roles tasks/{0-from/reef/{v18.2.0} 1-volume/{0-create 1-ranks/2 2-allow_standby_replay/yes 3-inline/no 4-verify} 2-client/fuse 3-upgrade-mgr-staggered 4-config-upgrade/{fail_fs} 5-upgrade-with-workload 6-verify}}	2
Failure Reason: reached maximum tries (51) after waiting for 300 seconds
fail	7814081		2024-07-23 15:55:08	2024-07-23 15:56:56	2024-07-23 17:17:02	1:20:06	1:10:12	0:09:54	smithi	main	ubuntu	22.04	orch:cephadm/upgrade/{1-start-distro/1-start-ubuntu_22.04-squid 2-repo_digest/repo_digest 3-upgrade/staggered 4-wait 5-upgrade-ls agent/on mon_election/classic}	2
Failure Reason: Command failed on smithi092 with status 1: 'sudo /home/ubuntu/cephtest/cephadm --image quay.ceph.io/ceph-ci/ceph:squid shell -c /etc/ceph/ceph.conf -k /etc/ceph/ceph.client.admin.keyring --fsid 70bd971a-490e-11ef-bcad-c7b262605968 -e sha1=4fb9f751b4567bf735172816a826f37e1d650dd6 -- bash -c \'ceph versions \| jq -e \'"\'"\'.rgw \| length == 1\'"\'"\'\''
pass	7814082		2024-07-23 15:55:09	2024-07-23 15:56:56	2024-07-23 16:18:46	0:21:50	0:12:00	0:09:50	smithi	main	centos	9.stream	orch:cephadm/smb/{0-distro/centos_9.stream_runc tasks/deploy_smb_basic}	2
fail	7814083		2024-07-23 15:55:10	2024-07-23 15:57:57	2024-07-23 16:32:24	0:34:27	0:24:38	0:09:49	smithi	main	ubuntu	22.04	orch:cephadm/workunits/{0-distro/ubuntu_22.04 agent/on mon_election/connectivity task/test_monitoring_stack_basic}	3
Failure Reason: Command failed on smithi078 with status 5: 'sudo /home/ubuntu/cephtest/cephadm --image quay-quay-quay.apps.os.sepia.ceph.com/ceph-ci/ceph:4fb9f751b4567bf735172816a826f37e1d650dd6 shell -c /etc/ceph/ceph.conf -k /etc/ceph/ceph.client.admin.keyring --fsid ea5200b6-490e-11ef-bcad-c7b262605968 -- bash -c \'set -e\nset -x\nceph orch apply node-exporter\nceph orch apply grafana\nceph orch apply alertmanager\nceph orch apply prometheus\nsleep 240\nceph orch ls\nceph orch ps\nceph orch host ls\nMON_DAEMON=$(ceph orch ps --daemon-type mon -f json \| jq -r \'"\'"\'last \| .daemon_name\'"\'"\')\nGRAFANA_HOST=$(ceph orch ps --daemon-type grafana -f json \| jq -e \'"\'"\'.[]\'"\'"\' \| jq -r \'"\'"\'.hostname\'"\'"\')\nPROM_HOST=$(ceph orch ps --daemon-type prometheus -f json \| jq -e \'"\'"\'.[]\'"\'"\' \| jq -r \'"\'"\'.hostname\'"\'"\')\nALERTM_HOST=$(ceph orch ps --daemon-type alertmanager -f json \| jq -e \'"\'"\'.[]\'"\'"\' \| jq -r \'"\'"\'.hostname\'"\'"\')\nGRAFANA_IP=$(ceph orch host ls -f json \| jq -r --arg GRAFANA_HOST "$GRAFANA_HOST" \'"\'"\'.[] \| select(.hostname==$GRAFANA_HOST) \| .addr\'"\'"\')\nPROM_IP=$(ceph orch host ls -f json \| jq -r --arg PROM_HOST "$PROM_HOST" \'"\'"\'.[] \| select(.hostname==$PROM_HOST) \| .addr\'"\'"\')\nALERTM_IP=$(ceph orch host ls -f json \| jq -r --arg ALERTM_HOST "$ALERTM_HOST" \'"\'"\'.[] \| select(.hostname==$ALERTM_HOST) \| .addr\'"\'"\')\n# check each host node-exporter metrics endpoint is responsive\nALL_HOST_IPS=$(ceph orch host ls -f json \| jq -r \'"\'"\'.[] \| .addr\'"\'"\')\nfor ip in $ALL_HOST_IPS; do\n curl -s http://${ip}:9100/metric\ndone\n# check grafana endpoints are responsive and database health is okay\ncurl -k -s https://${GRAFANA_IP}:3000/api/health\ncurl -k -s https://${GRAFANA_IP}:3000/api/health \| jq -e \'"\'"\'.database == "ok"\'"\'"\'\n# stop mon daemon in order to trigger an alert\nceph orch daemon stop $MON_DAEMON\nsleep 120\n# check prometheus endpoints are responsive and mon down alert is firing\ncurl -s http://${PROM_IP}:9095/api/v1/status/config\ncurl -s http://${PROM_IP}:9095/api/v1/status/config \| jq -e \'"\'"\'.status == "success"\'"\'"\'\ncurl -s http://${PROM_IP}:9095/api/v1/alerts\ncurl -s http://${PROM_IP}:9095/api/v1/alerts \| jq -e \'"\'"\'.data \| .alerts \| .[] \| select(.labels \| .alertname == "CephMonDown") \| .state == "firing"\'"\'"\'\n# check alertmanager endpoints are responsive and mon down alert is active\ncurl -s http://${ALERTM_IP}:9093/api/v1/status\ncurl -s http://${ALERTM_IP}:9093/api/v1/alerts\ncurl -s http://${ALERTM_IP}:9093/api/v1/alerts \| jq -e \'"\'"\'.data \| .[] \| select(.labels \| .alertname == "CephMonDown") \| .status \| .state == "active"\'"\'"\'\n\''
fail	7814084		2024-07-23 15:55:11	2024-07-23 15:58:07	2024-07-23 17:02:55	1:04:48	0:53:53	0:10:55	smithi	main	centos	9.stream	orch:cephadm/mds_upgrade_sequence/{bluestore-bitmap centos_9.stream conf/{client mds mgr mon osd} fail_fs/yes overrides/{ignorelist_health ignorelist_upgrade ignorelist_wrongly_marked_down pg-warn pg_health syntax} roles tasks/{0-from/squid 1-volume/{0-create 1-ranks/2 2-allow_standby_replay/yes 3-inline/yes 4-verify} 2-client/fuse 3-upgrade-mgr-staggered 4-config-upgrade/{fail_fs} 5-upgrade-with-workload 6-verify}}	2
Failure Reason: reached maximum tries (51) after waiting for 300 seconds
fail	7814085		2024-07-23 15:55:12	2024-07-23 15:59:08	2024-07-23 16:54:14	0:55:06	0:46:09	0:08:57	smithi	main	centos	9.stream	orch:cephadm/upgrade/{1-start-distro/1-start-centos_9.stream-squid 2-repo_digest/repo_digest 3-upgrade/staggered 4-wait 5-upgrade-ls agent/off mon_election/connectivity}	2
Failure Reason: Command failed on smithi002 with status 1: 'sudo /home/ubuntu/cephtest/cephadm --image quay.ceph.io/ceph-ci/ceph:squid shell -c /etc/ceph/ceph.conf -k /etc/ceph/ceph.client.admin.keyring --fsid 469d018c-490e-11ef-bcad-c7b262605968 -e sha1=4fb9f751b4567bf735172816a826f37e1d650dd6 -- bash -c \'ceph versions \| jq -e \'"\'"\'.rgw \| length == 1\'"\'"\'\''
dead	7814086		2024-07-23 15:55:13	2024-07-23 15:59:08	2024-07-24 04:10:25	12:11:17			smithi	main	centos	9.stream	orch:cephadm/thrash/{0-distro/centos_9.stream 1-start 2-thrash 3-tasks/radosbench fixed-2 msgr/async-v1only root}	2
Failure Reason: hit max job timeout
fail	7814087		2024-07-23 15:55:14	2024-07-23 16:00:19	2024-07-23 17:17:24	1:17:05	1:08:42	0:08:23	smithi	main	centos	9.stream	orch:cephadm/mds_upgrade_sequence/{bluestore-bitmap centos_9.stream conf/{client mds mgr mon osd} fail_fs/yes overrides/{ignorelist_health ignorelist_upgrade ignorelist_wrongly_marked_down pg-warn pg_health syntax} roles tasks/{0-from/squid 1-volume/{0-create 1-ranks/2 2-allow_standby_replay/no 3-inline/yes 4-verify} 2-client/fuse 3-upgrade-mgr-staggered 4-config-upgrade/{fail_fs} 5-upgrade-with-workload 6-verify}}	2
Failure Reason: reached maximum tries (51) after waiting for 300 seconds
fail	7814088		2024-07-23 15:55:15	2024-07-23 16:00:19	2024-07-23 16:23:04	0:22:45	0:12:24	0:10:21	smithi	main	ubuntu	22.04	orch:cephadm/workunits/{0-distro/ubuntu_22.04 agent/on mon_election/connectivity task/test_cephadm}	1
Failure Reason: Command failed (workunit test cephadm/test_cephadm.sh) on smithi203 with status 1: 'mkdir -p -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=4fb9f751b4567bf735172816a826f37e1d650dd6 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin CEPH_BASE=/home/ubuntu/cephtest/clone.client.0 CEPH_ROOT=/home/ubuntu/cephtest/clone.client.0 CEPH_MNT=/home/ubuntu/cephtest/mnt.0 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/clone.client.0/qa/workunits/cephadm/test_cephadm.sh'