[Buildroot] [Hosting] Unplanned outage: Hypervisor issue with gprod1 on primary Ganeti cluster

Peter Korsgaard peter at korsgaard.com
Wed Jun 13 20:00:26 UTC 2018


>>>>> "Lance" == Lance Albertson <lance at osuosl.org> writes:

FYI,

If you had issues accessing sources.buildroot.{net,org} today, this was
most likely the reason.

The machine is up and running again.

 > All,
 > At approximately 2:36AM PDT (0900 UTC), one of the hypervisors (gprod1) in
 > our primary Ganeti cluster started having hardware issues. This took down
 > all of the instances running on that node. I attempted to bring the node
 > back online however the hardware issue prevented it to come back online. At
 > that point I failed all of the VM instances over to their secondary nodes
 > and forced another node to become the Ganeti master (since gprod1 WAS the
 > master). All of the instances were back online by around 7:40AM PDT (1400
 > UTC).

 > Everything at this point seems to be back to normal (except for gprod1). I
 > will look into bringing gprod1 back online later today.

 > Thank you and sorry for the outages this caused.

 > -- 
 > Lance Albertson
 > Director
 > Oregon State University | Open Source Lab

-- 
Bye, Peter Korsgaard


More information about the buildroot mailing list