Simplify oVirt storage configuration

General

Additional Info

General

Additional Info

Description

Today's outage was a clear reminder that our current storage configuration does not serve us well. We hardly know how to debug it, it seems to not be resistant to the very issues it was supposed to protect against and introduce potential failure scenarios of its own.

I suggest we implement a new storage layout that meets the following criteria:

Ultimate simplicity at the lower level of the stack. More specifically:
1. The storage severs should be simple NFS or iSCSI servers. No DRBD and no exotic file-systems.
2. Only simple storage will be presented to oVirt for use as storage domains
Separation of resources between critical services - The 'Jenkins" master for e.g. should not share resources with the "resources" server or anything else.The separation should hold true down to the physical spindle level.
Duplication of services and use of local storage where possible - this is a longer term effort - but we have some low hanging fruits here like artifactory, where simple DNS/LB-based fail-over between two identical hosts would probably suffice.
Complexity only where needed and up the stack. For example we can just have the storage for Jenkins be mirrored at the VM level with fail-over to a backup VM.

Subtasks

100% Done

Linked issues

Activity

Show:

Former user August 23, 2018 at 3:05 PM

Both storage servers rebuilt to offer block storage via iSCSI plus NFS for some use cases. Closing the tracker ticket.

Former user August 8, 2018 at 9:10 AM
Edited

NFS migration complete, storage01 shut down and can be rebuilt.

Here's the partitioning from storage02 - I will likely repeat it unless we need some other specifics:

Physical size	RAID	Logical size	name	use
2x900G	0.9T	RAID1	centos	OS plus NFS shares
4x900G	1.8T	RAID10	jenkins	Jenkins
6x900G	3.6T	RAID50	prod-1	prod systems tier 1 like resources
4x900G	2.7T	RAID5	prod-2	prod systems tier 2

Need to have VLANs configured on it as well as patch BIOS and enable PXE.

Former user June 1, 2018 at 11:52 AM

4.2.3 is out and the production HE environment is up-to-date.

New ticket logged to track NFS migration: OVIRT‌-2075

Eyal Edri April 30, 2018 at 7:34 AM

let's target it to post 4.2.3, after we upgrade HE to latest 4.2.3 and it works well

Eyal Edri April 1, 2018 at 11:51 AM

can you give an update on this? what is the plan for reprovisoning the next storage server?

Done

Details
Assignee
Former user(Deactivated)
Reporter
Barak Korren(Deactivated)
Blocked By
pending upgrade of HE to 4.2.3 and verifying no open bugs on HE
Components
Priority
Highest

Created June 29, 2017 at 8:05 AM

Updated September 2, 2018 at 3:50 PM

Resolved August 23, 2018 at 3:05 PM

Simplify oVirt storage configuration

Description

Subtasks

Linked issues

causes

is blocked by

relates to

Activity

Former user August 23, 2018 at 3:05 PM

Former user August 8, 2018 at 9:10 AMEdited

Former user June 1, 2018 at 11:52 AM

Eyal Edri April 30, 2018 at 7:34 AM

Eyal Edri April 1, 2018 at 11:51 AM

DetailsAssigneeFormer userFormer user(Deactivated)ReporterBarak KorrenBarak Korren(Deactivated)Blocked Bypending upgrade of HE to 4.2.3 and verifying no open bugs on HEComponentsPriorityHighest

Details

Assignee

Reporter

Blocked By

Components

Priority

Former user August 8, 2018 at 9:10 AM
Edited

Details
Assignee
Former user(Deactivated)
Reporter
Barak Korren(Deactivated)
Blocked By
pending upgrade of HE to 4.2.3 and verifying no open bugs on HE
Components
Priority
Highest