Tech:Organisation

System administrators may have different access and different permissions on the farm. This page documents the roles of all groups and their responsibilities.

Site Reliability Engineering
Site Reliability Engineering (SRE, formerly known as 'Operations') is the department whose are responsible for day-to-day administration of the servers/services, acting during incidents and advising on managing the IT assets. They have unrestricted access on servers and contact the service providers (e.g. OVH) in case of issues with services.

Senior Site Reliability Engineer
A Senior Site Reliability Engineer (SSRE) is a Site Reliability Engineer considered to have a significant amount of experience in their SRE role. They have extra privileges in the approvals policy, check the integrity of the SRE director's actions and advise on the larger projects within SRE.

Director of Site Reliability Engineering
The Director of Site Reliability Engineering is a member of Site Reliability Engineering, delegated by Miraheze Limited's Board of Directors, to perform day-to-day management of Miraheze's Technical Operations. They maintain contact with the Service Providers, work as the budget holder in accordance with the approvals policy for the technical budget and advise on the larger projects within SRE.

Nominated Director
At least of the members of Miraheze Limited's Board of Directors is a person nominated by the Technical Team to be their representative. The nominated director is a point of contact within the Technical Team for all kinds of questions and respresents the interests of the Technical Team. They also work with the SRE director to ensure there is sufficient budget to keep the infrastructure running. While not mandatory, the SRE director and nominated director are likely to be the same person (helps to keep the lines short).

MediaWiki
MediaWiki Administrators (also known as 'mediawiki-admins', 'mw-admins', formerly known as 'MediaWiki System Administrators') are system administrators responsible for MediaWiki operations. They handle deployment of MediaWiki configuration changes, installing new extensions/skins, monitoring the security of the MediaWiki installation and advising on changes. They have access to the MediaWiki servers, with some sudo access. The full sudo access group is called 'mediawiki-roots' and the role is similar to mediawiki-admins. MediaWiki Administrators are delegated by Site Reliability Engineering.

Cache
Cache Administrators ('cache-admins') is the name of the group of system administrators responsible for managing the single layered cache infrastructure. This mostly details for now managing Varnish, but is likely to expand in the future.

Puppet
Puppet is mainly managed by Site Reliability Engineers but there is also a Puppet User ('puppet-users') group. The purpose of this group is to allow non-ops system administrators to manage SSL certificates.