User Details
- User Since
- Apr 1 2015, 4:33 PM (502 w, 2 d)
- Availability
- Available
- LDAP User
- Moritz Mühlenhoff
- MediaWiki User
- MMuhlenhoff (WMF) [ Global Accounts ]
Today
This could a great test case for the new apt staging repo (to easily e.g. upgrade the Hadoop test cluster)! If you're interested, we can figure out the details next week.
Yesterday
I had a closer look and I can confirm that the use of openssl is self-contained within the gitlab monorepo package:
Wed, Nov 13
Final status update: The VMs with the legacy setup have been removed and the obsolete Puppet code removed.
I created https://gerrit.wikimedia.org/r/c/operations/puppet/+/1090807 and also added the three new groups to
https://wikitech.wikimedia.org/w/index.php?title=SRE/LDAP/Groups&diff=prev&oldid=2243774 (which is our canonical list of NDA-sensitive groups)
BTW, there is also a much simpler option than writing LDIFs, running the following on ldap-maint1001 would have the same effect:
Tue, Nov 12
Thanks for the update, there's is no hurry, since we still have the old server(s), which ganeti2042 would eventually replace. I was just curious :-)
Will Supermicro send a replacement CPU for this server?
This was a lingering issue caused by an interface name change caused by the update to bookworm, now resolved.
Mon, Nov 11
Happened again on ganeti2031 today.
This behaviour hasn't changed compared to the legacy implementation: Every channel only gets created once there is an edit event for a given combination of language and wiki. Hence, #en.wikimedia will usually be instantly available after a restart of ircstream (the software powering irc.wikimedia.org), while less active wikis might take a little longer.
Fri, Nov 8
The replacement mainboard probably shipped a newer BIOS revision which now by default enables SGX. The state doesn't really affects us either way, so we can also simply close the task (and anyone who ever runs into it finds a reference).
We don't use or need SGX for virtualisation servers. It's a feature invented by Intel (AMD never adopted it, which is telling by itself) which provides an encrypted storage (in their terminology an "enclave") which is also inaccesible to the OS. In theory this would allow some interesting use cases, but in practice the predominant use case is DRM (4k UHD BluRays need it).
One more update: The upstream author (Faidon) of ircstream fixed the underlying bug in https://github.com/paravoid/ircstream/commit/7ef7acea12020189dd450c2de6a91d8baaa18942
Thu, Nov 7
Wed, Nov 6
The irc.wikimedia.org recently switched to ircstream, which has a different architecture, marking this task as declined.
All done!
I had been running into issues with moving VMs to ganeti1041 this morning (which is already added to the Ganeti cluster) and after debugging various OS-level aspects I finally realised that ganeti1041 also lost /dev/kvm? Was it also re-re-reprovisioned? It's not mentioned on this task at all.
Tue, Nov 5
Although Moritz was saying that the Intel Linux driver development cycle leaves a lot to be desired, with frequent updates and breaks in backwards compatibility.
Mon, Nov 4
This is just some log spam from an ongoing Ganeti installation.
And then set up some daily auto-deploy or similar
The data set is really small, I'd suggest to simply pull in the data with rsync during the container build/deploy.
The generation of the reports already lives outside of the miscweb hosts; it runs on puppetdb2003 and needs to continue to run there as it needs direct access to the puppet database. profile::microsites::os_reports basically just rsyncs these files from puppetdb2003 to the local vhost. If you want to move it to k8s, you can simply adapt the sync so that it deploys to Kubernetes instead.
@tappof: thanks for opening a task, but we usually deal with these via Phab tasks. These two both relate to hosts being setup, so some churn is to be expected. To reduce confusion I've just merged a patch to send these mails only to the SRE IF team alias: https://gerrit.wikimedia.org/r/c/operations/puppet/+/1087133
I had a look at the IPMI logs and there are still two more of these errors logged after you reseated the memory on Friday, so it seems this wasn't the memory:
Wed, Oct 30
For transparency: The ssotest03 user is used by myself for tests and has been temporarily added to cn=logstash-access.
irc.wikimedia.org is powered by ircstream 1.0 with no known bugs, marking this as resolved. The old VMs will be removed in two weeks.