Infra Team Meeting Agenda for November 2, 2021

Clark Boylan cboylan at sapwetik.org
Mon Nov 1 22:43:17 UTC 2021


We will meet on November 2, 2021 at 19:00 UTC in #opendev-meeting with this agenda:

== Agenda for next meeting ==

* Announcements
** Gerrit User Summit details arriving soon. I've been told they would be interested to hear from us on how we do automated testing and management of Gerrit.

* Actions from last meeting

* Specs Review
** Mailman 3 spec https://review.opendev.org/810990

* Topics
** Improving OpenDev's CD throughput (clarkb 20211102)
*** We can run many of our jobs in parallel in all of our CD pipelines. But this requires we properly document/address dependencies
**** Need to understand our job dependencies and properly note them in Zuul config or address them by combining jobs.
***** Example 1: Combine service-gitea-lb and service-gitea jobs.
***** Example 2: Combine letsencrypt and nameserver jobs
***** Example 3: Have all jobs with webserver config express a dependency on the letsencrypt job
**** Suggest we document the known job dependencies in a human readable format, then encode this into zuul, then we can switch to parallel runs.
**** https://review.opendev.org/c/opendev/system-config/+/807672
***** should list dependencies for all jobs
***** zuul doesn't trigger on this?  not sure on best approach to make it mergable
**** https://review.opendev.org/c/opendev/base-jobs/+/807807
***** currently every executor adds keys for bridge, then logs in and clones system-config before running playbooks
***** this change makes split jobs to do this.  however, production remains the same as both are called.
**** https://review.opendev.org/c/opendev/system-config/+/807808
***** this is a follow-on that adds a base job to clone system-config, and stops the other production jobs re-cloning.
***** this job must run first, but then all other jobs can run in parallel, as they are all in the same buildset and using the same "view" of system-config for that particular run
** Gerrit Account cleanups (clarkb 20211102)
*** 33 conflicts remain. Clarkb has written notes on proposed plans for each user in the comments of review02:~clarkb/gerrit_user_cleanups/audit-results-annotated.yaml
** Fedora 34 test node booting problems (clarkb 20211102)
*** Changes to Fedora's kernel packaging broke Xen
*** Not yet sure if that may have also somehow broken OVH and iweb.
** Zuul multi scheduler setup (clarkb 20211102)
*** Zuul ran with two schedulers for the first time over the last weekend.
*** First jobs started by one scheduler and completed by another ran.
*** Had to revert due to bugs in caching.
*** Expect Zuul restarts as the scale out scheduler work in Zuul progresses.
** FIPS testing in our CI system (clarkb 20211102)
*** There is interest in testing various pieces of software against FIPS enabled systems in our Zuul.
*** We are not building special FIPS images instead a Zuul role exists to update supported platforms and boot them into FIPS mode.
*** Some tests have problems with ephemeral state being lost if the FIPS role runs too late. Then tests fail for unexpected reasons. Worth checking on reboot ordering relative to other test setup if there are problems.

* Open discussion



More information about the service-discuss mailing list