I've therefore "pinned" the PR-6.1 jobs to Node 1 right now in order to get stable successful results but I wanted to know if the persons in charge of running Jenkins are aware of this?
I am not aware of differences between the nodes, except that node 3 has indeed been down for a while now.
But I have noticed often that parallel runs on the same node can easily go wrong. So there is no full isolation. I try not to start many jobs at once, if I can avoid it.
Moving to running all tests in a Docker container could help, then they would surely be isolated. But that takes effort.
Or move everything over to GitHub, but that takes effort as well, rewiring all the nice things that we currently have with Jenkins and mr.roboto.
Or move all 100 or so plone packages to a mono repo, but this also takes effort.
I restarted node 3.
Node 3 and node 4 are VMs hosted on machines that also have other things to do and they are also quite old, but last time I checked they were actually faster than node 1.
The situation might have changed.
Thanks for the info ... I've started the robottest builds once again sequentially and all is green now. I've also unpinned the 6.1 Jobs in order to let them choose on which node they are running ...
IIRC during the jenkins rfbrowser config session with @gforcada last week we saw a full disk on Node3 ... Node4 is maybe full too. Not sure about that but maybe the jobs need much more space on disk because of the multiple rfbrowser init commands.
I think there is something not configured properly there, as every time a job with robot tests is run browsers get downloaded over and over again.
I remember configuring it to keep the browsers in a global folder, but maybe with the last changes that has to be updated.
I'm having problems ssh'ing into Node3 or Node4 as of late, and usually are rather slow (while doing apt-get update for example one can notice that right away).
And I added our machine as Node5 to Jenkins with 8 workers so the Jenkins CI setup and the robottests are now more stable than ever before
A note to the Node installation: using the ansible playbook above wasn't working for me because of many deprecation warnings. I took the configs as a basis and installed the needed things manually.