<html><head><meta http-equiv="Content-Type" content="text/html charset=us-ascii"><meta http-equiv="Content-Type" content="text/html charset=us-ascii"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><div class="">Thanks -- what you did was great, but FWIW for these runs, which were exploratory, it wouldn't have mattered much either way. When it gets messy is when we're collecting statistics over large numbers of runs to get solid evidence that something works better than something else, etc. Then we have to worry about whether the death/restarting of runs introduces bias, which it often can... but not here, so all is well!</div><div class=""><br class=""></div><div class="">Thanks!</div><div class=""><br class=""></div><div class=""> -Lee</div><div class=""><br class=""></div><div class=""><br class=""></div><br class=""><div><blockquote type="cite" class=""><div class="">On Oct 12, 2017, at 9:28 AM, Wm. Josiah Erikson <<a href="mailto:wjerikson@hampshire.edu" class="">wjerikson@hampshire.edu</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><div class="">OK, we should be safe to restart jobs and not have them crash. Lee, I<br class="">restarted all error tasks on the jobs that I knew were due to this crash<br class="">and then realized maybe you wouldn't have wanted me to do that - sorry<br class="">if that's the case.<br class=""><br class="">I noticed you hadn't done retry all error tasks on the ones that died<br class="">because compute-0-2's hard drive died, which made me think maybe I had<br class="">made a mistake in restarting the other ones. If not, you can just<br class="">right-click the "retry all error tasks" on those jobs.<br class=""><br class=""> -Josiah<br class=""><br class=""><br class="">On 10/12/17 7:41 AM, Wm. Josiah Erikson wrote:<br class=""><blockquote type="cite" class="">Hi all,<br class=""><br class=""> In the past 24 hours, compute-1-3 (yesterday), and then last night,<br class="">compute-2-9 through compute-2-12 and compute-2-30 all rebooted<br class="">themselves and reinstalled due to overheating and/or overwhelming the<br class="">UPSes they are plugged into.<br class=""><br class=""> The A/C in the room can't keep up and neither can the UPSes, so I<br class="">think I'll try at least temporarily retiring the oldest and least<br class="">performance/watt-efficient nodes, which is I think at this point rack 4,<br class="">and see if that solves both problems, since we aren't using them for<br class="">much anyway.<br class=""><br class=""></blockquote><br class="">-- <br class="">Wm. Josiah Erikson<br class="">Assistant Director of IT, Infrastructure Group<br class="">System Administrator, School of CS<br class="">Hampshire College<br class="">Amherst, MA 01002<br class="">(413) 559-6091<br class="">pronouns: he/him/his<br class=""><br class="">_______________________________________________<br class="">Clusterusers mailing list<br class=""><a href="mailto:Clusterusers@lists.hampshire.edu" class="">Clusterusers@lists.hampshire.edu</a><br class=""><a href="https://lists.hampshire.edu/mailman/listinfo/clusterusers" class="">https://lists.hampshire.edu/mailman/listinfo/clusterusers</a><br class=""></div></div></blockquote></div><br class=""><div class="">
<div style="color: rgb(0, 0, 0); letter-spacing: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><div style="color: rgb(0, 0, 0); letter-spacing: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><div style="color: rgb(0, 0, 0); letter-spacing: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><div style="color: rgb(0, 0, 0); letter-spacing: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><span class="Apple-style-span" style="border-collapse: separate; color: rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant-ligatures: normal; font-variant-position: normal; font-variant-caps: normal; font-variant-numeric: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; border-spacing: 0px; -webkit-text-decorations-in-effect: none; -webkit-text-stroke-width: 0px;"><span class="Apple-style-span" style="font-size: 12px;"><div style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><span class="Apple-style-span" style="border-collapse: separate; color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-ligatures: normal; font-variant-position: normal; font-variant-caps: normal; font-variant-numeric: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; border-spacing: 0px; -webkit-text-decorations-in-effect: none; -webkit-text-stroke-width: 0px;"><div style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><span class="Apple-style-span" style="border-collapse: separate; color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-ligatures: normal; font-variant-position: normal; font-variant-caps: normal; font-variant-numeric: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; border-spacing: 0px; -webkit-text-decorations-in-effect: none; -webkit-text-stroke-width: 0px;"><div style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><span class="Apple-style-span" style="border-collapse: separate; color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-ligatures: normal; font-variant-position: normal; font-variant-caps: normal; font-variant-numeric: normal; font-variant-alternates: normal; font-variant-east-asian: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; border-spacing: 0px; -webkit-text-decorations-in-effect: none; -webkit-text-stroke-width: 0px;"><div style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><div style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><p style="margin: 0px;" class=""></p><div class=""><div apple-content-edited="true" style="orphans: auto; widows: auto;" class="">--</div><div apple-content-edited="true" style="orphans: auto; widows: auto;" class="">Lee Spector, Professor of Computer Science</div><div apple-content-edited="true" style="orphans: auto; widows: auto;" class="">Director, Institute for Computational Intelligence<br class="">Hampshire College, Amherst, Massachusetts, 01002, USA<br class=""><a href="mailto:lspector@hampshire.edu" class="">lspector@hampshire.edu</a>, <a href="http://hampshire.edu/lspector/" class="">http://hampshire.edu/lspector/</a>, 413-559-5352</div></div></div></div></span></div></span></div></span></div></span></span></div></div></div></div>
</div>
<br class=""></body></html>