<div dir="ltr">If they are the latest runs, that might take awhile, since those are the runs of string differences and pig latin, never solved ones if I remember correctly.</div><div class="gmail_extra"><br clear="all"><div><div class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr"><div><div dir="ltr">Best,<div>Eva</div></div></div></div></div></div>
<br><div class="gmail_quote">On Tue, Jan 31, 2017 at 11:50 AM, Lee Spector <span dir="ltr"><<a href="mailto:lspector@hampshire.edu" target="_blank">lspector@hampshire.edu</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><br>
Thanks,<br>
<br>
I'll just let Eva's runs keep going for the moment.<br>
<span class="HOEnZb"><font color="#888888"><br>
-Lee<br>
</font></span><div class="HOEnZb"><div class="h5"><br>
<br>
<br>
> On Jan 31, 2017, at 11:33 AM, Wm. Josiah Erikson <<a href="mailto:wjerikson@hampshire.edu">wjerikson@hampshire.edu</a>> wrote:<br>
><br>
> ah OK - yeah it wasn't running tractor so that's why. Unfortunately I<br>
> just screwed that up for you. Compute-1-18 is one of the faster nodes,<br>
> so you chose well. Four of Eva's jobs just got dispatched to that node,<br>
> but I think if you "eject and reschedule" them, it probably won't screw<br>
> things up for her? I have already NIMBYed the node. Sorry about that!<br>
><br>
> If you need me to do that because it says you don't have permission, let<br>
> me know.<br>
><br>
> -Josiah<br>
><br>
><br>
><br>
> On 1/31/17 11:23 AM, Lee Spector wrote:<br>
>> Thanks Josiah,<br>
>><br>
>> FWIW I'm currently and for the last few days running jobs via ssh on compute-1-18... and it seems to have been working fine.<br>
>><br>
>> If you'd rather that I use another node then let me know. I chose it more or less at random, but I think it was lightly loaded when I did so... maybe because of the reboot?<br>
>><br>
>> Ideally I'd use the fastest nodes available for these "manual" runs, and dabbling on rack 4 leads me to believe that the nodes with large numbers of cores are a bad idea for this.<br>
>><br>
>> -Lee<br>
>><br>
>>> On Jan 31, 2017, at 11:10 AM, Wm. Josiah Erikson <<a href="mailto:wjerikson@hampshire.edu">wjerikson@hampshire.edu</a>> wrote:<br>
>>><br>
>>> These two nodes weren't running tractor - they had rebooted themselves<br>
>>> and reinstalled. Compute-1-18 5 days, 11:39 uptime and compute-1-3 1<br>
>>> day, 5:18. Not sure why - neither has anything in the logs nor shows<br>
>>> anything particularly suspicious in ganglia. I have restarted tractor<br>
>>> and noted this occurrance to see if it's a pattern or random. Sometimes<br>
>>> some jobs do just trigger random reboots if they randomly use up all the<br>
>>> RAM, invoking the oom-killer, but usually that will leave something in<br>
>>> the logs.<br>
>>><br>
>>><br>
>>> --<br>
>>> Wm. Josiah Erikson<br>
>>> Assistant Director of IT, Infrastructure Group<br>
>>> System Administrator, School of CS<br>
>>> Hampshire College<br>
>>> Amherst, MA 01002<br>
>>> (413) 559-6091<br>
>>><br>
>>> ______________________________<wbr>_________________<br>
>>> Clusterusers mailing list<br>
>>> <a href="mailto:Clusterusers@lists.hampshire.edu">Clusterusers@lists.hampshire.<wbr>edu</a><br>
>>> <a href="https://lists.hampshire.edu/mailman/listinfo/clusterusers" rel="noreferrer" target="_blank">https://lists.hampshire.edu/<wbr>mailman/listinfo/clusterusers</a><br>
>> --<br>
>> Lee Spector, Professor of Computer Science<br>
>> Director, Institute for Computational Intelligence<br>
>> Hampshire College, Amherst, Massachusetts, USA<br>
>> <a href="mailto:lspector@hampshire.edu">lspector@hampshire.edu</a>, <a href="http://hampshire.edu/lspector/" rel="noreferrer" target="_blank">http://hampshire.edu/lspector/</a><wbr>, <a href="tel:413-559-5352" value="+14135595352">413-559-5352</a><br>
>><br>
>> ______________________________<wbr>_________________<br>
>> Clusterusers mailing list<br>
>> <a href="mailto:Clusterusers@lists.hampshire.edu">Clusterusers@lists.hampshire.<wbr>edu</a><br>
>> <a href="https://lists.hampshire.edu/mailman/listinfo/clusterusers" rel="noreferrer" target="_blank">https://lists.hampshire.edu/<wbr>mailman/listinfo/clusterusers</a><br>
><br>
> --<br>
> Wm. Josiah Erikson<br>
> Assistant Director of IT, Infrastructure Group<br>
> System Administrator, School of CS<br>
> Hampshire College<br>
> Amherst, MA 01002<br>
> <a href="tel:%28413%29%20559-6091" value="+14135596091">(413) 559-6091</a><br>
><br>
> ______________________________<wbr>_________________<br>
> Clusterusers mailing list<br>
> <a href="mailto:Clusterusers@lists.hampshire.edu">Clusterusers@lists.hampshire.<wbr>edu</a><br>
> <a href="https://lists.hampshire.edu/mailman/listinfo/clusterusers" rel="noreferrer" target="_blank">https://lists.hampshire.edu/<wbr>mailman/listinfo/clusterusers</a><br>
<br>
--<br>
Lee Spector, Professor of Computer Science<br>
Director, Institute for Computational Intelligence<br>
Hampshire College, Amherst, Massachusetts, USA<br>
<a href="mailto:lspector@hampshire.edu">lspector@hampshire.edu</a>, <a href="http://hampshire.edu/lspector/" rel="noreferrer" target="_blank">http://hampshire.edu/lspector/</a><wbr>, <a href="tel:413-559-5352" value="+14135595352">413-559-5352</a><br>
<br>
______________________________<wbr>_________________<br>
Clusterusers mailing list<br>
<a href="mailto:Clusterusers@lists.hampshire.edu">Clusterusers@lists.hampshire.<wbr>edu</a><br>
<a href="https://lists.hampshire.edu/mailman/listinfo/clusterusers" rel="noreferrer" target="_blank">https://lists.hampshire.edu/<wbr>mailman/listinfo/clusterusers</a><br>
</div></div></blockquote></div><br></div>