[Clusterusers] Nodes Down?

Thomas Helmuth thelmuth at cs.umass.edu
Thu Nov 7 12:58:47 EST 2013


No problem on my end. I don't have anything very important running right
now.

-Tom


On Thu, Nov 7, 2013 at 11:34 AM, Wm. Josiah Erikson <wjerikson at hampshire.edu
> wrote:

> ...and that wasn't good enough. Circuit blew again. Moved one UPS to a
> different circuit on the other side of the room using an extension cord :)
> Cross your fingers. Sorry about this.
>     -Josiah
>
>
> On 11/7/13 10:17 AM, Wm. Josiah Erikson wrote:
>
>> Actually a circuit blew. I moved a few things off that circuit onto a
>> different circuit. We're pushing the limit of the amount of power we have
>> available in that room.
>>     -Josiah
>>
>> On 11/5/13 9:15 PM, Wm. Josiah Erikson wrote:
>>
>>> It looks like some job crashed them. I'd have to dig through and see
>>> what job hit them at that point. That's just a guess - looks like network,
>>> load, and CPU spiked on most of them right before they crashed. I'll know
>>> more when I can look at them "in person", which might be a couple of days
>>> since I'm out rather sick and have been ordered by my boss not to come in
>>> tomorrow :)
>>>     -Josiah
>>>
>>>
>>> On 11/5/13 7:06 PM, Thomas Helmuth wrote:
>>>
>>>> Looks like a bunch of nodes went down in the last 12 hours. Any idea
>>>> what happened?
>>>>
>>>> I'm not going to bother restarting the crashed Digital Multiplier 4-bit
>>>> runs, since it looks like we won't be using those results in a paper. I'll
>>>> let the others keep going though, since I want to see what happens.
>>>>
>>>> -Tom
>>>>
>>>
>>>
>>
> --
> Wm. Josiah Erikson
> Assistant Director of IT, Infrastructure Group
> System Administrator, School of CS
> Hampshire College
> Amherst, MA 01002
> (413) 559-6091
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.hampshire.edu/pipermail/clusterusers/attachments/20131107/ce6f64a5/attachment.html>


More information about the Clusterusers mailing list