[TBR] Viva La Roast! (Perpetual Thread V)

EG

Ars Scholae Palatinae
626
[url=http://arstechnica.com/civis/viewtopic.php?p=28753693#p28753693:ne0m3la6 said:
APSBiker[/url]":ne0m3la6]Yesterday (27 Mar) we had 3 Ars users in the top 10 for Moo! 3 in the top 20 in dnetc stats too. Still 211 years out from completion though.

Really sad to see you leave the team, but good luck with your new team :)
Happy Crunchin.
 
Well I just re added Moo to try to add for a little while. running on a R9 270x at about 26-27 min/wu.

though I have noticed that for the work units that I have tried to process the dnetc_r72_1433334486_12_768_0 complete without issue but the dnetc_r72_1433334486_14_768_0 will start and just hang there and don't seem to be using any GPU. I am currently cruncing a wu with a 13 in the name and so far it looks good at 20% into the wu. the 14's would stop using the GPU very early if at all at around the 1.4% mark
 
well it looks like the issue is not just the 14 wu's. the sample size was small to begin with but out of the first 8 wu's the only ones that complted were the 12's and the fails were both 12's. But later this afternoon I found that a 12 got stuck also where it was not using the gpu. this time instead of aborting the wu I decided to see if exiting BOINC and restarting would make a difference ... It did .. the stuck wu restarted and began to use the GPU.... bad thing is that out of 14 wu's so far I had to abort or restart 3 different wu's. So unless I figure something out where this dows not happen anymore I may have to just complete the wu's in my queue and move on for now.
 

EG

Ars Scholae Palatinae
626
Have you checked the Moo website? anything there on the problem?

I've had WU's lock up on me before I just restart and they usually finish, if they have gone for too long I delete them. But mine is only occasionally like one in a thousand WU's.

I was considering popping out the 780's doing GPUGrid and popping in a couple more 7970's for a bit just to pump a few more WU's into the total and see if they quit chasin.
They can't catch us but they are recrutin trying to.
 
I thing its my GPU running too hot.....I see it at 85-88c and I think it may be shutting down at 90 or so.... I have not checked if I have that see in my config or not but that is pretty high anyways and I may not want to go higher. My gpu crunching will be tapering off somewhat now that it is getting hot. will be in the 90's this week and my upstairs ac just cant keep up ... especially with my downstairs ac totally not working. I would replace it but I have not been working for a while now and just don't have the cash.

I may try and swap out the 270x with a 750ti. I saw on the site that they have optimized their code for Nvidia cards ... so I might see how that one performs.
 

EG

Ars Scholae Palatinae
626
[url=http://arstechnica.com/civis/viewtopic.php?p=29152017#p29152017:1pvmmlpm said:
MechWarrior[/url]":1pvmmlpm]I thing its my GPU running too hot.....I see it at 85-88c and I think it may be shutting down at 90 or so.... I have not checked if I have that see in my config or not but that is pretty high anyways and I may not want to go higher. My gpu crunching will be tapering off somewhat now that it is getting hot. will be in the 90's this week and my upstairs ac just cant keep up ... especially with my downstairs ac totally not working. I would replace it but I have not been working for a while now and just don't have the cash.

I may try and swap out the 270x with a 750ti. I saw on the site that they have optimized their code for Nvidia cards ... so I might see how that one performs.

Yeah, Thermal throttling will give the appearance of a "stuck" WU also. What I did to resolve the heat issues with dual Maltas is devolt and declock the card in Afterburner. Normally they run at 1.2 volt and 1K clock speed. Running Moo at full bore they shoot right to 99 degrees and throttle topping out at 102 degrees. You can run them as low as 500 hertz and .95 volt. My malta's and tahiti's refuse to run under .95 I suspect the Hawaii's are the same being based upon the same chip.
I've managed to keep them all running now for three months with little to no interference at maximum devolt and medium declock settings. It doesn't seem to hurt the output any, they still crank out WU's at a 20 to 24 minute clip 95+ % usage.

If your running at 85-88 degrees you might want to try slight devolting from stock. It helps a lot. Stock operating temp is 85 degrees, keep them under 84 and your good to go.
 

EG

Ars Scholae Palatinae
626
[url=http://arstechnica.com/civis/viewtopic.php?p=29180973#p29180973:4nodasch said:
MechWarrior[/url]":4nodasch]Congrats EG.....

On my end I will have to postpone any Moo work. I have dropped the voltage down 8% and the the clock speed down to 950 and I still see my card hit 95C. And of course still getting the stalled workunits. I will have to wait till it gets cooler outside....

I can completely understand, If it wasn't for the 14k BTU in the window I would be down to three boxes right about now. Back when I was doing Collatz I didn't know any better, and suffered thru 105 deg days in the room where they were at, not anymore.

Haven't reached full summer yet here in the PNW but when late July/August comes around I expect I'll have to take a few down myself. Not worth burning anything out.

Take care brother...
 
[url=http://arstechnica.com/civis/viewtopic.php?p=29376883#p29376883:3dggk79x said:
EG[/url]":3dggk79x]Well I guess hardware issues means that they are not producing WU's until they get it fixed.
Anyone notice that the dnet stats server hasn't changed/updated in 6 days?

Yeah, staff blogs say that stats will update when Keymaster works again.
My box ran dry 2 days ago.
 

LeeF

Ars Centurion
387
Subscriptor
Looks like resellers are dumping Xeon Phis. You can get the higher-end 5110p for just a bit more than that. I'm kinda tempted to give it a try.

(Edit: I should probably clarify that it's the older 22nm ones getting dumped, Intel will be releasing a newer 14nm one soon which is supposed to be much more powerful.)

Guru, did you ever try running OGR-NG on a Xeon Phi?
 
[url=http://arstechnica.com/civis/viewtopic.php?p=29481225#p29481225:s7e6crat said:
LeeF[/url]":s7e6crat]Looks like resellers are dumping Xeon Phis. You can get the higher-end 5110p for just a bit more than that. I'm kinda tempted to give it a try.

(Edit: I should probably clarify that it's the older 22nm ones getting dumped, Intel will be releasing a newer 14nm one soon which is supposed to be much more powerful.)

Guru, did you ever try running OGR-NG on a Xeon Phi?
I wish but there is little software support. The other issue is they require specific Intel server boards for running.
 

EG

Ars Scholae Palatinae
626
Well Guys, It's been a nice run, but alas I have need for my boxen to generate some financial returns. I've picked up an opportunity to make some income off the power these things consume.

My participation will drop for a while. I will keep what ever I can running as long as I can.

It's a contract job with a termination at some point, how long I don't know. But gotta go where they do the most good.

It's been a fun run....

EG
 

LeeF

Ars Centurion
387
Subscriptor
Well, I rescued a toy from getting scrapped at work the other day. Four 12-core Opterons and 320GB of RAM, and they were literally going to scrap it! Unbelievable.

Not sure how long I can keep it running though. I can hear it anywhere in the house even through two closed doors, and it's not a steady white noise, either -- the fans are constantly cycling up an down. Also it noticeably heats my entire basement.