20140411, 18:14  #1 
(loop (#_fork))
Feb 2006
Cambridge, England
2·3·29·37 Posts 
Performance of cudaecm on newer hardware?
On each of my GTX580 cards, I see 512 stage1 curves run to B1=1e8 in six hours. Each card burns about 200 watts while doing this; my outbuilding is getting moderately warm and my electricity bill quite substantial.
Has anyone tried cudaecm on newer cards  750Ti would be very interesting, whilst a Keplerseries (6xx or 7xx) would have the advantage that they'll be readily available on ebay once the 8xx start to appear ? 
20140411, 18:16  #2 
I moo ablest echo power!
May 2013
2^{3}·223 Posts 
The 750 ti should be interesting as it is supposed to have the same computation power as older cards, but it has a significantly smaller watt pull (like 60 to 70W, I think?).

20140411, 18:38  #3 
Apr 2010
Over the rainbow
5126_{8} Posts 
60 watt; I just got one, will receive it next week.

20140411, 19:07  #4  
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2
2×7×683 Posts 
Quote:
Via nvidiasmi, it selfreports the "wattage" (true or not, that's another question). What test case number (or at least size) would you propose? I guess I'll take some newer Cunningham extension composites. 

20140411, 19:38  #5 
(loop (#_fork))
Feb 2006
Cambridge, England
2×3×29×37 Posts 
I'm currently running a couple of 17xdigit composites from aliquot sequences; why not try 5748.1526
Code:
1472719685748502449386007028264915543018048506185130332225484383890618193011424409284808724273216014754870734101746897136834531846846664397798081882555660328886965331001835843 Code:
Fri Apr 11 20:39:37 2014 ++  NVIDIASMI 331.20 Driver Version: 331.20  +++  GPU Name PersistenceM BusId Disp.A  Volatile Uncorr. ECC   Fan Temp Perf Pwr:Usage/Cap MemoryUsage  GPUUtil Compute M.  ===============================+======================+======================  0 GeForce GTX 580 Off  0000:02:00.0 N/A  N/A   78% 83C N/A N/A / N/A  237MiB / 1535MiB  N/A Default  ++++  1 GeForce GTX 580 Off  0000:03:00.0 N/A  N/A   65% 75C N/A N/A / N/A  62MiB / 1535MiB  N/A Default  ++++ ++  Compute processes: GPU Memory   GPU PID Process name Usage  =============================================================================  0 Not Supported   1 Not Supported  ++ 
20140411, 21:31  #6 
I moo ablest echo power!
May 2013
3370_{8} Posts 
Is there something akin to EVGA Precision X or MSI Afterburner for Linux? I use that to help keep the temperature down by adjusting the fan curve so it kicks up to higher speeds at lower temperatures. That seems to help a bit in keeping the temperature down. Still, yours seem a bit high.

20140411, 22:42  #7 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2
2×7×683 Posts 
No, there isn't. NVIDIA specifically disabled it in their linux drivers. (It used to be called "Coolbits" option.)
EDIT: ...or maybe they did restore it? See here. Here's what temperatures look like on a quad780 server: Code:
++  NVIDIASMI 5.319.37 Driver Version: 319.37  +++  GPU Name PersistenceM BusId Disp.A  Volatile Uncorr. ECC   Fan Temp Perf Pwr:Usage/Cap MemoryUsage  GPUUtil Compute M.  ===============================+======================+======================  0 GeForce GTX 780 Off  0000:02:00.0 N/A  N/A   60% 70C N/A N/A / N/A  132MB / 3071MB  N/A Default  ++++  1 GeForce GTX 780 Off  0000:03:00.0 N/A  N/A   71% 80C N/A N/A / N/A  72MB / 3071MB  N/A Default  ++++  2 GeForce GTX 780 Off  0000:83:00.0 N/A  N/A   71% 80C N/A N/A / N/A  132MB / 3071MB  N/A Default  ++++  3 GeForce GTX 780 Off  0000:84:00.0 N/A  N/A   71% 80C N/A N/A / N/A  72MB / 3071MB  N/A Default  ++++ I am running 4x768 curves. Let's see what the cat might drag in... Last fiddled with by Batalov on 20140411 at 22:46 
20140411, 22:45  #8 
I moo ablest echo power!
May 2013
11011111000_{2} Posts 
Man, that's disappointing...

20140412, 00:39  #9 
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2
10010101011010_{2} Posts 
On a Tesla:
Code:
++  NVIDIASMI 4.304.54 Driver Version: 304.54  +++  GPU Name  BusId Disp.  Volatile Uncorr. ECC   Fan Temp Perf Pwr:Usage/Cap MemoryUsage  GPUUtil Compute M.  ===============================+======================+======================  0 Tesla M2075  0000:42:00.0 Off  Off   N/A N/A P0 148W / 225W  1% 66MB / 6143MB  99% Default  ++++ ++  Compute processes: GPU Memory   GPU PID Process name Usage  =============================================================================  0 22022 ecmgpu20 53MB  ++ 
20140412, 00:44  #10 
Aug 2002
2^{4}·3·173 Posts 

20140425, 07:15  #11  
"Serge"
Mar 2008
Phi(4,2^7658614+1)/2
2×7×683 Posts 
Quote:
Code:
GMPECM 7.0dev [configured with GMP 6.0.0, enableasmredc, enablegpu, enableassert, enableopenmp] [ECM] Running on avva Input number is 1472719685748502449386007028264915543018048506185130332225484383890618193011424409284808724273216014754870734101746897136834531846846664397798081882555660328886965331001835843 (175 digits) Using MODMULN [mulredc:0, sqrredc:1] Computing batch product (of 144266969 bits) of primes below B1=100000000 took 7804ms GPU: compiled for a NVIDIA GPU with compute capability 3.5. GPU: device 3 is required. GPU: will use device 3: GeForce GTX 780, compute capability 3.5, 12 MPs. GPU: Selection and initialization of the device took 18ms Using B1=100000000, B2=110131572, sigma=3:12493871463:1249387913 (768 curves) dF=512, k=4, d=4620, d2=13, i0=21633 Expected number of curves to find a factor of n digits: 35 40 45 50 55 60 65 70 75 80 326 1658 9378 58245 393298 2863792 2.2e+07 1.9e+08 1.7e+09 1.8e+10 Computing 768 Step 1 took 1469430ms of CPU time / 32719214ms of GPU time Throughput: 0.023 curves by second (on average 42603.14ms by Step 1) Using 21 small primes for NTT Estimated memory usage: 2272K Initializing tables of differences for F took 0ms Computing roots of F took 9ms Building F from its roots took 9ms Computing 1/F took 10ms ... Code:
GMPECM 7.0dev [configured with GMP 6.0.0, enableasmredc, enablegpu, enableassert, enableopenmp] [ECM] Running on avva Input number is 1472719685748502449386007028264915543018048506185130332225484383890618193011424409284808724273216014754870734101746897136834531846846664397798081882555660328886965331001835843 (175 digits) Computing batch product (of 144266969 bits) of primes below B1=100000000 took 7786ms Using MODMULN [mulredc:0, sqrredc:1] Using B1=100000000, B2=110000000, polynomial x^2, sigma=2:3383285554610732763 dF=512, k=4, d=4620, d2=13, i0=21633 Expected number of curves to find a factor of n digits: 35 40 45 50 55 60 65 70 75 80 284 1436 8069 49838 334914 2428189 1.9e+07 1.6e+08 1.3e+09 7.9e+09 Step 1 took 532986ms Using 21 small primes for NTT Estimated memory usage: 2272K Initializing tables of differences for F took 1ms Computing roots of F took 7ms Building F from its roots took 8ms ... 

Thread Tools  
Similar Threads  
Thread  Thread Starter  Forum  Replies  Last Post 
Newer milestone thread  Uncwilly  Data  3469  20211013 15:55 
Newer X64 build needed  Googulator  Msieve  73  20200830 07:47 
CUDALucas / cuFFT Performance on CUDA 7 / 7.5 / 8  Brain  GPU Computing  13  20160219 15:53 
Xbox for TF (aka not higher performance hardware)  spaz  Hardware  4  20091214 17:11 
Newer msieves are slow on Core i7  mklasson  Msieve  9  20090218 12:58 