Site Tools


Action disabled: edit
slurm_report_two_months

CS SLURM Cluster Report - 8weeks

Report generated for jobs run on the CS SLURM cluster from 2024-09-22 through 2024-11-16.

Job total during this query range: 294,951

Job total since August 1st 2024: 627,060

This page is updated every Sunday at 5:00pm EST.


SLURM Scheduler System Output

--------------------------------------------------------------------------------
Cluster Utilization 2024-09-22T00:00:00 - 2024-11-16T23:59:59
Usage reported in TRES Hours/Percentage of Total
--------------------------------------------------------------------------------
  Cluster      TRES Name               Allocated                  Down         PLND Down                     Idle             Planned                 Reported 
--------- -------------- ----------------------- --------------------- ----------------- ------------------------ ------------------- ------------------------ 
       cs            cpu          701045(13.53%)          20377(0.39%)          0(0.00%)          3920364(75.66%)      539909(10.42%)          5181696(85.65%) 
       cs            mem      6583051083(16.26%)       49964695(0.12%)          0(0.00%)      33851329000(83.62%)            0(0.00%)      40484344778(85.62%) 
       cs       gres/gpu           91948(38.56%)            105(0.04%)          0(0.00%)           146411(61.40%)            0(0.00%)           238464(85.65%) 

* Total Cluster Resources Avaialble by Partition
 (Note, TRES is short for Trackable RESources)
PartitionName=cpu
   TRES=cpu=1680,mem=17094000M,node=44
PartitionName=gpu
   TRES=cpu=2226,mem=19710000M,node=37,gres/gpu=159
PartitionName=nolim
   TRES=cpu=168,mem=480000M,node=4
PartitionName=gnolim
   TRES=cpu=424,mem=2008000M,node=16,gres/gpu=48

SLURM Usage by Partition

PartitionNametotal_jobscputime(HH:MM:SS)completedcancelledrunningfailedpreemptedrequeuedpendingtimeoutout_of_memorysuspendedboot_faildeadlinenode_failresizingrevoked
gpu271712494663:37:58176832243025160200011956000900
cpu18603134362:54:13159599060159500010043000000
gnolim457853229:20:02200015450102500035000000
nolim58542:46:1642110400001000000

SLURM Usage by Advisor Group

  • slurm-cs-undefined, users that have CS accounts but are not CS students
  • slurm-cs-unassigned, users that are CS students but do not have a listed CS advisor
GroupNametotal_jobscputime(HH:MM:SS)cpugpunolimgnolimcompletedcancelledrunningfailedpreemptedrequeuedpendingtimeoutout_of_memorysuspendedboot_faildeadlinenode_failresizingrevoked
slurm-cs-zezhou-cheng602147423:48:140602004896304500050000000
slurm-cs-yu-meng89105163:01:46089003220017000142000400
slurm-cs-tianhao-wang39781871:57:147309081921330155000124000100
slurm-cs-chen-yu-wei803074281:29:308030000666511701186000620000000
slurm-cs-lu-feng755370240:59:180459102962339518570230000010000000
slurm-cs-hongning-wang4636804:04:140460011170800080000200
slurm-cs-undefined2369430500:34:42682015480413901731987705471000718000200
slurm-cs-haiying-shen400524763:29:5411582847003916880000010000000
slurm-cs-shangtong-zhang5323005:50:0428250098023000112000000
slurm-cs-kevin-skadron308518883:31:24653239634222921660595000293000000
slurm-cs-yen-ling-kuo18118506:51:524177004234089000115000000
slurm-cs-yangfeng-ji2444559347:59:3630424415100456250243766000181000000
slurm-cs-adwait-jog6608203:54:484591561035211398025000521000000
slurm-cs-daniel-graham3307570:39:52812490016155095000172000000
slurm-cs-farzad-farnoud537569:43:2244900102201300008000000
slurm-cs-madhur-behl2034838:06:341027910129531056000714000000
slurm-cs-ferdinando-fioretto5204636:41:55455650029515305800095000000
slurm-cs-unassigned983811:50:4419790028406200013000000
slurm-cs-sebastian-elbaum7303571:38:4246517608954220015800028000000
slurm-cs-ashish-venkat12858:57:1212000600500001000000
slurm-cs-xiaozhu-lin38731:51:560380017201600012000000
slurm-cs-geoffrey-fox25192:05:16025004401300013000000
slurm-cs-mary-lou-soffa5812:07:580580001104400003000000
slurm-cs-miaomiao-zhang2707:16:10225008001900000000000
slurm-cs-rich-nguyen700:06:120007000700000000000

SLURM Usage by NodeName

Nodenametotal_jobscputime(HH:MM:SS)completedcancelledrunningfailedpreemptedrequeuedpendingtimeoutout_of_memorysuspendedboot_faildeadlinenode_failresizingrevoked
serval01162143394:59:185039041000242000600
cheetah06591141379:11:585015803000020000000
cheetah04119930640:25:22210290946000111000200
jaguar031034029549:25:448331270936500087000000
cortado01326626231:00:222764550426000210000000
cortado08188026071:13:061337300489000231000000
bigcat01298421021:07:0823852560318000178000000
jaguar0113620935:52:48363705300055000000
puma0138919139:44:38366150100052000000
cheetah05202113363:03:381131800796000113000000
jaguar048713286:10:12392501700060000000
lotus624211847:16:16622780553200045000100
sds01803610083:33:56693990724200020000000
sds02132227994:50:1097215701208600052000000
hydro13847304:32:18130438024000126000000
lynx0819806912:38:1417915401130001210000000
puma025726600:28:3818942033100046000000
jaguar0259695669:31:5412381490457600060000000
ai032405311:43:46969804600000000000
ai022455264:04:429710404400000000000
ai042474888:29:109710904100000000000
cheetah0243634778:41:52486630381200002000000
ai012454630:04:541108305000011000000
jaguar05120264585:25:2873113601114500095000000
ai053424497:42:2416211706200010000000
ai083274335:50:0415513104100000000000
cheetah0112284116:42:06889780246000114000000
ai073533927:00:2217711905700000000000
ai063153719:44:1414710805900010000000
panther015013628:46:124165902000015000000
ai094783484:13:02189122016700000000000
adriatic01113293435:58:448228001042600010000000
adriatic02106883425:44:48821650980200000000000
ai104543284:08:00175119016000000000000
adriatic03100932941:33:52813610921900000000000
adriatic0498522785:39:04727560906900000000000
jaguar0618292776:21:28488390130000020000000
cheetah0340322677:59:54488380350500010000000
adriatic0696762630:12:18733530888800020000000
adriatic05100582509:33:28747510925900001000000
jinx013292481:50:2414911007000000000000
affogato13116202445:23:003786701117300011000000
affogato11119472424:31:563727501149700003000000
jinx023042418:09:2214310705300001000000
affogato15118372313:32:563656901140200010000000
affogato14116632310:45:583616801123300001000000
lynx0915992305:07:0615115003500030000000
lynx0265941997:41:50173200640000001000000
lynx0180111805:32:44197220779100010000000
slurm31671566:58:26141220400000000000
lynx0370761516:35:38153230689800002000000
lynx0556511481:18:18155210547400010000000
titanx031751436:45:10815703600001000000
lynx11154251348:41:522395001513600000000000
cortado031451343:16:50112260700000000000
lynx12162521325:57:022295501596800000000000
cortado097081323:58:24664360800000000000
cortado021571322:20:52122270800000000000
lynx10116021301:57:461885001136200011000000
lynx0469291299:43:52193230671100002000000
cortado041441288:37:12115220600010000000
titanx021811252:41:14665805500002000000
slurm41471197:42:00126190100001000000
titanx041561189:52:30764903100000000000
slurm16101117:00:395434701600004000000
titanx051691107:00:44805003900000000000
cortado051251087:11:24101170600010000000
affogato04751086:21:446090600000000000
slurm298991:08:3483110200002000000
cortado10305924:17:22273260500001000000
lynx066742866:23:30216260649900001000000
lynx076554825:41:12207190632700001000000
affogato02186791:53:0417840400000000000
cortado0682769:09:2457150800020000000
slurm568573:33:5857100000001000000
affogato01252514:04:40232120700010000000
cortado0786474:11:56601101400010000000
struct08230426:41:5422190000000000000
struct09228413:13:3422260000000000000
struct0321399:53:502000000001000000
struct10223385:05:2621760000000000000
affogato0543374:30:403730300000000000
affogato1049335:18:384230300001000000
affogato0644316:26:163420800000000000
struct0226313:31:502600000000000000
affogato0750307:51:424220600000000000
struct0125307:06:342500000000000000
struct0722296:32:062200000000000000
struct0423294:21:342300000000000000
struct0524293:45:062400000000000000
struct0622291:48:362200000000000000
affogato0842283:43:003720300000000000
affogato0941274:39:503620300000000000
heartpiece17264:22:56690100001000000
doppio017199:47:18700000000000000
affogato0311462:28:0411100300000000000
doppio02833:02:00710000000000000
epona903:37:36900000000000000
bigcat04000:00:00000000000000000
bigcat05000:00:00000000000000000
bigcat06000:00:00000000000000000

slurm_report_two_months.txt · Last modified: 2024/11/25 12:44 by 127.0.0.1