Skip to main content

Announcements

Read announcements for NREL’s high-performance computing (HPC) system users. 

Eagle Emergency Maintenance July 12, 2021

July 7, 2021

Eagle will be taking an emergency outage on Monday, July 12th at 8:00am to fix an issue with the underlying storage for a portion of the Lustre file system. This outage is expected to last approximately five hours.

Eagle and all associated systems including login and DAV/FastX nodes, Globus, Lustre, and the home filesystems will be inaccessible during this time.

New jobs submitted to the queue that will not complete by July 12, 8:00 AM will not start prior to this outage. Please set your job walltime (-t or --time) accordingly.

There is a separate scheduled outage will occur for ITS infrastructure upgrades from 6:00pm-10:00pm on the same day. The hardware for the firewalls in the ESIF data center will be upgraded during this time. All network access to the data center from the internet and from inside of NREL will be disrupted until work is complete.

Continue reading

Eagle Job Queue

July 7, 2021

The number of running jobs on Eagle has been dipping on the weekends. Please think about submitting jobs to run over the weekend, especially long weekends, so we can keep the system full. Continue reading

Eagle System Time August 2nd

July 7, 2021

The next Eagle system time is scheduled for the week of August 2nd. This will be a multi day outage to do updates to the parallel file system as well as take care of some hardware issues affecting the compute nodes. Eagle and related file systems will be unavailable at during this system time. Continue reading

CSC User & Applications Support

July 7, 2021

We will be making the conda/4.9.2 module the default module for loading (i.e., without a version number specified). By way of reminders,

  • If you need to reference the existing default Anaconda installation, add the version number to your module load statement in job scripts, i.e., module load conda/mini_py37_4.8.3 rather than just module load conda. Custom environments should interoperate with either version, though.
  • This module permits "conda activate" and "conda deactivate" functionality without conda init. Don't use conda init, as it breaks login shell setup.
  • Consider trying mamba instead of conda when setting up environments. For example, mamba install to add a package to a custom environment.

Julia modules are now available on Eagle. The module files are available at /nopt/nrel/ecom/modulefiles. Currently, versions 1.5.4 and 1.6.1 are available.

If you plan on using these module files regularly, you may wish to add this directory to your module search path with the command:
module use -a /nopt/nrel/ecom/modulefiles

You can add this command to your .bash_profile or .bashrc file with the following command:
echo 'module use -a /nopt/nrel/ecom/modulefiles' >> .bash_profile

(or .bashrc in place of .bash_profile).

Once your module path is updated, simply load the desired Julia version module:
module load julia

Questions or problems regarding Julia on Eagle can be sent to jonathan.maack@nrel.gov.

Continue reading

CSC User & Applications Support

June 8, 2021

  • A new Anaconda installation is in testing, and will be put into production shortly. To access the test installation, as always just enable the Test modules collection via
    module use /nopt/nrel/apps/modules/test/modulefiles
    and you should then see a conda/4.9.2 module
  • Unlike with previous Anaconda installations, we have enabled the conda activate and conda deactivate syntax without requiring conda init (which creates "environmental" problems, something we're all against). The "source"ing syntax still works, but you now have the option to use either. Our hope is that enabling the conda commands will permit greater interoperability with scripts developed elsewhere (where conda activate may have worked), perhaps prove slightly faster, and eliminate awkward error messages.
    • This Conda deployment includes a new faster command for setting up new environments and installing packages. Instead of conda ..., you can try mamba ..., e.g., mamba install tensorflow
  • Other application upgrades have been or will be deployed shortly. If they are not already in production, you may access the installations via the module use statement above.
  • Application New Version:
    • ANSYS 2021R1
    • CMake 3.18.2
    • COMSOL 5.6
    • CUDA 11, includes cudnn and development tools and libraries
    • GAMS 34.3.0
    • MATLAB R2020b
    • MPT 2.23
    • OpenMPI 4.1.0, including Java support
Continue reading

AUP Renewals

June 8, 2021

You may have received an email from DocuSign (dse_NA3@docusign.net) requesting that you renew your NREL HPC Appropriate Use Policy (AUP). We are required to maintain these agreements and, should you receive one, it will be necessary for you to complete it within 30 days in order to continue accessing HPC systems. We appreciate your cooperation. Continue reading

Fiscal Year 2022 HPC Allocation Requests Due Monday, June 7th

June 8, 2021

The deadline for requests for HPC requests is TODAY, Monday, June 7, at Midnight Mountain Time. Thank you to everyone who have already submitted requests.

Please submit a request if you are a researcher at any national laboratory, university, or other organization pursuing EERE-funded research, or if you are an NREL-affiliated researcher performing research aligned with the EERE mission funded through other organizations. Requests are welcome for current projects, projects where a funding request has been submitted, and projects where a funding request for FY22 is in preparation.

Additional information on the Eagle allocation process is available at https://www.nrel.gov/hpc/resource-allocation-requests.html. Please e-mail hpc-requests@nrel.gov if you have any additional questions.

Thank you,
NREL HPC Operations

Continue reading

FY22 NREL HPC Annual Call for Allocation Requests

May 7, 2021

The National Renewable Energy Lab (NREL) provides High Performance Computing and related capabilities to support the mission of the Department of Energy, Office of Energy Efficiency, and Renewable Energy (EERE). Research and development projects that are funded by EERE Offices or aligned with the EERE mission are eligible to utilize these resources. ​ Continue reading

Slurm Fairshare Refresher

May 7, 2021

FY21 saw the introduction of the "fairshare" priority algorithm in Eagle's job scheduler, Slurm. Queue times have been high during the Q2-Q3 rush and we've received some questions, so here's a quick refresher on Fairshare and what it means in regards to job scheduling. Continue reading

Queue Times

May 7, 2021

The allocation year transitioned from Q2 to Q3 on April 1st. The job queue leading up to the end of Q2 saw a very large spike in jobs submitted, and queue depth (job wait time) rose accordingly. A few projects saw some effect of fairshare, but much of the pressure came from over a third of all jobs being submitted as qos=high. Because of the large surge in jobs submitted, interactions with fairshare and a few projects that have used up their allocation we have been analyzing the scheduling algorithms. Based on some recommendations from SchedMD and internal analysis we have made a few adjustments to the slurm configuration. Those changes thus far appear to have alleviated some of the pressure on the queues as well as a reduction in the number of jobs submitted with qos=high. Continue reading