Obtaining the Eastern Wind Dataset
To obtain the Eastern Wind Dataset, follow the instructions in the box at right. Additional information about the dataset files is provided below.
How To Download and Unzip the Data
To obtain the Eastern Wind Dataset, you must first read and accept the data use disclaimer agreement. Once you accept the agreement, you will be directed to an FTP (file transfer protocol) site, where you must enter the username "eastwind." No password is necessary.
Most Web browsers will allow you to download files from an FTP site. If you experience technical problems using a Web browser to access the FTP site, you can use an FTP client program. The data files have been compressed or "zipped" to take up less space. After you download a zip file, you will need to uncompress it with a zip/unzip program.
The current Eastern Wind Dataset is an update to the Eastern Wind Integration and Transmission Study data, not an expansion. It covers the same geographic area, 10-minute time resolution, three years (2004, 2005, and 2006), and onshore and offshore (Great Lakes and Atlantic Coast) sites.
The Eastern Wind Dataset Site Summary spreadsheet summarizes the onshore and offshore site data. The onshore information (Onshore_Sites tab) includes ID number, state, longitude, latitude, elevation, plant capacity, net capacity factor, IEC class, and region. Region is denoted by P for those sites inside PJM territory or within 30 km of PJM, and E for all other sites in the East. There is also a summary of the number of onshore sites in each state. The offshore site information (Offshore_Sites tab) includes ID number, state, latitude, longitude, plant capacity, and net capacity factor. There is also a summary of the number of offshore sites associated with each coastal or Great Lakes state.
Each site has one .csv file for the synthesized 10-minute wind power data and one .csv file for the hourly forecasted wind power. The file naming convention for the synthesized wind power data is site_#####.csv, where ##### is the site number. Each .csv contains four columns of data: date, time (in UTC or coordinated universal time), wind speed at 80 m (in meters per second), and power output (in megawatts). All three years of data are in the one file.
The file naming convention for the hourly forecasted wind power data is site_#####_type_fcts.csv, where ##### is the site number and type is 4HR, 6HR, or ND. 4HR is a four-hour-ahead forecast, 6HR is a six-hour-ahead forecast, and ND is the next-day or day-ahead forecast. Each file contains five columns of data: date, time (in UTC), hourly average of the synthesized 10-minute wind output (in megawatts), forecasted power (in megawatts), and the error between the hourly average wind output and forecasted power (in megawatts). Again, all three years of data are in one file.
The FTP folder contains three subfolders: Actuals, PJM_Forecasts, and Non_PJM_Forecasts.
The Actuals folder has five zip files (one onshore and four offshore) that contain .csv files of each site's synthesized power production.
The Non_PJM_Forecasts folder has four zip files (one on-shore and three offshore) that contain .csv files of each non-PJM site's hourly forecast for three forecast periods: four-hour-ahead, six-hour-ahead, and next-day (or day-ahead). All forecast periods are in one zip file for the onshore sites. Each forecast period has a separate zip file for the offshore sites.
The PJM_Forecasts folder has six zip files (three onshore and three offshore) that contain .csv files of each PJM site's hourly forecast for three forecast periods: four-hour-ahead, six-hour-ahead, and next-day (or day-ahead). Each forecast period has a separate zip file for the onshore and offshore sites.