Weather Module

Weather data is fundamental to agricultural simulation, providing environmental inputs crucial for modeling crop growth and yield. Reliable weather data ensures realistic simulation outcomes. The EPIC model specifically requires weather data formatted into two key input files:

Daily Weather Files (.DLY): Contain day-to-day meteorological data (solar radiation, temperature max/min, precipitation, relative humidity, wind speed). These drive the daily simulation processes.
Monthly Weather Files (.WP1): Summarize climate statistics, typically monthly averages or totals derived from daily data.

soilg

1. Data Sources

Weather files can be created from various sources. GeoEPIC provides support for google earth engine sources.

Google Earth Engine (GEE): GEE is a cloud-based platform for planetary-scale geospatial analysis. It hosts a vast public data archive that includes numerous satellite imagery and climate datasets (like AgERA5, GRIDMET, CHIRPS, etc.), alongside capabilities to use your own private GEE assets. GeoEPIC leverages GEE to fetch data from these collections, allowing access to global or regional weather data often derived from satellite observations or climate reanalysis models.
- Reference: Google Earth Engine dataset catalog
- Reference: GEE Community Catalog

2. Creating Weather File

This section outlines the process for generating EPIC-compatible weather files (.DLY) using GeoEPIC.

2.1 Requirements and Configuration

When sourcing weather data from Google Earth Engine (GEE) collections (like AgERA5, GRIDMET, CHIRPS, or your private assets), a configuration file (typically config.yml) is essential. This file directs GeoEPIC on what data to retrieve, defining the time period, specific variables, target resolution, the GEE collection path, and how the source data bands should be mapped and potentially converted (e.g., units) to match EPIC's requirements.

Example Configuration File (config.yml) using AgERA5:

# Global parameters
global_scope:
# Define the time period for data fetching
time_range: ['2002-01-01', '2022-12-31']
# List of standard EPIC weather variable names
variables: ['srad', 'tmax', 'tmin', 'prcp', 'rh', 'ws']
# Target resolution in meters (AgERA5 native is ~9km or 9600m)
resolution: 9600

# Specify Earth Engine (EE) collections and map their bands to EPIC variables
collections:
AgEra5:
# Path to the EE ImageCollection Asset
collection: 'projects/climate-engine-pro/assets/ce-ag-era5/daily'
# Define how EE bands map to EPIC variables
# b() refers to the image band
# Note: Unit conversions can be applied directly (e.g., Kelvin to Celsius)
variables:
srad: b('Solar_Radiation_Flux') # MJ/m^2/day (assuming source is daily total flux)
tmax: b('Temperature_Air_2m_Max_24h') - 273.15 # Convert K to °C
tmin: b('Temperature_Air_2m_Min_24h') - 273.15 # Convert K to °C
prcp: b('Precipitation_Flux') # mm/day (assuming source is daily total)
rh: b('Relative_Humidity_2m_06h') # % (using 6 AM value as representative)
ws: b('Wind_Speed_10m_Mean') # m/s

2.2 Command Line (CLI)

Use the geo_epic weather command with your configuration file and specify the location(s).

Fetch for a Single Location (Latitude, Longitude):

# Provide lat, lon coordinates directly
geo_epic weather config.yml --fetch 40.71 -74.00 --out ./output/NewYorkCity.DLY

Fetch for Multiple Locations from a CSV File:

# Specify input CSV and the column containing output file paths
# CSV must contain 'lat', 'lon', and the output path column (e.g., 'OutputFileColumn')
# GeoEPIC adds a 'wthgridid' column to the CSV
geo_epic weather config.yml --fetch locations.csv --out OutputFileColumn

Fetch for Regions from a Shapefile:

# Specify input Shapefile and the output directory for .DLY files
# GeoEPIC adds a 'wthgridid' attribute to the shapefile
geo_epic weather config.yml --fetch ./boundaries/fields.shp --out ./weather_output/

2.3 Python API

Use the geoEpic.io.fetch_list function for programmatic fetching.

Fetch for a Single Location:

from geoEpic.io import fetch_list

# Define parameters
config = 'path/to/your/config.yml'
output = './weather_output/NewYorkCity.DLY' # Specific file path for single location
single_location = "40.71,-74.00" # Latitude,Longitude as a string

# Fetch weather data
fetch_list(
config_file=config,
input_data=single_location,
output_dir=output, # For single location, this is the full output file path
raw=False # False saves as .DLY (default)
)
print(f"Weather data fetched for {single_location} and saved to {output}")

Fetch for Multiple Locations (CSV or Shapefile):

from geoEpic.io import fetch_list

# Define parameters
config = 'path/to/your/config.yml'
# Input can be a path to a CSV or a Shapefile
input_data = 'path/to/locations.csv' # or 'path/to/area_of_interest.shp'
output = './weather_output' # Base output directory

# Fetch weather data for all locations/features in the input file
# Note: If input is CSV, it must contain columns like 'lat', 'lon', and potentially output paths
# output_dir acts as a base path if paths in CSV are relative, or target dir for shapefile outputs.
# GeoEPIC adds 'wthgridid' column/attribute to the input file.
fetch_list(
config_file=config,
input_data=input_data,
output_dir=output,
raw=False # False saves as .DLY (default)
)
print(f"Weather data fetched for locations/regions in {input_data}.")
print("Input file has been updated with 'wthgridid'.")

3. Editing Weather File

Once .DLY files are created (or if you have existing ones), the geoEpic.io.DLY class in the Python API allows loading, manipulation, and further processing. It acts like a pandas.DataFrame with added EPIC-specific features.

import pandas as pd
from geoEpic.io import DLY # Import the DLY class

# --- Loading Data ---
# Replace './output/NewYorkCity.DLY' with the path to an actual .DLY file
dly_filepath = './output/NewYorkCity.DLY'
try:
dly_data = DLY.load(dly_filepath) # Use the class method 'load'
print(f"Successfully loaded weather data from {dly_filepath}")

# --- Inspecting and Manipulating Data (using pandas methods) ---
print("\nFirst 5 rows of weather data:")
print(dly_data.head())

print("\nDaily Solar Radiation (srad) head:")
print(dly_data['srad'].head()) # Access columns like a DataFrame

print("\nWeather data summary statistics:")
print(dly_data.describe())

# Example manipulation: Calculate Temperature Range
# dly_data['Trange'] = dly_data['tmax'] - dly_data['tmin']
# print("\nDaily Temperature Range (first 5 days):")
# print(dly_data['Trange'].head())

# --- EPIC-specific Functionality: Creating Monthly Files ---
# Aggregates daily data to monthly stats and saves as .WP1
print("\nGenerating monthly weather file (.WP1)...")
wp1_filepath = dly_data.to_monthly() # Returns the path of the created WP1 file

if wp1_filepath:
print(f"Monthly weather file created successfully: {wp1_filepath}")
else:
print("Failed to create monthly weather file.")

except FileNotFoundError:
print(f"Error: The specified .DLY file was not found: {dly_filepath}")
except Exception as e:
print(f"An error occurred while processing {dly_filepath}: {e}")

Key DLY Class Features:

Loading: DLY.load(filepath) reads a .DLY file into a DLY object (DataFrame subclass).
Data Access/Manipulation: Utilize standard pandas.DataFrame methods for analysis and modification.
Monthly Aggregation: dly_data.to_monthly() calculates monthly statistics (averages/totals as appropriate for EPIC) and saves the results to a .WP1 file (typically in the same directory with the same base name as the .DLY file).