Validation¶

Validate with ORNL DAAC Subset¶

Request a subset¶

You can get a subset like the one below (or for 20+ other MODIS/VIIRS land products) by visiting https://modis.ornl.gov and clicking the Get Data link. The subset that I used comes from the Global Tool; the daily MCD43 products are not available through the web service. I requested a random point somewhere in/near the everglades and the ORNL DAAC's backend computed the black, white, and blue sky albedos from the MCD43A1 parameters just like we did. Input parameters called for solar zenith angles calculated for local noon and the optical depth parameter was set to 0.20:
https://modis.ornl.gov/subsetdata/03Jul2019_13:43:49_346167250L28.27149L-81.9668S1L1_MCD43A/

In [1]:

import requests

request_url = (
    "https://modis.ornl.gov/rst/api/v1/MOD13Q1/subsetOrder?"
    "latitude={lat}&"
    "longitude={lon}&"
    "email=jjmcnelis%40outlook.com&"
    "uid={uid}&"
    "startDate={stdt}&"
    "endDate={enddt}&"
    "kmAboveBelow={ab}&"
    "kmLeftRight={lr}"
).format(
    lat   =  28.27149,
    lon   = -81.9668,
    uid   = "beta",
    stdt  = "A2012001",
    enddt = "A2012111",
    ab    =  0,
    lr    =  0
)

print("Request URL:\n"+request_url)

Request URL:
https://modis.ornl.gov/rst/api/v1/MOD13Q1/subsetOrder?latitude=28.27149&longitude=-81.9668&email=jjmcnelis%40outlook.com&uid=beta&startDate=A2012001&endDate=A2012111&kmAboveBelow=0&kmLeftRight=0

Request the subset:

In [2]:

response = requests.get(request_url)

if response.status_code==200:
    orderid = response.text
    print("Order ID:"); print(orderid)
else:
    print(response.text)

Order ID:
{
  "order_id": "04Jul2019_15:27:19_009479664L28.27149L-81.9668S1L1_MOD13Q1_beta"
}

This might help you make sense of that "order_id" string:

order_id = (
    "https://modis.ornl.gov/subsetdata/"     # global tool orders dir
    "03Jul2019_13:43:49_346167250"           # order id
    "L28.27149"                              # latitude of center
    "L-81.9668"                              # longitude of center
    "S1L1_MCD43A/")                          # black, white, blue albedos

Subset CSVs¶

Download the CSVs for near-infrared black, white, and blue albedos and parse them to data frames:

In [3]:

import numpy as np
import pandas as pd
import xarray as xr
from io import StringIO
from datetime import datetime
from datetime import timedelta
from pyproj import Proj, transform

order_nir = {
    "nir_black": None, 
    "nir_white": None, 
    "nir_actual": None}


def mdate2dt(modis_date):
    """AYYYYDOY::A2019001 >> yyyy-mm-dd"""
    idate = datetime(int(modis_date[1:5]),1,1)
    idelt = timedelta(int(modis_date[5:])-1)    
    return((idate+idelt).strftime('%Y-%m-%d'))

colnames = ["id", "prod", "mdate", "loc", "pdate", "var", "data"]
for csv in list(order_nir.keys()):               # iterate over list of nir
    
    csv_url = (
        "https://modis.ornl.gov/subsetdata/"     # global tool orders dir
        "03Jul2019_13:43:49_346167250"           # order id
        "L28.27149"                              # latitude of center
        "L-81.9668"                              # longitude of center
        "S1L1_MCD43A/"+csv+".csv")               
    r = requests.get(csv_url)                    # download with requests.get
    
    order_nir[csv] = pd.read_csv(                # parse response text to pd 
        StringIO(r.text),                        #  StringIO pseudofile
        names=colnames,
        header=None, 
        sep=",") 
    
    # set the data column to mask the fill values (32767.0)
    order_nir[csv].loc[order_nir[csv].data==32767.0, "data"] = np.nan
    
    # get dates from the third column (base zero); set to index
    order_nir[csv].index = [mdate2dt(md) for md in order_nir[csv]["mdate"]] 

    
order_nir["nir_black"].head(5)               # print five rows of nir black

Out[3]:

	id	prod	mdate	loc	pdate	var	data
2018-01-01	MCD43A.A2018001.h10v06.006.2018010031010.nir_b...	MCD43A	A2018001	Lat28.27149Lon-81.9668Samp1Line1	2018010031010	nir_black	0.188892
2018-01-02	MCD43A.A2018002.h10v06.006.2018011031030.nir_b...	MCD43A	A2018002	Lat28.27149Lon-81.9668Samp1Line1	2018011031030	nir_black	0.183267
2018-01-03	MCD43A.A2018003.h10v06.006.2018012030820.nir_b...	MCD43A	A2018003	Lat28.27149Lon-81.9668Samp1Line1	2018012030820	nir_black	0.183281
2018-01-04	MCD43A.A2018004.h10v06.006.2018013032552.nir_b...	MCD43A	A2018004	Lat28.27149Lon-81.9668Samp1Line1	2018013032552	nir_black	0.183250
2018-01-05	MCD43A.A2018005.h10v06.006.2018014032359.nir_b...	MCD43A	A2018005	Lat28.27149Lon-81.9668Samp1Line1	2018014032359	nir_black	0.182639

Get lists of outputs for black, white, and blue:

In [4]:

import glob

bsa_result = glob.glob("result/black_albedo*.nc")
wsa_result = glob.glob("result/white_albedo*.nc")
alb_result = glob.glob("result/blue_albedo*.nc")

print("Black sky albedo outputs for 2018:"); bsa_result

Black sky albedo outputs for 2018:

Out[4]:

['result/black_albedo_MCD43A1.2018_3.nc',
 'result/black_albedo_MCD43A1.2018_7.nc',
 'result/black_albedo_MCD43A1.2018_5.nc',
 'result/black_albedo_MCD43A1.2018_9.nc',
 'result/black_albedo_MCD43A1.2018_8.nc',
 'result/black_albedo_MCD43A1.2018_6.nc',
 'result/black_albedo_MCD43A1.2018_11.nc',
 'result/black_albedo_MCD43A1.2018_4.nc',
 'result/black_albedo_MCD43A1.2018_1.nc',
 'result/black_albedo_MCD43A1.2018_12.nc',
 'result/black_albedo_MCD43A1.2018_10.nc',
 'result/black_albedo_MCD43A1.2018_2.nc']

Ooops, sort those files. File lists passed to xr.open_mfdataset must be sequential:

In [5]:

bsa_result = sorted(bsa_result, key=lambda x: int(x.split("_")[-1][:-3]))
wsa_result = sorted(wsa_result, key=lambda x: int(x.split("_")[-1][:-3]))
alb_result = sorted(alb_result, key=lambda x: int(x.split("_")[-1][:-3]))

bsa_result

Out[5]:

['result/black_albedo_MCD43A1.2018_1.nc',
 'result/black_albedo_MCD43A1.2018_2.nc',
 'result/black_albedo_MCD43A1.2018_3.nc',
 'result/black_albedo_MCD43A1.2018_4.nc',
 'result/black_albedo_MCD43A1.2018_5.nc',
 'result/black_albedo_MCD43A1.2018_6.nc',
 'result/black_albedo_MCD43A1.2018_7.nc',
 'result/black_albedo_MCD43A1.2018_8.nc',
 'result/black_albedo_MCD43A1.2018_9.nc',
 'result/black_albedo_MCD43A1.2018_10.nc',
 'result/black_albedo_MCD43A1.2018_11.nc',
 'result/black_albedo_MCD43A1.2018_12.nc']

Open the black albedo outputs as a single dataset (using xr.open_mfdataset):

In [6]:

bsa = xr.open_mfdataset(bsa_result)

Now extract the time series for the same pixel that was requested from ORNL DAAC. First we need to get the lat,lon coordinate in Sinusoidal meters so that we can index the array (please excuse the absurd string parse job):

In [7]:

ll = order_nir["nir_black"].iloc[0]["loc"].split("Samp")[0][3:].split("Lon")
ll

Out[7]:

['28.27149', '-81.9668']

We need to use Sinusoidal coordinates to safely index our dataset. The following transform is the exact opposite of the one that we used to generate latitude and longitude arrays in 1_Workflow.ipynb:

In [8]:

# EPSG code for geographic WGS84 is our input projection this time
inproj = Proj(init="epsg:4326")

# build the MODIS sinusoidal proj4 string again
getpar = lambda a: str(bsa.crs.attrs[a])
outproj = Proj(" ".join([
    "+proj=sinu",
    "+lon_0="+getpar("longitude_of_central_meridian"),
    "+x_0="+getpar("false_easting"),
    "+y_0="+getpar("false_northing"),
    "+a="+getpar("semi_major_axis"),
    "+b="+getpar("semi_minor_axis"),
    "+units="+"meter +no_defs"]))

# pass the two projection definitions to transform along with the lat,lon
x,y = transform(inproj, outproj, float(ll[1]), float(ll[0]))

print("Sinusoidal equivalent for the lat, lon from the DAAC subset:"); x,y

Sinusoidal equivalent for the lat, lon from the DAAC subset:

Out[8]:

(-8027086.119432877, 3143649.800007406)

Get the coordinate from the x and y dimensions that is nearest to the newly transformed coordinate:

In [9]:

import numpy as np

find_nearest = lambda array, value: array[(np.abs(array - value)).argmin()]
x_near = find_nearest(bsa.x, x).data.item()
y_near = find_nearest(bsa.y, y).data.item()

print("The nearest x,y coordinate in our dataset:"); x_near, y_near

The nearest x,y coordinate in our dataset:

Out[9]:

(-8027124.470202017, 3143808.4379992373)

Finally, index the array and plot:

our calculated black sky albedo time series for the near-infrared band,
the time series calculated by ORNL DAAC, and,
the difference of the two time series

Select black sky albedo arrays, select pixel, put it all in a data frame. Include SZA:

In [10]:

our_nir_bsa  = bsa["BRDF_Albedo_Parameters_nir"]  # get nir BSA for our ds
ornl_nir_bsa = order_nir["nir_black"]["data"]     # get nir BSA for subset

# select only the numpy arrays
sza = bsa["solar_zenith_angle"].sel(x=x_near, y=y_near).data.compute()
ours  = our_nir_bsa.sel(x=x_near, y=y_near).data.compute()
ornls = ornl_nir_bsa.values
diff  = ours-ornls

# make a df
df = pd.DataFrame({
    "ORNLDAAC": ornls, 
    "OURWORKFLOW": ours, 
    "DIFFERENCE": diff,
    "SZA": sza
}, index=bsa.time.dt.dayofyear)

df.head(5)

Out[10]:

	ORNLDAAC	OURWORKFLOW	DIFFERENCE	SZA
dayofyear
1	0.188892	0.188894	0.000002	51.303762
2	0.183267	0.183269	0.000002	51.224372
3	0.183281	0.183284	0.000003	51.138180
4	0.183250	0.183253	0.000003	51.045213
5	0.182639	0.182642	0.000003	50.945498

Plot the data frame:

In [11]:

%matplotlib inline
import matplotlib.pyplot as plt
plt.rcParams['font.size'] = 14 

fig, axs = plt.subplots(3, 1, sharex=True, figsize=(14,8))
fig.suptitle("Computed black sky albedo (MCD43A1): comparison "
             "ORNL DAAC versus Workflow")

df["ORNLDAAC"].plot(ax=axs[0], color="black")
axs[0].set_ylabel("ORNL DAAC")
axs[0].grid(True)

df["OURWORKFLOW"].plot(ax=axs[1], color="black")
axs[1].set_ylabel("OUR WORKFLOW")
axs[1].grid(True)

df["DIFFERENCE"].plot(ax=axs[2])
axs[2].set_ylabel("DIFFERENCE")
axs[2].grid(True)

axs3 = axs[2].twinx()
df["SZA"].plot(ax=axs3, color="orange")
axs3.set_ylabel("ZENITH ANGLE")

plt.show()

Better than I expected.

It looks like the difference is related to the solar zenith angle because the concave time series resembles the solar zenith angle curve through the year.

I can tell you with a lot of confidence that the tiny disparity (see y-axis precision) between the two time series comes from the ORNL DAAC's solar zenith angle calculator because it returns five decimal places. We're using the full precision solar zenith angle calculated by:

# parameters

doy = 166                         # day of the year    
diy = 365                         # number of days in the year
deg_rot_per_day = 360/diy         # degree of rotation per day
abs_max_declination = 23.45       # unsigned max declination

# declination is calculated by:
declination = cos(radians((doy+10)*deg_rot_per_day))*-abs_max_declination

# solar altitude (when given in angular units) and zenith are complementary angles
solar_altitude = 90 - latitude + declination
solar_zenith = 90 - solar_altitude

So, I'm satisfied with this and won't do any further validation for now. But I will plot a browse image for the repo in the cell below:

In [12]:

# we will plot the NIR band
band_name = "BRDF_Albedo_Parameters_nir"
xyd = dict(x=x_near, y=y_near)

# get pixel from black sky albedo results
with xr.open_mfdataset(bsa_result) as bsa:
    sza = bsa["solar_zenith_angle"].sel(**xyd).data.compute()
    ours_nir_bsa  = bsa[band_name].sel(**xyd).data.compute()
    ornls_nir_bsa  = order_nir["nir_black"]["data"].values
    
# get pixel from white sky albedo results
with xr.open_mfdataset(wsa_result) as wsa:
    ours_nir_wsa  = wsa[band_name].sel(**xyd).data.compute()

# get pixel from blue sky albedo results
with xr.open_mfdataset(alb_result) as alb: 
    ours_nir_alb  = alb[band_name].sel(**xyd).data.compute()

# select only the numpy arrays
ornls = ornl_nir_bsa.values
diff  = ours-ornls

# make a df
df = pd.DataFrame({
    "SZA": sza,
    "ORNLDAAC_BLACK": ornls_nir_bsa, 
    "WORKFLOW_BLACK": ours_nir_bsa, 
    "WORKFLOW_WHITE": ours_nir_wsa, 
    "WORKFLOW_BLUE": ours_nir_alb, 
    "DIFFERENCE": ours_nir_bsa-ornls_nir_bsa
}, index=bsa.time.dt.dayofyear)

# and display the first five rows
df.head(5)

Out[12]:

	SZA	ORNLDAAC_BLACK	WORKFLOW_BLACK	WORKFLOW_WHITE	WORKFLOW_BLUE	DIFFERENCE
dayofyear
1	51.303762	0.188892	0.188894	0.193498	0.190270	0.000002
2	51.224372	0.183267	0.183269	0.189028	0.184991	0.000002
3	51.138180	0.183281	0.183284	0.189811	0.185236	0.000003
4	51.045213	0.183250	0.183253	0.190001	0.185271	0.000003
5	50.945498	0.182639	0.182642	0.190485	0.184987	0.000003

Validate with MCD43A3 Albedos¶

coming soon

Plots¶

Blue, black, white albedo time series for one pixel¶

In [13]:

%matplotlib inline
import matplotlib.pyplot as plt
plt.rcParams['font.size'] = 14  
plt.style.use('seaborn-muted')

fig = plt.figure(num=1, figsize=(14, 4))

lnstyle = dict(color="black", alpha=0.75,)
df["WORKFLOW_BLUE"].plot(label="blue", linewidth=2.5)
df["WORKFLOW_BLACK"].plot(linestyle='dashed', label="black", **lnstyle)
df["WORKFLOW_WHITE"].plot(linestyle='dotted', label="white", **lnstyle)
location = "[ %s, %s ]" % (ll[0], ll[1])

plt.grid(alpha=0.5)
plt.legend(loc="upper right")
plt.xlabel("DAY OF THE YEAR")
plt.ylabel("ALBEDO [ 0.0 - 1.0 ]")
plt.title("Blue, black, white sky albedo for 2018 at "+location)

plt.show()

Time series comparison against ORNL DAAC¶

In [14]:

%matplotlib inline
import matplotlib.pyplot as plt
plt.rcParams['font.size'] = 14  
plt.style.use('seaborn-muted')

fig = plt.figure(num=2, figsize=(14, 4))

df["DIFFERENCE"].plot(label="difference")
plt.ticklabel_format(axis='y', style='sci', scilimits=(-2,2))
plt.xlabel("DAY OF THE YEAR", labelpad=10)
plt.ylim(-0.000001, 0.000005)
plt.ylabel("DIFFERENCE")
ax = plt.gca()

ax2 = ax.twinx()
plt.ylim(0, 55)
ax2.fill_between(df.index, df["SZA"], color="gray", alpha=0.1)
ax2.set_ylabel("ZENITH ANGLE [ DEG ]", labelpad=10)
location = "[ %s, %s ]" % (ll[0], ll[1])

plt.grid(alpha=0.5)
plt.title("Albedo difference [ WORKFLOW - ORNLDAAC ] at "+location)

plt.show()

Table of Contents