Functions and Modeling Applications¶

Contents:

Functions and Modeling Applications

This lab covers:

(1) User-defined functions;

(2) Loops;

(3) Linear algebra applications;

(4) Modeling finite-state Markov chains.

Creating Functions¶

In this section we will cover the basics of creating custom functions.

A function is essentially an object that takes a set of inputs, applies some sort of procedure to said inputs, and spits out a result.

Functions can be handy for organizing code that is likely to be routinely re-used in the future with varying inputs.

Let's define a function named add that takes two variables, x and y, as inputs, and returns their sum.

In [25]:

function add(x, y)
    z = x + y
    return z
end

Out[25]:

add (generic function with 1 method)

To check, we call on add() with the inputs x=2 and y=3 -- obviously the output is supposed to be 5.

In [26]:

add(2,3)

Out[26]:

Success!

Recall that we stored the output into a variable z that was defined inside of the function.

If we try to call on z in the global environment (outside of the function), we will get an error since z only lives within the function:

In [27]:

UndefVarError: z not defined

Stacktrace:
 [1] top-level scope
 [2] include_string(::Function, ::Module, ::String, ::String) at .\loading.jl:1091

Now let's define a function called all_operations that takes takes two variables, x and y, as inputs, and returns their sum, difference, product, and quotient.

In [28]:

function all_operations(x, y)
    sum = x + y
    difference = x - y
    product = x * y
    quotient = x / y 
    result = (sum, difference, product, quotient) 
    return result  
end

Out[28]:

all_operations (generic function with 1 method)

To check whether all_operations works, we call on it with the inputs x=1 and y=2:

In [29]:

all_operations(1, 2)

Out[29]:

(3, -1, 2, 0.5)

Notice that the output of all_operations() is a tuple with four entires.

Tuples are useful as function output objects because we can easily store their entries as separate variables:

In [30]:

# Store output of `all_operations(1,2)` as separate variables
xy_sum, xy_difference, xy_product, xy_quotient = all_operations(1,2)

# Print all collected variables
@show xy_sum
@show xy_difference
@show xy_product
@show xy_quotient;

xy_sum = 3
xy_difference = -1
xy_product = 2
xy_quotient = 0.5

An alternative simpler way of defining the all_operations function by creating an equivalent all_operations_v2:

In [31]:

function all_operations_v2(x,y)
    (x + y, x - y, x * y, x / y)
end

Out[31]:

all_operations_v2 (generic function with 1 method)

What did we do differently?

We defined and stored all operation results in an unassigned tuple ;
We didn't use return at the end of the function to return the output.

When we called on the function, Julia noticed the lack of a return command and chose the last thing it saw as the output -- in our case this was the unassigned tuple.

Is this alternative way of defining all_operations() better? Not necessarily -- it depends on the context. For example, you might find that shorter code isn't necessarily easier to read.

Let's just apply all_operations() and all_operations_v2() to the same inputs and check whether the outputs match using a custom-defined function check():

In [32]:

# Create var. `condition` that tests whether outputs are equivalent
condition = all_operations(1,2) == all_operations_v2(1,2)

# Create fun. `check` w/ input `condition`
function check(condition)
    if condition == true 
        result = "The two functions are the same!"
    end 
    if condition != true
        result = "The two functions are not the same!"
    end 
    return result
end 

# Run `check` on `condition`
check(condition)

Out[32]:

"The two functions are the same!"

The check() function, as defined in the previous cell, is pretty clunky -- let's simplify it:

In [33]:

# Re-define function `check()`
function check(condition)
    if condition == true
        return "The two functions are the same!"
    else 
        return "The two functions are not the same!"
    end 
end

# Call on `check()`
check(condition)

Out[33]:

"The two functions are the same!"

Or alternatively:

In [34]:

# Re-define function `check()
function check(condition)
    if condition == true 
        return "The two functions are the same!"
    end 
    "The two functions are not the same!"
end 

# Call on `check()`
check(condition)

Out[34]:

"The two functions are the same!"

Again -- in the case of simple functions such as the ones shown above, being super efficient is not necessary.

But clunky code can make larger scripts hard to read, and potentially even run slow!

Now let's talk math.

Defining mathematical functions in Julia is easy.

Let's define the polynomial mapping $f:\mathbb{R} \rightarrow \mathbb{R}$ such that $f(x) = x^2 - 3x + 2$:

In [35]:

f(x) = x^2 - 3x + 2

Out[35]:

f (generic function with 1 method)

Suppose we're interested in knowing the value of $f(\pi)$:

In [36]:

f(pi)

Out[36]:

2.4448264403199786

Alternatively:

In [37]:

f(π)

Out[37]:

2.4448264403199786

There are a lot of details on user-defined functions that we haven't been able to cover here, but I think the above should be enough to get you started.

Visit the official Julia manual section on functions to learn more.

Loops¶

Let's print every integer between 1 and 5 using a while loop:

In [38]:

# Initial value
i = 1

# While loop:
while i <= 5 # Run until i = 5
    println(i) # Print i
    i = i + 1 # Add 1 to i for the next iteration of the loop
end

The above can be accomplished more easily using a for loop:

In [39]:

for i in 1:5 
    println(i)
end

We can pass any kind of sequence to a for loop.

For example, we can print the set of odd numbers between 1 and 5 by defining a sequence called sequence and then iterating the values of said sequence:

In [40]:

sequence = [1.0,3.0,5.0]

for i in sequence 
    println(i)
end

1.0
3.0
5.0

What if we want to instead print the index associated with the entries of sequence instead of the entry values themselves?

In [41]:

for i in eachindex(sequence)
    println(i)
end

1
2
3

Suppose we want to square all values of sequence and store it as a separate array called seq_out.

We can accomplish this using a for loop that goes through each entry of sequence, squares it, and stores it as the corresponding entry of seq_out:

In [42]:

# Declare `seq_out` as a vector 
# w/ the same number of entries as `sequence`
seq_out  = zeros(length(sequence))

# Run a for-loop that goes through
# the indexes of `sequence`
for i in eachindex(sequence) 
    seq_out[i] = sequence[i]^2
end 

# Print `seq_out`
seq_out

Out[42]:

3-element Array{Float64,1}:
  1.0
  9.0
 25.0

Alternatively, we can use a for loop in a comprehension:

In [43]:

seq_out = [sequence[i]^2 for i in eachindex(sequence)]

Out[43]:

3-element Array{Float64,1}:
  1.0
  9.0
 25.0

Even better -- we can broadcast (remember this from last lab?) ^2 across sequence.

In [44]:

seq_out = sequence.^2

Out[44]:

3-element Array{Float64,1}:
  1.0
  9.0
 25.0

Linear Algebra¶

First, we load up the LinearAlgebra package.

In [45]:

using LinearAlgebra

Let's assume we have vectors $a_1 = (1, 2, 3)'$ and $a_2 = (4, 5, 6)'$.

We start by defining these two column vectors:

In [46]:

a_1 = [1; 2; 3]
a_2 = [4, 5, 6];

Recall that whether we use ;'s or ,'s to separate entries in single-entry arrays, we still get column vectors.

Obviously $a_1$ and $a_2$ do not span each other, but let's just check to be sure:

In [47]:

A = [a_1 a_2]
b = A \ zeros(3)

Out[47]:

2-element Array{Float64,1}:
 -0.0
 -0.0

Since the zero vector is the only solution for $x$ in $A \, x = b$ where $A = [ a_1 a_2 ]$, then $a_1$ and $a_2$ must be linearly independent.

Find the dot product of $a_1$ and $a_2$:

In [48]:

a_1' * a_2

Out[48]:

Alternatively:

In [49]:

dot(a_1, a_2)

Out[49]:

Now let's find $a_1 a_2'$:

In [50]:

a_1 * a_2'

Out[50]:

3×3 Array{Int64,2}:
  4   5   6
  8  10  12
 12  15  18

Now let's add the two vectors:

In [51]:

a_1 + a_2

Out[51]:

3-element Array{Int64,1}:
 5
 7
 9

Subtract $a_2$ from $a_1$:

In [52]:

a_1 - a_2

Out[52]:

3-element Array{Int64,1}:
 -3
 -3
 -3

Let's scale vector $a_1$ by 3:

In [53]:

3a_1

Out[53]:

3-element Array{Int64,1}:
 3
 6
 9

Equivalently:

In [54]:

3 * a_1

Out[54]:

3-element Array{Int64,1}:
 3
 6
 9

Equivalently:

In [55]:

3 .* a_1

Out[55]:

3-element Array{Int64,1}:
 3
 6
 9

The norm of vector $a_1$:

In [56]:

norm(a_1)

Out[56]:

3.7416573867739413

Since $a_1$ and $a_2$ cannot span $\mathbb{R}^3$, we can find another orthogonal vector $a_3$.

Is $a_3 = a_1 + a_2$ orthogonal? (Obviously not, by definition, but let's practice checking)

In [57]:

a_3 = a_1 + a_2
b = A \ a_3

Out[57]:

2-element Array{Float64,1}:
 0.9999999999999979
 1.000000000000001

Since there exists a non-trivial solution to $x$ in $[a_1 \, a_2] \, x = A \, x = a_3$, then $a_3$ is not linearly independent.

We can find a linearly independent $a_3$ by guessing some initial vector $b_3$, projecting it onto the columns of $A$ to obtain the projection $\hat{b}_3$, and then extracting the orthogonal $a_3 = b_3 - \hat{b}_3$:

In [58]:

b_3 = [2, 3, 4]
a_3 = b_3 - (A * inv(A'A) * A' * b_3)
a_3

Out[58]:

3-element Array{Float64,1}:
 -8.881784197001252e-16
 -1.3322676295501878e-15
  2.6645352591003757e-15

Now let's check whether $a_3$ is actually linearly independent by redefining $A$ as $A = [a_1 \, a_2 \, a_3]$ and solving for $x$ in $A \, x = 0$:

In [59]:

A = [a_1 a_2 a_3]
A \ zeros(3)

Out[59]:

3-element Array{Float64,1}:
  0.0
  0.0
 -0.0

Since the only solution for $x$ in $A \, x = 0$ is the trivial solution, then $A$ must be full-rank.

Now let's check the eigenvalues and eigenvectors of $A$:

In [60]:

eigenv_A, eigenvec_A = eigen(A);

In [61]:

eigenv_A

Out[61]:

3-element Array{Float64,1}:
 -0.4641016151377548
  4.440892098500625e-15
  6.464101615137752

In [62]:

eigenvec_A

Out[62]:

3×3 Array{Float64,2}:
 -0.491831  2.96059e-16  -0.412884
  0.180023  1.4803e-16   -0.56401
  0.851877  1.0          -0.715136

Finite Markov Chains¶

Suppose that for a Markov process with three states we are given an initial state distribution and a stochastic matrix.

We are told to compute the state density in 10 periods (at $t=10$).

The given initial distribution is $P_0 = (1/3, 1/3, 1/3)'$, while the stochastic matrix is

$$M = \begin{bmatrix} 0.95 & 0.05 & 0 \\ 0.15 & 0.75 & 0.1 \\ 0 & 0.5 & 0.5 \end{bmatrix} \, .$$

In [64]:

# Define initial state distr.
P0 = [1/3, 1/3, 1/3]

# Define stochastic matrix
M = [0.95 0.05 0 ; 0.15 0.75 0.1 ; 0.0 0.5 0.5]

# Compute state distr. at t = 10
P10 = (P0' * M^(10))'

Out[64]:

3-element Array{Float64,1}:
 0.6415045988833332
 0.2941181890520833
 0.06437721206458333

More generally, what if we're interested in computing the state density for a variety of $n \in \mathbb{N}$ periods?

This is when writing custom functions comes in handy!

Let's create a function named markov_chain with inputs for $P_0$, $M$, and $n$:

In [65]:

function markov_chain(P0, M, n)
    
    # Start by creating a copy of P0
    # to later feed into the loop
    P = copy(P0) 
    
    # Run a loop that computes
    # P n-steps ahead
    for i in 1:n 
        new_P = (P' * M)'
        P = new_P
    end 
    
    # Return the final state distr.
    return P

end

Out[65]:

markov_chain (generic function with 1 method)

We can now try our new function out using the previously-defined P0 and M with n set to 10:

In [66]:

markov_chain(P0, M, 10)

Out[66]:

3-element Array{Float64,1}:
 0.641504598883333
 0.29411818905208326
 0.06437721206458331

Does this match P10?

In [67]:

markov_chain(P0, M, 10) ≈ P10

Out[67]:

true

It does!

We can now use this function to do a whole bunch of useful things.

For example, given the same $P_0$ and $M$, we can now create a list of $P_n$ for $n = 1,2,\ldots,20$:

In [76]:

probabilities = [markov_chain(P0, M, n) for n in 1:20];

We may visualize the probability of each state across time by gathering and plotting the corresponding series for all three states:

In [77]:

state1 = zeros(20)
state2 = zeros(20)
state3 = zeros(20)

for i in 1:20
    state1[i] = probabilities[i][1]
    state2[i] = probabilities[i][2]
    state3[i] = probabilities[i][3]
end

Let's load the Plots package to make a couple of plots.

In [78]:

using Plots

First, let's make a simple plot that contains all series:

In [79]:

time = 1:20
plot(time, state1) # Plot state 1 density
plot!(time, state2) # Plot state 2 density
plot!(time, state3) # Plot state 3 density

Out[79]:

Notice that with time, being in state 1 becomes more likely, while states 2 and 3 becomes less likely.

We may also create a separate plot for each series, but include them in a single composition:

In [80]:

p1 = plot(time, state1, title = "Normal Growth") # Plot state 1 probability
p2 = plot(time, state2, title = "Recession") # Plot state 2 probability
p3 = plot(time, state3, title = "Deep Recession") # Plot state 3 probability
plot(p1, p2, p3, layout = (3,1))

Out[80]:

Now let's write a function that computes the probability of a given sequence of outcomes.

(You should have seen something like this on Assignment 1.)

In [81]:

function outcome_prob(outcome, P0, M)

# Make sure `outcome` contains integers
# to allow for indexing 
outcome = floor.(Int64, outcome) 

# Store probability of initial state 
probability = P0[outcome[1]]

# Compute probability of `outcome` sequence
for i in 2:length(outcome)
    probability = probability * M[outcome[i-1], outcome[i]]
    end 
    
    # Return `probability` -- prob. of sequence `outcome`
    return probability
end

Out[81]:

outcome_prob (generic function with 1 method)

Let's keep using our previously-defined initial distribution $P_0$ and stochastic matrix $M$.

We can now feed the function the following sequence of states to check its functionality: $11$ -- the probability of this sequence of outcomes should obviously be (1/3)(19/20) according to our defined $P_0$ and $M$.

In [84]:

outcome_prob(ones(2), P0, M)

Out[84]:

0.31666666666666665

What if we want to find out the probability of having one of the following outcomes: (1) $11$, and (2) $22$ ?

We can use outcome_prob() to compute the probability of each outcome, and then find their sum.

In [85]:

outcome_prob([1,1], P0, M) + outcome_prob([2,2], P0, M)

Out[85]:

0.5666666666666667