Neural Network IV with Simulated Data

Posted %b %e, %Y Updated %b %e, %Y

By Bastiaan

7 min read

Some simulated data, borrowed from this post.

{% highlight r %}

library for generation multivariate distributions

library(MASS)

always use the same random numbers

set.seed(123)

the means and errors for the multivariate distribution

MUs <- c(10,15) SIGMAs <- matrix(c(1, 0.5, 0.5, 2 ), nrow=2, ncol=2 )

the multivariate distribution

mdist <- mvrnorm(n = 1000, mu = MUs, Sigma = SIGMAs)

create unobserved covariate

c <- mdist[ , 2]

create the instrumental variable

z <- rnorm(1000)

create observed variable

x <- mdist[ , 1] + z

constuct the dependent variable

y <- 1 + x + c + rnorm(1000, 0, 0.5) {% endhighlight %}

Check if the variables behave as expected

{% highlight r %} cor(x, c) {% endhighlight %}

{% highlight text %}

[1] 0.1986307

{% endhighlight %}

{% highlight r %} cor(z, c) {% endhighlight %}

{% highlight text %}

[1] -0.0120011

{% endhighlight %}

Let’s look at the true model.

{% highlight r %} lm(y ~ x + c) {% endhighlight %}

{% highlight text %}

Call:

lm(formula = y ~ x + c)

Coefficients:

(Intercept) x c

0.9079 1.0156 0.9955

{% endhighlight %}

Estimate using OLS.

{% highlight r %} lm(y ~ x) {% endhighlight %}

{% highlight text %}

Call:

lm(formula = y ~ x)

Coefficients:

(Intercept) x

13.787 1.226

{% endhighlight %}

Now using instrumental variables.

{% highlight r %} library(AER) ivreg(y ~ x | z) {% endhighlight %}

{% highlight text %}

Call:

ivreg(formula = y ~ x | z)

Coefficients:

(Intercept) x

15.949 1.008

{% endhighlight %}

Now using the lm function.

{% highlight r %}

first stage

lms1 <- lm(x ~ z)

manually obtain fitted values

lmXhat <- lms1$coefficients[2]*z + lms1$coefficients[1]

estimate second stage using Xhat

(lms2 <- lm(y ~ lmXhat) ) {% endhighlight %}

{% highlight text %}

Call:

lm(formula = y ~ lmXhat)

Coefficients:

(Intercept) lmXhat

15.949 1.008

{% endhighlight %}

Now we can do the same using a neural network

{% highlight r %} library(nnet)

first stage

nns1 <- nnet(x ~ z, size=0, skip=TRUE, linout=TRUE) {% endhighlight %}

{% highlight text %}

# weights: 2

initial value 100832.781903

final value 924.804075

converged

{% endhighlight %}

{% highlight r %}

manually obtain fitted values

nnXhat <- nns1$fitted.values

estimate second stage using Xhat

nns2 <- nnet(y ~ nnXhat, size=0, skip=TRUE, linout=TRUE) {% endhighlight %}

{% highlight text %}

# weights: 2

initial value 528901.038261

final value 4019.409973

converged

{% endhighlight %}

{% highlight r %} summary(nns2) {% endhighlight %}

{% highlight text %}

a 1-0-1 network with 2 weights

options were - skip-layer connections linear output units

b->o i1->o

15.95 1.01

{% endhighlight %}

Compare output.

{% highlight r %} lms2$coefficients - nns2$wts {% endhighlight %}

{% highlight text %}

(Intercept) lmXhat

-1.749729e-10 -2.814797e-09

{% endhighlight %}

Compare estimates.

{% highlight r %} library(ggplot2) qplot(lms2$fitted.values - nns2$fitted.values) {% endhighlight %}

Now redo using a non-linearity

neural-networks

R linear model neural network

This post is licensed under CC BY 4.0 by the author.

library for generation multivariate distributions

always use the same random numbers

the means and errors for the multivariate distribution

the multivariate distribution

create unobserved covariate

create the instrumental variable

create observed variable

constuct the dependent variable

[1] 0.1986307

[1] -0.0120011

Call:

lm(formula = y ~ x + c)

Coefficients:

(Intercept) x c

0.9079 1.0156 0.9955

Call:

lm(formula = y ~ x)

Coefficients:

(Intercept) x

13.787 1.226

Call:

ivreg(formula = y ~ x | z)

Coefficients:

(Intercept) x

15.949 1.008

first stage

manually obtain fitted values

estimate second stage using Xhat

Call:

lm(formula = y ~ lmXhat)

Coefficients:

(Intercept) lmXhat

15.949 1.008

first stage

# weights: 2

initial value 100832.781903

final value 924.804075

converged

manually obtain fitted values

estimate second stage using Xhat

# weights: 2

initial value 528901.038261

final value 4019.409973

converged

a 1-0-1 network with 2 weights

options were - skip-layer connections linear output units

b->o i1->o

15.95 1.01

(Intercept) lmXhat

-1.749729e-10 -2.814797e-09

Trending Tags