Exponential Random Graph Models (ERGMs) using statnet

Last updated 2025-06-21

install.packages('ergm')
library(ergm)
Loading required package: network

'network' 1.19.0 (2024-12-08), part of the Statnet Project
* 'news(package="network")' for changes since last version
* 'citation("network")' for citation information
* 'https://statnet.org' for help, support, and other information

'ergm' 4.9.0 (2025-06-09), part of the Statnet Project
* 'news(package="ergm")' for changes since last version
* 'citation("ergm")' for citation information
* 'https://statnet.org' for help, support, and other information
'ergm' 4 is a major update that introduces some backwards-incompatible
changes. Please type 'news(package="ergm")' for a list of major
changes.
packageVersion("ergm")
[1] '4.9.0'
?ergmTerms
help("[name]-ergmTerm")
ergmTerm?[name]
?ergmKeyword`
search.ergmTerms(keyword='curved')
Found  8  matching ergm terms:
altkstar(lambda, fixed=FALSE) (binary)
    Alternating k-star

gwb1degree(decay, fixed=FALSE, attr=NULL, cutoff=30, levels=NULL) (binary)
    Geometrically weighted degree distribution for the first mode in a bipartite network

gwb1dsp(decay=0, fixed=FALSE, cutoff=30) (binary)
    Geometrically weighted dyadwise shared partner distribution for dyads in the first bipartition

gwb2degree(decay, fixed=FALSE, attr=NULL, cutoff=30, levels=NULL) (binary)
    Geometrically weighted degree distribution for the second mode in a bipartite network

gwb2dsp(decay=0, fixed=FALSE, cutoff=30) (binary)
    Geometrically weighted dyadwise shared partner distribution for dyads in the second bipartition

gwdegree(decay, fixed=FALSE, attr=NULL, cutoff=30, levels=NULL) (binary)
    Geometrically weighted degree distribution

gwidegree(decay, fixed=FALSE, attr=NULL, cutoff=30, levels=NULL) (binary)
    Geometrically weighted in-degree distribution

gwodegree(decay, fixed=FALSE, attr=NULL, cutoff=30, levels=NULL) (binary)
    Geometrically weighted out-degree distribution
vignette("ergm-term-crossRef")
?read.paj
?read.paj.simplify
?loading.attributes
?network
data(package='ergm') # tells us the datasets in our packages
set.seed(1) # The plot.network function uses random values
data(florentine) # loads flomarriage and flobusiness data
flomarriage # Equivalent to print(flomarriage): Examine properties
 Network attributes:
  vertices = 16 
  directed = FALSE 
  hyper = FALSE 
  loops = FALSE 
  multiple = FALSE 
  bipartite = FALSE 
  total edges= 20 
    missing edges= 0 
    non-missing edges= 20 

 Vertex attribute names: 
    priorates totalties vertex.names wealth 

No edge attributes
par(mfrow=c(1,2)) # Set up a 2-column (and 1-row) plot area

# Plot the network, saving the coordinates for the next plot
coords <- plot(flomarriage, 
     main="Florentine Marriage", 
     cex.main=0.8, 
     label = network.vertex.names(flomarriage), 
     label.cex=0.4,
     pad=3) # Equivalent to plot.network(...)

wealth <- flomarriage %v% 'wealth' # %v% references vertex attributes, equivalent to get.vertex.attribute(flomarriage, "wealth")
wealth
 [1]  10  36  55  44  20  32   8  42 103  48  49   3  27  10 146  48
# Plot with vertex size proportional to wealth
plot(flomarriage, coord=coords, jitter=FALSE,
     vertex.cex=wealth/25, 
     main="Florentine marriage by wealth", cex.main=0.8,
     pad=0.5) 
summary(flomarriage ~ edges) # Calculate the edges statistic for this network
edges 
   20 
flomodel.01 <- ergm(flomarriage ~ edges) # Estimate the model 
Starting maximum pseudolikelihood estimation (MPLE):
Obtaining the responsible dyads.
Evaluating the predictor and response matrix.
Maximizing the pseudolikelihood.
Finished MPLE.
Evaluating log-likelihood at the estimate. 
summary(flomodel.01) # Look at the fitted model object
Call:
ergm(formula = flomarriage ~ edges)

Maximum Likelihood Results:

      Estimate Std. Error MCMC % z value Pr(>|z|)    
edges  -1.6094     0.2449      0  -6.571   <1e-04 ***
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

     Null Deviance: 166.4  on 120  degrees of freedom
 Residual Deviance: 108.1  on 119  degrees of freedom

AIC: 110.1  BIC: 112.9  (Smaller is better. MC Std. Err. = 0)
(20/120)^3 * choose(16,3) 
[1] 2.592593
set.seed(321)
summary(flomarriage~edges+triangle) # Look at the g(y) statistics for this model
   edges triangle 
      20        3 
flomodel.02 <- ergm(flomarriage~edges+triangle) # Estimate the theta coefficients
summary(flomodel.02)
Call:
ergm(formula = flomarriage ~ edges + triangle)

Monte Carlo Maximum Likelihood Results:

         Estimate Std. Error MCMC % z value Pr(>|z|)    
edges     -1.6652     0.3474      0  -4.793   <1e-04 ***
triangle   0.1614     0.5592      0   0.289    0.773    
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

     Null Deviance: 166.4  on 120  degrees of freedom
 Residual Deviance: 108.1  on 118  degrees of freedom

AIC: 112.1  BIC: 117.6  (Smaller is better. MC Std. Err. = 0.009804)
plogis(coef(flomodel.02)[[1]] + (0:2) * coef(flomodel.02)[[2]])
[1] 0.1590709 0.1818639 0.2071184
class(flomodel.02) # this has the class ergm
[1] "ergm"
names(flomodel.02) # the ERGM object contains lots of components.
 [1] "coefficients"    "sample"          "iterations"      "MCMCtheta"      
 [5] "loglikelihood"   "gradient"        "hessian"         "covar"          
 [9] "failure"         "newnetwork"      "coef.init"       "est.cov"        
[13] "coef.hist"       "stats.hist"      "steplen.hist"    "control"        
[17] "etamap"          "MCMCflag"        "nw.stats"        "call"           
[21] "network"         "ergm_version"    "info"            "MPLE_is_MLE"    
[25] "drop"            "offset"          "estimable"       "formula"        
[29] "reference"       "constraints"     "obs.constraints" "estimate"       
[33] "estimate.desc"   "null.lik"        "mle.lik"        
flomodel.02$MCMCflag # all can be inspected/extracted
[1] TRUE
coef(flomodel.02) # key components have accessor functions
    edges  triangle 
-1.665157  0.161387 
summary(wealth) # summarize the distribution of wealth
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
   3.00   17.50   39.00   42.56   48.25  146.00 
# plot(flomarriage, 
#      vertex.cex=wealth/25, 
#      main="Florentine marriage by wealth", 
#      cex.main=0.8) # network plot with vertex size proportional to wealth

summary(flomarriage~edges+nodecov('wealth')) # observed statistics for the model
         edges nodecov.wealth 
            20           2168 
flomodel.03 <- ergm(flomarriage~edges+nodecov('wealth'))
Starting maximum pseudolikelihood estimation (MPLE):
Obtaining the responsible dyads.
Evaluating the predictor and response matrix.
Maximizing the pseudolikelihood.
Finished MPLE.
Evaluating log-likelihood at the estimate. 
summary(flomodel.03)
Call:
ergm(formula = flomarriage ~ edges + nodecov("wealth"))

Maximum Likelihood Results:

                Estimate Std. Error MCMC % z value Pr(>|z|)    
edges          -2.594929   0.536056      0  -4.841   <1e-04 ***
nodecov.wealth  0.010546   0.004674      0   2.256   0.0241 *  
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

     Null Deviance: 166.4  on 120  degrees of freedom
 Residual Deviance: 103.1  on 118  degrees of freedom

AIC: 107.1  BIC: 112.7  (Smaller is better. MC Std. Err. = 0)
data(faux.mesa.high) 
mesa <- faux.mesa.high
set.seed(1)
mesa
 Network attributes:
  vertices = 205 
  directed = FALSE 
  hyper = FALSE 
  loops = FALSE 
  multiple = FALSE 
  bipartite = FALSE 
  total edges= 203 
    missing edges= 0 
    non-missing edges= 203 

 Vertex attribute names: 
    Grade Race Sex 

No edge attributes
par(mfrow=c(1,1)) # Back to 1-panel plots
plot(mesa, vertex.col='Grade')
legend('bottomleft',fill=7:12,
       legend=paste('Grade',7:12),cex=0.75)
fauxmodel.01 <- ergm(mesa ~edges + 
        nodefactor('Grade') + nodematch('Grade',diff=T) +
        nodefactor('Race') + nodematch('Race',diff=T))
Observed statistic(s) nodematch.Race.Black and nodematch.Race.Other are at their smallest attainable values. Their coefficients will be fixed at -Inf.
Starting maximum pseudolikelihood estimation (MPLE):
Obtaining the responsible dyads.
Evaluating the predictor and response matrix.
Maximizing the pseudolikelihood.
Finished MPLE.
Evaluating log-likelihood at the estimate. 
summary(fauxmodel.01)
Call:
ergm(formula = mesa ~ edges + nodefactor("Grade") + nodematch("Grade", 
    diff = T) + nodefactor("Race") + nodematch("Race", diff = T))

Maximum Likelihood Results:

                      Estimate Std. Error MCMC % z value Pr(>|z|)    
edges                  -8.0538     1.2561      0  -6.412  < 1e-04 ***
nodefactor.Grade.8      1.5201     0.6858      0   2.216 0.026663 *  
nodefactor.Grade.9      2.5284     0.6493      0   3.894  < 1e-04 ***
nodefactor.Grade.10     2.8652     0.6512      0   4.400  < 1e-04 ***
nodefactor.Grade.11     2.6291     0.6563      0   4.006  < 1e-04 ***
nodefactor.Grade.12     3.4629     0.6566      0   5.274  < 1e-04 ***
nodematch.Grade.7       7.4662     1.1730      0   6.365  < 1e-04 ***
nodematch.Grade.8       4.2882     0.7150      0   5.997  < 1e-04 ***
nodematch.Grade.9       2.0371     0.5538      0   3.678 0.000235 ***
nodematch.Grade.10      1.2489     0.6233      0   2.004 0.045111 *  
nodematch.Grade.11      2.4521     0.6124      0   4.004  < 1e-04 ***
nodematch.Grade.12      1.2987     0.6981      0   1.860 0.062824 .  
nodefactor.Race.Hisp   -1.6659     0.2963      0  -5.622  < 1e-04 ***
nodefactor.Race.NatAm  -1.4725     0.2869      0  -5.132  < 1e-04 ***
nodefactor.Race.Other  -2.9618     1.0372      0  -2.856 0.004296 ** 
nodefactor.Race.White  -0.8488     0.2958      0  -2.869 0.004112 ** 
nodematch.Race.Black      -Inf     0.0000      0    -Inf  < 1e-04 ***
nodematch.Race.Hisp     0.6912     0.3451      0   2.003 0.045153 *  
nodematch.Race.NatAm    1.2482     0.3550      0   3.517 0.000437 ***
nodematch.Race.Other      -Inf     0.0000      0    -Inf  < 1e-04 ***
nodematch.Race.White    0.3140     0.6405      0   0.490 0.623947    
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

     Null Deviance: 28987  on 20910  degrees of freedom
 Residual Deviance:  1798  on 20889  degrees of freedom

AIC: 1836  BIC: 1987  (Smaller is better. MC Std. Err. = 0)

 Warning: The following terms have infinite coefficient estimates:
  nodematch.Race.Black nodematch.Race.Other 
ergmTerm?nodematch
table(mesa %v% 'Race') # Frequencies of race

Black  Hisp NatAm Other White 
    6   109    68     4    18 
mixingmatrix(mesa, "Race")
      Black Hisp NatAm Other White
Black     0    8    13     0     5
Hisp      8   53    41     1    22
NatAm    13   41    46     0    10
Other     0    1     0     0     0
White     5   22    10     0     4
Note:  Marginal totals can be misleading for undirected mixing matrices.
summary(mesa ~edges  + 
          nodefactor('Grade') + nodematch('Grade',diff=T) +
          nodefactor('Race') + nodematch('Race',diff=T))
                edges    nodefactor.Grade.8    nodefactor.Grade.9 
                  203                    75                    65 
  nodefactor.Grade.10   nodefactor.Grade.11   nodefactor.Grade.12 
                   36                    49                    28 
    nodematch.Grade.7     nodematch.Grade.8     nodematch.Grade.9 
                   75                    33                    23 
   nodematch.Grade.10    nodematch.Grade.11    nodematch.Grade.12 
                    9                    17                     6 
 nodefactor.Race.Hisp nodefactor.Race.NatAm nodefactor.Race.Other 
                  178                   156                     1 
nodefactor.Race.White  nodematch.Race.Black   nodematch.Race.Hisp 
                   45                     0                    53 
 nodematch.Race.NatAm  nodematch.Race.Other  nodematch.Race.White 
                   46                     0                     4 
set.seed(2)
data(samplk) # directed data: Sampson's Monks
ls() 
 [1] "coords"         "faux.mesa.high" "fauxmodel.01"   "flobusiness"   
 [5] "flomarriage"    "flomodel.01"    "flomodel.02"    "flomodel.03"   
 [9] "mesa"           "samplk1"        "samplk2"        "samplk3"       
[13] "wealth"        
samplk3
 Network attributes:
  vertices = 18 
  directed = TRUE 
  hyper = FALSE 
  loops = FALSE 
  multiple = FALSE 
  bipartite = FALSE 
  total edges= 56 
    missing edges= 0 
    non-missing edges= 56 

 Vertex attribute names: 
    cloisterville group vertex.names 

No edge attributes
plot(samplk3)
summary(samplk3~edges+mutual)
 edges mutual 
    56     15 
set.seed(3)
sampmodel.01 <- ergm(samplk3~edges+mutual)
summary(sampmodel.01)
Call:
ergm(formula = samplk3 ~ edges + mutual)

Monte Carlo Maximum Likelihood Results:

       Estimate Std. Error MCMC % z value Pr(>|z|)    
edges   -2.1606     0.2177      0  -9.925   <1e-04 ***
mutual   2.3152     0.4826      0   4.797   <1e-04 ***
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

     Null Deviance: 424.2  on 306  degrees of freedom
 Residual Deviance: 266.9  on 304  degrees of freedom

AIC: 270.9  BIC: 278.3  (Smaller is better. MC Std. Err. = 0.3281)
set.seed(4)
missnet <- network.initialize(10,directed=F) # initialize an empty net with 10 nodes
missnet[1,2] <- missnet[2,7] <- missnet[3,6] <- 1 # add a few ties
missnet[4,6] <- missnet[4,9] <- missnet[5,6] <- NA # mark a few dyads missing
summary(missnet)
Network attributes:
  vertices = 10
  directed = FALSE
  hyper = FALSE
  loops = FALSE
  multiple = FALSE
  bipartite = FALSE
 total edges = 6 
   missing edges = 3 
   non-missing edges = 3 
 density = 0.06666667 

Vertex attributes:
  vertex.names:
   character valued attribute
   10 valid vertex names

No edge attributes

Network adjacency matrix:
   1 2 3  4  5  6 7 8  9 10
1  0 1 0  0  0  0 0 0  0  0
2  1 0 0  0  0  0 1 0  0  0
3  0 0 0  0  0  1 0 0  0  0
4  0 0 0  0  0 NA 0 0 NA  0
5  0 0 0  0  0 NA 0 0  0  0
6  0 0 1 NA NA  0 0 0  0  0
7  0 1 0  0  0  0 0 0  0  0
8  0 0 0  0  0  0 0 0  0  0
9  0 0 0 NA  0  0 0 0  0  0
10 0 0 0  0  0  0 0 0  0  0
# plot missnet with missing dyads colored red. 
tempnet <- missnet
tempnet[4,6] <- tempnet[4,9] <- tempnet[5,6] <- 1
missnetmat <- as.matrix(missnet)
missnetmat[is.na(missnetmat)] <- 2
plot(tempnet,label = network.vertex.names(tempnet),
     edge.col = missnetmat)
# fit an ergm to the network with missing data identified
summary(missnet~edges)
edges 
    3 
summary(ergm(missnet~edges))
Starting maximum pseudolikelihood estimation (MPLE):
Obtaining the responsible dyads.
Evaluating the predictor and response matrix.
Maximizing the pseudolikelihood.
Finished MPLE.
Evaluating log-likelihood at the estimate. 
Call:
ergm(formula = missnet ~ edges)

Maximum Likelihood Results:

      Estimate Std. Error MCMC % z value Pr(>|z|)    
edges  -2.5649     0.5991      0  -4.281   <1e-04 ***
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

     Null Deviance: 58.22  on 42  degrees of freedom
 Residual Deviance: 21.61  on 41  degrees of freedom

AIC: 23.61  BIC: 25.35  (Smaller is better. MC Std. Err. = 0)
missnet_bad <- missnet # create network with missing dyads set to 0
missnet_bad[4,6] <- missnet_bad[4,9] <- missnet_bad[5,6] <- 0

# fit an ergm to the network with missing dyads set to 0
summary(missnet_bad)
Network attributes:
  vertices = 10
  directed = FALSE
  hyper = FALSE
  loops = FALSE
  multiple = FALSE
  bipartite = FALSE
 total edges = 3 
   missing edges = 0 
   non-missing edges = 3 
 density = 0.06666667 

Vertex attributes:
  vertex.names:
   character valued attribute
   10 valid vertex names

No edge attributes

Network adjacency matrix:
   1 2 3 4 5 6 7 8 9 10
1  0 1 0 0 0 0 0 0 0  0
2  1 0 0 0 0 0 1 0 0  0
3  0 0 0 0 0 1 0 0 0  0
4  0 0 0 0 0 0 0 0 0  0
5  0 0 0 0 0 0 0 0 0  0
6  0 0 1 0 0 0 0 0 0  0
7  0 1 0 0 0 0 0 0 0  0
8  0 0 0 0 0 0 0 0 0  0
9  0 0 0 0 0 0 0 0 0  0
10 0 0 0 0 0 0 0 0 0  0
summary(ergm(missnet_bad~edges))
Starting maximum pseudolikelihood estimation (MPLE):
Obtaining the responsible dyads.
Evaluating the predictor and response matrix.
Maximizing the pseudolikelihood.
Finished MPLE.
Evaluating log-likelihood at the estimate. 
Call:
ergm(formula = missnet_bad ~ edges)

Maximum Likelihood Results:

      Estimate Std. Error MCMC % z value Pr(>|z|)    
edges  -2.6391     0.5976      0  -4.416   <1e-04 ***
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

     Null Deviance: 62.38  on 45  degrees of freedom
 Residual Deviance: 22.04  on 44  degrees of freedom

AIC: 24.04  BIC: 25.85  (Smaller is better. MC Std. Err. = 0)
set.seed(314159)
summary(flobusiness~edges+degree(1))
  edges degree1 
     15       3 
fit <- ergm(flobusiness~edges+degree(1))
summary(fit)
Call:
ergm(formula = flobusiness ~ edges + degree(1))

Monte Carlo Maximum Likelihood Results:

        Estimate Std. Error MCMC % z value Pr(>|z|)    
edges    -2.1161     0.2874      0  -7.363   <1e-04 ***
degree1  -0.6076     0.7220      0  -0.842      0.4    
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

     Null Deviance: 166.36  on 120  degrees of freedom
 Residual Deviance:  89.37  on 118  degrees of freedom

AIC: 93.37  BIC: 98.95  (Smaller is better. MC Std. Err. = 0.03104)
mcmc.diagnostics(fit)
Sample statistics summary:

Iterations = 14336:262144
Thinning interval = 1024 
Number of chains = 1 
Sample size per chain = 243 

1. Empirical mean and standard deviation for each variable,
   plus standard error of the mean:

           Mean    SD Naive SE Time-series SE
edges   -0.1523 3.908   0.2507        0.21354
degree1 -0.2058 1.556   0.0998        0.09053

2. Quantiles for each variable:

         2.5% 25% 50% 75% 97.5%
edges   -7.95  -3   0   3     7
degree1 -3.00  -1   0   1     3

Are sample statistics significantly different from observed?
                edges    degree1     (Omni)
diff.      -0.1522634 -0.2057613         NA
test stat. -0.7130408 -2.2728669 7.19056384
P-val.      0.4758205  0.0230342 0.02904169

Sample statistics cross-correlations:
             edges    degree1
edges    1.0000000 -0.4483514
degree1 -0.4483514  1.0000000

Sample statistics auto-correlation:
Chain 1 
               edges     degree1
Lag 0     1.00000000  1.00000000
Lag 1024 -0.16100203 -0.09923759
Lag 2048 -0.03448682 -0.03578759
Lag 3072 -0.03232265 -0.06282576
Lag 4096  0.13234288  0.12661492
Lag 5120  0.03208760  0.02686442

Sample statistics burn-in diagnostic (Geweke):
Chain 1 

Fraction in 1st window = 0.1
Fraction in 2nd window = 0.5 

    edges   degree1 
-1.357457 -1.308064 

Individual P-values (lower = worse):
    edges   degree1 
0.1746359 0.1908517 
Joint P-value (lower = worse):  0.03956785 

Note: MCMC diagnostics shown here are from the last round of
  simulation, prior to computation of final parameter estimates.
  Because the final estimates are refinements of those used for this
  simulation run, these diagnostics may understate model performance.
  To directly assess the performance of the final model on in-model
  statistics, please use the GOF command: gof(ergmFitObject,
  GOF=~model).
set.seed(271828)
fit.1step <- ergm(flobusiness~edges+degree(1),
            control=snctrl(MCMC.interval=1))
set.seed(101)
flomodel.03.sim <- simulate(flomodel.03,nsim=10)
class(flomodel.03.sim) # Reveal the class of the object created
[1] "network.list"
summary(flomodel.03.sim) # quick summary of a network.list object
List of  10  Networks
Model: flomarriage ~ edges + nodecov("wealth") 
Reference: ~Bernoulli 
Constraints: ~. ~. - observed 
Stored network statistics:
      edges nodecov.wealth
 [1,]    26           2998
 [2,]    23           2311
 [3,]    22           2121
 [4,]    28           3127
 [5,]    29           3189
 [6,]    20           2423
 [7,]    36           3772
 [8,]    23           2570
 [9,]    31           3364
[10,]    23           2376
attr(,"monitored")
[1] FALSE FALSE
List of  10  Networks
Model: flomarriage ~ edges + nodecov("wealth") 
Reference: ~Bernoulli 
Constraints: ~. ~. - observed 
attributes(flomodel.03.sim) # Reveal the various attributes of this network.list
$coefficients
         edges nodecov.wealth 
   -2.59492903     0.01054591 

$control
Control parameter list generated by 'control.simulate.formula' or equivalent. Non-empty parameters:
MCMC.burnin: 16384
MCMC.interval: 1024
MCMC.scale: 1
MCMC.prop: ~sparse + .triadic
MCMC.prop.weights: "default"
MCMC.batch: 0
MCMC.effectiveSize.damp: 10
MCMC.effectiveSize.maxruns: 1000
MCMC.effectiveSize.burnin.pval: 0.2
MCMC.effectiveSize.burnin.min: 0.05
MCMC.effectiveSize.burnin.max: 0.5
MCMC.effectiveSize.burnin.nmin: 16
MCMC.effectiveSize.burnin.nmax: 128
MCMC.effectiveSize.burnin.PC: FALSE
MCMC.effectiveSize.burnin.scl: 1024
MCMC.maxedges: Inf
MCMC.runtime.traceplot: FALSE
network.output: "network"
parallel: 0
parallel.version.check: TRUE
parallel.inherit.MT: FALSE
MCMC.samplesize: 10
obs.MCMC.mul: 0.25
obs.MCMC.samplesize.mul: 0.5
obs.MCMC.interval.mul: 0.5
obs.MCMC.burnin.mul: 0.5
obs.MCMC.prop: ~sparse + .triadic
obs.MCMC.prop.weights: "default"
MCMC.save_networks: TRUE

$response
[1] NA

$class
[1] "network.list"

$stats
      edges nodecov.wealth
 [1,]    26           2998
 [2,]    23           2311
 [3,]    22           2121
 [4,]    28           3127
 [5,]    29           3189
 [6,]    20           2423
 [7,]    36           3772
 [8,]    23           2570
 [9,]    31           3364
[10,]    23           2376
attr(,"monitored")
[1] FALSE FALSE

$formula
flomarriage ~ edges + nodecov("wealth")
attr(,".Basis")
 Network attributes:
  vertices = 16 
  directed = FALSE 
  hyper = FALSE 
  loops = FALSE 
  multiple = FALSE 
  bipartite = FALSE 
  total edges= 20 
    missing edges= 0 
    non-missing edges= 20 

 Vertex attribute names: 
    priorates totalties vertex.names wealth 

No edge attributes

$constraints
$constraints[[1]]
~.
<environment: base>

$constraints[[2]]
~. - observed
<environment: base>

$reference
~Bernoulli
<environment: base>
rbind("obs"=summary(flomarriage~edges+nodecov("wealth")),
      "sim mean"=colMeans(attr(flomodel.03.sim, "stats"))) 
         edges nodecov.wealth
obs       20.0         2168.0
sim mean  26.1         2825.1
# we can also plot individual simulations
flomodel.03.sim[[7]]
 Network attributes:
  vertices = 16 
  directed = FALSE 
  hyper = FALSE 
  loops = FALSE 
  multiple = FALSE 
  bipartite = FALSE 
  total edges= 36 
    missing edges= 0 
    non-missing edges= 36 

 Vertex attribute names: 
    priorates totalties vertex.names wealth 

No edge attributes
plot(flomodel.03.sim[[7]], 
     label= flomodel.03.sim[[7]] %v% "vertex.names",
     label.cex = 0.5,
     vertex.cex = (flomodel.03.sim[[7]] %v% "wealth")/25)
set.seed(54321) # The gof function uses random values
flomodel.03.gof <- gof(flomodel.03)
flomodel.03.gof

Goodness-of-fit for degree 

         obs min mean max MC p-value
degree0    1   0 1.20   5       1.00
degree1    4   0 3.64   8       1.00
degree2    2   0 3.98   9       0.44
degree3    6   0 3.43   7       0.20
degree4    2   0 1.86   7       1.00
degree5    0   0 1.03   5       0.68
degree6    1   0 0.48   4       0.70
degree7    0   0 0.24   2       1.00
degree8    0   0 0.11   1       1.00
degree9    0   0 0.02   1       1.00
degree10   0   0 0.01   1       1.00

Goodness-of-fit for edgewise shared partner 

     obs min  mean max MC p-value
esp0  12   5 12.65  19       0.86
esp1   7   0  5.49  15       0.72
esp2   1   0  1.71   8       1.00
esp3   0   0  0.22   5       1.00
esp4   0   0  0.03   2       1.00

Goodness-of-fit for minimum geodesic distance 

    obs min  mean max MC p-value
1    20  13 20.10  37       1.00
2    35  17 35.34  67       1.00
3    32  11 27.79  41       0.58
4    15   2 12.20  26       0.76
5     3   0  3.68  13       0.94
6     0   0  0.88  11       1.00
7     0   0  0.19   8       1.00
8     0   0  0.03   2       1.00
Inf  15   0 19.79  65       1.00

Goodness-of-fit for model statistics 

                obs  min    mean  max MC p-value
edges            20   13   20.10   37        1.0
nodecov.wealth 2168 1287 2201.89 3467        0.9
plot(flomodel.03.gof)
set.seed(12345)
mesamodel.02 <- ergm(mesa~edges)
Starting maximum pseudolikelihood estimation (MPLE):
Obtaining the responsible dyads.
Evaluating the predictor and response matrix.
Maximizing the pseudolikelihood.
Finished MPLE.
Evaluating log-likelihood at the estimate. 
mesamodel.02.gof <- gof(mesamodel.02~degree + esp + distance, 
                        control = snctrl(nsim=10))
Warning in gof.formula(object = object$formula, coef = coef, GOF = GOF, : No
parameter values given, using 0.
plot(mesamodel.02.gof)
set.seed(10)
data('faux.magnolia.high')
magnolia <- faux.magnolia.high
magnolia
 Network attributes:
  vertices = 1461 
  directed = FALSE 
  hyper = FALSE 
  loops = FALSE 
  multiple = FALSE 
  bipartite = FALSE 
  total edges= 974 
    missing edges= 0 
    non-missing edges= 974 

 Vertex attribute names: 
    Grade Race Sex vertex.names 

 Edge attribute names not shown 
plot(magnolia, vertex.cex=.5)
summary(magnolia~edges+triangle) # Simple model for triad closure
   edges triangle 
     974      169 
set.seed(100)
fit <- ergm(magnolia~edges+triangle,
            control=snctrl(MCMLE.effectiveSize=NULL))
Starting maximum pseudolikelihood estimation (MPLE):
Evaluating the predictor and response matrix.
Maximizing the pseudolikelihood.
Finished MPLE.
Starting Monte Carlo maximum likelihood estimation (MCMLE):
...
Iteration 4 of at most 60:
Optimizing with step length 0.3963.
The log-likelihood improved by 1.1568.
Estimating equations are not within tolerance region.
Iteration 5 of at most 60:
Error in ergm.MCMLE(init, nw, model, initialfit = (initialfit <- NULL),  : 
  Number of edges in a simulated network exceeds that in the observed by a factor of more than 20. This is a strong indicator of model degeneracy or a very poor starting parameter configuration. If you are reasonably certain that neither of these is the case, increase the MCMLE.density.guard control.ergm() parameter.
set.seed(1000)
fit <- ergm(
  magnolia ~ edges + triangle,
  control = snctrl(
    MCMLE.maxit = 2,
    MCMLE.effectiveSize = NULL,
    MCMLE.density.guard.min = 10^6
  )
)
Starting maximum pseudolikelihood estimation (MPLE):
Obtaining the responsible dyads.
Evaluating the predictor and response matrix.
Maximizing the pseudolikelihood.
Finished MPLE.
Starting Monte Carlo maximum likelihood estimation (MCMLE):
Iteration 1 of at most 2:
1 Optimizing with step length 0.0300.
The log-likelihood improved by 3.4201.
Estimating equations are not within tolerance region.
Iteration 2 of at most 2:
1 Optimizing with step length 0.9502.
The log-likelihood improved by 0.9473.
Estimating equations are not within tolerance region.
MCMLE estimation did not converge after 2 iterations. The estimated coefficients may not be accurate. Estimation may be resumed by passing the coefficients as initial values; see 'init' under ?control.ergm for details.
Finished MCMLE.
Evaluating log-likelihood at the estimate. Fitting the dyad-independent submodel...
Bridging between the dyad-independent submodel and the full model...
Setting up bridge sampling...
Using 16 bridges: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 .
Bridging finished.

This model was fit using MCMC.  To examine model diagnostics and check
for degeneracy, use the mcmc.diagnostics() function.
mcmc.diagnostics(fit)
set.seed(10101)
fit <- ergm(magnolia~edges+gwesp(0.25, fixed=T), 
            control=snctrl(MCMC.interval = 10000),
            verbose=T)
Evaluating network in model.
Initializing unconstrained Metropolis-Hastings proposal: ‘ergm:MH_TNT’.
Initializing model...
Model initialized.
Using initial method 'MPLE'.
Fitting initial model.
Starting maximum pseudolikelihood estimation (MPLE):
Evaluating the predictor and response matrix.
Maximizing the pseudolikelihood.
Finished MPLE.
Starting Monte Carlo maximum likelihood estimation (MCMLE):

 ... (output snipped)

Bridging finished.
This model was fit using MCMC.  To examine model diagnostics and check for degeneracy, use the mcmc.diagnostics() function.
mcmc.diagnostics(fit)
Sample statistics summary:

Iterations = 72500:1425000
Thinning interval = 2500 
Number of chains = 1 
Sample size per chain = 542 

1. Empirical mean and standard deviation for each variable,
   plus standard error of the mean:

                   Mean    SD Naive SE Time-series SE
edges            -8.450 41.20    1.770          3.851
gwesp.fixed.0.25 -6.509 35.27    1.515          3.202

2. Quantiles for each variable:

                   2.5%    25%    50%  75% 97.5%
edges            -84.47 -37.75 -7.500 18.0 74.00
gwesp.fixed.0.25 -78.86 -29.54 -4.699 15.7 61.95

Are sample statistics significantly different from observed?
                 edges gwesp.fixed.0.25     (Omni)
diff.      -8.45018450      -6.50904126         NA
test stat. -2.19417566      -2.03297788 4.68731815
P-val.      0.02822278       0.04205476 0.09964556

Sample statistics cross-correlations:
                     edges gwesp.fixed.0.25
edges            1.0000000        0.8123595
gwesp.fixed.0.25 0.8123595        1.0000000

Sample statistics auto-correlation:
Chain 1 
               edges gwesp.fixed.0.25
Lag 0     1.00000000       1.00000000
Lag 2500  0.65081600       0.56300040
Lag 5000  0.41773366       0.38460119
Lag 7500  0.28184861       0.26302064
Lag 10000 0.19815660       0.21016863
Lag 12500 0.07557243       0.08598698

Sample statistics burn-in diagnostic (Geweke):
Chain 1 

Fraction in 1st window = 0.1
Fraction in 2nd window = 0.5 

           edges gwesp.fixed.0.25 
      0.87041540      -0.05755468 

Individual P-values (lower = worse):
           edges gwesp.fixed.0.25 
       0.3840734        0.9541033 
Joint P-value (lower = worse):  0.02361807 

Note: MCMC diagnostics shown here are from the last round of
  simulation, prior to computation of final parameter estimates.
  Because the final estimates are refinements of those used for this
  simulation run, these diagnostics may understate model performance.
  To directly assess the performance of the final model on in-model
  statistics, please use the GOF command: gof(ergmFitObject,
  GOF=~model).

Exponential Random Graph Models (ERGMs) using statnet

Statnet Development Team

The Statnet Project

Introduction to this workshop/tutorial.

Prerequisites

Software installation

1. Statistical network modeling with ERGMs

The general form for an ERGM

The model statistics \(g(y)\): ERGM terms

Help for ERGM terms

ERGM probabilities: at the tie level

Loading network data

The `ergm` and `summary` functions

Some simple models

A Bernoulli (“Erdős/Rényi”) model

Triad formation

Nodal covariates: effects on mean degree

Nodal covariates: Homophily

Directed ties

2. Missing data

3. Model terms available for ergm estimation and simulation

Terms provided with ergm

Coding new ergm-terms

4. Assessing convergence for dyad dependent models: MCMC Diagnostics

What it looks like when a model converges properly

5. Network simulation: the simulate command and network.list objects

6. Examining the quality of model fit — GOF

7. Diagnostics: troubleshooting and checking for model degeneracy

What it looks like when a model fails

8. Working with egocentrically sampled network data

9. Additional functionality in statnet and other packages

Additional functionality in base `ergm`

Extensions by other developers

Appendix A: Clarifying the terms “ergm” and “network”

References

Exponential Random Graph Models (ERGMs) using statnet

Statnet Development Team

The Statnet Project

Introduction to this workshop/tutorial.

Prerequisites

Software installation

1. Statistical network modeling with ERGMs

The general form for an ERGM

The model statistics \(g(y)\): ERGM terms

Help for ERGM terms

ERGM probabilities: at the tie level

Loading network data

The ergm and summary functions

Some simple models

A Bernoulli (“Erdős/Rényi”) model

Triad formation

Nodal covariates: effects on mean degree

Nodal covariates: Homophily

Directed ties

2. Missing data

3. Model terms available for ergm estimation and simulation

Terms provided with ergm

Coding new ergm-terms

4. Assessing convergence for dyad dependent models: MCMC Diagnostics

What it looks like when a model converges properly

5. Network simulation: the simulate command and network.list objects

6. Examining the quality of model fit — GOF

7. Diagnostics: troubleshooting and checking for model degeneracy

What it looks like when a model fails

8. Working with egocentrically sampled network data

9. Additional functionality in statnet and other packages

Additional functionality in base ergm

Extensions by other developers

Appendix A: Clarifying the terms “ergm” and “network”

References

The `ergm` and `summary` functions

Additional functionality in base `ergm`