Skip to content

gn0/tomltable

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

68 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

tomltable: generate LaTeX tables from TOML specifications and JSON data

This Python package provides a command-line utility called tomltable. This utility generates a LaTeX table from a TOML formatted table specification (read from stdin) and a set of JSON files (specified as arguments).

tomltable can accommodate JSON files regardless of how they are structured. However, its specification language also provides a convenience shortcut for regression tables. There are two related packages that save regression results in a structure that is compatible with tomltable:

  1. jsonwriter for R and
  2. json_this for Stata.

The spiritual ancestor of this package is coeftable which performed a similar task but without the TOML-based specification language.

Installation

tomltable is available through PyPI so you can install it with pip:

$ python3 -m pip install tomltable

You can also install it directly from the Codeberg Git repository by typing either

$ python3 -m pip install --user git+https://github.com/gn0/tomltable

or

$ git clone https://github.com/gn0/tomltable
$ python3 -m pip install --user ./tomltable

Usage

Generating a regression table

We will generate the following table:

Preview of example_mag.tex

To start with, it is convenient to save regression results with jsonwriter or json_this. An example that uses the former in R:

# example_mag.R

install.packages("devtools")
devtools::install_git(url = "https://github.com/gn0/jsonwriter")

library(fixest)
library(jsonwriter)

data(quakes)

# Regression on all earthquakes.
#
write_json(
    feols(depth ~ mag, quakes),
    "example_model_1.json")

# Regression on earthquakes that were reported by a below-median number
# of stations.
#
write_json(
    feols(depth ~ mag, quakes[quakes$stations <= 27, ]),
    "example_model_2.json")

# Regression on earthquakes that were reported by an above-median number
# of stations.
#
write_json(
    feols(depth ~ mag, quakes[quakes$stations > 27, ]),
    "example_model_3.json")

Let's specify a table with three columns, one for each JSON file that is generated by the R script above. We add column numbers and the name of the outcome variable to the header, and the R-squared along with the number of observations to the footer:

# example_mag.toml

[header]
add-column-numbers = true

[[header.row]]
cell = ["Depth", "Depth", "Depth"]

[[body.cell]]
label = "Magnitude"
coef = "mag"

[[footer.cell]]
label = "Observations"
cell = "%(n::nobs)d"

[[footer.cell]]
label = "$R^2$"
cell = "%(n::r_squared).03f"

[[footer.row]]
label = "Sample"
cell = ["full", 'stations $\leq 27$', "stations $> 27$"]

Now we can run the R script and generate the regression table with tomltable:

$ Rscript example_mag.R
$ cat example_mag.toml \
    | tomltable \
        -j example_model_1.json \
        -j example_model_2.json \
        -j example_model_3.json \
        --title "Earthquake depth and magnitude" \
        --label tab:quakes \
        --human-readable-numbers \
    > example_mag.tex

Only generating a template

Use the --only-table option if you only want to generate the template, not the final table:

$ cat example_mag.toml \
    | tomltable \
        -j example_model_1.json \
        -j example_model_2.json \
        -j example_model_3.json \
        --title "Earthquake depth and magnitude" \
        --label tab:quakes \
        --only-template \
    > example_mag.tmpl

Note that the --human-readable-numbers option is dropped. This option is only used for generating the final table.

Using a template to generate a table

Use the --from-template option if you want to use a pre-specified template instead of generating the template from a TOML specification:

$ cat example_mag.tmpl \
    | tomltable \
        -j example_model_1.json \
        -j example_model_2.json \
        -j example_model_3.json \
        --from-template \
        --human-readable-numbers \
    > example_mag.tex

Note that the --title and the --label options are dropped. These options are only used for template generation.

Generating a regression table with column-specific coefficients

We will generate the following table:

Preview of example_mag_squared.tex

In the above table, the coefficient for Magnitude squared is only present in column (2). Because it is missing in the JSON file for column (1), we will need to call tomltable with the --ignore-missing-keys option.

To make the example complete, the following R script generates the results for column (2):

# example_mag_squared.R

library(fixest)
library(jsonwriter)

data(quakes)

write_json(
    feols(depth ~ mag + mag^2, quakes),
    "example_model_4.json")

The TOML specification for the table is similar to before. The most notable difference is in the new [[body.cell]] block for Magnitude squared:

# example_mag_squared.toml

[header]
add-column-numbers = true

[[header.row]]
cell = ["Depth", "Depth"]

[[body.cell]]
label = "Magnitude"
coef = "mag"

[[body.cell]]
label = "Magnitude squared"
coef = "I(mag^2)"

[[footer.cell]]
label = "$R^2$"
cell = "%(n::r_squared).03f"

[[footer.cell]]
label = "Observations"
cell = "%(n::nobs)d"

Running the R script and generating the table, tomltable gives us warning messages because Magnitude squared is missing for column (1). The missing elements in the table are simply omitted in the output, leaving some dangling text in the affected cells. To remove these, we use sed:

$ Rscript example_mag_squared.R
$ cat example_mag_squared.toml \
    | tomltable \
        -j example_model_1.json \
        -j example_model_4.json \
        --title "Earthquake depth and magnitude" \
        --label tab:quakes \
        --human-readable-numbers \
        --ignore-missing-keys \
    | sed \
        -e "s/ & \$\$/ \& /" \
        -e "s/ & ()/ \& /" \
    > example_mag_squared.tex
warning: Specifier '%(1::coef::I(mag^2)::est).03f' refers to key '1::coef::I(mag^2)::est' but this key is not in the JSON object.
warning: Specifier '%(1::coef::I(mag^2)::stars)s' refers to key '1::coef::I(mag^2)::stars' but this key is not in the JSON object.
warning: Specifier '%(1::coef::I(mag^2)::se).04f' refers to key '1::coef::I(mag^2)::se' but this key is not in the JSON object.

Notice that to generate this table, we only had to estimate one new regression specification, the one whose results we saved in example_model_4.json for column (2). For column (1), we reused the results that we already had in example_model_1.json.

Author

Gabor Nyeki. Contact information is on https://www.gabornyeki.com/.

About

Generate LaTeX tables from TOML specifications and JSON data

Topics

Resources

Stars

Watchers

Forks