The artificial directory holds artificially generated test data, often to explicitly test corner cases of various file formats. The files there usually come in families whose names have a common prefix, such as three files
foo.bar.commentary.txt all prefixed by
foo.bar. The file whose name is that prefix is usually the canonical (typically hand-crafted) file, and the other files are then derived from that. Outside of the artificial directory, the other files in this directory are typically real world examples of various file formats, or deriviations of them.
The non-ascii directory holds trivial UTF-8 (but not ASCII) text files.
archive.* archives a subset of other files in this directory.
animated-red-blue.gif is an original animation by Nigel Tao email@example.com.
australian-abc-local-stations.json was crawled from data.gov.au.
bricks-* are various encodings of an original photo by Nigel Tao firstname.lastname@example.org.
cbor-rfc-7049-examples.cbor is the concatenated examples from RFC 7049. The
cbor-rfc-7049-examples.*.json files were then generated by
crude-flag.* is an original animation by Nigel Tao email@example.com. See the
file-sizes.json was created by running
script/print-file-sizes-json.go in this repository's root directory.
gifplayer-muybridge.gif is an original animation by Nigel Tao firstname.lastname@example.org.
harvesters.* are various encodings of a photo of “The Harvesters” by Pieter Bruegel the Elder, held by the Metropolitan Museum of Art. www.metmuseum.org lists that image as in the public domain.
github-tags.json was crawled from api.github.com.
hat.* are various encodings of a photo of “Self-Portrait with a Straw Hat (obverse: The Potato Peeler)” by Vincent van Gogh, held by the Metropolitan Museum of art. www.metmuseum.org lists that image as in the public domain.
hello.sh is a trivial shell script.
hibiscus.regular.* are various encodings of a photo of “Hibiscus and Parrots” by Louis Comfort Tiffany, held by the Metropolitan Museum of Art. www.metmuseum.org lists that image as in the public domain.
hibiscus.primitive.* is the result of running the
hibiscus.regular.png image through
github.com/fogleman/primitive and other programs:
primitive -i hibiscus.regular.png -o hibiscus.primitive.svg -s 442 -n 32
inkscape -z -e hibiscus.primitive.png hibiscus.primitive.svg
hippopotamus.* are various encodings of a cropping of a photo of “Hippopotamus (William)”, held by the Metropolitan Museum of Art. www.metmuseum.org lists that image as in the public domain.
www.metmuseum.org says that “You are welcome to use images of artworks in The Met collection that the Museum believes to be in the public domain, or those to which the Museum waives any copyright it might have, for any purpose, including commercial and noncommercial use, free of charge and without requiring permission from the Museum.”
json-things.* are original JSON objects by Nigel Tao email@example.com.
midsummer.txt is an excerpt of Shakespeare‘s "A Midsummer Night’s Dream", copied from shakespeare.mit.edu.
muybridge.gif is derived from en.wikipedia.org which is in the public domain.
nobel-prizes.json was crawled from api.nobelprize.org.
pi.txt contains the digits of pi.
pjw-thumbnail.* are various encodings of an image derived from an iconic, original photo of Peter J. Weinberger by Rob Pike firstname.lastname@example.org.
rgb24png.bmp comes from BMP Suite, which states that its generated images “are in the public domain”.
romeo.txt is an excerpt of Shakespeare's “Romeo and Juliet”, copied from shakespeare.mit.edu.
romeo.txt.fixed-huff.deflate was derived from
romeo.txt by a custom program to use fixed (not dynamic) Huffman tables for the deflate encoding.
sheep-more.rac is a RAC-compression of original text by Nigel Tao email@example.com.