AIP structure¶
This page documents the structure of the AIP produced by Archivematica.
Name¶
The AIP name is composed of the following:
1. Either the name of the original transfer if no new name has been assigned to the SIP upon formation or the name of the SIP or SIPs created from the transfer and
- a UUID assigned during SIP formation
example: Pictures_of_my_cat-aebbfc44-9f2e-4351-bcfb-bb80d4914112
“Pictures_of_my_cat” is the name assigned by the user and “aebbfc44-9f2e-4351-bcfb-bb80d4914112” is the UUID generated during SIP formation.
Directory structure¶
The AIP is zipped in the AIPsStore. The AIP directories are broken down into UUID quad directories for efficient storage and retrieval.
Note
UUID quad directories: Some file systems limit the number of items allowed in a directory, so Archivematica uses a directory tree structure to store AIPs. The tree is based on the AIP UUIDs. The UUID is broken down into manageable 4 character pieces, or “UUID quads”, each quad representing a directory. The first four characters (UUID quad) of the AIP UUID will compose a sub directory of the AIP storage. The second UUID quad will be the name of a sub directory of the first, and so on and so forth, until the last four characters (last UUID Quad) create the leaf of the AIP store directory tree, and the AIP with that UUID resides in that directory.)
Bagit documentation¶
The AIP is packaged in accordance with the Library of Congress Bagit specification (PDF, 84KB) As shown below, the BagIt files are bag-info.txt, bagit.txt, manifest-sha512.txt and tagmanifest-md5.txt:
The following describes the contents of the AIP once extracted:
Data directory
The data directory consists of the METS file for the AIP and three folders: logs, objects. and thumbnails.
METS file
/data/METS.uuid.xml
contains the full PREMIS implementation (see PREMIS
metadata for original file, PREMIS metadata: normalized files, PREMIS
metadata: events, and PREMIS metadata: rights The role of the METS file is to
link original objects to their preservation copies and to their descriptions
and submission documentation, as well as to link PREMIS metadata to the
objects in the AIP.
Logs
/data/logs
contains the /transfers directory, normalization log, malware scan
log, and the extraction log (from unpackaging packages) generated during SIP
creation.
The /transfers
directory contains the logs from processing that occurred
to each transfer which is part of the SIP in the transfer workflow in the
dashboard.
Objects
/data/objects
contains original objects, normalized objects, /metadata
and
/submissionDocumentation
. If there were any lower level directories within
the SIP, that directory structure is maintained.
/metadata
contains /transfers
, which contains any metadata which may have
been imported with the transfers
/submissionDocumentation
contains submission documentation for each
transfer which is part of the SIP and each transfer’s METS.xml file. The
structmap for the transfer is the closest approximation of original order
for the transfer, along with the tree diagram if the user chose to create one
during transfer.
Thumbnails
/data/thumbnails
contains any thumbnails generated for viewing in the AIP
search interface of the dashboard.