Installing Archivematica¶
On this page
- Overview
- Technical requirements
- Instructions for new installations
- Instructions for upgrades
- Advanced installation options
Overview¶
The following table lists various ways to install and use Archivematica. Further down the page, you will find more detailed information about new installations and advanced installation options.
If you need assistance or clarification regarding the installation instructions, the Archivematica user forum is a good place ask questions.
New installations for production deployment | ||
---|---|---|
Ubuntu 22.04 64-bit Server Edition | Automated install using Ansible Ubuntu 22.04 (Jammy) | These instructions will not work if you are using a Windows machine as the host environment. |
Rocky Linux 9 x86-64 | Installing Archivematica on Rocky Linux 9 (Blue Onyx) | Archivematica versions 1.15.0 and higher support installation on Rocky Linux 9. |
Windows OS | Not supported |
Test-driving the latest release | ||
---|---|---|
Archivematica sandbox | Using the sandbox | The sandbox will automatically reset daily. Any packages created will not be permanently saved. Additionally, there may be more than one demo user logged in at the same time, so you may see changes made by others while using the software. |
Local virtual machine using Vagrant and VirtualBox | Installing on a virtual machine using Vagrant | This virtual machine is not intended to be used in production. |
Development environments | ||
---|---|---|
Docker and Linux | Archivematica development on Docker Compose | Docker will provide instructions on how to use it as a non-root user. This may not be desirable for all. |
Spreading Archivematica’s processing load across several machines | ||
---|---|---|
Base on your customized setup and network configuration | Scaling Archivematica | When installing Archivematica on multiple machines, the various Archivematica processes must be able to reach each other on the relevant ports. Your firewall configuration must allow for this. |
Technical requirements¶
Operating system¶
Archivematica 1.15.1 installation instructions are provided here for the following operating systems:
- Ubuntu 22.04 64-bit Server Edition
- Rocky Linux 9 x86-64
Other Linux distributions should work, but will require customization of these installation instructions.
Support for macOS is possibly in theory, but is not being tested, and would require more significant deviation from these instructions.
Archivematica is unlikely to ever run directly in a Windows environment. Consider the use of a virtualization platform to run Linux VMs.
Dependencies¶
Archivematica has a long list of software it depends on. All of these dependencies are installed when following the instructions below.
Note that it is possible to install some of the components on separate machines in order to improve performance, such as:
- MySQL
- Gearman
- Elasticsearch (optional as of Archivematica 1.7, see below)
Using additional machines will require additional configuration. For more information, see Advanced.
Note
Archivematica 1.15 has been tested with MySQL 8.0, including the Percona and MariaDB alternatives.
Some of the tools run by Archivematica require Java to be installed (primarily Elasticsearch and fits). On Ubuntu 22.04, Open JDK 8 is used, but Open JDK 11 is the default. It is possible to use Oracle Java 8 instead.
The remaining dependencies should be kept at the versions installed by Archivematica.
Elasticsearch¶
As of Archivematica 1.7, installing Elasticsearch is optional. Elasticsearch powers the indexes that are used for searching in the Backlog, Appraisal, and/or Archival Storage. Installing Archivematica without Elasticsearch results in reduced consumption of compute resources and lower operational complexity. Disabling Elasticsearch means that the Backlog, Appraisal, and/or Archival Storage tabs will not appear in the user interface and their functionality will not be available.
By setting the archivematica_src_search_enabled
configuration attribute, administrators can define how many things Elasticsearch
is indexing, if any. This can impact searching across several different
dashboard pages.
Possible archivematica_src_search_enabled
configuration attribute values:
transfers
: Only transfers are indexed. Search is enabled on the Backlog and Appraisal tabs, but not the Archival Storage tab.aips
: Only AIPs are indexed. Search is enabled on the Archival Storage tab, but not the Backlog or Appraisal tabs.aips,transfers
, ortrue
: Both AIPs and transfers are indexed. Search works on the Backlog, Appraisal, and Archival Storage tabs.false
: Indexless mode. Neither AIPs nor transfers are indexed. The Backlog, Appraisal, and Archival Storage tabs will be non-functional.
When Elasticsearch is used, Archivematica 1.15.1 requires version 6.x (tested with 6.8.21).
For more information on disabling Elasticsearch, please see the README for Archivematica’s ansible role,
Hardware¶
Archivematica is capable of running on almost any hardware supported by Linux; however, processing large collections will require better hardware.
Minimum hardware requirements¶
For small-scale functionality testing using small collections (transfers with 100 files or less, total file size 1 GB or smaller), we recommend the following minimum hardware requirements:
- Processor: 2 CPU cores
- Memory: 2GB+
- Disk space (processing): 7GB plus two to three times the disk space required for the collection being processed (e.g., 3GB to process a 1GB transfer)
Recommended minimum production requirements¶
For production processing, the hardware requirements depend almost entirely on the size and number of files being processed. These recommendations should be considered the minimum for a viable production system:
- Processor: 2 CPU cores
- Memory: 4GB
- Disk space (processing): 200GB
More commonly, we deploy the following:
- Processor: 8 CPU cores
- Memory: 16GB
For processing disk space, we recommend allocating 20GB plus four times the disk space required for the largest transfer that you expect to process. If your largest transfer is 50GB, allocation at least 220GBs of disk space.
The amount of transfer source disk space needed is subjective, and depends on individual workflows.
The amount of storage disk space needed will depend on how much material you intend to store, as well as how it is stored (compressed or uncompressed).
These requirements may not be suitable for certain types of material - for example, audio-visual material requires more processing power than images or documents.
Browser compatability¶
Archivematica has been tested most extensively with Firefox and Chrome. Minimal, but successful, testing has been done with Microsoft Edge.
Instructions for new installations¶
Archivematica can be installed using packages or Ansible scripts in either Rocky Linux/Red Hat or Ubuntu environments. At this time, installation instructions are provided for officially tested and supported installation environments:
- Automated install on Ubuntu 22.04 using Ansible.
- Manual install of OS packages on Rocky Linux 9/Red Hat
Note that manual install of OS packages on Ubuntu 22.04 is documented but not officially supported.
Installing Archivematica using Docker is not officially supported for production deployments. However, it is the preferred development environment for those who work on Archivematica’s code.
For more information about installation environments, please see the ansible-archivematica-src repo, the deploy-pub repo, and ask on the Archivematica user forum for more details.
Instructions for upgrades¶
If you are upgrading from a previous version of Archivematica, please see the upgrading instructions.
Advanced installation options¶
There are many ways to install Archivematica, depending on the needs of the individual user. We have documented some common advanced installation setups.