Additional settings and use cases¶
Setting the location of the staging area¶
When indexing a dataset, Dac-Man uses a staging area to save all metadata and index information. Each directory in the staging area uniquely identifies each dataaset (using a hash representation of the dataset path) indexed by Dac-Man. Users can customize this location when analyzing read-only datasets or comparing files on different systems.
The default staging area is located in $HOME/.dacman/data
.
Users can change the staging area path through the command-line with this command:
dacman index mydir/ -s mystage
The command above creates the indexes inside mystage
directory.
You can copy or move these indexes to compare and calculate the changes,
without necessarily copying or moving the data.
This is useful for cases when access to the datasets is limited or restricted, as illustrated in the following examples.
Comparing datasets with read-only access¶
If the data directories have read-only access,
the metadata and indexes can be stored in a user-defined location my_staging_dir
using the -s
option:
dacman index datadir -s my_staging_dir
Comparing datasets at two different sites¶
To compare datasets at two different sites,
one strategy is to create indexes in a user-defined location (as shown in the previous step),
and copy the staged indexes to a common location, e.g. my_shared_index_location
.
The changes can then be retrieved using:
dacman diff local_dir remote_dir -s my_shared_index_location