10.13.Configuration of File system archive job
In this section we will configure a file system archive job, which will run every day from 6 PM, will archive our 1-2 year old files, and the archived files will be replaced with shortcuts. Our database, storage, root folder, retention and scheduler are already configured. Navigate to File Archive ⇒ Archive ⇒ Jobs button, click on +new. In the Add new job instance dialog select File system archive job from the Available jobs: dropdown list. Select the node where the job should run from the Run on node list. Now we you would not like to specify a node, so we select “Any available”. Enter a Display name, and click on Add.
On the job’s configuration page go through the following sections; most of them are mandatory, filtering settings are optional:
✓ Scheduling settings:
In this step the running times of the archive job must be selected. It is possible either to select a scheduler from the list or to create a new scheduler via create new option. For archiving job it is recommended to set up a scheduler which will run always, or in certain times during the week. Our job will run every day from 6 PM till midnight, after business hours. (For more information about how to set schedulers refer to the section Schedules above.)
✓ Retention settings:
Under retention settings the administrator may select a previously configured time period. During this time it will be disabled to delete the archived items from the storage. It is recommended to set here a time interval based on data recording regulations required either by the law of the country, or by internal company policies. In our country the required retention time is 10 years, so we set this here. (For more information how to set retentions check section Retentions above.)
✓ Shortcut settings in File archive:
The user may decide, if the files will be archived using shortcuts or will be simply archived, creating a backup file to the original. It is also possible to set automatic deletion of the original document from the file system location. There are 3 shortcut types available in file system archive (the state before and after the archiving process is illustrated on the respective screenshots):
- Keep original method (backup files with versioning) is often used on a daily basis. The main advantage of this processing type is, that the documents are ready at hand at any time, and the versions can be accessed from the archive at any time, too. This method keeps the original item in the original file location.
- Create shortcut from original item: Files older than 1 year are usually archived with this shortcutting method. The main advantage of this method is the spared space on the computer. In case of need, the documents can be easily retrieved from the archive with clicking on the respective shortcut.
If the user modifies the file and saves it to the original location again, then the next archive job will create a new version from it into the archive, and will synchronize the shortcut with this new version.
- Delete original: with this method the user deletes the file from the original location when archiving it. This processing type is often used when the document is not in use any more, but it must be restorable until a definite date required by the general records retentions of the country.
It depends on a particular situation, which method is advisable to use. You can use all methods simultaneously to have your daily backup — create versions when files are changed, or archive with shortcutting or delete original method in case of old(er) files.
In this use case we will archive by using shortcuts. If you decide to use shortcutting, it is also necessary to specify the Virtual drive server name. This is the name of the server where the virtual drive is installed. In our case it is tanews. It is strongly recommended to use a name of the server under which it will be always accessible (DNS alias at best). This server name is then included in the generated shortcuts. By using the Use default checkbox you may automatically select the Virtual Drive, which is already configured in the System ⇒ Client Applications ⇒ Virtual Drive configuration section. It is very useful to use this option, as if the Virtual Drive server name has been changed, then you have to change these settings on the Virtual Drive configuration page only. (For more information refer to section “Virtual drive”.) On the jobs’ configuration page the default Virtual Drive will be prepopulated.
With clicking on the Test button it is allowed to check if the connection to the Virtual drive server was established.Note: Virtual drive is included in the content ACCESS setup package. It needs to be configured with navigating to System tab ⇒ Client applications ⇒ Virtual drive configuration button on the ribbon. For further information how to install and launch it please refer to the Virtual Drive Installation Manual uploaded here. After these necessary steps it will be shown among another drives (by default the Virtual drive will be the Z drive).Note: The shortcuts used in file archive system are compatible with Macintosh operating system, too. It is a must to install TECH-ARROW’s WinShortcutter application on every client machine. It can be downloaded either from the Tools of the contentACCESS setup, or from our website, here.Note: It is important to bear in mind that archiving an HTML file with the shortcutting method in contentACCESS File Archive has certain technical limitations. If an HTML file to be archived contains pictures, then after the archiving process these pictures will not be visible when trying to open the HTML over the shortcut in the original location. Nevertheless these pictures will be still accessible with opening the HTML from the Virtual Drive.
✓ Folders to process:
In this section the user is required to select the local folder(s) or remote share(s) where the system data are located and will be archived. There are two available options how the list of folders may be specified:
- The folder(s) may be added via +new option. In this case the folder(s) must be added manually one-by-one. Click on +new to open the File Path dialog.
Click on “new” to specify the folder to be archived manually. In the “Folder to process” dialog unroll the list of already created roots and select the one that should be applied. This dropdown lists the roots that are already configured (in File Archive settings: File Archive ⇒ Settings ⇒ Root folders). Enter the Relative path if necessary, test the path via the “Test” button and click “OK”
- The folder list may be also imported from a file. The file to be imported must contain the list of folders that should be processed. Click the import button and the Import file location dialog opens. Specify here the required settings according to chapter Importing contentACCESS configurations from files of this guide.
✓ Exclude folders
With this function you may set, which folders/subfolders should not be archived. If you would not like to exclude any folder from the archive process, you can skip these settings.
The folders may be excluded using one of the available methods: a) The folders to exclude may be added manually via the + new option or; b) may be imported in one of the available file formats via the import option.
- Add the folders to exclude manually:
Click on + new option. In the Exclude folder dialog’s textbox specify the folder(s). Use the hints that are listed in the same dialog.
Test pattern button
This button was developed to check, if the entered pattern will be suitable for the path that the user would like to exclude from the archiving process. Click the Test pattern button to open the Manual test of pattern matching window. Enter the folder path into the Folder path field, and click on Test button.
The test will be successful, if the Pattern was entered correctly into the Exclude folder window.
- Import the list of exclude folders:
Folders to exclude may be imported as files in one of the available formats: CSV, XML, Tab delimited files or Space delimited file. The file to be imported must contain the list of folders to exclude. To import such a file with the list of folders refer to chapter Importing contentACCESS configurations from files of this guide.
✓ Item filtering:
This configuration enables for the user to filter among items by 3 types of criteria: by age, size or file type. It is also enabled to use more filters simultaneously. For filtering documents according to any of these criteria activate the filter button by checking the checkbox next to it, then click on the activated button.
- Age filter:
With this filter the user may select files to process with a specific age. The user has multiple choices here; he can select the filtration type from the list, and a) use either the Use dates option and pick up a date from the calendar, or b) he can enter a value into days/months/years textbox and search for the files which are younger than/older than or fall into time interval set here.
Using the creation date or modification date for filtering: By default the modification date is the determinative date, but this can be changed by checking the “If creation date is younger than the modification date, use creation date for filtration” checkbox. However, in certain cases the creation date of a file is set to younger date than the modification date of a file. This happens, if a file is copied to another external disk (for example from disk G to disk I). In this case the creation date will be set to the date of the copy action. If the user would like to use this file in the future and would not like to archive it, this enhancement that can solve this problem. With this feature it is possible to exclude the file with the newer creation date from the archiving process. To create a proper archiving policy take into consideration these rules: https://support.microsoft.com/en-us/kb/299648.
Examples of using dates:
Examples of filtering using days/months/years:
If the Use dates checkbox is not selected, then the date selection option becomes inactive. In this case it is allowed to filter between items with Modification date is younger than/Modification date is older than a certain number of days/month/years. Enter the value into the From, To checkboxes next to the Days/Months/Years radio button (screenshot A). It is also allowed to filter out items which belong to a certain period of time, e.g. items that are 1-3 months old (screenshot B). This age filtering type should be used if you would like to archive items, which are older/younger than a certain number of days/months/years (e.g. items older than 1 year). If the Age filter has been set click OK.
Screenshot A Screenshot B
- Size filter: This filter can be very useful because it enables to find the biggest files to archive and save space. For filtering files according to this criteria activate Size filter button by checking the checkbox next to it, then click on the active button.
In the Size filter dialog choose from the filtering types (Bigger then, Smaller then, or Between values), enter the size (in kilobytes) and click OK.
Screenshot: Files bigger than 64 kB will be processedImportant!!!: When the user changes the file content of an already archived item, e.g. deletes a picture from the file and saves it to the original location, (and thus the file size will be reduced e.g. from 110 to 85 kB) and file size filter is set on >100 kB, then File archive job will not archive the newly created item by the next processing. The newly created item will be saved in the original location, and the old shortcut will be present in the original location, which will still link to the old file without any changes.
From this reason, it is recommended to always check the size filter settings, if a file has not been archived by the next archiving process.
- File type filter: This filter was improved to select certain file types for processing. For filtering between documents according to this criteria activate File Type filter button by checking the checkbox next to it, then click on the active button.
In the File Type filter dialog the user may select the file types that need to be processed (option Process ONLY selected file types), or he may specify file types, that should be excluded from the processing (DO NOT process selected file types).
In our use case we will set only age filter for files older than 1 year, but younger than 2 years:
✓ Notification settings. Select here, when you will need to get notification emails from the File Archive job. It is possible to set here, if you need a notification email always/never, or, you need it only if an error/warning occurred during the archiving process − for example, if the archive of one or more items failed, then you may get a notification email about this error. Into the Recipient list textbox insert the email addresses of the persons, to whom you would like to send these emails. It may happen, that the files are modified too often, and too many version are created. With the Notify if version count reaches option it is possible to define a maximum number of versions, over which the notifications will be sent. It is recommended to set here a higher value. This feature might also be used for troubleshooting purposes. E.g. if the number of versions gets higher than the value specified here, then an error might have occurred during the processing. In this use case the notifications will be sent to Walter White, if any errors or warning occur and/or version count reaches 50.
✓ Resource settings. Resource settings. Set the value, which will determine how many items will be processed simultaneously by the file archive job. The recommended value is 2, as in our case.
At the end never forget to save your job configurations: click on Save in the left upper corner of UI.
Now you need to wait till 6 PM, when the scheduler starts the job. Its running can be checked in the file archive jobs’ grid.
Where an end user may find his file system archive?
An end user has multiple choices where to view his archived files. He may either browse for them on the Virtual drive, or he can view them in contentWEB (contentWEB is the archive interface of contentACCESS). Virtual drive is a special contentACCESS extension developed by TECH-ARROW. Even the archived item will not be present here on this drive (it will be present in contentACCESS), but the user can browse for them here. A file opened through the Virtual drive is opened in read-only mode. This drive can be shared with any other users, so they can have a very easy access to the archived files. The shortcuts that replace the originals can be opened from the original file location. These shortcuts link to the virtual drive.