Categories

Versions

You are viewing the RapidMiner Hub documentation for version 9.10 - Check here for latest version

Automatic Job Cleanup

If you’re not on 9.10.4, but any version from 9.10.0 to 9.10.3, please head over to the older documentation for Job Cleanup.

RapidMiner Server automatically saves information related to recently executed jobs. This includes the user who triggered the execution, the job's state, the queue on which the job has been executed and also the date on which the process was executed - this information can be reviewed on the Executions page within RapidMiner Server. Additionally, the Job Agent which was responsible for executing the job, created a dedicated working directory for it.

Those stored pieces of information and also the working directories can grow large. In order to avoid this, RapidMiner Server provides a job cleanup mechanism to wipe old jobs. There are two alternative methods of configuration:

  1. Configure docker-compose.yml
  2. Configure execution.properties

Configure docker-compose.yml

Add these lines to the rm-server-svc container, under environment:

  1. JOBSERVICE_SCHEDULED_ARCHIVE_JOBCLEANUP_ENABLED=(true or false): Enables the job cleanup

  2. JOBSERVICE_SCHEDULED_ARCHIVE_JOBCLEANUP_CRON_EXPRESSION=(cron expression): This property defines the point in time when the automatic job cleanup will be executed. It follows the cron pattern <second> <minute> <hour> <day> <month> <weekday>. So 0 */30 * * * * would run the job cleanup every 30 minute whereas 0 0 0 * * * would run it daily at midnight. You can use the scheduling dialog in RapidMiner Studio to create cron expressions graphically.

  3. JOBSERVICE_SCHEDULED_ARCHIVE_JOBCLEANUP_MAXAGE=(number of minutes): This property defines the maximum age of jobs in minutes. Jobs older than the value set in the property will be cleaned up. Set this to any number greater than zero, according to your organization's policies and the available disk space.

Configure execution.properties

The configuration of docker-compose.yml is probably simpler, but an alternative scheme is to add the following properties to the execution.properties file within the <rapidminer-home>/configuration folder of your RapidMiner Server home directory.

  1. jobservice.scheduled.archive.jobCleanup.enabled: Enable the job cleanup.

  2. jobservice.scheduled.archive.jobCleanup.cronExpression: This property defines the point in time when the automatic job cleanup will be executed with the help of a cron expression. By default, the cleanup task is configured to run hourly with the cron expression 0 0 * * * *. It follows the cron pattern <second> <minute> <hour> <day> <month> <weekday>. So 0 */30 * * * * would run the job cleanup every 30 minute whereas 0 0 0 * * * would run it daily.

  3. jobservice.scheduled.archive.jobCleanup.maxAge: This property defines the maximum age of jobs in minutes. By default, this is not configured. If configured, jobs older than the value set in the property will be cleaned up. Set this to any arbitrary number greater than zero. Please note that the property value needs to reflect the execution time. If you like to clean up all jobs which are older than 10 minutes, you also need to run the job every 10 minutes.