Test file existance.

Parameters:
See dedicated page for more information.
testFileExists is a scripted action. Embedded code is accessible and customizable through this tab.
See dedicated page for more information.
The testFileExists action is used to verify the existence of files specified in the input dataset. It is typically used as a validation step in ETL pipelines before performing operations such as copying, deleting, or transforming files. This action ensures that subsequent processes only proceed with records referencing existing files.
Additionally, the action can optionally validate file size and remove rows from the dataset where the referenced file does not exist. This makes it a valuable utility for building robust file-handling pipelines.
| ID | Description | Type | Required | Example Value |
|---|---|---|---|---|
Path |
Path to the file to check | Column | Yes | C1 |
Optional: file size |
Optional: Expected file size | Column | No | C2 (optional) |
remove row if file is missing |
Remove row if file is missing | Switch | No | true or false |
FileAlreadyExists) to the output dataset.1 if the file exists (and matches the size if specified).0 if the file does not exist or the size is mismatched.remove row if file is missing is enabled, rows with missing or mismatched files are removed.

With remove row if file is missing = false

With remove row if file is missing = true

Optional: file size is specified, the action will additionally compare the actual file size with the provided value. If mismatched, the file is treated as non-existent.remove row if file is missing helps sanitize datasets early, avoiding conditional logic downstream.The testFileExists action ensures that only valid, accessible files are processed in your ETL pipeline. Its flexibility to optionally remove invalid records or validate file sizes makes it a powerful tool for defensive data integration workflows.
