Skip to main content

Data Sources

process-data-source

Data sources are the central elements of Qalita; they are referenced in the platform, and the data is never copied and stored within the platform.

Add a Source

Adding a source is currently done only through the qalita command line.

When you add a source, it creates a configuration file qalita-conf.yaml in the ~/.qalita/ directory, which contains the connection information for the source.

Example:

version: 1
sources:
- config:
path: ~/desktop/data/heart/
description: 11 clinical features for predicting heart disease events.
id: 1
name: Heart Failure Prediction Dataset
owner: admin
owner_id: 1
reference: true
sensitive: true
type: file
validate: valid
visibility: internal

In this example, the source is a local file, but it could also be a database, a remote file, a data stream, etc.

PropertyDescription
versionVersion of the configuration
sourcesList of sources
configConfiguration of the source; properties vary depending on the type of source.
idUnique identifier of the source; this identifier is obtained from the platform after finalizing the source referencing with the qalita source push command.
nameName of the source
ownerName of the user who owns the source
owner_idIdentifier of the user who owns the source
typeType of the source
referenceWhether the source is a reference, allowing comparison with other sources.
sensitiveWhether the source is sensitive; special authorization will be required to access it see authorizations.
validateWhether the source is valid, meaning the client has access to the source and it can be referenced on the platform.
visibilityVisibility of the source, can be public, internal, or private. Allows restricting access to different user profiles on the platform.

Source Types

Sources can be of any type, as long as there are packs available to process them. QALITA Platform does not impose any limitations on the types of sources.