Data Sources
Data sources are the central elements of Qalita; they are referenced in the platform, and the data is never copied and stored within the platform.
Add a Source
Adding a source is currently done only through the qalita command line.
When you add a source, it creates a configuration file qalita-conf.yaml
in the ~/.qalita/
directory, which contains the connection information for the source.
Example:
version: 1
sources:
- config:
path: ~/desktop/data/heart/
description: 11 clinical features for predicting heart disease events.
id: 1
name: Heart Failure Prediction Dataset
owner: admin
owner_id: 1
reference: true
sensitive: true
type: file
validate: valid
visibility: internal
In this example, the source is a local file, but it could also be a database, a remote file, a data stream, etc.
Property | Description |
---|---|
version | Version of the configuration |
sources | List of sources |
config | Configuration of the source; properties vary depending on the type of source. |
id | Unique identifier of the source; this identifier is obtained from the platform after finalizing the source referencing with the qalita source push command. |
name | Name of the source |
owner | Name of the user who owns the source |
owner_id | Identifier of the user who owns the source |
type | Type of the source |
reference | Whether the source is a reference, allowing comparison with other sources. |
sensitive | Whether the source is sensitive; special authorization will be required to access it see authorizations. |
validate | Whether the source is valid, meaning the client has access to the source and it can be referenced on the platform. |
visibility | Visibility of the source, can be public , internal , or private . Allows restricting access to different user profiles on the platform. |
Source Types
Sources can be of any type, as long as there are packs available to process them. QALITA Platform does not impose any limitations on the types of sources.