Databricks
Time-travel support
Time-travel is supported by DirectQuery with Databricks, but is limited to tables (and not views, as it is not supported by Databricks).
This means that updating applications based on this Database are subject to data desynchronization in case of changes in a view of the database.
To choose the time-travel behavior, DatabricksDirectQuerySettings#timeTravelPolicy
can be configured with the following values:
DatabricksDirectQuerySettings.TimeTravelPolicy#DISABLED
: time-travel will not be used.DatabricksDirectQuerySettings.TimeTravelPolicy#STRICT
: time-travel will be used if there are only tables and no views, and otherwise will throw an error. This is the default mode.DatabricksDirectQuerySettings.TimeTravelPolicy#LIGHT
: time-travel will be used only for tables and views will be queried without it.
Read this page to learn more about the visible effects of this desynchronization.
Vectors support
Databricks handles vectors natively, via the Array
type in the model and thanks to special functions at query time.
Thus, in DirectQuery, this is the only supported mode for dealing with this data type.
Gotchas
Nullable fields
Databricks can define fields with nullable types.
DirectQuery is capable of detecting this and defines its internal model accordingly.
While this is not a problem in itself, it can conflict with the rule in Atoti cubes that all levels must be based on non-nullable values.
This does not create an issue of any sort as the local model is updated behind the scene, assigning a default value based on the type.
It has a side effect on the query performance, as DirectQuery must convert on-the-fly null values to their assigned default values, as well as adding extra conditions during joins to handle null values on both sides.