dbt (data build tool) interview questions

getDbt

21. What tests are available to use in dbt?
Out of the box, dbt ships with the following tests:
unique
not_null
accepted_values
relationships (i.e. referential integrity)

22. How do I build one seed at a time?
As of v0.16.0, you can use a –select option with the dbt seed command, like so:
$ dbt seed –select country_codes
There is also an –exclude option.


23. How can I see the SQL that dbt is running?
To check out the SQL that dbt is running, you can look in:
dbt Cloud:
Within the run output, click on a model name, and then select “Details”
dbt CLI:
The target/compiled/ directory for compiled select statements
The target/run/ directory for compiled create statements
The logs/dbt.log file for verbose logging.

24. What is the difference between dbt Core, the dbt CLI and dbt Cloud?
dbt Core is the software that takes a dbt project (.sql and .yml files) and a command and then creates tables/views in your warehouse. dbt Core includes a command line interface (CLI) so that users can execute dbt commands using a terminal program. dbt Core is open source and free to use.
dbt Cloud is an application that helps teams use dbt. dbt Cloud provides a web-based IDE to develop dbt projects, a purpose-built scheduler, and a way to share dbt documentation with your team. dbt Cloud offers a number of features for free, as well as additional features in paid tiers

25. Can I store my seeds in a directory other than the `data` directory in my project?
By default, dbt expects your seed files to be located in the data subdirectory of your project.
To change this, update the data-paths configuration in your dbt_project.yml file, like so:
dbt_project.yml
data-paths: [“seeds”]

Author: user

Leave a Reply