-
Notifications
You must be signed in to change notification settings - Fork 17
Can't run "soda analyze" or "soda scan" on new installation - module not found error #338
Comments
Hi @dcape42 - unfortunately I am not able to reproduce it.. here's the log:
Can you please try only installing soda-spark ? and not both soda-spark-sql and soda-spark? |
SOLVED: Test : created a python test file test.py with import sodasql.dialects.spark_dialect(.venv) sveera@SVEER1ML1 soda-test % python test.py Resolution:
|
Describe the bug
On a new installation, I am able to run the soda command but I am unable to run any subcommands that access my data warehouse. I get an error saying
Module sodasql.dialects.spark_dialect not found
even though it is installed.This is a shared project and I am attempting to use the warehouse and scan YAML files created by my colleague and shared in a git repo.
To Reproduce
Steps to reproduce the behavior:
~/.soda/config.yml
and~/.soda/env_vars.yml
with the values I need.soda --help
to confirm Soda is installed - success.soda scan warehouse_riot_data.yml tables/accounts_alias.yml
(or any YAML files) - this fails.Expected result:
Soda should run the scan command.
Actual result:
Error
Context
This happens with any warehouse.yml file and any scan.yml file. It also happens when trying to run
soda analyze
with any warehouse.yml file. The contents of the YAML file don't matter because the failure occurs when trying to create a parser object.I can confirm the python files are present under my site-packages directory with what appears to be the correct relative path.
But the warehouse_yml_parser.py file is found and the spark_dialect.py file is not found (as shown by the module not found error).
I can also confirm the packages are installed at the correct versions.
My colleague has run the same installation steps, and we are showing the same packages installed. He can run soda commands but I can't.
additional info: If I also install
soda-sql-hive
I am able to runsoda create -f test.yml hive
but notsoda create -f test2.yml spark
. So it seems there is something specific to the spark libraries.OS: MacOS Big Sur 11.6.5
Python Version: 3.8.12 (also tried with 3.7.8 and 3.7.10)
Soda SQL Version: 2.1.8
Warehouse Type: Spark
The text was updated successfully, but these errors were encountered: