-
deequ Public
Forked from awslabs/deequDeequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Scala Apache License 2.0 UpdatedFeb 10, 2024 -
amundsen Public
Forked from amundsen-io/amundsenAmundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Python Apache License 2.0 UpdatedOct 27, 2023 -
spark Public
Forked from apache/sparkApache Spark - A unified analytics engine for large-scale data processing
Scala Apache License 2.0 UpdatedOct 6, 2023 -
-
ml-design-patterns Public
Forked from GoogleCloudPlatform/ml-design-patternsSource code accompanying O'Reilly book: Machine Learning Design Patterns
Jupyter Notebook Apache License 2.0 UpdatedOct 26, 2020 -
-
qds-sdk-java Public
Forked from qubole/qds-sdk-javaA Java library that provides the tools you need to authenticate with, and use the Qubole Data Service API.
Java Apache License 2.0 UpdatedJun 12, 2020 -
LintCode Public
Forked from terrytong0876/LintCode-1Java Solutions to problems on LintCode/LeetCode
Java UpdatedNov 17, 2017 -
message-hub-samples Public
Forked from ibm-messaging/event-streams-samplesJava Apache License 2.0 UpdatedSep 20, 2017 -
-
-
-