2021.10
Collibra DIC Integration
Powered By GitBook
DQ Job HDFS
Run Data Quality on a file in HDFS. Owl will automatically infer the schema and create an internal training model.
1
-f "hdfs:///demo/ssn_test2.csv" \
2
-d "," \
3
-rd "2018-01-08" \
4
-ds "ssn_hdfs_file" \
5
-master yarn \
6
-deploymode cluster \
7
-numexecutors 2 \
8
-executormemory 2g
Copied!
Last modified 2yr ago
Copy link