2021.10
Collibra DIC Integration
Powered By GitBook
DQ Job Back Run
Replay 30 days of data

How to Replay a Data Test

Many times you will want to see how a dataset plays out over time. This could be 5 days or 5 months. Using this slider the tool will automatically create training sets and profiles as well as run any rules or outliers you've put in place.

Quickly Replay 30 days of data, -br 30

Add -br to any owlcheck and replay in time order. Jan 1, Jan 2, Jan 3... To do this we need to use the ${rd} variable that owl provides as a run_date replacement for job control and templates. Also note that if you run from the cmdline you need to escape "quot;s. so use \${rd}. If you are running from a Notebook or Java or Scala or the Rest API you do not need to escape the ${rd} variable.
1
./owlcheck \
2
-ds OWLDB2.NYSE_STOCKS3 -rd "2018-01-14" \
3
-lib "/opt/owl/drivers/db2/" \
4
-cxn db2 \
5
-q "select * from OWLDB2.NYSE_STOCKS where TRADE_DATE = '\${rd}'" \
6
-br 4
Copied!

Replay 4 Months of data, -br 4 -tbin MONTH

In situations where your data rolls up into Months you may want to re-run several months of data but not a day at a time. In this case we will use -br with -tbin
1
./owlcheck \
2
-ds OWLDB2.NYSE_STOCKS3 \
3
-rd "2018-01-01" \
4
-q "select * from OWLDB2.NYSE_STOCKS where TRADE_DATE = '\${rd}'" \
5
-br 4 \
6
-tbin MONTH \
7
-lib "/opt/owl/drivers/db2/" \
8
-cxn db2
Copied!

Monthly using a range for the entire Month

1
./owlcheck \
2
-ds OWLDB2.NYSE_STOCKS3 \
3
-rd "2018-01-01" \
4
-rdEnd "2018-02-01" \
5
-q "select * from OWLDB2.NYSE_STOCKS where TRADE_DATE >= '${rd}' and TRADE_DATE < '${rdEnd}'" \
6
-br 4 \
7
-tbin MONTH
8
-lib "/opt/owl/drivers/db2/" \
9
-cxn db2
Copied!
Last modified 5mo ago