dir structure ~/git/Data aa_codes.csv ~/git/Data//input ~/git/Data//output data_extraction.py must have the dirs else creates it in the curr dir needs reference_dict.py tidy_split.py