reads the initial transaction csv, cleans the data a little bit, and creates transactions_cleaned.csv and exits.csv
reads from the cleaned transaction csv, strips all of the info except the basic company descriptions, and removes any duplicates to leave a list of every company in companies.csv