To help inform the ongoing work being done by BEACON, a company I co-founded, I wanted to do an inventory and assessment of all the privacy policies of Fortune 500 companies.
How it Worked
- Create a List of Fortune 500 Companies + Website Links
First the system would visit the Fortune 500 list and capture the names, ranking, website links and available labor market and financial data for each company. This output an ordered CSV list.
- Pull All Links from Each Company Website
Next, the system would pull the website URL for each company from the CSV file and use Apple Automator to scan the website and create a list of every link on the site.
- Filter Link List to Identify Privacy URL's
This list of links pulled from each company website was then run through a filtering mechanism that would identify and select only links that contained "privacy" or "policy" related keywords.
- Assess Privacy Readability with Hemingway App
- Identifying Errors & Cleanup
Finally, once all of the data had been consolidated and organized in the master CSV, the system ran a final check through each entry and identified any items that had not successfully captured the desired data results.