Forum & Social Media Data, Custom Data Scraper

Closed
Food Trucks Association of Canada
Waterloo, Ontario, Canada
Mackenzie P
Project Manager
(14)
4
Preferred learners
Anywhere
Academic experience
Categories
Software development Machine learning Data science
Skills
data validation conceptual design data extraction user experience (ux) requirements elicitation python (programming language) ethical standards and conduct data privacy laws data structures quality assurance
Project scope
What is the main goal for this project?

The Food Trucks Association of Canada (FTAC) seeks to dive deep into the insights and prevalent issues of the food truck industry, exploring public forums and online communities where operators gather (such as Facebook, reddit, etc.). The goal of this project is to create a custom data scraping program in Python, as to collect large sets of data from shortlisted group forums. This scraper will be used as a pivotal tool, intended to automate and streamline the intricate data extraction process from multiple online forums. Thereby providing a robust dataset for FTAC to use in our existing data projects.

What tasks will learners need to complete to achieve the project goal?

1. Preliminary Research and Requirements Gathering:

  • Investigate various forums and platforms to understand their data structure and accessibility.
  • Establish detailed requirements concerning the data types, formats, and categories needed.


2. Legal and Ethical Compliance Understanding:

  • Research and comprehend data privacy laws relevant to the websites used.
  • Establish strategies to ensure full compliance with these legal and ethical parameters throughout the project.


3. Designing the Data Scraper Architecture:

  • Develop a conceptual design for the scraper, considering the varied structures of targeted platforms.
  • Envision and layout the data flow, ensuring organized extraction, processing, and output.


4. Developing the Data Scraper:

  • Code the scraper, ensuring it aligns with the established design and requirements.


5. Testing and Quality Assurance:

  • Perform systematic testing of the scraper, ensuring it effectively extracts and categorizes data across all identified platforms.


6. Data Validation and Preliminary Analysis:

  • Conduct a preliminary analysis of the data to identify any gaps, inconsistencies, or areas of improvement in the scraping process.


7. User Interface (UI) and User Experience (UX) Development:

  • Develop a user-friendly interface that allows easy configuration and operation of the scraper.


8. Documentation and User Manual Creation:

  • Compile comprehensive documentation detailing the scraper’s architecture, functionalities, and operation.
  • Develop a user manual that provides clear instructions and use-cases for end-users, ensuring they can effectively utilize the tool.


About the company
2 - 10 employees
Food & beverage, Non-profit, philanthropic & civil society

The Food Trucks Association of Canada (FTAC) is a national, nonprofit organization which was first registered in Canada in the late summer of 2020, in the earlier period of onset of the pandemic.

https://www150.statcan.gc.ca/n1/pub/45-28-0001/2021001/article/00010-eng.htm

An agile approach has been taken and we are now looking to redefine how we can best start and grow to support the industry. It is critically important to us to provide real and lasting value to our members.

Projects that are taken on by students and courses in the Riipen platform will be instrumental in our ability to build capacity to deliver that value.
To date, the work of the Food Trucks Association of Canada has been led by a volunteer Executive Director who is a passionate advocate in this space, and has leveraged a 75% student body of employees made available through various employment subsidies. It is a key part of our mandate to support student learning.

The NAICS code for the Food Trucks is 7223 and other code subsets.