ESSnet Web Intelligence Network (WIN) Project: 2022-2023 Webinar Series | Enhancing the Quality of Statistical Business Registers with Scraped Data & Methods of Processing and Analysing of Web Scraped Tourism Data | 24 January & 23 February 2023

logo Web Intelligence Network

2022-2023 Webinar series 

The ESSnet Web Intelligence Network (WIN) project is creating an environment where an array of non-traditional data sources can be accessed by members of the European Statistics System (ESS) and beyond.  To help build capability in this community, the project will be delivering a series of webinars and a face-to-face workshop covering the following subjects:

  • Architecture, Methodology and Quality – available to watch on
  • Business Registers Quality Enhancements
  • Tourism Statistics
  • Web Intelligence in Practice – Satellite event at the NTTS Conference (face-to-face workshop)

 

Our next two webinars are now available to book on Eventbrite:

24 January 2023 at 10:00 CET

Enhancing the Quality of Statistical Business Registers with Scraped Data

This webinar will aim to inspire and equip participants keen to use web-scraped information to enhance the quality of the Statistical Business Registers. The webinar will discuss some of the different approaches to web scraping business information and the automatic prediction of NACE codes via text mining.  To book your place visit our Eventbrite page.

23 February 2023 at 14:00 CET

Methods of Processing and Analysing of Web Scraped Tourism Data

This webinar will discuss the issues of data sources available in tourism statistics. We will present how to search for new data sources and how to analyse them.  We will review and apply methods for merging and combining the web scraped data with other sources, using various programming environments. We will also look at different methods for removing any duplicates found in the merged dataset.  To book your place visit our Eventbrite page.

For additional information, please access the project’s work packagesblogs and trainings pages.