Preparation of data extraction processes; Analysis of format and content of ad hoc open data sets at different spatial and temporal scales; Collect the data such as though dedicated services/APIs, ftp or generic web repositories; Analysis of the data format that could be CSV, tables, PDF, HTML, XML,