Courses in Statistics, Computing, Data Analytics and Modeling
During its first cycle (a five-year period), the Department of Excellence started actively promoting the development and coordination of courses related to its core research mission. These included courses on programming, data analytics and AI (which build upon Python courses first offered by Andrea Vandin and Daniele Licari in the Spring of 2020) and courses on statistical methods (which build upon prior courses offered by Francesca Chiaromonte and Chiara Seghieri, stressing applications and the use of large contemporary datasets).
Since a.y. 2020-21, we regularly offered a sequence of coordinated courses entitled "Stats & Computing" - which utilized online or blended modalities during the pandemic, and had consistently large enrollments by undergraduate and graduate students of the Sant’Anna School, as well as by students from other programs in the Pisan academic community. Some of the materials on data analytics was also taught in different formats and venues, e.g., for the PhD Program in Computer Science at GSSI (Gran Sasso Science Institute of L'Aquila) and for the ARTES4 Industry 4.0 Competence Center.
During the second cycle of L'EMbeDS, we are further expanding the sequence of coordinated courses -- which is now called "Computing, Data Analysis & Modeling for the Social Sciences". Materials and details for a.y. 2024-25 can be found here. At present, the sequence includes the following courses:
- ASM: Applied Statistical Modeling (taught by Chiara Seghieri) which, through one module of 20 hours, aims at providing students with methodological and applied background on statistical models for analysing data with different types of response variables. The course has a practice-oriented approach with applications in the context of social sciences and practical examples using R software.
- DMPD: Dynamic Models for Panel Data (taught by Laura Magazzini) which, through one module of 10 hours, aims at providing students advanced econometric tools for the empirical analysis of panel data models in a dynamic framework.
- PDAI: Programming, Data Analytics and AI (taught by Andrea Vandin) which, through two modules of 20 hours each, introduces the students to structured computer programming and various data processing, manipulation, visualization and analysis techniques -- using Python as reference language.
- SLLD: Statistical Learning and Large Data (taught by Francesca Chiaromonte) which, through two modules of 20 hours each, introduces the students to key topics in contemporary Statistical Learning and approaches to the analysis of high dimensional, ultra-high dimensional, and ultra-large datasets -- using R as a reference language.
- ISE: Introduction to Search Engines (taught by Paolo Ferragina) which, through one module of 20 hours, introduces the main methodologies, algorithms, and AI techniques underlying the design of modern search engines and, more generally, Information Retrieval systems.
- CSIR: Case studies of Information Retrieval (taught by Paolo Ferragina) in which, through a 10-hour hands-on course, students work in groups to explore a chosen topic based on their disciplinary background, where Search Engines or Information Retrieval, in general, play a key role.
We expect the number and scope of courses to grow over time, as L'EMbeDS contributes to the design of the educational offer of the Sant'Anna School's II level Diploma in Data Science.