Giovanni Simonini

MIT CSAIL, Cambridge MA · giovanni _at_ csail.mit.edu

Short Bio

I am a postdoctoral associate at MIT CSAIL - QCRI. I received the PhD degree in Computer Science from the University of Modena in 2016. My doctoral dissertation won the PhD Thesis Award from the IEEE Computer Society Italy Section. My research interests include data integration and big data management.

News

  • [01 Oct 18] Starting a postdoc at the MIT CSAIL, joining the database group.
  • [28 Sept 18] My research project about application-driven data cleaning has been funded by UniMoRe FAR (35.000 €).
  • [26 Jun 18] Paper accepted at IEEE TKDE: "Schema-agnostic Progressive Entity Resolution" (extended version) -- G. Simonini, G. Papadakis, T. Palpanas, S. Bergamaschi
  • [28 Feb 18] Paper accepted at Inf. Sys.: "Computing inter-document similarity with Context Semantic Analysis" -- F. Benedetti, D. Beneventano, S. Bergamaschi, G. Simonini
  • [23 Dec 17] Paper accepted at ICDE: "Schema-agnostic Progressive Entity Resolution" -- G. Simonini, G. Papadakis, T. Palpanas, S. Bergamaschi

Service

  • Program Chair of BDAA 2018 (as part of IEEE HPCS 2018)
  • Technical Session Chair “Big Data Integration and IoT for Smart Health Care”, IEEE RTSI 2017
  • Reviewer for: IEEE TKDE, MEDI 2018, ICDE 2018 (external), BDAA/HPCS 2014/2015/2017, VLDB 2014 (external)

  • Publications

    [j] Journal -- [c] Conference -- [b] Book Chapter -- [t] Thesis
    • [j5] Schema-agnostic Progressive Entity Resolution. - IEEE TKDE. - G. Simonini, G. Papadakis, T. Palpanas, S. Bergamaschi https://ieeexplore.ieee.org/document/8403302/
    • [j4] Computing inter-document similarity with Context Semantic Analysis. - Information Systems - F. Benedetti, D. Beneventano, S. Bergamaschi, G. Simonini https://doi.org/10.1016/j.is.2018.02.009. [PDF]
    • [c10] Schema-agnosticProgressiveEntityResolution.-IEEEInternationalConf.onDataEngineering, ICDE 2018. - G. Simonini, G. Papadakis, T. Palpanas, S. Bergamaschi. [PDF]
    • [c9] Enhancing Loosely Schema-aware Entity Resolution with User Interaction. - IEEE International Conf. on High Performance Computing & Simulation, HPCS 2018. - G. Simonini, L. Gagliardelli, S. Zhu, S. Bergamaschi. [PDF]
    • [c8] HowimproveSetSimilarityJoinbasedonprefixapproachindistributedenvironment.-IEEE International Conf. on High Performance Computing & Simulation, HPCS 2018. - S. Zhu, L. Gagliardelli, D. Beneventano, G. Simonini. [PDF]
    • [b2] Enhancing Big Data Exploration with Faceted Browsing. - “Classification, (Big) Data Analysis and Statistical Learning”, Springer. - G. Simonini, S. Zhu, S. Bergamaschi. [PDF]
    • [b1] From Data Integration to Big Data Integration. - “A Comprehensive Guide Through the Italian Database Research 2018”, Springer. - S. Bergamaschi, D. Beneventano, F. Mandreoli, R. Martoglia, F. Guerra, M. Orsini, L. Po, M. Vincini, G. Simonini, S. Zhu, L. Gagliardelli, L. Magnotta. [PDF]
    • [c7] SOPJ: A Scalable Online Provenance Join for Data Integration. - IEEE International Conf. on High Performance Computing & Simulation, HPCS 2017. - S. Zhu, G. Fiameni, G. Simonini, S. Bergamaschi. [PDF]
    • [c6] BigBench workload executed by using Apache Flink - Procedia Manufacturing 11 (2017): 695- 702. - S. Bergamaschi, L. Gagliardelli, G. Simonini, S. Zhu [PDF]
    • [t] Loosely Schema-aware Techniques for Big Data Integration - PhD Thesis (2016) - G. Simonini [PDF]
    • [j3] BLAST: a loosely schema-aware meta-blocking approach for entity resolution - PVLDB 9.12 (2016): 1173-1184. - G. Simonini, S. Bergamaschi, H. V. Jagadish [PDF] [CODE]
    • [j2] Providing Insight into Data Source Topics - Journal on Data Semantics (2016): 1-18. - S. Bergamaschi, F. Guerra, D. Ferrari, G. Simonini, Y. Velegrakis [PDF]
    • [c5] Big data exploration with faceted browsing - IEEE International Conf. on High Performance Computing & Simulation, HPCS 2015 - G. Simonini, Z. Song [PDF]
    • [c4] Discovering the topics of a data source: a statistical approach - SWSD Workshop @ISWC 2014 - G. Simonini, F. Guerra, S. Bergamaschi [PDF]
    • [c3] Towards Declarative Imperative Data-parallel Systems - Italian Symposium on Advanced Database Systems, SEBD 2014 - M. Interlandi, G. Simonini, S. Bergamaschi [PDF]
    • [c2] Using big data to support automatic Word Sense Disambiguation - IEEE International Conf. on High Performance Computing & Simulation, HPCS 2014 - G. Simonini, F. Guerra [PDF]
    • [j1] Supporting Image Search with Tag Clouds: a Preliminary Approach - Advances in Multimedia, 2015 - G. Simonini, F. Guerra, M. Vincini [PDF]
    • [c1] Keyword Searchover Relational Databases: Issues, Approachesand Open Challenges-Bridging Between Information Retrieval and Databases. Springer, Berlin, Heidelberg, 2014. 54-73. - S. Bergamaschi, F. Guerra, G. Simonini [PDF]

    Awards & Grants

    • 35,000€ research grant from University of Modena and Reggio Emilia (FAR Junior call), for my research project about application-driven data cleaning. (2018)
    • IEEE Computer Society Italy Section Chapter PhD Thesis Award (2017)
    • Certificate of merit for national and international research from University of Modena and Reggio Emilia (2017)
    • VLDB 2016 Travel Fellowship (2016)
    • Spinner 2013 Scholarship -- one year grant from "Regione Emilia Romagna" (2013)

    Teaching

    For Students [NO MORE AT UNIMORE]

    Office Loc.: Building MO27, 1st floor - Via P. Vivarelli 10, Modena (Dipartimento di Ingegneria "Enzo Ferrai")
    Office Hours: Thu 3pm-17pm (please email in advance to schedule)


    Università degli studi di Modena e Reggio Emilia

    Bachelor of Science in Computer Engineering
    Adjunct Professor for the course: "Database Principles" (Basi di Dati e Lab.)
    2017 - 2018

    Università degli studi di Modena e Reggio Emilia

    Bachelor of Science in Management Engineering
    Adjunct Professor for the course: "Database Principles" (Basi di Dati e Lab.)
    2016 - 2017

    Università degli studi di Modena e Reggio Emilia

    Master of Science in Computer Engineering
    Teaching Assistant for the course: "Advanced Database Technologies" (Tecnologia delle Basi di Dati / Data Management and Governance.)
    2013 - 2018