Francesco Tosoni at UNESCO Paris

UNESCO Paris, 2026 • 10th anniversary of Software Heritage

Photo by Elarionne via Wikimedia Commons CC BY-SA 4.0 Source
Research Contractor

Francesco Tosoni

PhD, Computer Science

|

I make data more efficient, more accessible, and more sustainable.
My work blends lossless compression algorithms with open data infrastructures to create greener computing solutions.

Based at Sant'Anna School of Advanced Studies in Pisa, Italy.

Lossless Compression Open Data Green Algorithms Matrix Computation String Indexing Wikimedia Volunteer

About

I'm a computer science researcher and a Wikimedian passionate about making open data more efficient, accessible, and sustainable. My work sits at the intersection of lossless compression algorithms, open data infrastructures, and green computing.

By day, at Sant'Anna School of Advanced Studies I research compressed data structures, indexing and retrieval techniques for the Software Heritage archive, the "Library of Alexandria" of code. By weekends, I keep contributing to open-data, collaborative projects like Wikidata.

My core research area is lossless compression. I completed my PhD at the University of Pisa under the supervision of Professors P. Ferragina and G. Manzini, focusing on computation-friendly compression: techniques that allow us to operate directly on compressed data without decompression overhead. The challenge is to develop tools that make data processing more energy-efficient, too. Compression is not just about saving space on disk: the real challenge is to adapt compression schemes so that they allow to operate directly in main memory (without compression overhead) and in time proportional to the compressed representation size.

I'm actively involved in:

  • Software Heritage - Making source code archival more efficient and accessible
  • Wikimedia projects - Technical contributions to Wikidata and Meta-Wiki
  • Green algorithms - Developing energy-aware compression techniques

Outreach and Public Engagement

I occasionally contribute to Diff, the official Wikimedia Foundation blog, with articles on open data and free software.

2 June 2026 • Diff (Wikimedia Foundation)

Wikis for Everyone: Bridging the Accessibility Gap at the 2026 Hackathon

Italian wikimedians discussing web accessibility at the Wikimedia Hackathon 2026 Web accessibility is not merely a technical feature. It is a prerequisite for truly free knowledge. During the recen…

Read the article
Photo: Francesco Tosoni (SSSA) / Apache License 2.0
14 April 2026 • Diff (Wikimedia Foundation)

Introducing Mediawiki Code2Code Search: Semantic search to find code by under-the-surface similarity

The Telugu user interface of MediaWiki Code2Code Search Have you ever tried to find a specific function in a MediaWiki extension but only vaguely remembered what it does, not its name? If so, a new…

Read the article

Connect

Profiles

Publications

Fetching data from the University of Freiburg’s Wikidata mirror...

Research Activity

As an algorithmist, I primarily specialised in lossless data compression. Since July 2024, I have been working on optimising the compression and efficient indexing of large code archives in collaboration with the Software Heritage team.

Current Research Focus

  • Compressed formats for matrices and trie structures
  • Sparse matrix formats supporting matrix-vector multiplications (SpMV) in the compressed domain
  • Energy-efficient computation on compressed data

Pronunciation: For those familiar with the IPA, my name is pronounced
[fraÅ‹Ėˆt͔ʃesko toˈzoːni].

Research Topic Distribution

Co-author Network

Collaboration Geography

Contact & Location

Institutional Affiliation

Sant'Anna School of Advanced Studies

Research Contractor

Email

francescošŸ”“tosoni🐌santannapisašŸ”“it

(obfuscated for spam protection)

Location

Sant'Anna School of Advanced Studies

L'EMbeDS room

p.zza Martiri della LibertĆ  33

56127 Pisa PI

Italy

Map

View on OpenStreetMap