Here, we formalize the decentralized data collaboration problem and discuss example workloads and required abstractions.

To revisit the background of the problem, check Background.

Definition of the problem

✏️ Decentralized data collaborations often involve two or multiple participants cooperatively performing certain tasks together for greater data insights.

Here, we discuss examples of decentralized data collaboration workloads and identify a set of useful, shared and required abstractions.

Example workloads

Many collaborations focus on private data access.

There are also systems designed for more general and complicated collaboration goals.

In addition, there are also decentralized procedures that do not directly focus on gaining data insights but serve as building blocks for security or privacy.

Required abstractions

Going through examples of existing systems mentioned above, we identify the following list of shared abstractions that is often useful: