Why does open source matter for sign language?
Sign languages should be recognised and supported by technology just as much as spoken languages. But still, one of the biggest barriers to this innovation is the lack of openly available, high quality data sets. Without these data sets, it becomes very difficult for anyone to build reliable translation tools, educational resources, or accessible platforms.
That is where we, Project EDI, step in.
We are creating the world's first large-scale, OSSL (Open Source Sign Language) data set designed for everyone, from the deaf community, to researchers. The data set grows through video submissions of people signing, paired with community driven accuracy ratings to ensure transparency and quality.
Privacy first, always:
We are building this technology for real people. People who deserve respect, choice and protection. That is why privacy has shaped every part of our platform:
- You do not need an account to contribute.
- You get to choose whether your video is public or private.
- You decide how your data is used.
For many sign language users, visibility comes with risks. Whether that is social, professional or personal. We want everyone to feel safe and empowered when participating. Our privacy first design ensures that contributing to accessibility does not require sacrificing your own.
A community powered approach:
OSSL is more than just a data set, it's a movement.
Sign languages are beautifully varied across regions and cultures. No single organisation can capture this on its own. That is what makes us different. We rely on the people who know sign languages best: the communities who use them everyday.
By crowdsourcing both video submissions and accuracy ratings, we ensure that:
- Representation stays authentic.
- Different dialects and signing styles are included.
- Quality is continually improved by the community.
- The data set grows in a fair and unbiased way.
This approach helps us build tools that really reflect the diversity of real world signing, not just a narrow subset.
Building technology that doesn't hold anyone back:
Our goal is ambitious, real time sign language translation and accessible tools powered by AI and body tracking technology. But the truth is, none of this is possible without data - open data.
Right now the world doesn't have enough sign language resources to train accurate, ethical, and inclusive models. Dialects vary chaotically. Some words don't have universal signs.
By building OSSL openly from the start, we are not just creating a dataset, we are creating the foundation for future innovation. We are making sure that progress in sign language technology does not rely on closed systems or restricted access, but on collaboration, shared knowledge and community.
What comes next?
As our dataset grows, so will the possibilities:
- Better translation tools.
- More educational content.
- Greater accessibility across platforms.
- Representations of sign language users in technology.
And ultimately, a world where communication barriers continue to shrink, not because a single organisation decided so, but because thousands of people worked together to make it happen.
Join us in shaping the future:
Project EDI exists because we believe in an open, inclusive future, one where technology reflects the diverse ways we communicate. We are pushing boundaries and championing diversity.
Your contribution, small or big, helps build that future. Whether you want to submit a video, rate accuracy, or follow our journey, we would love to have you on board.
Together we can create a more inclusive world. One sign at a time.

.png)