Big Data Special Interest Group

Covers deploying and operating big data applications (Spark, Kafka, Hadoop, Flink, Storm, etc) on Kubernetes. We focus on integrations with big data applications and architecting the best ways to run them on Kubernetes.




The Chairs of the SIG run operations and processes governing the SIG.


GitHub Teams

The below teams can be mentioned on issues and PRs in order to get attention from the right people. Note that the links to display team membership will only work if you are a member of the org.

The google groups contain the archive of Github team notifications. Mentioning a team on Github will CC its group. Monitor these for Github activity if you are not a member of the team.

Team Name Details Google Groups Description
@kubernetes/sig-big-data-api-reviews link link API Changes and Reviews
@kubernetes/sig-big-data-bugs link link Bug Triage and Troubleshooting
@kubernetes/sig-big-data-feature-requests link link Feature Requests
@kubernetes/sig-big-data-misc link link General Discussion
@kubernetes/sig-big-data-pr-reviews link link PR Reviews
@kubernetes/sig-big-data-proposals link link Design Proposals
@kubernetes/sig-big-data-test-failures link link Test Failures and Triage


  • Design and architect ways to run big data applications effectively on Kubernetes
  • Discuss ongoing implementation efforts
  • Discuss resource sharing and multi-tenancy (in the context of big data applications)
  • Suggest Kubernetes features where we see a need


  • Endorsing any particular tool/framework