DataHub
DataHub is a versatile open-source metadata platform crafted to enhance data discovery, observability, and governance within various data environments. It empowers organizations to easily find reliable data, providing customized experiences for users while avoiding disruptions through precise lineage tracking at both the cross-platform and column levels. By offering a holistic view of business, operational, and technical contexts, DataHub instills trust in your data repository. The platform features automated data quality assessments along with AI-driven anomaly detection, alerting teams to emerging issues and consolidating incident management. With comprehensive lineage information, documentation, and ownership details, DataHub streamlines the resolution of problems. Furthermore, it automates governance processes by classifying evolving assets, significantly reducing manual effort with GenAI documentation, AI-based classification, and intelligent propagation mechanisms. Additionally, DataHub's flexible architecture accommodates more than 70 native integrations, making it a robust choice for organizations seeking to optimize their data ecosystems. This makes it an invaluable tool for any organization looking to enhance their data management capabilities.
Learn more
dbt
dbt Labs is redefining how data teams work with SQL. Instead of waiting on complex ETL processes, dbt lets data analysts and data engineers build production-ready transformations directly in the warehouse, using code, version control, and CI/CD. This community-driven approach puts power back in the hands of practitioners while maintaining governance and scalability for enterprise use.
With a rapidly growing open-source community and an enterprise-grade cloud platform, dbt is at the heart of the modern data stack. It’s the go-to solution for teams who want faster analytics, higher quality data, and the confidence that comes from transparent, testable transformations.
Learn more
Select Star
In just 15 minutes, you can set up your automated data catalogue and receive column-level lines, Entity Relationship diagrams, and auto-populated documentation in 24 hours. You can easily tag, find, and add documentation to data so everyone can find the right one for them. Select Star automatically detects your column-level data lineage and displays it. Now you can trust the data by knowing where it came. Select Star automatically displays how your company uses data. This allows you to identify relevant data fields without having to ask anyone else. Select Star ensures that your data is protected with AICPA SOC2 Security, Confidentiality and Availability standards.
Learn more
Decube
Decube is a comprehensive data management platform designed to help organizations manage their data observability, data catalog, and data governance needs. Our platform is designed to provide accurate, reliable, and timely data, enabling organizations to make better-informed decisions.
Our data observability tools provide end-to-end visibility into data, making it easier for organizations to track data origin and flow across different systems and departments. With our real-time monitoring capabilities, organizations can detect data incidents quickly and reduce their impact on business operations.
The data catalog component of our platform provides a centralized repository for all data assets, making it easier for organizations to manage and govern data usage and access. With our data classification tools, organizations can identify and manage sensitive data more effectively, ensuring compliance with data privacy regulations and policies.
The data governance component of our platform provides robust access controls, enabling organizations to manage data access and usage effectively. Our tools also allow organizations to generate audit reports, track user activity, and demonstrate compliance with regulatory requirements.
Learn more