ICYMI: CData Foundations 2024 Wrap-Up
CData Foundations 2024 was a resounding success, bringing together hundreds of data-focused professionals for an engaging two-day event! A sincere thank you to everyone who attended our inaugural conference and contributed to the vibrant discussions and activities.
This event created a space for data-focused professionals to learn how current insights in data connectivity can power future development. The two-day event featured industry leaders, tech innovators, and data professionals as they shared their insights, strategies, and forward-thinking ideas to tackle the issues facing today's data-centric enterprises. From thought-provoking sessions and keynote presentations to interactive trivia and great Q&A from attendees, Foundations 2024 delivered insights for every data professional.
If you missed this event—or just want to review what you learned—we've distilled some takeaways from the event's keynotes.
Get all Foundations sessions on-demand
The foundation is trustworthy data
Enterprises rely on advanced analytics and decision-making systems, but the effectiveness of these tools hinges entirely on the quality and accessibility of their data. Whether you're driving AI initiatives, predictive modeling, or even basic reporting, centralized, clean, and governed data is non-negotiable.
This foundation is built on three critical pillars: security, to protect data integrity; flexibility, to ensure interoperability and adaptability; and real-time access, to deliver actionable insights when they matter most. Together, these pillars enable organizations to create a single, trustworthy source of truth.
Several speakers highlighted the importance of high-quality, timely data as the foundation of accurate insights and efficient operations.
Chai Pydimukkala, Product Executive at Google BigQuery, emphasized the direct link between data quality and outcomes. His succinct observation highlighted the need for well-structured data for any use case, not just AI.
VIDEO
"AI is nothing without data. Without the right data to train these AI models, it's garbage in, garbage out." – Chai Pydimukkala, Google
Mike Gualtieri, VP & Principal Analyst at Forrester Research, expanded on this point, recommending combining live and historical data for contextual decision-making. While important for AI, the same principles apply to operational processes and predictive analytics.
Data governance and security: your data, your rules
Maintaining data governance and security in AI adoption are foundational concerns. Speakers highlighted that compliance, and security shouldn’t come at the cost of accessibility—seamless data access should support all data-related tasks, not just AI-specific ones.
Preserving data permissions with virtualization
Amit Sharma, Co-Founder and CEO at CData, pointed out an often-overlooked issue: Copying data from access-controlled sources into centralized warehouses or lakes strips away the permissions tied to the original source. This creates all manner of vulnerabilities and compliance risks. The solution: data virtualization. Semantic layers in data virtualization platforms help ensure that source-level permissions remain intact.
VIDEO
Amit further explains that semantic layers within a virtualization platform bypass that issue, allowing users to access the data they need without recreating permissions.
Centralized transparency and accountability
Dr. Nick Golovin, SVP of Enterprise Data Platform at CData, emphasized how a strong governance framework sets the stage for transparency and accountability in modern data architectures. Centralized auditing capabilities offer clear visibility into data access, modifications, and usage patterns. These features help organizations track who interacts with data, when changes are made, and how systems are being used.
Such transparency is particularly valuable in highly regulated industries, where adherence to data protection laws and standards is non-negotiable. By making audit trails and governance features an integral part of modern data platforms, businesses—especially those in regulated industries—can balance agility with the right compliance measures.
Safeguarding PII and other sensitive data for AI use
Nick emphasized the importance of preserving user permissions when handling sensitive data like personally identifiable information (PII) - especially if that data will feed AI. By working directly with live data through modern platforms, organizations can avoid the risks of manually recreating access controls as data moves between systems. He also highlighted the need for features like column masking and automated anonymization to ensure sensitive information remains protected in workflows like AI or analytics.
Manish Patel, Chief Product Officer at CData, addressed the challenge of complying with privacy regulations such as GDPR, pointing to solutions like regional data residency controls, end-to-end encryption, and centralized key management. These tools enable businesses to safeguard sensitive data while still accessing real-time insights, striking a balance between innovation and security.
"In today’s hybrid architectures, protecting PII requires compliance with regional data residency laws and encryption standards. Our platform enables organizations to define boundaries for data flow, ensuring that sensitive data stays within specified regions. Additionally, we enforce end-to-end encryption with centralized key management, which is critical for safeguarding personal data and adhering to privacy regulations like GDPR." – Dr. Nick Golovin, CData
Flexibility: data across ecosystems
Flexibility is an important aspect as enterprises navigate rapid changes in their respective industries while also controlling costs and driving innovation. Businesses are looking for integration solutions and strategies to provide reliable access to data across ecosystems for seamless connectivity with the tools they already use.
Amit pointed out the challenges when adopting new SaaS platforms or repositories, emphasizing that users often seek to solve multiple use cases simultaneously. For instance, an organization that adopts a platform for human resources may also need to address analytics, reporting, and real-time operations together rather than tackling them separately. A unified approach that supports varied integration patterns allows businesses to adapt to evolving needs and embrace hybrid or multi-cloud strategies without being tied to specific systems.
"Customers don’t think in terms of integration patterns—they want solutions that address multiple use cases at once." – Amit Sharma, CData
Manish highlighted the growing trend toward hybrid and multi-cloud strategies, prioritizing flexibility by supporting multiple vendor ecosystems and architectures. These approaches enable organizations to leverage the strengths of different platforms while avoiding "all-in" reliance on a single vendor. Features like cross-cloud pipelines, interoperability across systems, and regional data residency compliance enable seamless data integration across diverse environments. Companies can scale, optimize costs, and innovate with the freedom to choose the tools and platforms that best fit their needs.
Real-time data, real-world results
Smooth operations and informed decision-making need seamless access to data when and wherever it's needed. Foundations speakers explored how innovations in real-time data connectivity are reshaping how organizations interact with their data. For many businesses, data strategy is focused on reducing latency to ensure access to the freshest information for immediate response and AI-driven analytics and operations.
Simplifying how data moves from source to application is a promising innovation. Chai highlighted tools that streamline data ingestion—whether through real-time streaming or change data capture—making it easier for users of all skill levels to access and use the most recent and relevant data available.
Mike introduced the concept of a "live data intelligence layer," which bridges the gap between historical and real-time data. This approach ensures that decisions—whether for operational processes, analytics, or AI systems—are contextually grounded and relevant to the moment. By combining historical insights with up-to-the-minute data, enterprises can make decisions with precision and context.
The effectiveness of actionable outcomes rests solely on the timeliness of the data it consumes. Old data, which Mike refers to as "perishable," leads to flawed, inaccurate results. Live data, along with solid data governance and iterative refinement, will set your systems up for success.
"AI must think in the same time frame decisions are made—combining historical and live data for better outcomes." – Mike Gualtieri, Forrester Research
This rings true, especially in critical use cases—ranging from fraud detection to customer retention—that depend on the continuous availability and timeliness of the data. Without live data, decisions risk being outdated or even counterproductive.
VIDEO
CData Foundations: the future of connected data
The convergence of data connectivity and advanced analytics presents an unparalleled opportunity for enterprises to transform their decision-making and operational capabilities. Achieving this potential, however, requires a strong foundation of clean, accessible, and well-governed data for all modern data-driven processes.
The overall consensus is that centralized data is vital in fostering seamless operations and driving innovation. By prioritizing live, reliable data and ensuring collaboration between IT and business teams, enterprises can lay the groundwork for long-term success, no matter how rapidly technology evolves.
As Manish pointed out, “The future isn’t just about connecting data—it’s about empowering organizations to unlock their full potential.”
CData Foundations is over for 2024, but we've saved all of the tracks, sessions, and keynotes for you.
Watch on-demand