Big data clusters
SQL Server 2019 makes it easier to manage a big data environment. It provides key elements of a data lake—Hadoop Distributed File System (HDFS), Spark, and analytics tools—deeply integrated with SQL Server and fully supported by Microsoft. Easily deploy using Linux containers on a Kubernetes-managed cluster.
In SQL Server 2016, PolyBase enabled you to run a T-SQL query inside SQL Server to pull data from Hadoop and return it in a structured format—all without moving or copying the data. Now, we’re expanding that concept of data virtualization to additional data sources, including Oracle, Teradata, MongoDB, and other SQL Servers.