Unity Catalog
PulseAugur coverage of Unity Catalog — every cluster mentioning Unity Catalog across labs, papers, and developer communities, ranked by signal.
- 2026-05-13 product_launch Databricks announces the general availability of new data governance features in Unity Catalog. 来源
13 天有情绪数据
-
语义层架构:BI原生、平台原生和dbt方法
文章讨论了数据架构中语义层不断发展的格局,强调了三种主要模式而非单一的定义。文章对比了BI原生语义层(逻辑嵌入在Looker或Power BI等工具中)与平台原生语义层(存在于Snowflake或Databricks等数据平台内)。第三种模式,即dbt语义层,也被呈现为一种独特的方法。
-
Databricks 将 OpenTelemetry 跟踪集成到 Unity Catalog 中,用于 AI 代理
Databricks 推出了一项新功能,允许 AI 代理将 OpenTelemetry 跟踪直接写入 Unity Catalog 表。此集成旨在克服传统可观测性工具的局限性,这些工具难以应对 AI 跟踪数据的高流量和高成本。通过将跟踪数据存储在 Databricks Lakehouse 中,用户可以利用 SQL 等常用工具进行分析,应用治理,并将跟踪数据集成到评估和监控工作流中,以持续改进 AI 代理。
-
世界银行集团利用 Databricks 进行减贫数据统一
世界银行集团已实施一个统一的数据和人工智能平台,利用 Databricks 加强其减贫工作。该平台将结构化运营数据与数百万份非结构化文档整合起来,克服了以往的数据孤岛和手动研究瓶颈。通过利用 Unity Catalog 和 Genie 等工具,该组织现在能够进行自然语言查询,加速知识共享,并支持更快、更明智的决策以产生全球影响。
-
Databricks partners launch industry AI solutions on Genie platform
Databricks has launched new industry-specific conversational AI solutions built on its Genie platform. These solutions, developed by Databricks consulting and SI partners, aim to address sector-specific challenges acros…
-
Databricks 融合 Genie 和 TabPFN 以实现预测性 BI
Databricks 推出了一个新架构,集成了 Genie 和 TabPFN,以便在对话式商业智能工具中实现预测分析。该系统允许业务用户用自然语言提出预测性问题,无需数据科学家手动准备数据、选择模型或解释结果。这种融合架构能动态地将用户查询转换为 TabPFN 所需的输入数据,TabPFN 随后快速生成预测,提供统一且受治理的体验。
-
Databricks adds AI cost controls and safety guardrails to Unity AI Gateway
Databricks has introduced new AI governance features within its Unity AI Gateway, focusing on cost controls and safety. The platform now offers proactive budget alerts at various granularities, including user, workspace…
-
Databricks launches analytics engineer learning pathway for SQL pros
Databricks has launched a new learning pathway designed for SQL practitioners to become analytics engineers. This curriculum focuses on transforming raw data into governed, AI-ready semantic models and metric views with…
-
Databricks launches AI sales tool for prescriptive actions
Databricks has developed PipelineIQ, an AI-powered sales intelligence tool designed to move beyond traditional forecasting. Instead of relying on retrospective data, PipelineIQ analyzes messy CRM information to provide …
-
Databricks enables external engines to write to Unity Catalog tables
Databricks has introduced a beta feature allowing external engines like Apache Spark, Flink, and DuckDB to create, read, and write to Unity Catalog managed Delta tables. This expansion builds on the open APIs for Unity …
-
Databricks Unity Catalog 增加了自动数据保护和治理功能
Databricks 宣布其 Unity Catalog 中的新功能现已普遍可用,旨在增强数据保护和治理。这些功能包括用于行过滤和列掩码的基于属性的访问控制 (ABAC)、通过受管标签进行标准化数据分类以及自动数据检测和标记。其目标是为组织整个数据资产中的敏感数据提供可扩展、一致且实时的保护,从而减少手动开销并提高合规性。
-
Databricks MCP 让 AI 代理直接查询 Lakehouse 数据
Databricks 发布了一项名为 MCP 的集成,允许 Claude 和 Cursor 等 AI 代理直接访问和交互存储在 Databricks Lakehouse 中的数据。该工具使 AI 模型能够查询 Delta 表、执行笔记本、管理集群和检查数据沿袭,而不仅仅是访问文档。该集成旨在通过允许对话式命令在 Databricks 环境中触发操作,来简化数据分析、自动化和 MLOps 任务。
-
Databricks Catalog Commits GA unifies lakehouse data coordination
Databricks has announced the general availability of Catalog Commits for Unity Catalog managed tables, a significant platform upgrade. This feature aims to unify the lakehouse by aligning Delta Lake with Iceberg's catal…
-
Databricks 使用 MemAlign 改进 AI 生成的 ML 代码评估
Databricks 开发了 MemAlign,一个与 MLflow 集成的开源对齐框架,用于增强其 Genie Code 工具生成的机器学习代码的评估。初步的人类专家标注显示,LLM 裁判和人类专家之间存在显著差异,在 3 分制评分中平均误差高达 0.68。通过使用大约 50 个标注示例的 MemAlign,Databricks 在最不匹配的维度上成功将错误率降低了 74-89%,证明了该框架在缩小 AI 生成代码质量与专家标准之间…
-
Databricks 推动统一AI平台,应对AI影响的担忧
Databricks 正在强调一种统一的AI可扩展性平台方法,整合了Amazon SageMaker和Unity Catalog等工具,以简化模型训练和部署。与此同时,人们对AI对人类理解和决策能力的影响日益担忧,并呼吁加强具身AI代理的验证。该行业还面临AI驱动裁员的审查,可能对领导层产生财务影响。
-
Databricks Genie offers natural language access to federal data
Federal agencies possess vast amounts of data but struggle with accessibility due to siloed, legacy systems. This data modernization challenge requires technical intermediaries for non-expert staff to access insights, l…
-
Databricks and SAP sync semantic metadata for AI-ready SAP data
Databricks and SAP have partnered to automatically sync semantic metadata and governance tags from SAP Business Data Cloud into Databricks Unity Catalog. This integration aims to make SAP data more understandable and AI…
-
Databricks syncs operational Postgres data to Lakehouse natively
Databricks has introduced Native Lakehouse Sync, a feature that allows operational PostgreSQL data from its Lakebase to be automatically replicated into Unity Catalog managed tables. This eliminates the need for traditi…
-
Databricks integrates Stripe data for AI-powered payment analytics
Databricks has announced a new integration allowing users to access Stripe data directly within their Databricks workspace. This partnership enables AI-native analysis of payment and business data without the need for t…
-
Databricks and Google Cloud enable data interoperability via catalog federation
Databricks and Google Cloud have enhanced interoperability between their data platforms, Unity Catalog and BigQuery, respectively. This new integration allows customers to access the same data from either platform witho…
-
Databricks enables public sector AI for fraud prevention and operational efficiency
Databricks has published a blog post detailing how public sector agencies can leverage AI to combat the growing problem of fraud. The post outlines a new operating model that embeds intelligence into everyday workflows,…