BigQuery’s update will make it easier to prepare data for AI with speech-to-text and document processing. Credit: Google Google is integrating its Gemini 1.0 Pro large language model with its AI and machine learning platform, Vertex AI, to help enterprises unlock new capabilities of large language models (LLMs), including analysis of text, image and video. The Gemini API, which has been made generally available, can also be used in Google’s data warehouse, BigQuery, to develop generative AI-based analytical applications. “The Gemini 1.0 Pro model is designed for higher input-output scale and better result quality across a wide range of tasks like text summarization and sentiment analysis. You can now access it using simple SQL statements or BigQuery’s embedded DataFrame API from right inside the BigQuery console,” Gerrit Kazmaier, general manager of data analytics at Google Cloud, said in a statement. The company is also expected to integrate the vision version of the Gemini Pro model in the coming months. In addition, Google is extending Vertex AI’s document processing and speech-to-text APIs to BigQuery to help enterprises analyze unstructured data, such as documents and audio. Earlier this month, the company announced the preview of BigQuery vector search, which when integrated with Vertex AI can enable vector similarity search on data inside BigQuery along with other features such as retrieval augmented generation (RAG), text clustering and summarization. Hyoun Park, principal analyst at Amalgam Insights, sees RAG support as table stakes for data warehouse vendors these days. “Retrieval augmented generation is a capability every data warehouse will need to support, as it refers to accessing data from a third party source when someone asks a question,” Park said. “For instance, if someone asks an HR question, the RAG would also ask the employee’s HR system for relevant and current data to contextualize the question. The relevant capability here is in accessing a real-time update of a specific table or data source when someone asks a question to an LLM.” Other companies are moving in a similar direction. Steven Dickens, vice president and practice lead at The Futurum Group, said that warehouse stalwarts such as Teradata and Cloudera are also adding vector capabilities alongside players such as Oracle and Elastic. Related reading: Google, Udacity offer free course on Gemini API JetBrains AI Assistant to integrate Google Gemini AI models Google opens access to 2 million context window of Gemini 1.5 Pro Related content news Microsoft gives enterprises new reasons to adopt Fabric At its European Fabric Community Conference last week, Microsoft released a volley of incremental updates to woo enterprises to its data platform. By Anirban Ghoshal Sep 30, 2024 7 mins Business Intelligence Data Engineering Data Warehousing feature Why Apache Iceberg is on fire right now Apache Iceberg provides an open table format for interoperability across data lakes, showing the importance of a true open standard. By James Malone Jul 31, 2024 5 mins Data Warehousing Databases Cloud Computing news Databricks expands Mosaic AI support for generative AI apps New Mosaic AI capabilities, all in preview, will help enterprises building generative AI applications on Databricks’ platform. By Anirban Ghoshal Jun 12, 2024 5 mins Generative AI Data Warehousing how-to Download our cloud data warehouse enterprise buyer’s guide These days, organizations implementing data warehouses often consider creating the data warehouse in the cloud rather than on premises. This guide explores the options, including data lakes, that IT can consider in its service choices. By Martin Heller Jun 11, 2024 1 min Cloud Storage Data Warehousing Enterprise Buyer’s Guides Resources Videos