Archives AI News

The Polestar 5 is an 884hp fastback sedan that should make Porsche nervous

It’s been five years since Polestar first introduced the Precept, a concept car that the electric automaker described as “a manifesto of things to come; a declaration.” Well, come they have, because today Polestar finally revealed the production car that’s based on this manifesto: the Polestar 5. And wow, these specs: up to 460 miles […]

September 9, 2025

All 54 lost clickwheel iPod games have now been preserved for posterity

Finding working copies of the last few titles was an "especially cursed" journey.

September 9, 2025

Congress and Trump may compromise on the SLS rocket by axing its costly upper stage

"At $4 billion a launch, you don’t have a Moon program."

September 9, 2025

Google admits the open web is in ‘rapid decline’

For months, Google has maintained that the web is “thriving,” AI isn’t tanking traffic, and its search engine is sending people to a wider variety of websites than ever. But in a court filing from last week, Google admitted that “the open web is already in rapid decline,” as spotted earlier by Jason Kint and […]

September 9, 2025

AI Mode is now available in five new languages around the world.

AIM PROMO SPARK 1920x1080.max 600x600.format webp

Starting today, we’re bringing AI Mode, our most powerful AI search experience, to five new languages for users around the globe: Hindi, Indonesian, Japanese, Korean, an…

September 9, 2025

Registration now open: Our no-cost, generative AI training and certification program for veterans

Growing up in a Navy family instilled a strong sense of purpose in me. My father’s remarkable 42 years of naval service not only shaped my values, but inspired me to join the Navy myself. Today, as the leader of Google Public Sector, I'm honored to continue that tradition of service in support of our federal government. Over the past year, Google Public Sector has made significant strides in delivering AI and secure cloud infrastructure to government agencies. We introduced a Google Workspace discount through GSA’s OneGov Strategy in April, and just last month expanded that partnership with the introduction of a “Gemini for Government” offering to provide a comprehensive set of AI tools to federal agencies at less than $.50 per government agency for a year. In addition, Google Public Sector secured a $200 million contract with the Department of Defense's (DoD) Chief Digital and Artificial Intelligence Office (CDAO) to accelerate the adoption of AI and cloud capabilities. These are just a few examples of Google’s commitment to the federal workforce. In addition to these efforts, my role at Google Public Sector affords me the opportunity to work closely with a community I know well: our nation's veterans. I've seen firsthand the immense value and leadership skills veterans bring to the table, yet the transition to civilian life can be difficult, with veterans facing challenges ranging from underemployment to difficulties securing meaningful work. At Google Public Sector, we’ve long been committed to changing this narrative by empowering the veteran community with the skills and resources needed for successful career transitions. Sign up today for Google Launchpad for Veterans I’m excited to share that registration is officially open for the next cohort of Google Launchpad for Veterans. Introduced in 2024, this three-week, no-cost, virtual training program provides veterans with the foundational skills necessary to jump start rewarding careers with generative AI. The program is open to US and Canada military veterans and service members. Last year, we trained over 4,000 veterans to help them transition into high-paying tech roles. This next cohort of learners will gain in-demand skills that enable them to thrive in both functional and technical positions, along with the knowledge to help drive digital transformation through AI within their organization. Become a leader in AI digital transformation with this virtual, no-cost program The Gen AI Leader training, which does not require previous technical experience, kicks off with a two-day virtual event on November 13th and 14th. You’ll enjoy interactive training sessions and a panel discussion with veterans from Google and learn: Foundational generative AI knowledge: Grasp the core concepts of generative AI, including Large Language Models (LLMs), machine learning paradigms, and various data types. AI ecosystem navigation: Learn to navigate the broader AI landscape, encompassing infrastructure, models, platforms, agents, and applications. Practical business applications: Explore real-world uses of generative AI within business, with a focus on powerful Google Cloud tools like Gemini and NotebookLM. Strategic perspective: Understand how generative AI agents can drive organizational transformation. After completing the program, you’ll receive a complimentary voucher to take the Gen AI Leader exam. Attendees are encouraged to take the exam between November 21st and December 19th, 2025. After successful completion, you’ll receive Google’s industry-recognized Gen AI Leader certification, a valuable credential to help you advance your career. If you’d like additional practice before taking the exam, we're offering optional exam preparation sessions on November 17th, and 21st. As a bonus, the first 500 individuals to pass the exam will receive a voucher for their very own pair of Google socks! Register today and get ready to translate your military experience to a powerful career where you can apply the latest in AI, security, and cloud technologies to public sector missions. Learn more about Google Public Sector To learn more about Google Public Sector partners, peers, and partners are building the future, join us for our Public Sector Summit, held in Washington, DC on October 29th. You can explore more content at publicsector.google.

September 9, 2025

BigQuery under the hood: The power of the Column Metadata index aka CMETA

While petabyte-scale data warehouses are becoming more common, getting the performance you need without escalating costs and effort remain key challenges, even in a modern cloud data warehouse. While many data warehouse platform providers continue to work on these challenges, BigQuery has already moved past petabyte-scale data warehouses to petabyte-scale tables. In fact, some BigQuery users have single tables in excess of 200 petabytes and over 70 trillion rows. At this scale, even metadata is big data that requires an (almost) infinitely scalable design and high performance. In 2021, we presented the Column Metadata (CMETA) index in a 2021 VLDB paper, which, as the name implies, acts like an index for metadata. Compared to existing techniques, CMETA proved to be superior, meeting both our scalability and performance requirements. Further, BigQuery’s implementation thereof requires no user effort to maintain, and in addition to transparently improving query performance, CMETA may also reduce overall slot usage. In this blog, we take a look at how CMETA works, the impact it can have on your workloads, and how to maximize its benefits. Let’s jump in. How BigQuery stores data All data in BigQuery tables is stored as data blocks that are organized in a columnar format. Data blocks also store metadata about all rows within the block. This includes min and max values for each column in the block and any other necessary properties that may be used for query optimization. This metadata allows BigQuery to perform fine-grained dynamic pruning to improve both query performance and resource efficiency. This approach is well-known and commonly applied in the data management industry. However, as noted above, BigQuery operates on a vast scale, routinely handling tables that have over a hundred petabytes of data spread across billions of blocks in storage. Metadata for these tables frequently reach terabyte scale — larger than many organizations' entire data warehouses! aside_block <ListValue: [StructValue([('title', '$300 in free credit to try Google Cloud data analytics'), ('body', <wagtail.rich_text.RichText object at 0x3e4c083c78b0>), ('btn_text', ''), ('href', ''), ('image', None)])]> Enter the Column Metadata index To optimize queries, especially when large tables are involved, BigQuery now leverages CMETA. This system table is automatically created and managed by BigQuery to maintain a snapshot of metadata for all data blocks of user tables that may benefit from the index. This provides additional data to BigQuery's planner, allowing it to apply additional fine-grained pruning of data blocks, reducing both resource consumption (slots usage and/or bytes scanned) and query execution time. CMETA relies on a few key techniques. Index generationCMETA is automatically generated and refreshed in the background at no additional cost and does not impact user workloads. Creation and updates to the index occur automatically whenever BigQuery determines the table will benefit from the index based on size and/or volume of change to the data in an existing table. BigQuery ensures the index remains up-to-date with block statistics and column-level attributes with no need for any user action. Using efficient storage and horizontally scalable techniques, BigQuery can maintain these indexes at scale, even for some of our performance sensitive users with tables over 200 petabytes in size. Figure 1 Query servingTo illustrate how the index serves queries in practice, let’s use the `natality` table from BigQuery's public dataset. Imagine this table's data is stored in three blocks (see Figure 1), committed at times 110, 120, and 130. Our column metadata index, with a snapshot taken at time 125, includes block- and column-level statistics for blocks 1 and 2. code_block <ListValue: [StructValue([('code', 'SELECT * FROM samples.natality WHERE weight_pounds >= 7 and is_male = false'), ('language', ''), ('caption', <wagtail.rich_text.RichText object at 0x3e4c1bfd16d0>)])]> Considering the query above, BigQuery first scans the index to identify relevant blocks. Since the maximum value of `weight_pounds` in block 2 is 6.56 and the query filters on ‘weight_pounds’ >= 7, we know we can safely skip that block without even inspecting it. The original query then runs only against block 1 and any newer block(s) that haven’t been indexed yet — in this case, block 3. The results are combined and returned to the user. Figure 2 With rich column-level attributes in the index, BigQuery can prune efficiently at the early stage of query processing. Without the index, pruning occurs at later stages when BigQuery opens the data blocks, which involves more computing resources. For large tables, skipping data blocks with this technique significantly benefits selective queries, enabling BigQuery to support much larger tables. Consider the above example but with a table that has billions of blocks. Imagine the time and slot usage savings from pruning unnecessary blocks without even needing to access the block’s header. BigQuery’s CMETA index is unique in a few ways: Zero maintenance cost or effort: The CMETA index is a fully automated background operation Applicable to all data tables: CMETA works transparently to improve performance regardless of whether the table size is measured in gigabytes or petabytes Integrated with other Google Cloud services: Works with BigQuery tables and BigLake External Tables Safe: Always returns correct results regardless of whether CMETA is available or up-to-date Measuring CMETA’s impact Early adopters of CMETA have reported up to 60x improvement in query performance and up to 10x reduction in slot usage for some queries. The benefits are particularly pronounced for queries with more selective filters, especially for filters on clustering columns, as CMETA minimizes the amount of data processed by query workers. Figure 3 Maximizing CMETA’s benefits BigQuery currently automatically manages CMETA at no additional cost to users and allocates resources to create or refresh the index in a round robin way. If your tables grow or change very rapidly and you have strict performance requirements, you may choose to use your own resource pool (slots) for CMETA maintenance operations to maximize CMETA’s throughput. This will provide the most consistent experience in query performance improvement via CMETA. To do this, simply create a reservation assignment and allocate slots for background jobs, and CMETA maintenance jobs will automatically use it. More details are available in the documentation. More to come While this first iteration of CMETA is now generally available, we’re already working on future iterations to further improve BigQuery’s autonomous query processing capabilities, without any extra effort or cost on your part. Stay tuned for more to come.

September 9, 2025

The Raku Programming Language: There’s More Than One Way To Do It

The Raku Programming Language is a modern, expressive tool built for flexibility and power. We will explore its key features, from multi-paradigm support to advanced metaprogramming.

September 9, 2025

Meta curbed research about VR safety risks to kids, whistleblowers say

A new group of whistleblowers are coming forward to allege that Meta is restricting research into how its virtual reality offerings could negatively impact kids and teens, The Washington Post reports. Four current and former Meta staffers allege that after an earlier whistleblower, Frances Haugen, leaked internal research to Congress, the company called on its […]

September 9, 2025

Physicists devise an idea for lasers that shoot beams of neutrinos

Super-cooling radioactive atoms could produce a laser-like neutrino beam, offering a new way to study these ghostly particles — and possibly a new form of communication.

September 9, 2025