Archives AI News

Conditional Wasserstein Distances with Applications in Bayesian OT Flow Matching

arXiv:2403.18705v3 Announce Type: replace Abstract: In inverse problems, many conditional generative models approximate the posterior measure by minimizing a distance between the joint measure and its learned approximation. While this approach also controls the distance between the posterior measures in the case of the Kullback--Leibler divergence, this is in general not hold true for the Wasserstein distance. In this paper, we introduce a conditional Wasserstein distance via a set of restricted couplings that equals the expected Wasserstein distance of the posteriors. Interestingly, the dual formulation of the conditional Wasserstein-1 flow resembles losses in the conditional Wasserstein GAN literature in a quite natural way. We derive theoretical properties of the conditional Wasserstein distance, characterize the corresponding geodesics and velocity fields as well as the flow ODEs. Subsequently, we propose to approximate the velocity fields by relaxing the conditional Wasserstein distance. Based on this, we propose an extension of OT Flow Matching for solving Bayesian inverse problems and demonstrate its numerical advantages on an inverse problem and class-conditional image generation.

Experimental End-to-End Optimization of Directly Modulated Laser-based IM/DD Transmission

arXiv:2508.19910v1 Announce Type: cross Abstract: Directly modulated lasers (DMLs) are an attractive technology for short-reach intensity modulation and direct detection communication systems. However, their complex nonlinear dynamics make the modeling and optimization of DML-based systems challenging. In this paper, we study the end-to-end optimization of DML-based systems based on a data-driven surrogate model trained on experimental data. The end-to-end optimization includes the pulse shaping and equalizer filters, the bias current and the modulation radio-frequency (RF) power applied to the laser. The performance of the end-to-end optimization scheme is tested on the experimental setup and compared to 4 different benchmark schemes based on linear and nonlinear receiver-side equalization. The results show that the proposed end-to-end scheme is able to deliver better performance throughout the studied symbol rates and transmission distances while employing lower modulation RF power, fewer filter taps and utilizing a smaller signal bandwidth.

LLM-based feature generation from text for interpretable machine learning

arXiv:2409.07132v2 Announce Type: replace Abstract: Existing text representations such as embeddings and bag-of-words are not suitable for rule learning due to their high dimensionality and absent or questionable feature-level interpretability. This article explores whether large language models (LLMs) could address this by extracting a small number of interpretable features from text. We demonstrate this process on two datasets (CORD-19 and M17+) containing several thousand scientific articles from multiple disciplines and a target being a proxy for research impact. An evaluation based on testing for the statistically significant correlation with research impact has shown that LLama 2-generated features are semantically meaningful. We consequently used these generated features in text classification to predict the binary target variable representing the citation rate for the CORD-19 dataset and the ordinal 5-class target representing an expert-awarded grade in the M17+ dataset. Machine-learning models trained on the LLM-generated features provided similar predictive performance to the state-of-the-art embedding model SciBERT for scientific text. The LLM used only 62 features compared to 768 features in SciBERT embeddings, and these features were directly interpretable, corresponding to notions such as article methodological rigor, novelty, or grammatical correctness. As the final step, we extract a small number of well-interpretable action rules. Consistently competitive results obtained with the same LLM feature set across both thematically diverse datasets show that this approach generalizes across domains.

Samsung is Unpacking again in early September

Apple isn’t the only tech company to send out launch event invitations this week. If you’re keeping score at home, September is actually next week somehow, and Samsung is sneaking a virtual Unpacked in on September 4th before Apple hosts its annual iPhone event the following week. But it’s not just a convenient date to […]

FDA approves updated covid vaccines, but with severe new limits

On Wednesday, the FDA approved the new round of COVID-19 vaccines from Pfizer, Moderna, and Novavax for use by seniors over the age of 65. But for anyone younger than that, the FDA approval only mentions  people who have “at least one underlying condition that puts them at high risk for severe outcomes from COVID-19.” […]

Microsoft fires two employee protesters who occupied its president’s office

Microsoft has fired two employees that were involved in a sit-in protest in vice chair and president Brad Smith’s office. Software engineers Riki Fameli and Anna Hattle were both dismissed today, after being part of a group of seven protesters that managed to get inside Smith’s office in Building 34 yesterday. Microsoft was forced to temporarily […]

Anthropic’s Claude Opus 4.1 Improves Refactoring and Safety, Scores 74.5% SWE-bench Verified

Anthropic has launched Claude Opus 4.1, an update that strengthens coding reliability in multi-file projects and improves reasoning across long interactions. The model also raised its SWE-bench Verified score to 74.5%, up from 72.5%. Building on Opus 4, the new version strengthens Claude’s ability to act as a coding assistant, particularly in multi-file contexts. By Hien Luu

Apple pulls iPhone torrent app from AltStore PAL in Europe

Apple has removed the iPhone torrenting client, iTorrent, from AltStore PAL’s alternative iOS marketplace in the EU, showing that it can still exert control over apps that aren’t listed on the official App Store. iTorrent developer Daniil Vinogradov told TorrentFreak that Apple has revoked his distribution rights to publish apps in any alternative iOS stores, […]

DJI’s Mic 3 crams more features into a smaller package

DJI is making its latest wireless lavalier microphone system even smaller without scrimping on features or battery life. The DJI Mic 3 is half the size and weight of its Mic 2 predecessor and introduces new capabilities, including two adaptive gain control modes, three voice tone presets, and a sizable increase in storage capacity for […]