Automatically curated collection of the latest research papers in Speech & Language Technology
📅 Updated on 2026.04.02
This repository provides a daily-updated collection of the latest research papers from arXiv in the following domains:
📖 Usage instructions: here 🌐 Web version: GitHub Pages
💡 This page is inspired by cv-arxiv-daily
📊 647 papers
| 📅 Publish Date | 📝 Title | 👥 Authors | 💻 Code | |
|---|---|---|---|---|
| 2026-04-01 | VisG AV-HuBERT: Viseme-Guided AV-HuBERT | Aristeidis Papadopoulos et.al. | 2604.00982 | link |
| 2026-04-01 | English to Central Kurdish Speech Translation: Corpus Creation, Evaluation, and Orthographic Standardization | Mohammad Mohammadamini et.al. | 2604.00613 | null |
| 2026-04-01 | Speech LLMs are Contextual Reasoning Transcribers | Keqi Deng et.al. | 2604.00610 | null |
| 2026-04-01 | Adapting Text LLMs to Speech via Multimodal Depth Up-Scaling | Kazuki Yano et.al. | 2604.00489 | null |
| 2026-03-31 | FLEURS-Kobani: Extending the FLEURS Dataset for Northern Kurdish | Daban Q. Jaff et.al. | 2603.29892 | null |
| 2026-03-31 | Can LLM Agents Identify Spoken Dialects like a Linguist? | Tobias Bystrich et.al. | 2603.29541 | null |
| 2026-03-31 | LLM Probe: Evaluating LLMs for Low-Resource Languages | Hailay Kidu Teklehaymanot et.al. | 2603.29517 | null |
| 2026-03-31 | Spoken Digit Recognition and Speaker Classification by Nonlinear Interfered Spin Wave-Based Physical Reservoir Computing | Sota Hikasa et.al. | 2603.29311 | null |
| 2026-03-31 | Advancing LLM-based phoneme-to-grapheme for multilingual speech recognition | Lukuang Dong et.al. | 2603.29217 | null |
| 2026-03-30 | EBuddy: a workflow orchestrator for industrial human-machine collaboration | Michele Banfi et.al. | 2603.28579 | null |
| 2026-03-30 | Users and Wizards in Conversations: How WoZ Interface Choices Define Human-Robot Interactions | Ekaterina Torubarova et.al. | 2603.28338 | null |
| 2026-03-30 | Voice-Controlled Scratch for Children with (Motor) Disabilities | Elias Goller et.al. | 2603.28246 | null |
| 2026-03-30 | On the Role of Encoder Depth: Pruning Whisper and LoRA Fine-Tuning in SLAM-ASR | Ganesh Pavan Kartikeya Bharadwaj Kolluri et.al. | 2603.27981 | null |
| 2026-03-29 | Investigation on the Robustness of Acoustic Foundation Models on Post Exercise Speech | Xiangyuan Xue et.al. | 2603.27508 | null |
| 2026-03-28 | Two-Stage Acoustic Adaptation with Gated Cross-Attention Adapters for LLM-Based Multi-Talker Speech Recognition | Hao Shi et.al. | 2603.27205 | null |
| 2026-03-27 | JAL-Turn: Joint Acoustic-Linguistic Modeling for Real-Time and Robust Turn-Taking Detection in Full-Duplex Spoken Dialogue Systems | Guangzhao Yang et.al. | 2603.26515 | null |
| 2026-03-27 | Automatic Speech Recognition for Documenting Endangered Languages: Case Study of Ikema Miyakoan | Chihiro Taguchi et.al. | 2603.26248 | null |
| 2026-03-27 | Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR | Shashi Kumar et.al. | 2603.26246 | null |
| 2026-03-30 | Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models | Kyudan Jung et.al. | 2603.25750 | null |
| 2026-03-26 | Back to Basics: Revisiting ASR in the Age of Voice Agents | Geeyang Tay et.al. | 2603.25727 | null |
| 2026-03-26 | CLAR: CIF-Localized Alignment for Retrieval-Augmented Speech LLM-Based Contextual ASR | Shangkun Huang et.al. | 2603.25460 | null |
| 2026-03-26 | Goodness-of-pronunciation without phoneme time alignment | Jeremy H. M. Wong et.al. | 2603.25150 | null |
| 2026-03-25 | A Sociolinguistic Analysis of Automatic Speech Recognition Bias in Newcastle English | Dana Serditova et.al. | 2603.24549 | null |
| 2026-03-25 | When AI Meets Early Childhood Education: Large Language Models as Assessment Teammates in Chinese Preschools | Xingming Li et.al. | 2603.24389 | null |
| 2026-03-25 | Bridging Biological Hearing and Neuromorphic Computing: End-to-End Time-Domain Audio Signal Processing with Reservoir Computing | Rinku Sebastian et.al. | 2603.24283 | null |
| 2026-03-25 | From Oracle to Noisy Context: Mitigating Contextual Exposure Bias in Speech-LLMs | Xiaoyong Guo et.al. | 2603.24034 | null |
| 2026-03-24 | Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages | Badr M. Abdullah et.al. | 2603.23654 | null |
| 2026-03-24 | Evaluating a Multi-Agent Voice-Enabled Smart Speaker for Care Homes: A Safety-Focused Framework | Zeinab Dehghani et.al. | 2603.23625 | null |
| 2026-03-05 | Berta: an open-source, modular tool for AI-enabled clinical documentation | Samridhi Vaid et.al. | 2603.23513 | null |
| 2026-03-24 | MSR-HuBERT: Self-supervised Pre-training for Adaptation to Multiple Sampling Rates | Zikang Huang et.al. | 2603.23048 | null |
| 2026-03-24 | Who Spoke What When? Evaluating Spoken Language Models for Conversational ASR with Semantic and Overlap-Aware Metrics | Naohiro Tawara et.al. | 2603.22709 | null |
| 2026-03-23 | Precision-Varying Prediction (PVP): Robustifying ASR systems against adversarial attacks | Matías Pizarro et.al. | 2603.22590 | null |
| 2026-03-23 | SLURP-TN : Resource for Tunisian Dialect Spoken Language Understanding | Haroun Elleuch et.al. | 2603.21940 | null |
| 2026-03-23 | Ara-Best-RQ: Multi Dialectal Arabic SSL | Haroun Elleuch et.al. | 2603.21900 | null |
| 2026-03-23 | Cascade-Free Mandarin Visual Speech Recognition via Semantic-Guided Cross-Representation Alignment | Lei Yang et.al. | 2603.21808 | null |
| 2026-03-30 | RESPOND: Responsive Engagement Strategy for Predictive Orchestration and Dialogue | Meng-Chen Lee et.al. | 2603.21682 | null |
| 2026-03-20 | Demonstration of Adapt4Me: An Uncertainty-Aware Authoring Environment for Personalizing Automatic Speech Recognition to Non-normative Speech | Niclas Pokel et.al. | 2603.20112 | null |
| 2026-03-20 | LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families | Jianan Chen et.al. | 2603.20042 | null |
| 2026-03-18 | Impact of automatic speech recognition quality on Alzheimer’s disease detection from spontaneous speech: a reproducible benchmark study with lexical modeling and statistical validation | Himadri Samanta et.al. | 2603.18239 | null |
| 2026-03-27 | LoGSAM: Parameter-Efficient Cross-Modal Grounding for MRI Segmentation | Mohammad Robaitul Islam Bhuiyan et.al. | 2603.17576 | null |
| 2026-03-19 | Zipper-LoRA: Dynamic Parameter Decoupling for Speech-LLM based Multilingual Speech Recognition | Yuxiang Mei et.al. | 2603.17558 | null |
| 2026-03-17 | Over-the-air White-box Attack on the Wav2Vec Speech Recognition Neural Network | Protopopov Alexey et.al. | 2603.16972 | null |
| 2026-03-18 | Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Embeddings Bridging Massively Multilingual Text and Speech | Omnilingual SONAR Team et.al. | 2603.16606 | null |
| 2026-03-17 | RECOVER: Robust Entity Correction via agentic Orchestration of hypothesis Variants for Evidence-based Recovery | Abhishek Kumar et.al. | 2603.16411 | null |
| 2026-03-17 | Fanar 2.0: Arabic Generative AI Stack | FANAR TEAM et.al. | 2603.16397 | null |
| 2026-03-18 | Attention-guided Evidence Grounding for Spoken Question Answering | Ke Yang et.al. | 2603.16292 | null |
| 2026-03-17 | Is Semi-Automatic Transcription Useful in Corpus Creation? Preliminary Considerations on the KIParla Corpus | Martina Simonotti et.al. | 2603.16258 | null |
| 2026-03-17 | Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR | Quy-Anh Dang et.al. | 2603.16184 | null |
| 2026-03-16 | Lost in Transcription: Subtitle Errors in Automatic Speech Recognition Reduce Speaker and Content Evaluations | Kowe Kadoma et.al. | 2603.15807 | null |
| 2026-03-16 | Two-Stage Adaptation for Non-Normative Speech Recognition: Revisiting Speaker-Independent Initialization for Personalization | Shan Jiang et.al. | 2603.15261 | null |
| 2026-03-16 | LLMs and Speech: Integration vs. Combination | Robin Schmitt et.al. | 2603.15045 | null |
| 2026-03-16 | SoulX-Duplug: Plug-and-Play Streaming State Prediction Module for Realtime Full-Duplex Speech Conversation | Ruiqi Yan et.al. | 2603.14877 | null |
| 2026-03-16 | Vietnamese Automatic Speech Recognition: A Revisit | Thi Vu et.al. | 2603.14779 | null |
| 2026-03-04 | BrainWhisperer: Leveraging Large-Scale ASR Models for Neural Speech Decoding | Tommaso Boccato et.al. | 2603.13321 | null |
| 2026-03-12 | TASTE-Streaming: Towards Streamable Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling | Liang-Hsuan Tseng et.al. | 2603.12350 | null |
| 2026-03-12 | Dr. SHAP-AV: Decoding Relative Modality Contributions via Shapley Attribution in Audio-Visual Speech Recognition | Umberto Cappellazzo et.al. | 2603.12046 | null |
| 2026-03-11 | Continued Pretraining for Low-Resource Swahili ASR: Achieving State-of-the-Art Performance with Minimal Labeled Data | Hillary Mutisya et.al. | 2603.11378 | null |
| 2026-03-11 | Duration Aware Scheduling for ASR Serving Under Workload Drift | Darshan Makwana et.al. | 2603.11273 | null |
| 2026-03-11 | Self-Speculative Decoding for LLM-based ASR with CTC Encoder Drafts | George Saon et.al. | 2603.11243 | null |
| 2026-03-11 | Huntington Disease Automatic Speech Recognition with Biomarker Supervision | Charles L. Wang et.al. | 2603.11168 | null |
| 2026-03-11 | Uni-ASR: Unified LLM-Based Architecture for Non-Streaming and Streaming Automatic Speech Recognition | Yinfeng Xia et.al. | 2603.11123 | null |
| 2026-03-11 | AlphaFlowTSE: One-Step Generative Target Speaker Extraction via Conditional AlphaFlow | Duojia Li et.al. | 2603.10701 | null |
| 2026-03-11 | Distilling LLM Semantic Priors into Encoder-Only Multi-Talker ASR with Talker-Count Routing | Hao Shi et.al. | 2603.10587 | null |
| 2026-03-11 | G-STAR: End-to-End Global Speaker-Tracking Attributed Recognition | Jing Peng et.al. | 2603.10468 | null |
| 2026-03-11 | FireRedASR2S: A State-of-the-Art Industrial-Grade All-in-One Automatic Speech Recognition System | Kaituo Xu et.al. | 2603.10420 | null |
| 2026-03-10 | SCENEBench: An Audio Understanding Benchmark Grounded in Assistive and Industrial Use Cases | Laya Iyer et.al. | 2603.09853 | null |
| 2026-03-10 | Finetuning a Text-to-Audio Model for Room Impulse Response Generation | Kirak Kim et.al. | 2603.09708 | null |
| 2026-03-12 | Logics-Parsing-Omni Technical Report | Xin An et.al. | 2603.09677 | null |
| 2026-03-10 | Speech-Omni-Lite: Portable Speech Interfaces for Vision-Language Models | Dehua Tao et.al. | 2603.09627 | null |
| 2026-03-10 | SPAR-K: Scheduled Periodic Alternating Early Exit for Spoken Language Models | Hsiao-Ying Huang et.al. | 2603.09215 | null |
| 2026-03-10 | Trade-offs Between Capacity and Robustness in Neural Audio Codecs for Adversarially Robust Speech Recognition | Jordan Prescott et.al. | 2603.09034 | null |
| 2026-03-09 | NLE: Non-autoregressive LLM-based ASR by Transcript Editing | Avihu Dekel et.al. | 2603.08397 | null |
| 2026-03-09 | Bootstrapping Audiovisual Speech Recognition in Zero-AV-Resource Scenarios with Synthetic Visual Data | Pol Buitrago et.al. | 2603.08249 | null |
| 2026-03-09 | PathBench: Speech Intelligibility Benchmark for Automatic Pathological Speech Assessment | Bence Mark Halpern et.al. | 2603.08097 | null |
| 2026-03-09 | Listening with the Eyes: Benchmarking Egocentric Co-Speech Grounding across Space and Time | Weijie Zhou et.al. | 2603.07966 | null |
| 2026-03-08 | Nwāchā Munā: A Devanagari Speech Corpus and Proximal Transfer Benchmark for Nepal Bhasha ASR | Rishikesh Kumar Sharma et.al. | 2603.07554 | null |
| 2026-03-07 | Seeing the Context: Rich Visual Context-Aware Speech Recognition via Multimodal Reasoning | Wenjie Tian et.al. | 2603.07263 | null |
| 2026-03-06 | Speak in Context: Multilingual ASR with Speech Context Alignment via Contrastive Learning | Yuchen Zhang et.al. | 2603.06505 | null |
| 2026-03-06 | Doctor or Patient? Synergizing Diarization and ASR for Code-Switched Hinglish Medical Conditions Extraction | Séverin Baroudi et.al. | 2603.06373 | null |
| 2026-03-06 | Continual Adaptation for Pacific Indigenous Speech Recognition | Yang Xiao et.al. | 2603.06310 | null |
| 2026-03-06 | Whisper-CD: Accurate Long-Form Speech Recognition using Multi-Negative Contrastive Decoding | Hoseong Ahn et.al. | 2603.06193 | null |
| 2026-03-12 | Which Data Matter? Embedding-Based Data Selection for Speech Recognition | Zakaria Aldeneh et.al. | 2603.05819 | null |
| 2026-03-06 | Activation Steering for Accent Adaptation in Speech Foundation Models | Jinuo Sun et.al. | 2603.05813 | null |
| 2026-03-05 | Exploring the potential and limitations of Model Merging for Multi-Domain Adaptation in ASR | Carlos Carvalho et.al. | 2603.05354 | null |
| 2026-03-05 | PersianPunc: A Large-Scale Dataset and BERT-Based Approach for Persian Punctuation Restoration | Mohammad Javad Ranjbar Kalahroodi et.al. | 2603.05314 | null |
| 2026-03-05 | Beyond Word Error Rate: Auditing the Diversity Tax in Speech Recognition through Dataset Cartography | Ting-Hui Cheng et.al. | 2603.05267 | null |
| 2026-03-05 | Boosting ASR Robustness via Test-Time Reinforcement Learning with Audio-Text Semantic Rewards | Linghan Fang et.al. | 2603.05231 | null |
| 2026-03-05 | Measuring the Redundancy of Decoder Layers in SpeechLLMs | Adel Moumen et.al. | 2603.05121 | null |
| 2026-03-05 | TW-Sound580K: A Regional Audio-Text Dataset with Verification-Guided Curation for Localized Audio-Language Modeling | Hao-Hui Xie et.al. | 2603.05094 | null |
| 2026-03-05 | Federated Heterogeneous Language Model Optimization for Hybrid Automatic Speech Recognition | Mengze Hong et.al. | 2603.04945 | null |
| 2026-03-05 | Spectral dynamics reservoir computing for high-speed hardware-efficient neuromorphic processing | Jiaxuan Chen et.al. | 2603.04901 | null |
| 2026-02-16 | Generating Realistic, Protocol-Compliant Maritime Radio Dialogues using Self-Instruct and Low-Rank Adaptation | Gürsel Akdeniz et.al. | 2603.04423 | null |
| 2026-03-04 | FlowW2N: Whispered-to-Normal Speech Conversion via Flow-Matching | Fabian Ritter-Gutierrez et.al. | 2603.04296 | null |
| 2026-03-04 | Robust LLM-based Audio-Visual Speech Recognition with Sparse Modality Alignment and Visual Unit-Guided Refinement | Fei Su et.al. | 2603.03811 | null |
| 2026-03-05 | The PARLO Dementia Corpus: A German Multi-Center Resource for Alzheimer’s Disease | Franziska Braun et.al. | 2603.03471 | null |
| 2026-03-07 | ACES: Accent Subspaces for Coupling, Explanations, and Stress-Testing in Automatic Speech Recognition | Swapnil Parekh et.al. | 2603.03359 | null |
| 2026-03-03 | Speech recognition assisted by large language models to command software orally – Application to an augmented and virtual reality web app for immersive molecular graphics | Fabio Cortes Rodriguez et.al. | 2603.02901 | null |
| 2026-03-04 | SilentWear: an Ultra-Low Power Wearable System for EMG-based Silent Speech Recognition | Giusy Spacone et.al. | 2603.02847 | null |
| 2026-03-02 | GLoRIA: Gated Low-Rank Interpretable Adaptation for Dialectal ASR | Pouya Mehralian et.al. | 2603.02464 | null |
| 2026-03-02 | Sequence-Level Unsupervised Training in Speech Recognition: A Theoretical Study | Zijian Yang et.al. | 2603.02285 | null |
| 2026-03-15 | Whisper-RIR-Mega: A Paired Clean-Reverberant Speech Benchmark for ASR Robustness to Room Acoustics | Mandip Goswami et.al. | 2603.02252 | null |
| 2026-02-25 | Quality of Automatic Speech Recognition – Polish Language case study – from Wav2Vec to Scribe ElevenLabs | Marcin Pietroń et.al. | 2603.02246 | null |
| 2026-03-02 | VietSuperSpeech: A Large-Scale Vietnamese Conversational Speech Dataset for ASR Fine-Tuning in Chatbot, Customer Support, and Call Center Applications | Loan Do et.al. | 2603.01894 | null |
| 2026-03-02 | The USTC-NERCSLIP Systems for the CHiME-9 MCoRec Challenge | Ya Jiang et.al. | 2603.01415 | null |
| 2026-03-07 | Using Songs to Improve Kazakh Automatic Speech Recognition | Rustem Yeshpanov et.al. | 2603.00961 | null |
| 2026-03-01 | Towards Orthographically-Informed Evaluation of Speech Recognition Systems for Indian Languages | Kaushal Santosh Bhogale et.al. | 2603.00941 | null |
| 2026-02-28 | Polynomial Mixing for Efficient Self-supervised Speech Encoders | Eva Feillet et.al. | 2603.00683 | null |
| 2026-02-28 | Whisper-MLA: Reducing GPU Memory Consumption of ASR Models based on MHA2MLA Conversion | Sen Zhang et.al. | 2603.00563 | null |
| 2026-02-27 | Chunk-wise Attention Transducers for Fast and Accurate Streaming Speech-to-Text | Hainan Xu et.al. | 2602.24245 | null |
| 2026-02-27 | Dialect and Gender Bias in YouTube’s Spanish Captioning System | Iris Dania Jimenez et.al. | 2602.24002 | null |
| 2026-02-26 | Challenges in Automatic Speech Recognition for Adults with Cognitive Impairment | Michelle Cohn et.al. | 2602.23436 | null |
| 2026-02-16 | Hello-Chat: Towards Realistic Social Audio Interactions | Yueran Hou et.al. | 2602.23387 | null |
| 2026-02-26 | Align-Consistency: Improving Non-autoregressive and Semi-supervised ASR with Consistency Regularization | Wanting Huang et.al. | 2602.23171 | null |
| 2026-02-26 | Efficient Dialect-Aware Modeling and Conditioning for Low-Resource Taiwanese Hakka Speech Processing | An-Ci Peng et.al. | 2602.22522 | null |
| 2026-02-25 | TG-ASR: Translation-Guided Learning with Parallel Gated Cross Attention for Low-Resource Automatic Speech Recognition | Cheng-Yeh Yang et.al. | 2602.22039 | null |
| 2026-03-02 | Mitigating Structural Noise in Low-Resource S2TT: An Optimized Cascaded Nepali-English Pipeline with Punctuation Restoration | Tangsang Chongbang et.al. | 2602.21647 | null |
| 2026-02-23 | An Approach to Combining Video and Speech with Large Language Models in Human-Robot Interaction | Guanting Shen et.al. | 2602.20219 | null |
| 2026-02-23 | Cross-lingual Matryoshka Representation Learning across Speech and Text | Yaya Sy et.al. | 2602.19991 | null |
| 2026-02-22 | Pay Attention to CTC: Fast and Robust Pseudo-Labelling for Unified Speech Recognition | Alexandros Haliassos et.al. | 2602.19316 | null |
| 2026-02-21 | Whisper: Courtside Edition Enhancing ASR Performance Through LLM-Driven Context Generation | Yonathan Ron et.al. | 2602.18966 | null |
| 2026-02-24 | MDM-ASR: Bridging Accuracy and Efficiency in ASR with Diffusion-Based Non-Autoregressive Decoding | Hao Yen et.al. | 2602.18952 | null |
| 2026-02-21 | ReHear: Iterative Pseudo-Label Refinement for Semi-Supervised Speech Recognition via Audio Large Language Models | Zefang Liu et.al. | 2602.18721 | null |
| 2026-02-18 | Fine-Pruning: A Biologically Inspired Algorithm for Personalization of Machine Learning Models | Joseph Bingham et.al. | 2602.18507 | null |
| 2026-03-05 | The Cascade Equivalence Hypothesis: When Do Speech LLMs Behave Like ASR $\rightarrow$ LLM Pipelines? | Jayadev Billa et.al. | 2602.17598 | null |
| 2026-02-17 | Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits | Gilad Nurko et.al. | 2602.15405 | null |
| 2026-02-16 | CLAP-Based Automatic Word Naming Recognition in Post-Stroke Aphasia | Yacouba Kaloga et.al. | 2602.14584 | null |
| 2026-02-15 | From Scarcity to Scale: A Release-Level Analysis of the Pashto Common Voice Dataset | Jandad Jahani et.al. | 2602.14062 | null |
| 2026-02-15 | Eureka-Audio: Triggering Audio Intelligence in Compact Language Models | Dan Zhang et.al. | 2602.13954 | null |
| 2026-02-14 | voice2mode: Phonation Mode Classification in Singing using Self-Supervised Speech Models | Aju Ani Justus et.al. | 2602.13928 | null |
| 2026-02-03 | Multimodal Consistency-Guided Reference-Free Data Selection for ASR Accent Adaptation | Ligong Lei et.al. | 2602.13263 | null |
| 2026-02-13 | Can we trust AI to detect healthy multilingual English speakers among the cognitively impaired cohort in the UK? An investigation using real-world conversational speech | Madhurananda Pahar et.al. | 2602.13047 | null |
| 2026-02-13 | ViMedCSS: A Vietnamese Medical Code-Switching Speech Dataset & Benchmark | Tung X. Nguyen et.al. | 2602.12911 | null |
| 2026-02-13 | Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting | Jing Xu et.al. | 2602.12746 | null |
| 2026-02-16 | Towards explainable reference-free speech intelligibility evaluation of people with pathological speech | Bence Mark Halpern et.al. | 2602.12723 | null |
| 2026-02-13 | Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR | Jaeyoung Lee et.al. | 2602.12546 | null |
| 2026-02-12 | Moonshine v2: Ergodic Streaming Encoder ASR for Latency-Critical Speech Applications | Manjunath Kudlur et.al. | 2602.12241 | null |
| 2026-02-12 | On the Sensitivity of Firing Rate-Based Federated Spiking Neural Networks to Differential Privacy | Luiz Pereira et.al. | 2602.12009 | null |
| 2026-02-28 | TC-BiMamba: Trans-Chunk bidirectionally within BiMamba for unified streaming and non-streaming ASR | Qingshun She et.al. | 2602.11546 | null |
| 2026-02-21 | Voxtral Realtime | Alexander H. Liu et.al. | 2602.11298 | null |
| 2026-02-10 | When Less Is More? Diagnosing ASR Predictions in Sardinian via Layer-Wise Decoding | Domenico De Cristofaro et.al. | 2602.10350 | null |
| 2026-02-10 | ViSpeechFormer: A Phonemic Approach for Vietnamese Automatic Speech Recognition | Khoa Anh Nguyen et.al. | 2602.10003 | null |
| 2026-02-10 | Where Are We At with Automatic Speech Recognition for the Bambara Language? | Seydou Diallo et.al. | 2602.09785 | null |
| 2026-02-04 | Beyond the Utterance: An Empirical Study of Very Long Context Speech Recognition | Robert Flynn et.al. | 2602.09044 | null |
| 2026-02-04 | Windowed SummaryMixing: An Efficient Fine-Tuning of Self-Supervised Learning Models for Low-resource Speech Recognition | Aditya Srinivas Menon et.al. | 2602.09043 | null |
| 2026-02-09 | Cross-Modal Bottleneck Fusion For Noise Robust Audio-Visual Speech Recognition | Seaone Ok et.al. | 2602.08293 | null |
| 2026-02-08 | D-ORCA: Dialogue-Centric Optimization for Robust Audio-Visual Captioning | Changli Tang et.al. | 2602.07960 | null |
| 2026-02-06 | Equipping LLM with Directional Multi-Talker Speech Understanding Capabilities | Ju Lin et.al. | 2602.07211 | null |
| 2026-02-05 | From Hallucination to Articulation: Language Model-Driven Losses for Ultra Low-Bitrate Neural Speech Coding | Jayeon Yi et.al. | 2602.06213 | null |
| 2026-02-05 | Enabling Automatic Disordered Speech Recognition: An Impaired Speech Dataset in the Akan Language | Isaac Wiafe et.al. | 2602.05406 | null |
| 2026-02-11 | Evaluating Kubernetes Performance for GenAI Inference: From Automatic Speech Recognition to LLM Summarization | Sai Sindhur Malleni et.al. | 2602.04900 | null |
| 2026-02-04 | Speaker-Aware Simulation Improves Conversational Speech Recognition | Máté Gedeon et.al. | 2602.04776 | null |
| 2026-02-04 | Linguistically Informed Evaluation of Multilingual ASR for African Languages | Fei-Yueh Chen et.al. | 2602.04716 | null |
| 2026-02-04 | Frontend Token Enhancement for Token-Based Speech Recognition | Takanori Ashihara et.al. | 2602.04217 | null |
| 2026-02-03 | Mići Princ – A Little Boy Teaching Speech Technologies the Chakavian Dialect | Nikola Ljubešić et.al. | 2602.03245 | null |
| 2026-02-02 | Mixture-of-Experts with Intermediate CTC Supervision for Accented Speech Recognition | Wonjun Lee et.al. | 2602.01967 | null |
| 2026-02-02 | BBPE16: UTF-16-based byte-level byte-pair encoding for improved multilingual speech recognition | Hyunsik Kim et.al. | 2602.01717 | null |
| 2026-02-01 | Adapting Where It Matters: Depth-Aware Adaptation for Efficient Multilingual Speech Recognition in Low-Resource Languages | Yang Xiao et.al. | 2602.01008 | null |
| 2026-02-01 | MedSpeak: A Knowledge Graph-Aided ASR Error Correction Framework for Spoken Medical QA | Yutong Song et.al. | 2602.00981 | null |
| 2026-01-30 | CALM: Joint Contextual Acoustic-Linguistic Modeling for Personalization of Multi-Speaker ASR | Muhammad Shakeel et.al. | 2601.22792 | null |
| 2026-01-30 | Streaming Speech Recognition with Decoder-Only Large Language Models and Latency Optimization | Genshun Wan et.al. | 2601.22779 | null |
| 2026-01-29 | Towards Robust Dysarthric Speech Recognition: LLM-Agent Post-ASR Correction Beyond WER | Xiuwen Zheng et.al. | 2601.21347 | null |
| 2026-01-30 | Qwen3-ASR Technical Report | Xian Shi et.al. | 2601.21337 | null |
| 2026-01-28 | asr_eval: Algorithms and tools for multi-reference and streaming speech recognition evaluation | Oleg Sedukhin et.al. | 2601.20992 | null |
| 2026-01-30 | Text-only adaptation in LLM-based ASR through text denoising | Sergio Burdisso et.al. | 2601.20900 | null |
| 2026-01-28 | Reducing Prompt Sensitivity in LLM-based Speech Recognition Through Learnable Projection | Sergio Burdisso et.al. | 2601.20898 | null |
| 2026-01-28 | A Study of Data Selection Strategies for Pre-training Self-Supervised Speech Models | Ryan Whetten et.al. | 2601.20896 | null |
| 2026-01-28 | SW-ASR: A Context-Aware Hybrid ASR Pipeline for Robust Single Word Speech Recognition | Manali Sharma et.al. | 2601.20890 | null |
| 2026-01-27 | MA-LipNet: Multi-Dimensional Attention Networks for Robust Lipreading | Matteo Rossi et.al. | 2601.20881 | null |
| 2026-02-04 | SpeechMapper: Speech-to-text Embedding Projector for LLMs | Biswesh Mohapatra et.al. | 2601.20417 | null |
| 2026-01-28 | Mind the Shift: Using Delta SSL Embeddings to Enhance Child ASR | Zilai Wang et.al. | 2601.20142 | null |
| 2026-01-27 | Do we really need Self-Attention for Streaming Automatic Speech Recognition? | Youness Dkhissi et.al. | 2601.19960 | null |
| 2026-01-23 | Benchmarking von ASR-Modellen im deutschen medizinischen Kontext: Eine Leistungsanalyse anhand von Anamnesegesprächen | Thomas Schuster et.al. | 2601.19945 | null |
| 2026-01-08 | FastWhisper: Adaptive Self-knowledge Distillation for Real-time Automatic Speech Recognition | Junseok Lee et.al. | 2601.19919 | null |
| 2026-01-27 | Rethinking Discrete Speech Representation Tokens for Accent Generation | Jinzuomu Zhong et.al. | 2601.19786 | null |
| 2026-01-27 | Phonological Tokenizer: Prosody-Aware Phonetic Token via Multi-Objective Fine-Tuning with Differentiable K-Means | Kentaro Onda et.al. | 2601.19781 | null |
| 2026-01-27 | Advanced Modeling of Interlanguage Speech Intelligibility Benefit with L1-L2 Multi-Task Learning Using Differentiable K-Means for Accent-Robust Discrete Token-Based ASR | Kentaro Onda et.al. | 2601.19767 | null |
| 2026-01-27 | SLM-SS: Speech Language Model for Generative Speech Separation | Tianhua Li et.al. | 2601.19533 | null |
| 2026-01-27 | Dynamic Multi-Expert Projectors with Stabilized Routing for Multilingual Speech Recognition | Isha Pandey et.al. | 2601.19451 | null |
| 2026-02-02 | Language Family Matters: Evaluating LLM-Based ASR Across Linguistic Boundaries | Yuchen Zhang et.al. | 2601.18899 | null |
| 2026-01-29 | Unheard in the Digital Age: Rethinking AI Bias and Speech Diversity | Onyedikachi Hope Amaechi-Okorie et.al. | 2601.18641 | null |
| 2026-01-26 | Pisets: A Robust Speech Recognition System for Lectures and Interviews | Ivan Bondarenko et.al. | 2601.18415 | null |
| 2026-01-26 | Noise-Robust AV-ASR Using Visual Features Both in the Whisper Encoder and Decoder | Zhengyang Li et.al. | 2601.18396 | null |
| 2026-01-26 | OCR-Enhanced Multimodal ASR Can Read While Listening | Junli Chen et.al. | 2601.18393 | null |
| 2026-01-26 | Efficient Rehearsal for Continual Learning in ASR via Singular Value Tuning | Steven Vander Eeckt et.al. | 2601.18266 | null |
| 2026-01-30 | LLM-ForcedAligner: A Non-Autoregressive and Accurate LLM-Based Forced Aligner for Multilingual and Long-Form Speech | Bingshen Mu et.al. | 2601.18220 | null |
| 2026-01-25 | SpatialEmb: Extract and Encode Spatial Information for 1-Stage Multi-channel Multi-speaker ASR on Arbitrary Microphone Arrays | Yiwen Shao et.al. | 2601.18037 | null |
| 2026-01-25 | dLLM-ASR: A Faster Diffusion LLM-based Framework for Speech Recognition | Wenjie Tian et.al. | 2601.17902 | null |
| 2026-01-25 | BanglaRobustNet: A Hybrid Denoising-Attention Architecture for Robust Bangla Speech Recognition | Md Sazzadul Islam Ridoy et.al. | 2601.17679 | null |
| 2026-01-24 | Window Size Versus Accuracy Experiments in Voice Activity Detectors | Max McKinnon et.al. | 2601.17270 | null |
| 2026-01-22 | Sink or SWIM: Tackling Real-Time ASR at Scale | Federico Bruzzone et.al. | 2601.17097 | null |
| 2026-01-16 | AI-based System for Transforming text and sound to Educational Videos | M. E. ElAlami et.al. | 2601.17022 | null |
| 2026-01-20 | SoundBreak: A Systematic Study of Audio-Only Adversarial Attacks on Trimodal Models | Aafiya Hussain et.al. | 2601.16231 | null |
| 2026-01-22 | Quantum Dimension Reduction of Hidden Markov Models | Rishi Sundar et.al. | 2601.16126 | null |
| 2026-01-27 | Distillation-based Layer Dropping (DLD): Effective End-to-end Framework for Dynamic Speech Networks | Abdul Hannan et.al. | 2601.16117 | null |
| 2026-01-20 | Lost in Transcription: How Speech-to-Text Errors Derail Code Understanding | Jayant Havare et.al. | 2601.15339 | null |
| 2026-01-22 | Deaf and Hard of Hearing Access to Intelligent Personal Assistants: Comparison of Voice-Based Options with an LLM-Powered Touch Interface | Paige S. DeVries et.al. | 2601.15209 | null |
| 2026-01-21 | Inverse-Hessian Regularization for Continual Learning in ASR | Steven Vander Eeckt et.al. | 2601.14751 | null |
| 2026-01-20 | HoverAI: An Embodied Aerial Agent for Natural Human-Drone Interaction | Yuhua Jin et.al. | 2601.13801 | null |
| 2026-01-20 | LongSpeech: A Scalable Benchmark for Transcription, Translation and Understanding in Long Speech | Fei Yang et.al. | 2601.13539 | null |
| 2026-01-28 | Arab Voices: Mapping Standard and Dialectal Arabic Speech Technology | Peter Sullivan et.al. | 2601.13319 | null |
| 2026-01-19 | Typhoon ASR Real-time: FastConformer-Transducer for Thai Automatic Speech Recognition | Warit Sirichotedumrong et.al. | 2601.13044 | null |
| 2026-01-18 | SSVD-O: Parameter-Efficient Fine-Tuning with Structured SVD for Speech Recognition | Pu Wang et.al. | 2601.12600 | null |
| 2026-01-18 | Harmonizing the Arabic Audio Space with Data Scheduling | Hunzalah Hassan Bhatti et.al. | 2601.12494 | null |
| 2026-01-18 | CTC-DID: CTC-Based Arabic dialect identification for streaming applications | Muhammad Umar Farooq et.al. | 2601.12199 | null |
| 2025-12-23 | Multi-Level Embedding Conformer Framework for Bengali Automatic Speech Recognition | Md. Nazmus Sakib et.al. | 2601.09710 | null |
| 2026-01-14 | Linear Complexity Self-Supervised Learning for Music Understanding with Random Quantizer | Petros Vavaroutsos et.al. | 2601.09603 | null |
| 2026-01-14 | Speech-Hands: A Self-Reflection Voice Agentic Approach to Speech Recognition and Audio Reasoning with Omni Perception | Zhen Wan et.al. | 2601.09413 | null |
| 2026-01-14 | SLAM-LLM: A Modular, Open-Source Multimodal Large Language Model Framework and Best Practice for Speech, Language, Audio and Music Processing | Ziyang Ma et.al. | 2601.09385 | null |
| 2026-01-17 | MCGA: A Multi-task Classical Chinese Literary Genre Audio Corpus | Yexing Du et.al. | 2601.09270 | null |
| 2026-01-15 | DSA-Tokenizer: Disentangled Semantic-Acoustic Tokenization via Flow Matching-based Hierarchical Fusion | Hanlin Zhang et.al. | 2601.09239 | null |
| 2026-01-14 | SITA: Learning Speaker-Invariant and Tone-Aware Speech Representations for Low-Resource Tonal Languages | Tianyi Xu et.al. | 2601.09050 | null |
| 2026-01-13 | Robust CAPTCHA Using Audio Illusions in the Era of Large Language Models: from Evaluation to Advances | Ziqi Ding et.al. | 2601.08516 | null |
| 2026-01-12 | HiVid-Narrator: Hierarchical Video Narrative Generation with Scene-Primed ASR-anchored Compression | Haoxuan Li et.al. | 2601.07366 | null |
| 2026-01-12 | Towards Comprehensive Semantic Speech Embeddings for Chinese Dialects | Kalvin Chang et.al. | 2601.07274 | null |
| 2026-01-11 | Task Arithmetic with Support Languages for Low-Resource ASR | Emma Rafkin et.al. | 2601.07038 | null |
| 2026-01-11 | Categorize Early, Integrate Late: Divergent Processing Strategies in Automatic Speech Recognition | Nathan Roll et.al. | 2601.06972 | null |
| 2026-01-11 | TagSpeech: End-to-End Multi-Speaker ASR and Diarization with Fine-Grained Temporal Grounding | Mingyue Huo et.al. | 2601.06896 | null |
| 2026-01-11 | Variational decomposition autoencoding improves disentanglement of latent representations | Ioannis Ziogas et.al. | 2601.06844 | null |
| 2026-01-10 | QMAVIS: Long Video-Audio Understanding using Fusion of Large Multimodal Models | Zixing Lin et.al. | 2601.06573 | null |
| 2026-01-10 | Representing Sounds as Neural Amplitude Fields: A Benchmark of Coordinate-MLPs and A Fourier Kolmogorov-Arnold Framework | Linfei Li et.al. | 2601.06406 | null |
| 2026-01-09 | An Intelligent AI glasses System with Multi-Agent Architecture for Real-Time Voice Processing and Task Execution | Sheng-Kai Chen et.al. | 2601.06235 | null |
| 2026-01-13 | GenAITEd Ghana: A First-of-Its-Kind Context-Aware and Curriculum-Aligned Conversational AI Agent for Teacher Education | Matthew Nyaaba et.al. | 2601.06093 | null |
| 2025-12-31 | AzeroS: Extending LLM to Speech with Self-Generated Instruction-Free Tuning | Yiwen Shao et.al. | 2601.06086 | null |
| 2026-01-09 | Multimodal In-context Learning for ASR of Low-resource Languages | Zhaolin Li et.al. | 2601.05707 | null |
| 2026-01-08 | WESR: Scaling and Evaluating Word-level Event-Speech Recognition | Chenchen Yang et.al. | 2601.04508 | null |
| 2026-01-07 | Dialect Matters: Cross-Lingual ASR Transfer for Low-Resource Indic Language Varieties | Akriti Dhasmana et.al. | 2601.04373 | null |
| 2026-01-08 | TellWhisper: Tell Whisper Who Speaks When | Yifan Hu et.al. | 2601.03712 | null |
| 2026-01-06 | Linear Script Representations in Speech Foundation Models Enable Zero-Shot Transliteration | Ryan Soh-Eun Shim et.al. | 2601.02906 | null |
| 2026-01-06 | Multi-channel multi-speaker transformer for speech recognition | Guo Yifan et.al. | 2601.02688 | null |
| 2026-01-05 | Dynamic Quantization Error Propagation in Encoder-Decoder ASR Quantization | Xinyu Wang et.al. | 2601.02455 | null |
| 2026-01-14 | MORE: Multi-Objective Adversarial Attacks on Speech Recognition | Xiaoxue Gao et.al. | 2601.01852 | null |
| 2026-01-15 | Bridging the gap: A comparative exploration of Speech-LLM and end-to-end architecture for multilingual conversational ASR | Yuxiang Mei et.al. | 2601.01461 | null |
| 2026-01-03 | IO-RAE: Information-Obfuscation Reversible Adversarial Example for Audio Privacy Protection | Jiajie Zhu et.al. | 2601.01239 | null |
| 2025-12-31 | Index-ASR Technical Report | Zheshu Song et.al. | 2601.00890 | null |
| 2026-01-02 | Three factor delay learning rules for spiking neural networks | Luke Vassallo et.al. | 2601.00668 | null |
| 2026-01-02 | A Language-Agnostic Hierarchical LoRA-MoE Architecture for CTC-based Multilingual ASR | Yuang Zheng et.al. | 2601.00557 | null |
| 2026-01-01 | ROBIN: Incremental Oblique Interleaved ECC for Reliability Improvement in STT-MRAM Caches | Elham Cheshmikhani et.al. | 2601.00456 | null |
| 2026-01-01 | Enhancing Reliability of STT-MRAM Caches by Eliminating Read Disturbance Accumulation | Elham Cheshmikhani et.al. | 2601.00450 | null |
| 2026-01-01 | Unseen Risks of Clinical Speech-to-Text Systems: Transparency, Privacy, and Reliability Challenges in AI-Driven Documentation | Nelly Elsayed et.al. | 2601.00382 | null |
| 2026-01-01 | IKFST: IOO and KOO Algorithms for Accelerated and Precise WFST-based End-to-End Automatic Speech Recognition | Zhuoran Zhuang et.al. | 2601.00160 | null |
| 2025-12-31 | SLM-TTA: A Framework for Test-Time Adaptation of Generative Spoken Language Models | Yuan-Kuei Wu et.al. | 2512.24739 | null |
| 2025-12-29 | PROFASR-BENCH: A Benchmark for Context-Conditioned ASR in High-Stakes Professional Speech | Deepak Babu Piskala et.al. | 2512.23686 | null |
| 2025-12-17 | Marco-ASR: A Principled and Metric-Driven Framework for Fine-Tuning Large-Scale ASR Models for Domain Adaptation | Xuanfan Ni et.al. | 2512.22165 | null |
| 2025-12-14 | EEG-to-Voice Decoding of Spoken and Imagined speech Using Non-Invasive EEG | Hanbeot Park et.al. | 2512.22146 | null |
| 2025-12-26 | Contextual Biasing for LLM-Based ASR with Hotword Retrieval and Reinforcement Learning | YuXiang Kong et.al. | 2512.21828 | null |
| 2025-12-25 | Broadband tunable microwave photonic radar for simultaneous detection of human respiration, heartbeat, and speech with deep learning-based speech recognition | Lei Gao et.al. | 2512.21566 | null |
| 2025-12-23 | Corpus of Cross-lingual Dialogues with Minutes and Detection of Misunderstandings | Marko Čechovič et.al. | 2512.20204 | null |
| 2025-12-29 | VALLR-Pin: Uncertainty-Factorized Visual Speech Recognition for Mandarin with Pinyin Guidance | Chang Sun et.al. | 2512.20032 | null |
| 2025-12-22 | Kunnafonidilaw ka Cadeau: an ASR dataset of present-day Bambara | Yacouba Diarra et.al. | 2512.19400 | null |
| 2025-12-22 | From Speech to Subtitles: Evaluating ASR Models in Subtitling Italian Television Programs | Alessandro Lucca et.al. | 2512.19161 | null |
| 2025-12-22 | Enhancing Fully Formatted End-to-End Speech Recognition with Knowledge Distillation via Multi-Codebook Vector Quantization | Jian You et.al. | 2512.18967 | null |
| 2025-12-20 | Phoneme-based speech recognition driven by large language models and sampling marginalization | Te Ma et.al. | 2512.18371 | null |
| 2025-12-20 | TICL+: A Case Study On Speech In-Context Learning for Children’s Speech Recognition | Haolong Zheng et.al. | 2512.18263 | null |
| 2025-11-27 | Supplementary Resources and Analysis for Automatic Speech Recognition Systems Trained on the Loquacious Dataset | Nick Rossenbach et.al. | 2512.17915 | null |
| 2025-12-19 | Peeking Into The Future For Contextual Biasing | Ramaneswaran Selvakumar et.al. | 2512.17657 | null |
| 2025-12-19 | Zero-Shot Recognition of Dysarthric Speech Using Commercial Automatic Speech Recognition and Multimodal Large Language Models | Ali Alsayegh et.al. | 2512.17474 | null |
| 2025-12-19 | Incorporating Error Level Noise Embedding for Improving LLM-Assisted Robustness in Persian Speech Recognition | Zahra Rahmani et.al. | 2512.17247 | null |
| 2026-01-14 | Navigating the Reality Gap: Privacy-Preserving On-Device Continual Adaptation of ASR for Clinical Telephony | Darshil Chauhan et.al. | 2512.16401 | null |
| 2025-12-16 | ComMark: Covert and Robust Black-Box Model Watermarking with Compressed Samples | Yunfei Yang et.al. | 2512.15641 | null |
| 2025-12-16 | Scalable Frameworks for Real-World Audio-Visual Speech Recognition | Sungnyun Kim et.al. | 2512.14083 | null |
| 2025-12-18 | Adaptive Edge-Cloud Inference for Speech-to-Action Systems Using ASR and Large Language Models | Mohammad Jalili Torkamani et.al. | 2512.12769 | null |
| 2025-12-13 | System X: A Mobile Voice-Based AI System for EMR Generation and Clinical Decision Support in Low-Resource Maternal Healthcare | Maryam Mustafa et.al. | 2512.12240 | null |
| 2025-12-12 | All-in-One ASR: Unifying Encoder-Decoder Models of CTC, Attention, and Transducer in Dual-Mode ASR | Takafumi Moriya et.al. | 2512.11543 | null |
| 2025-12-12 | The Affective Bridge: Unifying Feature Representations for Speech Deepfake Detection | Yupei Li et.al. | 2512.11241 | null |
| 2025-11-30 | Benchmarking Automatic Speech Recognition Models for African Languages | Alvin Nahabwe et.al. | 2512.10968 | null |
| 2025-11-30 | ASR Under the Stethoscope: Evaluating Biases in Clinical Speech Recognition across Indian Languages | Subham Kumar et.al. | 2512.10967 | null |
| 2025-12-11 | TRIDENT: A Redundant Architecture for Caribbean-Accented Emergency Speech Triage | Elroy Galbraith et.al. | 2512.10741 | null |
| 2025-12-10 | Robust Speech Activity Detection in the Presence of Singing Voice | Philipp Grundhuber et.al. | 2512.09713 | null |
| 2025-12-02 | Enhancing Automatic Speech Recognition Through Integrated Noise Detection Architecture | Karamvir Singh et.al. | 2512.08973 | null |
| 2025-12-08 | A Simple Method to Enhance Pre-trained Language Models with Speech Tokens for Classification | Nicolas Calbucura et.al. | 2512.07571 | null |
| 2025-12-08 | Efficient ASR for Low-Resource Languages: Leveraging Cross-Lingual Unlabeled Data | Srihari Bandarupalli et.al. | 2512.07277 | null |
| 2025-12-05 | Morphologically-Informed Tokenizers for Languages with Non-Concatenative Morphology: A case study of Yoloxóchtil Mixtec ASR | Chris Crawford et.al. | 2512.06169 | null |
| 2025-12-01 | KidSpeak: A General Multi-purpose LLM for Kids’ Speech Recognition and Screening | Rohan Sharma et.al. | 2512.05994 | null |
| 2025-12-02 | Comparing Unsupervised and Supervised Semantic Speech Tokens: A Case Study of Child ASR | Mohan Shi et.al. | 2512.03301 | null |
| 2025-12-02 | Bangla Hate Speech Classification with Fine-tuned Transformer Models | Yalda Keivan Jafari et.al. | 2512.02845 | null |
| 2025-12-02 | Spoken Conversational Agents with Large Language Models | Chao-Han Huck Yang et.al. | 2512.02593 | null |
| 2025-12-01 | See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models | Le Thien Phuc Nguyen et.al. | 2512.02231 | null |
| 2025-12-01 | Swivuriso: The South African Next Voices Multilingual Speech Dataset | Vukosi Marivatee et.al. | 2512.02201 | null |
| 2025-11-18 | On the Difficulty of Token-Level Modeling of Dysfluency and Fluency Shaping Artifacts | Kashaf Gulzar et.al. | 2512.02027 | null |
| 2025-12-01 | MAC-SLU: Multi-Intent Automotive Cabin Spoken Language Understanding Benchmark | Yuezhang Peng et.al. | 2512.01603 | null |
| 2025-12-01 | ZO-ASR: Zeroth-Order Fine-Tuning of Speech Foundation Models without Back-Propagation | Yuezhang Peng et.al. | 2512.01267 | null |
| 2025-12-11 | A Low-Complexity Speech Codec Using Parametric Dithering for ASR | Ellison Murray et.al. | 2512.00511 | null |
| 2025-11-28 | OmniFusion: Simultaneous Multilingual Multimodal Translations via Modular Fusion | Sai Koneru et.al. | 2512.00234 | null |
| 2025-11-28 | Scaling HuBERT for African Languages: From Base to Large and XL | Antoine Caubrière et.al. | 2511.23370 | null |
| 2025-11-28 | Group-Aware Partial Model Merging for Children’s Automatic Speech Recognition | Thomas Rolland et.al. | 2511.23098 | null |
| 2025-11-27 | Modeling Romanized Hindi and Bengali: Dataset Creation and Multilingual LLM Integration | Kanchon Gharami et.al. | 2511.22769 | null |
| 2025-11-27 | 3RSeT: Read Disturbance Rate Reduction in STT-MRAM Caches by Selective Tag Comparison | Elham Cheshmikhani et.al. | 2511.22551 | null |
| 2025-11-27 | Do You See What I Say? Generalizable Deepfake Detection based on Visual Speech Recognition | Maheswar Bora et.al. | 2511.22443 | null |
| 2025-11-16 | On the Cross-lingual Transferability of Pre-trained wav2vec2-based Models | Jonatas Grosman et.al. | 2511.21704 | null |
| 2025-11-26 | ASR Error Correction in Low-Resource Burmese with Alignment-Enhanced Transformers using Phonetic Features | Ye Bhone Lin et.al. | 2511.21088 | null |
| 2025-11-26 | RosettaSpeech: Zero-Shot Speech-to-Speech Translation from Monolingual Data | Zhisheng Zheng et.al. | 2511.20974 | null |
| 2025-11-26 | Towards Audio Token Compression in Large Audio Language Models | Saurabhchand Bhati et.al. | 2511.20973 | null |
| 2025-11-25 | Bridging the Language Gap: Synthetic Voice Diversity via Latent Mixup for Equitable Speech Recognition | Wesley Bian et.al. | 2511.20534 | null |
| 2025-11-25 | Mispronunciation Detection and Diagnosis Without Model Training: A Retrieval-Based Approach | Huu Tuong Tu et.al. | 2511.20107 | null |
| 2025-11-24 | Neural Architecture Search for Quantum Autoencoders | Hibah Agha et.al. | 2511.19246 | null |
| 2025-11-24 | AuViRe: Audio-visual Speech Representation Reconstruction for Deepfake Temporal Localization | Christos Koutlis et.al. | 2511.18993 | null |
| 2025-11-24 | Context-Aware Whisper for Arabic ASR Under Linguistic Varieties | Bashar Talafha et.al. | 2511.18774 | null |
| 2025-11-21 | Point of Order: Action-Aware LLM Persona Modeling for Realistic Civic Simulation | Scott Merrill et.al. | 2511.17813 | null |
| 2025-11-21 | Enhancing Quranic Learning: A Multimodal Deep Learning Approach for Arabic Phoneme Recognition | Ayhan Kucukmanisa et.al. | 2511.17477 | null |
| 2025-11-21 | WER is Unaware: Assessing How ASR Errors Distort Clinical Understanding in Patient Facing Dialogue | Zachary Ellis et.al. | 2511.16544 | null |
| 2025-12-03 | NLP Datasets for Idiom and Figurative Language Tasks | Blake Matheny et.al. | 2511.16345 | null |
| 2025-11-19 | Scriboora: Rethinking Human Pose Forecasting | Daniel Bermuth et.al. | 2511.15565 | null |
| 2025-11-19 | Building Robust and Scalable Multilingual ASR for Indian Languages | Arjun Gangwar et.al. | 2511.15418 | null |
| 2025-11-18 | Ground Truth Generation for Multilingual Historical NLP using LLMs | Clovis Gladstone et.al. | 2511.14688 | null |
| 2025-11-18 | TTA: Transcribe, Translate and Alignment for Cross-lingual Speech Representation | Wei Liu et.al. | 2511.14410 | null |
| 2025-11-18 | AfriSpeech-MultiBench: A Verticalized Multidomain Multicountry Benchmark Suite for African Accented English ASR | Gabrial Zencha Ashungafac et.al. | 2511.14255 | null |
| 2025-11-18 | Listen Like a Teacher: Mitigating Whisper Hallucinations using Adaptive Layer Attention and Knowledge Distillation | Kumud Tripathi et.al. | 2511.14219 | null |
| 2025-11-17 | Human-centric Maintenance Process Through Integration of AI, Speech, and AR | Parul Khanna et.al. | 2511.13918 | null |
| 2025-11-17 | Spatial Blind Spot: Auditory Motion Perception Deficits in Audio LLMs | Zhe Sun et.al. | 2511.13273 | null |
| 2025-11-17 | Distinguishing Repetition Disfluency from Morphological Reduplication in Bangla ASR Transcripts: A Novel Corpus and Benchmarking Analysis | Zaara Zabeen Arpa et.al. | 2511.13159 | null |
| 2025-11-15 | How Far Do SSL Speech Models Listen for Tone? Temporal Focus of Tone Representation under Low-resource Transfer | Minu Kim et.al. | 2511.12285 | null |
| 2025-11-15 | Fusionista2.0: Efficiency Retrieval System for Large-Scale Datasets | Huy M. Le et.al. | 2511.12255 | null |
| 2025-11-12 | Tighter Truncated Rectangular Prism Approximation for RNN Robustness Verification | Xingqi Lin et.al. | 2511.11699 | null |
| 2025-11-14 | Speech-Aware Long Context Pruning and Integration for Contextualized Automatic Speech Recognition | Yiming Rong et.al. | 2511.11139 | null |
| 2025-11-13 | TEDxTN: A Three-way Speech Translation Corpus for Code-Switched Tunisian Arabic - English | Fethi Bougares et.al. | 2511.10780 | null |
| 2025-11-09 | Towards Fine-Grained Code-Switch Speech Translation with Semantic Space Alignment | Yan Gao et.al. | 2511.10670 | null |
| 2025-11-13 | ELYADATA & LIA at NADI 2025: ASR and ADI Subtasks | Haroun Elleuch et.al. | 2511.10090 | null |
| 2025-11-12 | Omnilingual ASR: Open-Source Multilingual Speech Recognition for 1600+ Languages | Omnilingual ASR team et.al. | 2511.09690 | null |
| 2025-11-12 | End-to-end Contrastive Language-Speech Pretraining Model For Long-form Spoken Question Answering | Jiliang Hu et.al. | 2511.09282 | null |
| 2025-11-12 | Context-Aware Dynamic Chunking for Streaming Tibetan Speech Recognition | Chao Wang et.al. | 2511.09085 | null |
| 2025-11-12 | Towards Effective and Efficient Non-autoregressive decoders for Conformer and LLM-based ASR using Block-based Attention Mask | Tianzi Wang et.al. | 2511.09084 | null |
| 2025-11-11 | Unifying Model and Layer Fusion for Speech Foundation Models | Yi-Jen Shih et.al. | 2511.08389 | null |
| 2025-11-11 | Quantizing Whisper-small: How design choices affect ASR performance | Arthur Söhler et.al. | 2511.08093 | null |
| 2025-11-11 | Pruning as Regularization: Sensitivity-Aware One-Shot Pruning in ASR | Julian Irigoyen et.al. | 2511.08092 | null |
| 2025-11-13 | SpikCommander: A High-performance Spiking Transformer with Multi-view Learning for Efficient Speech Command Recognition | Jiaqi Wang et.al. | 2511.07883 | null |
| 2025-11-11 | Surgical Agent Orchestration Platform for Voice-directed Patient Data Interaction | Hyeryun Park et.al. | 2511.07392 | null |
| 2025-11-10 | Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models | Umberto Cappellazzo et.al. | 2511.07253 | null |
| 2025-11-24 | Privacy on the Fly: A Predictive Adversarial Transformation Network for Mobile Sensor Data | Tianle Song et.al. | 2511.07242 | null |
| 2025-11-10 | Improving Remote Patient Monitoring Systems Using a Fog-based IoT Platform with Speech Recognition | Marc Jayson Baucas et.al. | 2511.07189 | null |
| 2025-11-10 | CLiFT-ASR: A Cross-Lingual Fine-Tuning Framework for Low-Resource Taiwanese Hokkien Speech Recognition | Hung-Yang Sung et.al. | 2511.06860 | null |
| 2025-11-10 | MedVoiceBias: A Controlled Study of Audio LLM Behavior in Clinical Decision-Making | Zhi Rui Tam et.al. | 2511.06592 | null |
| 2025-11-09 | We Can Hear You with mmWave Radar! An End-to-End Eavesdropping System | Dachao Han et.al. | 2511.06205 | null |
| 2025-11-06 | CantoASR: Prosody-Aware ASR-LALM Collaboration for Low-Resource Cantonese | Dazhong Chen et.al. | 2511.04139 | null |
| 2025-11-06 | WST: Weakly Supervised Transducer for Automatic Speech Recognition | Dongji Gao et.al. | 2511.04035 | null |
| 2025-11-06 | Accelerating scientific discovery with the common task framework | J. Nathan Kutz et.al. | 2511.04001 | null |
| 2025-11-05 | Open Source State-Of-the-Art Solution for Romanian Speech Recognition | Gabriel Pirlogeanu et.al. | 2511.03361 | null |
| 2025-11-05 | TASU: Text-Only Alignment for Speech Understanding | Jing Peng et.al. | 2511.03310 | null |
| 2025-11-11 | How to Evaluate Speech Translation with Source-Aware Neural MT Metrics | Mauro Cettolo et.al. | 2511.03295 | null |
| 2025-11-04 | Energy-Efficient Hardware Acceleration of Whisper ASR on a CGLA | Takuto Ando et.al. | 2511.02269 | null |
| 2025-10-30 | Overview of the MEDIQA-OE 2025 Shared Task on Medical Order Extraction from Doctor-Patient Consultations | Jean-Philippe Corbeil et.al. | 2510.26974 | null |
| 2025-10-29 | Multi-Representation Attention Framework for Underwater Bioacoustic Denoising and Recognition | Amine Razig et.al. | 2510.26838 | null |
| 2025-10-28 | See the Speaker: Crafting High-Resolution Talking Faces from Speech with Prior Guidance and Region Refinement | Jinting Wang et.al. | 2510.26819 | null |
| 2025-10-30 | HMM for short independent sequences: Multiple sequence Baum-Welch application | Margarita Cabrera-Bean et.al. | 2510.26532 | null |
| 2025-10-29 | Learning Disentangled Speech- and Expression-Driven Blendshapes for 3D Talking Face Animation | Yuxiang Mao et.al. | 2510.25234 | null |
| 2025-10-29 | Explainable Disentanglement on Discrete Speech Representations for Noise-Robust ASR | Shreyas Gopal et.al. | 2510.25150 | null |
| 2025-10-28 | POWSM: A Phonetic Open Whisper-Style Speech Foundation Model | Chin-Jou Li et.al. | 2510.24992 | null |
| 2025-10-28 | Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation | Inclusion AI et.al. | 2510.24821 | null |
| 2025-10-28 | BEST-RQ-Based Self-Supervised Learning for Whisper Domain Adaptation | Raphaël Bagat et.al. | 2510.24570 | null |
| 2025-10-30 | Audio Signal Processing Using Time Domain Mel-Frequency Wavelet Coefficient | Rinku Sebastian et.al. | 2510.24519 | null |
| 2025-10-28 | V-SAT: Video Subtitle Annotation Tool | Arpita Kundu et.al. | 2510.24180 | null |
| 2025-10-28 | RegSpeech12: A Regional Corpus of Bengali Spontaneous Speech Across Dialects | Md. Rezuwan Hassan et.al. | 2510.24096 | null |
| 2025-10-27 | A Neural Model for Contextual Biasing Score Learning and Filtering | Wanting Huang et.al. | 2510.23849 | null |
| 2025-11-01 | RoboOmni: Proactive Robot Manipulation in Omni-modal Context | Siyin Wang et.al. | 2510.23763 | null |
| 2025-10-27 | Arabic Little STT: Arabic Children Speech Recognition Dataset | Mouhand Alkadri et.al. | 2510.23319 | null |
| 2025-10-27 | A Cocktail-Party Benchmark: Multi-Modal dataset and Comparative Evaluation Results | Thai-Binh Nguyen et.al. | 2510.23276 | null |
| 2025-10-29 | Are ASR foundation models generalized enough to capture features of regional dialects for low-resource languages? | Tawsif Tashwar Dipto et.al. | 2510.23252 | null |
| 2025-10-27 | Adapting Speech Foundation Models with Large Language Models for Unified Speech Recognition | Jing-Xuan Zhang et.al. | 2510.22961 | null |
| 2025-10-26 | EchoMind: An Interrelated Multi-level Benchmark for Evaluating Empathetic Speech Language Models | Li Zhou et.al. | 2510.22758 | null |
| 2025-10-26 | LRW-Persian: Lip-reading in the Wild Dataset for Persian Language | Zahra Taghizadeh et.al. | 2510.22716 | null |
| 2025-11-02 | Mitigating Attention Sinks and Massive Activations in Audio-Visual Speech Recognition with LLMs | Anand et.al. | 2510.22603 | null |
| 2025-10-26 | A Sociophonetic Analysis of Racial Bias in Commercial ASR Systems Using the Pacific Northwest English Corpus | Michael Scott et.al. | 2510.22495 | null |
| 2025-10-26 | The Limits of Data Scaling: Sub-token Utilization and Acoustic Saturation in Multilingual ASR | Siyu Liang et.al. | 2510.22492 | null |
| 2025-10-26 | The Tonogenesis Continuum in Tibetan: A Computational Investigation | Siyu Liang et.al. | 2510.22485 | null |
| 2025-10-25 | Bridging the Perceptual-Statistical Gap in Dysarthria Assessment: Why Machine Learning Still Falls Short | Krishna Gurugubelli et.al. | 2510.22237 | null |
| 2025-10-25 | M-CIF: Multi-Scale Alignment For CIF-Based Non-Autoregressive ASR | Ruixiang Mao et.al. | 2510.22172 | null |
| 2025-10-23 | LSF-Animation: Label-Free Speech-Driven Facial Animation via Implicit Feature Representation | Xin Lu et.al. | 2510.21864 | null |
| 2025-10-24 | SindBERT, the Sailor: Charting the Seas of Turkish NLP | Raphael Scheible-Schmitt et.al. | 2510.21364 | null |
| 2025-10-27 | ReFESS-QI: Reference-Free Evaluation For Speech Separation With Joint Quality And Intelligibility Scoring | Ari Frummer et.al. | 2510.21014 | null |
| 2025-10-21 | Can large audio language models understand child stuttering speech? speech summarization, and source separation | Chibuzor Okocha et.al. | 2510.20850 | null |
| 2025-10-23 | Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding | Xin Zhang et.al. | 2510.20504 | null |
| 2025-10-22 | Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech Recognition | Yuu Jinnai et.al. | 2510.19471 | null |
| 2025-10-23 | FLASH Viterbi: Fast and Adaptive Viterbi Decoding for Modern Data Systems | Ziheng Deng et.al. | 2510.19301 | null |
| 2025-10-22 | Tibetan Language and AI: A Comprehensive Survey of Resources, Methods and Challenges | Cheng Huang et.al. | 2510.19144 | null |
| 2025-10-28 | RIR-Mega: a large-scale simulated room impulse response dataset for machine learning and room acoustics modeling | Mandip Goswami et.al. | 2510.18917 | null |
| 2025-10-23 | MLMA: Towards Multilingual ASR With Mamba-based Architectures | Mohamed Nabih Ali et.al. | 2510.18684 | null |
| 2025-10-21 | Towards Fair ASR For Second Language Speakers Using Fairness Prompted Finetuning | Monorama Swain et.al. | 2510.18374 | null |
| 2025-10-19 | Zero- and One-Shot Data Augmentation for Sentence-Level Dysarthric Speech Recognition in Constrained Scenarios | Shiyao Wang et.al. | 2510.16700 | null |
| 2025-10-18 | Hallucination Benchmark for Speech Foundation Models | Alkis Koudounas et.al. | 2510.16567 | null |
| 2025-10-18 | Probing the Hidden Talent of ASR Foundation Models for L2 English Oral Assessment | Fu-An Chao et.al. | 2510.16387 | null |
| 2025-10-17 | SpeechLLMs for Large-scale Contextualized Zero-shot Slot Filling | Kadri Hacioglu et.al. | 2510.15851 | null |
| 2025-10-17 | SpikeVox: Towards Energy-Efficient Speech Therapy Framework with Spike-driven Generative Language Models | Rachmad Vidya Wicaksana Putra et.al. | 2510.15566 | null |
| 2025-10-15 | Do Slides Help? Multi-modal Context for Automatic Transcription of Conference Talks | Supriti Sinhamahapatra et.al. | 2510.13979 | null |
| 2025-10-15 | Personal Attribute Leakage in Federated Speech Models | Hamdan Al-Ali et.al. | 2510.13357 | null |
| 2025-10-15 | Two Heads Are Better Than One: Audio-Visual Speech Error Correction with Dual Hypotheses | Sungnyun Kim et.al. | 2510.13281 | null |
| 2025-10-15 | STT-GS: Sample-Then-Transmit Edge Gaussian Splatting with Joint Client Selection and Power Control | Zhen Li et.al. | 2510.13186 | null |
| 2025-10-14 | A Critical Review of the Need for Knowledge-Centric Evaluation of Quranic Recitation | Mohammed Hilal Al-Kharusi et.al. | 2510.12858 | null |
| 2025-10-14 | Adaptive vector steering: A training-free, layer-wise intervention for hallucination mitigation in large audio and multimodal models | Tsung-En Lin et.al. | 2510.12851 | null |
| 2025-10-11 | Automatic Speech Recognition in the Modern Era: Architectures, Training, and Evaluation | Md. Nayeem et.al. | 2510.12827 | null |
| 2025-10-14 | Cost Analysis of Human-corrected Transcription for Predominately Oral Languages | Yacouba Diarra et.al. | 2510.12781 | null |
| 2025-10-14 | Structured Sparsity and Weight-adaptive Pruning for Memory and Compute efficient Whisper models | Prasenjit K Mudi et.al. | 2510.12666 | null |
| 2025-10-12 | Dual Data Scaling for Robust Two-Stage User-Defined Keyword Spotting | Zhiqi Ai et.al. | 2510.10740 | null |
| 2025-10-12 | Proficiency-Aware Adaptation and Data Augmentation for Robust L2 ASR | Ling Sun et.al. | 2510.10738 | null |
| 2025-10-12 | End-to-end Speech Recognition with similar length speech and text | Peng Fan et.al. | 2510.10453 | null |
| 2025-10-12 | Knowledge-Decoupled Functionally Invariant Path with Synthetic Personal Data for Personalized ASR | Yue Gu et.al. | 2510.10401 | null |
| 2025-10-11 | End-to-end Automatic Speech Recognition and Speech Translation: Integration of Speech Foundational Models and LLMs | Nam Luu et.al. | 2510.10329 | null |
| 2025-10-11 | SyncLipMAE: Contrastive Masked Pretraining for Audio-Visual Talking-Face Representation | Zeyu Ling et.al. | 2510.10069 | null |
| 2025-10-10 | Accent-Invariant Automatic Speech Recognition via Saliency-Driven Spectrogram Masking | Mohammad Hossein Sameti et.al. | 2510.09528 | null |
| 2025-10-10 | WildElder: A Chinese Elderly Speech Dataset from the Wild with Fine-Grained Manual Annotations | Hui Wang et.al. | 2510.09344 | null |
| 2025-10-10 | SynthVC: Leveraging Synthetic Data for End-to-End Low Latency Streaming Voice Conversion | Zhao Guo et.al. | 2510.09245 | null |
| 2025-10-10 | Effects of automotive microphone frequency response characteristics and noise conditions on speech and ASR quality – an experimental evaluation | Michele Buccoli et.al. | 2510.09236 | null |
| 2025-10-10 | FLToP CTC: Frame-Level Token Pruning via Relative Threshold for Efficient and Memory-Saving Decoding on Diverse Platforms | Atul Shree et.al. | 2510.09085 | null |
| 2025-10-08 | Look before Transcription: End-to-End SlideASR with Visually-Anchored Policy Optimization | Rui Hu et.al. | 2510.08618 | null |
| 2025-10-01 | Articulation-Informed ASR: Integrating Articulatory Features into ASR via Auxiliary Speech Inversion and Cross-Attention Fusion | Ahmed Adel Attia et.al. | 2510.08585 | null |
| 2025-10-09 | Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition | Yi-Cheng Lin et.al. | 2510.08047 | null |
| 2025-10-09 | Bloodroot: When Watermarking Turns Poisonous For Stealthy Backdoor | Kuan-Yu Chen et.al. | 2510.07909 | null |
| 2025-10-08 | LASER: An LLM-based ASR Scoring and Evaluation Rubric | Amruta Parulekar et.al. | 2510.07437 | null |
| 2025-10-08 | How much speech data is necessary for ASR in African languages? An evaluation of data scaling in Kinyarwanda and Kikuyu | Benjamin Akera et.al. | 2510.07221 | null |
| 2025-10-09 | Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation | Vaibhav Srivastav et.al. | 2510.06961 | null |
| 2025-10-07 | Linguistically Informed Tokenization Improves ASR for Underresourced Languages | Massimo Daul et.al. | 2510.06461 | null |
| 2025-10-07 | BanglaTalk: Towards Real-Time Speech Assistance for Bengali Regional Dialects | Jakir Hasan et.al. | 2510.06188 | null |
| 2025-10-06 | How I Built ASR for Endangered Languages with a Spoken Dictionary | Christopher Bartley et.al. | 2510.04832 | null |
| 2025-10-06 | Evaluating Self-Supervised Speech Models via Text-Based LLMS | Takashi Maekaku et.al. | 2510.04463 | null |
| 2025-10-05 | Probing Whisper for Dysarthric Speech in Detection and Assessment | Zhengjun Yue et.al. | 2510.04219 | null |
| 2025-10-05 | Drax: Speech Recognition with Discrete Flow Matching | Aviv Navon et.al. | 2510.04162 | null |
| 2025-10-05 | MoME: Mixture of Matryoshka Experts for Audio-Visual Speech Recognition | Umberto Cappellazzo et.al. | 2510.04136 | null |
| 2025-10-04 | Adapting Diarization-Conditioned Whisper for End-to-End Multi-Talker Speech Recognition | Martin Kocour et.al. | 2510.03723 | null |
| 2025-10-04 | Towards Unsupervised Speech Recognition at the Syllable-Level | Liming Wang et.al. | 2510.03639 | null |
| 2025-10-04 | Scaling Multi-Talker ASR with Speaker-Agnostic Activity Streams | Xiluo He et.al. | 2510.03630 | null |
| 2025-10-03 | Listening or Reading? Evaluating Speech Awareness in Chain-of-Thought Speech-to-Text Translation | Jacobo Romero-Díaz et.al. | 2510.03115 | null |
| 2025-10-03 | Revisiting Direct Speech-to-Text Translation with Speech LLMs: Better Scaling than CoT Prompting? | Oriol Pareras et.al. | 2510.03093 | null |
| 2025-10-03 | Sequence-Preserving Dual-FoV Defense for Traffic Sign and Light Recognition in Autonomous Vehicles | Abhishek Joshi et.al. | 2510.02642 | null |
| 2025-10-02 | A Physical Unclonable Function Based on Variations of Write Times in STT-MRAM due to Manufacturing Defects | Jacob Huber et.al. | 2510.02574 | null |
| 2025-10-16 | Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models | Tolúlopé Ògúnrèmí et.al. | 2510.02569 | null |
| 2025-10-02 | EvolveCaptions: Empowering DHH Users Through Real-Time Collaborative Captioning | Liang-Yuan Wu et.al. | 2510.02181 | null |
| 2025-09-30 | An Analysis of the New EU AI Act and A Proposed Standardization Framework for Machine Learning Fairness | Mike Teodorescu et.al. | 2510.01281 | null |
| 2025-10-01 | Automatic Speech Recognition (ASR) for African Low-Resource Languages: A Systematic Literature Review | Sukairaj Hafiz Imam et.al. | 2510.01145 | null |
| 2025-10-01 | Spiralformer: Low Latency Encoder for Streaming Speech Recognition with Circular Layer Skipping and Early Exiting | Emiru Tsunoo et.al. | 2510.00982 | null |
| 2025-10-01 | EuroSpeech: A Multilingual Speech Corpus | Samuel Pfisterer et.al. | 2510.00514 | null |
| 2025-09-26 | Temporal-Aware Iterative Speech Model for Dementia Detection | Chukwuemeka Ugwu et.al. | 2510.00030 | null |
| 2025-09-30 | IR-UWB Radar-Based Contactless Silent Speech Recognition with Attention-Enhanced Temporal Convolutional Networks | Sunghwa Lee et.al. | 2509.26409 | null |
| 2025-09-30 | ASR Under Noise: Exploring Robustness for Sundanese and Javanese | Salsabila Zahirah Pranida et.al. | 2509.25878 | null |
| 2025-09-29 | Beyond WER: Probing Whisper’s Sub-token Decoder Across Diverse Language Resource Levels | Siyu Liang et.al. | 2509.25516 | null |
| 2025-09-29 | Confidence-Guided Error Correction for Disordered Speech Recognition | Abner Hernandez et.al. | 2509.25048 | null |
| 2025-10-05 | HiKE: Hierarchical Evaluation Framework for Korean-English Code-Switching Speech Recognition | Gio Paik et.al. | 2509.24613 | null |
| 2025-09-29 | A Text-To-Text Alignment Algorithm for Better Evaluation of Modern Speech Recognition Systems | Lasse Borgholt et.al. | 2509.24478 | null |
| 2025-09-28 | AISHELL6-whisper: A Chinese Mandarin Audio-visual Whisper Speech Dataset with Speech Recognition Baselines | Cancan Li et.al. | 2509.23833 | null |
| 2025-09-28 | Automatic Speech Recognition for Greek Medical Dictation | Vardis Georgilas et.al. | 2509.23550 | null |
| 2025-09-26 | Index-MSR: A high-efficiency multimodal fusion framework for speech recognition | Jinming Chen et.al. | 2509.22744 | null |
| 2025-10-10 | From Coarse to Fine: Recursive Audio-Visual Semantic Enhancement for Speech Separation | Ke Xue et.al. | 2509.22425 | null |
| 2025-09-26 | Decoding Deception: Understanding Automatic Speech Recognition Vulnerabilities in Evasion and Poisoning Attacks | Aravindhan G et.al. | 2509.22060 | null |
| 2025-09-26 | A Parallel Ultra-Low Power Silent Speech Interface based on a Wearable, Fully-dry EMG Neckband | Fiona Meier et.al. | 2509.21964 | null |
| 2025-09-25 | Visual Authority and the Rhetoric of Health Misinformation: A Multimodal Analysis of Social Media Videos | Mohammad Reza Zarei et.al. | 2509.20724 | null |
| 2025-09-23 | Variational Low-Rank Adaptation for Personalized Impaired Speech Recognition | Niclas Pokel et.al. | 2509.20397 | null |
| 2025-09-23 | Data-Efficient ASR Personalization for Non-Normative Speech Using an Uncertainty-Based Phoneme Difficulty Score for Guided Sampling | Niclas Pokel et.al. | 2509.20396 | null |
| 2025-09-24 | DRES: Benchmarking LLMs for Disfluency Removal | Maria Teleki et.al. | 2509.20321 | null |
| 2025-09-25 | From Text to Talk: Audio-Language Model Needs Non-Autoregressive Joint Training | Tianqiao Liu et.al. | 2509.20072 | null |
| 2025-09-24 | Discrete Diffusion for Generative Modeling of Text-Aligned Speech Tokens | Pin-Jui Ku et.al. | 2509.20060 | null |
| 2025-09-24 | Weakly Supervised Phonological Features for Pathological Speech Analysis | Jenthe Thienpondt et.al. | 2509.19879 | null |
| 2025-09-26 | MMedFD: A Real-world Healthcare Benchmark for Multi-turn Full-Duplex Automatic Speech Recognition | Hongzhao Chen et.al. | 2509.19817 | null |
| 2025-09-23 | Retrieval Augmented Generation based context discovery for ASR | Dimitrios Siskos et.al. | 2509.19567 | null |
| 2025-09-23 | WolBanking77: Wolof Banking Speech Intent Classification Dataset | Abdou Karim Kandji et.al. | 2509.19271 | null |
| 2025-09-23 | SloPalSpeech: A 2,8000-Hour Slovak Speech Corpus from Parliamentary Data | Erik Božík et.al. | 2509.19270 | null |
| 2025-09-23 | LOTUSDIS: A Thai far-field meeting corpus for robust conversational ASR | Pattara Tipaksorn et.al. | 2509.18722 | null |
| 2025-09-22 | Speech Vecalign: an Embedding-based Method for Aligning Parallel Speech Documents | Chutong Meng et.al. | 2509.18360 | null |
| 2025-09-20 | Conversational Orientation Reasoning: Egocentric-to-Allocentric Navigation with Multimodal Chain-of-Thought | Yu Ti Huang et.al. | 2509.18200 | null |
| 2025-09-24 | MNV-17: A High-Quality Performative Mandarin Dataset for Nonverbal Vocalization Recognition in Speech | Jialong Mai et.al. | 2509.18196 | null |
| 2025-09-22 | Transformer-Encoder Trees for Efficient Multilingual Machine Translation and Speech Translation | Yiwen Guan et.al. | 2509.17930 | null |
| 2025-09-22 | Qwen3-Omni Technical Report | Jin Xu et.al. | 2509.17765 | null |
| 2025-09-22 | Leveraging Audio-Visual Data to Reduce the Multilingual Gap in Self-Supervised Speech Models | María Andrea Cruz Blandón et.al. | 2509.17523 | null |
| 2025-09-20 | Idiosyncratic Versus Normative Modeling of Atypical Speech Recognition: Dysarthric Case Studies | Vishnu Raja et.al. | 2509.16718 | null |
| 2025-09-20 | Audio-Conditioned Diffusion LLMs for ASR and Deliberation Processing | Mengqi Wang et.al. | 2509.16622 | null |
| 2025-09-19 | Whisper-UT: A Unified Translation Framework for Speech and Text | Cihan Xiao et.al. | 2509.16375 | null |
| 2025-09-26 | GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition | Tianyue Wang et.al. | 2509.16031 | null |
| 2025-09-19 | Session-Level Spoken Language Assessment with a Multimodal Foundation Model via Multi-Target Learning | Hong-Yun Lin et.al. | 2509.16025 | null |
| 2025-09-22 | Interpreting the Role of Visemes in Audio-Visual Speech Recognition | Aristeidis Papadopoulos et.al. | 2509.16023 | null |
| 2025-09-19 | VOX-KRIKRI: Unifying Speech and Language through Continuous Fusion | Dimitrios Damianos et.al. | 2509.15667 | null |
| 2025-09-19 | Layer-wise Minimal Pair Probing Reveals Contextual Grammatical-Conceptual Hierarchy in Speech Representations | Linyang He et.al. | 2509.15655 | null |
| 2025-09-19 | Thinking in cocktail party: Chain-of-Thought and reinforcement learning for target speaker automatic speech recognition | Yiru Zhang et.al. | 2509.15612 | null |
| 2025-09-19 | Chunk Based Speech Pre-training with High Resolution Finite Scalar Quantization | Yun Tang et.al. | 2509.15579 | null |
| 2025-09-19 | State-of-the-Art Dysarthric Speech Recognition with MetaICL for on-the-fly Personalization | Dhruuv Agarwal et.al. | 2509.15516 | null |
| 2025-09-18 | BiRQ: Bi-Level Self-Labeling Random Quantization for Self-Supervised Speech Recognition | Liuyuan Jiang et.al. | 2509.15430 | null |
| 2025-09-25 | Speech Language Models for Under-Represented Languages: Insights from Wolof | Yaya Sy et.al. | 2509.15362 | null |
| 2025-09-20 | Listening, Imagining & Refining: A Heuristic Optimized ASR Correction Framework with LLMs | Yutong Liu et.al. | 2509.15095 | null |
| 2025-09-19 | From Hype to Insight: Rethinking Large Language Model Integration in Visual Speech Recognition | Rishabh Jain et.al. | 2509.14880 | null |
| 2025-09-18 | Towards Building Speech Large Language Models for Multitask Understanding in Low-Resource Languages | Mingchen Shao et.al. | 2509.14804 | null |
| 2025-09-18 | UMA-Split: unimodal aggregation for both English and Mandarin non-autoregressive speech recognition | Ying Fang et.al. | 2509.14653 | null |
| 2025-09-17 | Multi-Channel Differential ASR for Robust Wearer Speech Recognition on Smart Glasses | Yufeng Yang et.al. | 2509.14430 | null |
| 2025-09-13 | Context-Enhanced Granular Edit Representation for Efficient and Accurate ASR Post-editing | Luan Vejsiu et.al. | 2509.14263 | null |
| 2025-09-25 | Canary-1B-v2 & Parakeet-TDT-0.6B-v3: Efficient and High-Performance Models for Multilingual ASR and AST | Monica Sekoyan et.al. | 2509.14128 | null |
| 2025-09-17 | Language Conditioning Improves Accuracy of Aircraft Goal Prediction in Untowered Airspace | Sundhar Vinodh Sangeetha et.al. | 2509.14063 | null |
| 2025-09-17 | Conducting Mission-Critical Voice Experiments with Automated Speech Recognition and Crowdsourcing | Jan Janak et.al. | 2509.13724 | null |
| 2025-09-16 | Invisible Ears at Your Fingertips: Acoustic Eavesdropping via Mouse Sensors | Mohamad Fakih et.al. | 2509.13581 | null |
| 2025-09-16 | TICL: Text-Embedding KNN For Speech In-Context Learning Unlocks Speech Recognition Abilities of Large Multimodal Models | Haolong Zheng et.al. | 2509.13395 | null |
| 2025-09-22 | GLAD: Global-Local Aware Dynamic Mixture-of-Experts for Multi-Talker ASR | Yujie Guo et.al. | 2509.13093 | null |
| 2025-09-16 | PAC: Pronunciation-Aware Contextualized Large Language Model-based Automatic Speech Recognition | Li Fu et.al. | 2509.12647 | null |
| 2025-09-17 | FunAudio-ASR Technical Report | Keyu An et.al. | 2509.12508 | null |
| 2025-09-15 | In-domain SSL pre-training and streaming ASR | Jarod Duret et.al. | 2509.12101 | null |
| 2025-09-12 | Improving Audio Event Recognition with Consistency Regularization | Shanmuka Sadhu et.al. | 2509.10391 | null |
| 2025-09-12 | Data-independent Beamforming for End-to-end Multichannel Multi-speaker ASR | Can Cui et.al. | 2509.10234 | null |
| 2025-09-12 | Prominence-aware automatic speech recognition for conversational speech | Julian Linke et.al. | 2509.10116 | null |
| 2025-09-12 | Unified Learnable 2D Convolutional Feature Extraction for ASR | Peter Vieting et.al. | 2509.10031 | null |
| 2025-09-11 | Combining Textual and Spectral Features for Robust Classification of Pilot Communications | Abdullah All Tanvir et.al. | 2509.09752 | null |
| 2025-09-11 | Improving Synthetic Data Training for Contextual Biasing Models with a Keyword-Aware Cost Function | Chin Yuen Kwok et.al. | 2509.09197 | null |
| 2025-09-11 | Efficient Trie-based Biasing using K-step Prediction for Rare Word Recognition | Chin Yuen Kwok et.al. | 2509.09196 | null |
| 2025-09-09 | A Bottom-up Framework with Language-universal Speech Attribute Modeling for Syllable-based ASR | Hao Yen et.al. | 2509.08173 | null |
| 2025-09-09 | EnvX: Agentize Everything with Agentic AI | Linyao Chen et.al. | 2509.08088 | null |
| 2025-09-08 | Identifying and Calibrating Overconfidence in Noisy Speech Recognition | Mingyue Huo et.al. | 2509.07195 | null |
| 2025-09-08 | The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties | William Chen et.al. | 2509.07139 | null |
| 2025-09-20 | TSPC: A Two-Stage Phoneme-Centric Architecture for code-switching Vietnamese-English Speech Recognition | Minh N. H. Nguyen et.al. | 2509.05983 | null |
| 2025-09-07 | Enhancing the Robustness of Contextual ASR to Varying Biasing Information Volumes Through Purified Semantic Correlation Joint Modeling | Yue Gu et.al. | 2509.05908 | null |
| 2025-09-06 | New Insights into Optimal Alignment of Acoustic and Linguistic Representations for Knowledge Transfer in ASR | Xugang Lu et.al. | 2509.05609 | null |
| 2025-09-05 | Graph Connectionist Temporal Classification for Phoneme Recognition | Henry Grafé et.al. | 2509.05399 | null |
| 2025-09-05 | Layer-wise Analysis for Quality of Multilingual Synthesized Speech | Erica Cooper et.al. | 2509.04830 | null |
| 2025-09-02 | From Silent Signals to Natural Language: A Dual-Stage Transformer-LLM Approach | Nithyashree Sivasubramaniam et.al. | 2509.04507 | null |
| 2025-09-01 | Refining Transcripts With TV Subtitles by Prompt-Based Weakly Supervised Training of ASR | Xinnian Zhao et.al. | 2509.04491 | null |
| 2025-09-01 | Serialized Output Prompting for Large Language Model-based Multi-Talker Speech Recognition | Hao Shi et.al. | 2509.04488 | null |
| 2025-08-29 | SpeechLLM: Unified Speech and Language Model for Enhanced Multi-Task Understanding in Low Resource Settings | Jaekwon Yoo et.al. | 2509.04473 | null |
| 2025-09-04 | Contextualized Token Discrimination for Speech Search Query Correction | Junyu Lu et.al. | 2509.04393 | null |
| 2025-09-04 | Denoising GER: A Noise-Robust Generative Error Correction with LLM for Speech Recognition | Yanyan Liu et.al. | 2509.04392 | null |
| 2025-09-04 | PARCO: Phoneme-Augmented Robust Contextual ASR via Contrastive Entity Disambiguation | Jiajun He et.al. | 2509.04357 | null |
| 2025-09-04 | Enhancing Self-Supervised Speaker Verification Using Similarity-Connected Graphs and GCN | Zhaorui Sun et.al. | 2509.04147 | null |
| 2025-08-27 | An Effective Strategy for Modeling Score Ordinality and Non-uniform Intervals in Automated Speaking Assessment | Tien-Hong Lo et.al. | 2509.03372 | null |
| 2025-09-05 | Exploring persuasive interactions with generative social robots: An experimental framework | Stephan Vonschallen et.al. | 2509.03231 | null |
| 2025-09-03 | Beyond Words: Interjection Classification for Improved Human-Computer Interaction | Yaniv Goren et.al. | 2509.03181 | null |
| 2025-09-03 | A Study on Zero-Shot Non-Intrusive Speech Intelligibility for Hearing Aids Using Large Language Models | Ryandhimas E. Zezario et.al. | 2509.03021 | null |
| 2025-09-04 | Speech Intelligibility Assessment with Uncertainty-Aware Whisper Embeddings and sLSTM | Ryandhimas E. Zezario et.al. | 2509.03013 | null |
| 2025-09-02 | SSVD: Structured SVD for Parameter-Efficient Fine-Tuning and Benchmarking under Domain Shift in ASR | Pu Wang et.al. | 2509.02830 | null |
| 2025-09-02 | Flavors of Moonshine: Tiny Specialized ASR Models for Edge Devices | Evan King et.al. | 2509.02523 | null |
| 2025-09-04 | AudioCodecBench: A Comprehensive Benchmark for Audio Codec Evaluation | Lu Wang et.al. | 2509.02349 | null |
| 2025-09-03 | NADI 2025: The First Multidialectal Arabic Speech Processing Shared Task | Bashar Talafha et.al. | 2509.02038 | null |
| 2025-09-02 | Group Relative Policy Optimization for Speech Recognition | Prashanth Gurunath Shivakumar et.al. | 2509.01939 | null |
| 2025-09-02 | Multilingual Speech Recognition Using Discrete Tokens with a Two-step Training Strategy | Zehan Li et.al. | 2509.01900 | null |
| 2025-09-01 | Mic Drop or Data Flop? Evaluating the Fitness for Purpose of AI Voice Interviewers for Data Collection within Quantitative & Qualitative Research Contexts | Shreyas Tirumala et.al. | 2509.01814 | null |
| 2025-09-01 | Characterization of Speech Similarity Between Australian Aboriginal and High-Resource Languages: A Case Study on Dharawal | Ting Dang et.al. | 2509.01419 | null |
| 2025-09-01 | CabinSep: IR-Augmented Mask-Based MVDR for Real-Time In-Car Speech Separation with Distributed Heterogeneous Arrays | Runduo Han et.al. | 2509.01399 | null |
| 2025-09-01 | Analysing the Language of Neural Audio Codecs | Joonyong Park et.al. | 2509.01390 | null |
| 2025-09-01 | Noisy Disentanglement with Tri-stage Training for Noise-Robust Speech Recognition | Shuangyuan Chen et.al. | 2509.01087 | null |
| 2025-08-31 | A Unified Denoising and Adaptation Framework for Self-Supervised Bengali Dialectal ASR | Swadhin Biswas et.al. | 2509.00988 | null |
| 2025-08-30 | Entropy-based Coarse and Compressed Semantic Speech Representation Learning | Jialong Zuo et.al. | 2509.00503 | null |
| 2025-08-27 | Automatic Pronunciation Error Detection and Correction of the Holy Quran’s Learners Using Deep Learning | Abdullah Abdelfattah et.al. | 2509.00094 | null |
| 2025-08-29 | NSPDI-SNN: An efficient lightweight SNN based on nonlinear synaptic pruning and dendritic integration | Wuque Cai et.al. | 2508.21566 | null |
| 2025-09-02 | AHELM: A Holistic Evaluation of Audio-Language Models | Tony Lee et.al. | 2508.21376 | null |
| 2025-08-28 | Can Layer-wise SSL Features Improve Zero-Shot ASR Performance for Children’s Speech? | Abhijit Sinha et.al. | 2508.21225 | null |
| 2025-08-28 | Benchmarking Large Pretrained Multilingual Models on Québec French Speech Recognition | Coralie Serrand et.al. | 2508.21193 | null |
| 2025-08-28 | OLMoASR: Open Models and Data for Training Robust Speech Recognition Models | Huong Ngo et.al. | 2508.20869 | null |
| 2025-08-28 | Generative Annotation for ASR Named Entity Correction | Yuanchang Luo et.al. | 2508.20700 | null |
| 2025-08-28 | Towards Inclusive Communication: A Unified LLM-Based Framework for Sign Language, Lip Movements, and Audio Understanding | Jeong Hun Yeo et.al. | 2508.20476 | null |
| 2025-09-08 | Heterogeneous Self-Supervised Acoustic Pre-Training with Local Constraints | Xiaodong Cui et.al. | 2508.19990 | null |
| 2025-08-27 | TokenVerse++: Towards Flexible Multitask Learning with Dynamic Task Activation | Shashi Kumar et.al. | 2508.19856 | null |
| 2025-08-27 | CAMÕES: A Comprehensive Automatic Speech Recognition Benchmark for European Portuguese | Carlos Carvalho et.al. | 2508.19721 | null |
| 2025-08-27 | Hybrid Decoding: Rapid Pass and Selective Detailed Correction for Sequence Models | Yunkyu Lim et.al. | 2508.19671 | null |
| 2025-08-27 | Towards stable AI systems for Evaluating Arabic Pronunciations | Hadi Zaatiti et.al. | 2508.19587 | null |
| 2025-08-22 | Whisper based Cross-Lingual Phoneme Recognition between Vietnamese and English | Nguyen Huu Nhat Minh et.al. | 2508.19270 | null |
| 2025-08-26 | MOSA: Mixtures of Simple Adapters Outperform Monolithic Approaches in LLM-based Multilingual ASR | Junjie Li et.al. | 2508.18998 | null |
| 2025-08-26 | TaiBai: A fully programmable brain-inspired processor with topology-aware efficiency | Qianpeng Li et.al. | 2508.18961 | null |
| 2025-08-26 | DESAMO: A Device for Elder-Friendly Smart Homes Powered by Embedded LLM with Audio Modality | Youngwon Choi et.al. | 2508.18918 | null |
| 2025-08-26 | Improving Noise Robust Audio-Visual Speech Recognition via Router-Gated Cross-Modal Feature Fusion | DongHoon Lim et.al. | 2508.18734 | null |
| 2025-08-26 | Cross-Learning Fine-Tuning Strategy for Dysarthric Speech Recognition Via CDSD database | Qing Xiao et.al. | 2508.18732 | null |
| 2025-08-26 | Attention2Probability: Attention-Driven Terminology Probability Estimation for Robust Speech-to-Text System | Yanfan Du et.al. | 2508.18701 | null |
| 2025-08-22 | H-PRM: A Pluggable Hotword Pre-Retrieval Module for Various Speech Recognition Systems | Huangyu Dai et.al. | 2508.18295 | null |
| 2025-08-20 | Toward Responsible ASR for African American English Speakers: A Scoping Review of Bias and Equity in Speech Technology | Jay L. Cunningham et.al. | 2508.18288 | null |
| 2025-08-25 | Evaluating the Representation of Vowels in Wav2Vec Feature Extractor: A Layer-Wise Analysis Using MFCCs | Domenico De Cristofaro et.al. | 2508.17914 | null |
| 2025-08-25 | Designing Practical Models for Isolated Word Visual Speech Recognition | Iason Ioannis Panagos et.al. | 2508.17894 | null |
| 2025-08-25 | Talking to Robots: A Practical Examination of Speech Foundation Models for HRI Applications | Theresa Pekarek Rosin et.al. | 2508.17753 | null |
| 2025-08-24 | AI-Powered Legal Intelligence System Architecture: A Comprehensive Framework for Automated Legal Consultation and Analysis | Sean Kalaycioglu et.al. | 2508.17499 | null |
| 2025-08-22 | Benchmarking Training Paradigms, Dataset Composition, and Model Scaling for Child ASR in ESPnet | Anyu Ying et.al. | 2508.16576 | null |
| 2025-08-21 | Beyond Transcription: Mechanistic Interpretability in ASR | Neta Glazer et.al. | 2508.15882 | null |
| 2025-08-20 | MGSC: A Multi-granularity Consistency Framework for Robust End-to-end Asr | Xuwen Yang et.al. | 2508.15853 | null |
| 2025-08-21 | UniCoM: A Universal Code-Switching Speech Generator | Sangmin Lee et.al. | 2508.15244 | null |
| 2025-08-20 | A Study of the Scale Invariant Signal to Distortion Ratio in Speech Separation with Noisy References | Simon Dahl Jepsen et.al. | 2508.14623 | null |
| 2025-08-18 | Whispering Context: Distilling Syntax and Semantics for Long Speech Transcripts | Duygu Altinok et.al. | 2508.13376 | null |
| 2025-08-18 | Overcoming Latency Bottlenecks in On-Device Speech Translation: A Cascaded Approach with Alignment-Based Streaming MT | Zeeshan Ahmed et.al. | 2508.13358 | null |
| 2025-08-18 | Evaluating ASR robustness to spontaneous speech errors: A study of WhisperX using a Speech Error Database | John Alderete et.al. | 2508.13060 | null |
| 2025-08-18 | Arabic ASR on the SADA Large-Scale Arabic Speech Corpus with Transformer-Based Models | Branislav Gerazov et.al. | 2508.12968 | null |
| 2025-08-17 | CarelessWhisper: Turning Whisper into a Causal Streaming Model | Tomer Krichli et.al. | 2508.12301 | null |
| 2025-08-17 | HuBERT-VIC: Improving Noise-Robust Automatic Speech Recognition of Speech Foundation Model via Variance-Invariance-Covariance Regularization | Hyebin Ahn et.al. | 2508.12292 | null |
| 2025-08-17 | What do Speech Foundation Models Learn? Analysis and Applications | Ankita Pasad et.al. | 2508.12255 | null |
| 2025-11-06 | Omni-Router: Sharing Routing Decisions in Sparse Mixture-of-Experts for Speech Recognition | Zijin Gu et.al. | 2507.05724 | null |
| 2025-08-12 | Transfer Learning from Visual Speech Recognition to Mouthing Recognition in German Sign Language | Dinh Nam Pham et.al. | 2505.13784 | null |
| 2025-05-19 | Survey of End-to-End Multi-Speaker Automatic Speech Recognition for Monaural Audio | Xinlu He et.al. | 2505.10975 | null |
| 2025-02-26 | Exploring Gender Disparities in Automatic Speech Recognition Technology | Hend ElGhazaly et.al. | 2502.18434 | null |
| 2025-02-11 | Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models | Jing-Xuan Zhang et.al. | 2502.05766 | null |
| 2025-02-04 | Sagalee: an Open Source Automatic Speech Recognition Dataset for Oromo Language | Turi Abu et.al. | 2502.00421 | null |
| 2025-02-03 | Language Bias in Self-Supervised Learning For Automatic Speech Recognition | Edward Storey et.al. | 2501.19321 | null |
| 2025-01-20 | Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR | Karl El Hajal et.al. | 2501.10256 | null |
| 2024-09-25 | Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs | Yang Yuhang et.al. | 2409.16005 | null |
| 2024-08-26 | Focused Discriminative Training For Streaming CTC-Trained Automatic Speech Recognition Models | Adnan Haider et.al. | 2408.13008 | null |
| 2024-09-26 | Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations | Kunal Dhawan et.al. | 2407.03495 | null |
| 2025-01-10 | Towards Unsupervised Speech Recognition Without Pronunciation Models | Junrui Ni et.al. | 2406.08380 | null |
| 2024-09-12 | Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation | Muhammad Shakeel et.al. | 2405.13514 | null |
| 2024-04-26 | Developing Acoustic Models for Automatic Speech Recognition in Swedish | Giampiero Salvi et.al. | 2404.16547 | null |
| 2025-04-29 | SpeechColab Leaderboard: An Open-Source Platform for Automatic Speech Recognition Evaluation | Jiayu Du et.al. | 2403.08196 | null |
| 2024-03-14 | Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of Speech Sound Disorders in Korean children | Taekyung Ahn et.al. | 2403.08187 | null |
| 2025-11-04 | Aligning Speech to Languages to Enhance Code-switching Speech Recognition | Hexin Liu et.al. | 2403.05887 | null |
| 2024-02-22 | ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge | He Wang et.al. | 2401.03473 | null |
| 2024-02-12 | Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification | Anirudh S. Sundar et.al. | 2312.14378 | null |
| 2023-11-20 | Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition | Qijie Shao et.al. | 2311.07062 | null |
| 2024-01-29 | Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting | Chao-Han Huck Yang et.al. | 2309.15649 | null |
| 2024-02-23 | Training dynamic models using early exits for automatic speech recognition on resource-constrained devices | George August Wright et.al. | 2309.09546 | null |
| 2023-08-15 | Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition | Han Zhu et.al. | 2308.06547 | null |
| 2023-08-09 | Federated Representation Learning for Automatic Speech Recognition | Guruprasad V Ramesh et.al. | 2308.02013 | null |
| 2023-07-06 | Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture | Haoran Miao et.al. | 2307.02351 | null |
| 2023-07-06 | Boosting Norwegian Automatic Speech Recognition | Javier de la Rosa et.al. | 2307.01672 | null |
| 2023-04-18 | A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech Recognition | Ruchao Fan et.al. | 2304.07611 | null |
| 2023-03-07 | End-to-End Speech Recognition: A Survey | Rohit Prabhavalkar et.al. | 2303.03329 | null |
| 2023-03-07 | A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One | Lingwei Meng et.al. | 2302.09908 | null |
| 2023-02-03 | Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition | Minglun Han et.al. | 2302.01194 | null |
| 2022-09-23 | Assessing ASR Model Quality on Disordered Speech using BERTScore | Jimmy Tobin et.al. | 2209.10591 | null |
| 2023-08-25 | Automatic Speech Recognition for Speech Assessment of Persian Preschool Children | Amirhossein Abaskohi et.al. | 2203.12886 | null |
| 2022-03-18 | Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition | Mengzhe Geng et.al. | 2202.10290 | null |
| 2022-02-03 | Visualizing Automatic Speech Recognition – Means for a Better Understanding? | Karla Markert et.al. | 2202.00673 | null |
| 2022-01-31 | Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition | Piotr Żelasko et.al. | 2201.11207 | null |
| 2022-05-10 | A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition | Qiu-Shi Zhu et.al. | 2201.08930 | null |
| 2024-11-07 | Robustifying automatic speech recognition by extracting slowly varying features | Matías Pizarro et.al. | 2112.07400 | null |
| 2022-05-02 | Privacy attacks for automatic speech recognition acoustic models in a federated learning framework | Natalia Tomashenko et.al. | 2111.03777 | null |
| 2022-05-03 | Speech Pattern based Black-box Model Watermarking for Automatic Speech Recognition | Haozhe Chen et.al. | 2110.09814 | null |
| 2021-11-05 | Towards efficient end-to-end speech recognition with biologically-inspired neural networks | Thomas Bohnstingl et.al. | 2110.02743 | null |
| 2025-02-06 | Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch | Jakob Poncelet et.al. | 2109.14357 | null |
| 2021-10-19 | Adversarial Example Devastation and Detection on Speech Recognition System by Adding Random Noise | Mingyu Dong et.al. | 2108.13562 | null |
| 2021-07-06 | Arabic Code-Switching Speech Recognition using Monolingual Data | Ahmed Ali et.al. | 2107.01573 | null |
| 2021-07-05 | Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition | Qiujia Li et.al. | 2107.00764 | null |
| 2022-03-22 | Unsupervised Automatic Speech Recognition: A Review | Hanan Aldarmaki et.al. | 2106.04897 | link |
| 2021-05-06 | Accent Recognition with Hybrid Phonetic Features | Zhan Zhang et.al. | 2105.01920 | null |
| 2021-10-05 | Non-autoregressive Mandarin-English Code-switching Speech Recognition | Shun-Po Chuang et.al. | 2104.02258 | null |
| 2021-02-23 | Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model | Junwei Liao et.al. | 2102.11114 | null |
| 2021-11-30 | Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition | Aswin Shanmugam Subramanian et.al. | 2102.07955 | null |
| 2021-02-16 | Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition | Priyabrata Karmakar et.al. | 2102.07259 | null |
| 2021-02-10 | Sparsification via Compressed Sensing for Automatic Speech Recognition | Kai Zhen et.al. | 2102.04932 | null |
| 2021-02-01 | BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge | Martin Kocour et.al. | 2101.12729 | null |
| 2021-09-14 | Multi-task Language Modeling for Improving Speech Recognition of Rare Words | Chao-Han Huck Yang et.al. | 2011.11715 | null |
| 2020-11-09 | Multilingual Bottleneck Features for Improving ASR Performance of Code-Switched Speech in Under-Resourced Languages | Trideba Padhi et.al. | 2011.03118 | null |
| 2020-09-22 | Far-Field Automatic Speech Recognition | Reinhold Haeb-Umbach et.al. | 2009.09395 | null |
| 2020-10-06 | CTC-Segmentation of Large Corpora for German End-to-end Speech Recognition | Ludwig Kürzinger et.al. | 2007.09127 | null |
| 2020-06-04 | The NTNU System at the Interspeech 2020 Non-Native Children’s Speech ASR Challenge | Tien-Hong Lo et.al. | 2005.08433 | null |
| 2020-03-02 | A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition | Erik McDermott et.al. | 2002.11268 | null |
| 2021-10-11 | Submodular Rank Aggregation on Score-based Permutations for Distributed Automatic Speech Recognition | Jun Qi et.al. | 2001.10529 | null |
| 2023-05-23 | Leveraging End-to-End Speech Recognition with Neural Architecture Search | Ahmed Baruwa et.al. | 1912.05946 | null |
| 2019-11-21 | On using 2D sequence-to-sequence models for speech recognition | Parnia Bahar et.al. | 1911.08888 | null |
| 2019-11-13 | Recurrent Neural Network Transducer for Audio-Visual Speech Recognition | Takaki Makino et.al. | 1911.04890 | null |
| 2019-10-15 | VAIS ASR: Building a conversational speech recognition system using language model combination | Quang Minh Nguyen et.al. | 1910.05603 | null |
| 2020-03-17 | Advancing Speech Recognition With No Speech Or With Noisy Speech | Gautam Krishna et.al. | 1906.08871 | null |
| 2019-05-22 | Articulatory and bottleneck features for speaker-independent ASR of dysarthric speech | Emre Yılmaz et.al. | 1905.06533 | null |
| 2019-04-26 | Phonetically-Oriented Word Error Alignment for Speech Recognition Error Analysis in Speech Translation | Nicholas Ruiz et.al. | 1904.11024 | null |
| 2023-05-15 | End-to-end Audiovisual Speech Activity Detection with Bimodal Recurrent Neural Models | Fei Tao et.al. | 1809.04553 | null |
| 2018-09-13 | Isolated and Ensemble Audio Preprocessing Methods for Detecting Adversarial Examples against Automatic Speech Recognition | Krishan Rajaratnam et.al. | 1809.04397 | null |
| 2018-05-29 | Automatic context window composition for distant speech recognition | Mirco Ravanelli et.al. | 1805.10498 | null |
| 2018-04-27 | End-to-End Multimodal Speech Recognition | Shruti Palaskar et.al. | 1804.09713 | link |
| 2018-03-08 | Extracting Domain Invariant Features by Unsupervised Learning for Robust Automatic Speech Recognition | Wei-Ning Hsu et.al. | 1803.02551 | null |
| 2019-05-01 | Unsupervised Adaptation with Domain Separation Networks for Robust Speech Recognition | Zhong Meng et.al. | 1711.08010 | null |
| 2018-02-23 | BridgeNets: Student-Teacher Transfer Learning Based on Recursive Neural Networks and its Application to Distant Speech Recognition | Jaeyoung Kim et.al. | 1710.10224 | null |
| 2018-04-26 | Resolution limits on visual speech recognition | Helen L. Bear et.al. | 1710.01073 | null |
| 2017-09-01 | Leveraging Deep Neural Network Activation Entropy to cope with Unseen Data in Speech Recognition | Vikramjit Mitra et.al. | 1708.09516 | null |
| 2018-12-06 | Single-Channel Multi-talker Speech Recognition with Permutation Invariant Training | Yanmin Qian et.al. | 1707.06527 | null |
| 2017-04-27 | Towards Estimating the Upper Bound of Visual-Speech Recognition: The Visual Lip-Reading Feasibility Database | Adriana Fernandez-Lopez et.al. | 1704.08028 | null |
| 2016-12-07 | Invariant Representations for Noisy Speech Recognition | Dmitriy Serdyuk et.al. | 1612.01928 | null |
| 2016-11-10 | Automatic recognition of child speech for robotic applications in noisy environments | Samuel Fernando et.al. | 1611.02695 | null |
| 2014-02-12 | Modified SPLICE and its Extension to Non-Stereo Data for Noise Robust Speech Recognition | D. S. Pavan Kumar et.al. | 1307.4048 | null |
📊 517 papers
| 📅 Publish Date | 📝 Title | 👥 Authors | 💻 Code | |
|---|---|---|---|---|
| 2026-04-01 | OmniVoice: Towards Omnilingual Zero-Shot Text-to-Speech with Diffusion Language Models | Han Zhu et.al. | 2604.00688 | null |
| 2026-03-31 | MambaVoiceCloning: Efficient and Expressive Text-to-Speech via State-Space Modeling and Diffusion Control | Sahil Kumar et.al. | 2604.00292 | null |
| 2026-03-24 | Fast elementwise operations on tensor trains with alternating cross interpolation | Marc K. Ritter et.al. | 2604.00037 | null |
| 2026-03-31 | LongCat-AudioDiT: High-Fidelity Diffusion Text-to-Speech in the Waveform Latent Space | Detai Xin et.al. | 2603.29339 | null |
| 2026-03-31 | From Natural Alignment to Conditional Controllability in Multimodal Dialogue | Zeyu Jin et.al. | 2603.29162 | null |
| 2026-03-30 | ParaSpeechCLAP: A Dual-Encoder Speech-Text Model for Rich Stylistic Language-Audio Pretraining | Anuj Diwan et.al. | 2603.28737 | null |
| 2026-03-29 | VoxAnchor: Grounding Speech Authenticity in Throat Vibration via mmWave Radar | Mingda Han et.al. | 2603.27562 | null |
| 2026-03-27 | LLaDA-TTS: Unifying Speech Synthesis and Zero-Shot Editing via Masked Diffusion Modeling | Xiaoyu Fan et.al. | 2603.26364 | null |
| 2026-03-26 | Voxtral TTS | Alexander H. Liu et.al. | 2603.25551 | null |
| 2026-03-25 | YingMusic-Singer: Controllable Singing Voice Synthesis with Flexible Lyric Manipulation and Annotation-free Melody Guidance | Chunbo Hao et.al. | 2603.24589 | null |
| 2026-03-25 | Iterate to Differentiate: Enhancing Discriminability and Reliability in Zero-Shot TTS Evaluation | Shengfan Shen et.al. | 2603.24430 | null |
| 2026-04-01 | How Open is Open TTS? A Practical Evaluation of Open Source TTS Tools | Teodora Răgman et.al. | 2603.24116 | null |
| 2026-03-23 | SelfTTS: cross-speaker style transfer through explicit embedding disentanglement and self-refinement using self-augmentation | Lucas H. Ueda et.al. | 2603.22252 | null |
| 2026-03-23 | Tuning Real-World Image Restoration at Inference: A Test-Time Scaling Paradigm for Flow Matching Models | Purui Bai et.al. | 2603.22027 | null |
| 2026-03-22 | Assessing the Ability of Neural TTS Systems to Model Consonant-Induced F0 Perturbation | Tianle Yang et.al. | 2603.21078 | null |
| 2026-03-21 | The Binding Effect: Analyzing How Multi-Dimensional Cues Form Gender Bias in Instruction TTS | Kuan-Yu Chen et.al. | 2603.20743 | null |
| 2026-03-21 | SNAP: Speaker Nulling for Artifact Projection in Speech Deepfake Detection | Kyudan Jung et.al. | 2603.20686 | null |
| 2026-03-24 | Tensor Train Representation of High-Dimensional Unsteady Flamelet Manifolds | Sinan Demir et.al. | 2603.20240 | null |
| 2026-03-20 | Audio Avatar Fingerprinting: An Approach for Authorized Use of Voice Cloning in the Era of Synthetic Audio | Candice R. Gerstner et.al. | 2603.20165 | null |
| 2026-03-20 | Gesture2Speech: How Far Can Hand Movements Shape Expressive Speech? | Lokesh Kumar et.al. | 2603.19831 | null |
| 2026-03-20 | Borderless Long Speech Synthesis | Xingchen Song et.al. | 2603.19798 | null |
| 2026-03-20 | MOSS-TTS Technical Report | Yitian Gong et.al. | 2603.18090 | null |
| 2026-03-03 | EEG-Based Brain-LLM Interface for Human Preference Aligned Generation | Junzi Zhang et.al. | 2603.16897 | null |
| 2026-03-17 | From the Inside Out: Progressive Distribution Refinement for Confidence Calibration | Xizhong Yang et.al. | 2603.16500 | null |
| 2026-03-17 | On the Emotion Understanding of Synthesized Speech | Yuan Ge et.al. | 2603.16483 | null |
| 2026-03-17 | CAST-TTS: A Simple Cross-Attention Framework for Unified Timbre Control in TTS | Zihao Zheng et.al. | 2603.16280 | null |
| 2026-03-16 | Meta-TTRL: A Metacognitive Framework for Self-Improving Test-Time Reinforcement Learning in Unified Multimodal Models | Lit Sin Tan et.al. | 2603.15724 | null |
| 2026-03-18 | NV-Bench: Benchmark of Nonverbal Vocalization Synthesis for Expressive Text-to-Speech Generation | Qinke Ni et.al. | 2603.15352 | null |
| 2026-03-16 | PhonemeDF: A Synthetic Speech Dataset for Audio Deepfake Detection and Naturalness Evaluation | Vamshi Nallaguntla et.al. | 2603.15037 | null |
| 2026-03-16 | WhispSynth: Scaling Multilingual Whisper Corpus through Real Data Curation and A Novel Pitch-free Generative Framework | Tianyi Tan et.al. | 2603.14853 | null |
| 2026-03-16 | Investigating the Impact of Speech Enhancement on Audio Deepfake Detection in Noisy Environments | Anacin et.al. | 2603.14767 | null |
| 2026-03-15 | Affectron: Emotional Speech Synthesis with Affective and Contextually Aligned Nonverbal Vocalizations | Deok-Hyeon Cho et.al. | 2603.14432 | null |
| 2026-03-15 | CodecMOS-Accent: A MOS Benchmark of Resynthesized and TTS Speech from Neural Codecs Across English Accents | Wen-Chin Huang et.al. | 2603.14328 | null |
| 2026-03-27 | DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and Synchronization | Ngoc-Son Nguyen et.al. | 2603.14267 | null |
| 2026-03-14 | Beyond Two-stage Diffusion TTS: Joint Structure and Content Refinement via Jump Diffusion | Jiabao Ai et.al. | 2603.14032 | null |
| 2026-03-13 | VoXtream2: Full-stream TTS with dynamic speaking rate control | Nikita Torgashov et.al. | 2603.13518 | null |
| 2026-03-12 | MamTra: A Hybrid Mamba-Transformer Backbone for Speech Synthesis | Tan Dat Nguyen et.al. | 2603.12342 | null |
| 2026-03-12 | Linking Perception, Confidence and Accuracy in MLLMs | Yuetian Du et.al. | 2603.12149 | null |
| 2026-03-12 | Causal Prosody Mediation for Text-to-Speech:Counterfactual Training of Duration, Pitch, and Energy in FastSpeech2 | Suvendu Sekhar Mohanty et.al. | 2603.11683 | null |
| 2026-03-12 | RAF: Relativistic Adversarial Feedback For Universal Speech Synthesis | Yongjoon Lee et.al. | 2603.11678 | null |
| 2026-03-11 | When Fine-Tuning Fails and when it Generalises: Role of Data Diversity and Mixed Training in LLM-based TTS | Anupam Purwar et.al. | 2603.10904 | null |
| 2026-03-12 | Probabilistic Verification of Voice Anti-Spoofing Models | Evgeny Kushnir et.al. | 2603.10713 | null |
| 2026-03-25 | MM-tau-p $^2$ : Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings | Anupam Purwar et.al. | 2603.09643 | null |
| 2026-03-10 | GeoSolver: Scaling Test-Time Reasoning in Remote Sensing with Fine-Grained Process Supervision | Lang Sun et.al. | 2603.09551 | null |
| 2026-03-12 | Multi-tasking through quantum annealing | Jargalsaikhan Artag et.al. | 2603.09468 | null |
| 2026-03-09 | MAPLE: Elevating Medical Reasoning from Statistical Consensus to Process-Led Alignment | Kailong Fan et.al. | 2603.08987 | null |
| 2026-03-11 | Fish Audio S2 Technical Report | Shijia Liao et.al. | 2603.08823 | null |
| 2026-03-09 | SWE-Fuse: Empowering Software Agents via Issue-free Trajectory Learning and Entropy-aware RLVR Training | Xin-Cheng Wen et.al. | 2603.07927 | null |
| 2026-03-08 | Targeted Speaker Poisoning Framework in Zero-Shot Text-to-Speech | Thanapat Trachu et.al. | 2603.07551 | null |
| 2026-03-08 | Learning-free L2-Accented Speech Generation using Phonological Rules | Thanathai Lertpetchpun et.al. | 2603.07550 | null |
| 2026-03-08 | Accent Vector: Controllable Accent Manipulation for Multilingual TTS Without Accented Data | Thanathai Lertpetchpun et.al. | 2603.07534 | null |
| 2026-03-08 | Bolbosh: Script-Aware Flow Matching for Kashmiri Text-to-Speech | Tajamul Ashraf et.al. | 2603.07513 | null |
| 2026-02-21 | Advances in GRPO for Generation Models: A Survey | Zexiang Liu et.al. | 2603.06623 | null |
| 2026-03-06 | Prosodic Boundary-Aware Streaming Generation for LLM-Based TTS with Streaming Text Input | Changsong Liu et.al. | 2603.06444 | null |
| 2026-03-06 | Is it Me? Toward Self-Extension to AI Avatars in Virtual Reality | Jieying Zhang et.al. | 2603.06030 | null |
| 2026-03-06 | Activation Steering for Accent-Neutralized Zero-Shot Text-To-Speech | Mu Yang et.al. | 2603.05977 | null |
| 2026-03-06 | How Well Do Current Speech Deepfake Detection Methods Generalize to the Real World? | Daixian Li et.al. | 2603.05852 | null |
| 2026-03-06 | StreamWise: Serving Multi-Modal Generation in Real-Time at Scale | Haoran Qiu et.al. | 2603.05800 | null |
| 2026-03-05 | Hierarchical Decoding for Discrete Speech Synthesis with Multi-Resolution Spoof Detection | Junchuan Zhao et.al. | 2603.05373 | null |
| 2026-03-04 | ZeSTA: Zero-Shot TTS Augmentation with Domain-Conditioned Training for Data-Efficient Personalized Speech Synthesis | Youngwon Choi et.al. | 2603.04219 | null |
| 2026-03-04 | VietNormalizer: An Open-Source, Dependency-Free Python Library for Vietnamese Text Normalization in TTS and NLP Applications | Hung Vu Nguyen et.al. | 2603.04145 | null |
| 2026-03-03 | DLIOS: An LLM-Augmented Real-Time Multi-Modal Interactive Enhancement Overlay System for Douyin Live Streaming | Shuide Wen et.al. | 2603.03060 | null |
| 2026-03-02 | When Spoof Detectors Travel: Evaluation Across 66 Languages in the Low-Resource Language Spoofing Corpus | Kirill Borodin et.al. | 2603.02364 | null |
| 2026-03-01 | MM-DeepResearch: A Simple and Effective Multimodal Agentic Search Baseline | Huanjin Yao et.al. | 2603.01050 | null |
| 2026-03-01 | S-VoCAL: A Dataset and Evaluation Framework for Inferring Speaking Voice Character Attributes in Literature | Abigail Berthe-Pardo et.al. | 2603.00958 | null |
| 2026-02-27 | Disentangled Mode-Specific Representations for Tensor Time Series via Contrastive Learning | Kohei Obata et.al. | 2602.23663 | null |
| 2026-02-26 | TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment | Trung Dang et.al. | 2602.23068 | null |
| 2026-02-24 | MIDI-Informed Singing Accompaniment Generation in a Compositional Song Pipeline | Fang-Duo Tsai et.al. | 2602.22029 | null |
| 2026-02-23 | Can You Tell It’s AI? Human Perception of Synthetic Voices in Vishing Scenarios | Zoha Hayat Bhatti et.al. | 2602.20061 | null |
| 2026-02-23 | CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment | Hanwen Liu et.al. | 2602.19574 | null |
| 2026-02-22 | CosyAccent: Duration-Controllable Accent Normalization Using Source-Synthesis Training Data | Qibing Bai et.al. | 2602.19166 | null |
| 2026-02-20 | Recursive Sketched Interpolation: Efficient Hadamard Products of Tensor Trains | Zhaonan Meng et.al. | 2602.17974 | null |
| 2026-02-19 | Financial time series augmentation using transformer based GAN architecture | Andrzej Podobiński et.al. | 2602.17865 | null |
| 2026-02-18 | How to Label Resynthesized Audio: The Dual Role of Neural Audio Codecs in Audio Deepfake Detection | Yixuan Xiao et.al. | 2602.16343 | null |
| 2026-03-03 | UniTAF: A Modular Framework for Joint Text-to-Speech and Audio-to-Face Modeling | Qiangong Zhou et.al. | 2602.15651 | null |
| 2026-02-16 | Disentangling Pitch and Creak for Speaker Identity Preservation in Speech Synthesis | Frederik Rautenberg et.al. | 2602.14686 | null |
| 2026-02-16 | Probing Human Articulatory Constraints in End-to-End TTS with Reverse and Mismatched Speech-Text Directions | Parth Khadse et.al. | 2602.14664 | null |
| 2026-02-15 | LogitsCoder: Towards Efficient Chain-of-Thought Path Search via Logits Preference Decoding for Code Generation | Jizheng Chen et.al. | 2602.14054 | null |
| 2026-02-27 | Learning Vocal-Tract Area and Radiation with a Physics-Informed Webster Model | Minhui Lu et.al. | 2602.13834 | null |
| 2026-02-14 | ELEAT-SAGA: Early & Late Integration with Evading Alternating Training for Spoof-Robust Speaker Verification | Amro Asali et.al. | 2602.13761 | null |
| 2026-02-12 | UniT: Unified Multimodal Chain-of-Thought Test-time Scaling | Leon Liangyu Chen et.al. | 2602.12279 | null |
| 2026-02-12 | SLD-L2S: Hierarchical Subspace Latent Diffusion for High-Fidelity Lip to Speech Synthesis | Yifan Liang et.al. | 2602.11477 | null |
| 2026-01-19 | Synthesizing the Virtual Advocate: A Multi-Persona Speech Generation Framework for Diverse Linguistic Jurisdictions in Indic Languages | Aniket Deroy et.al. | 2602.11172 | null |
| 2026-02-11 | Calliope: A TTS-based Narrated E-book Creator Ensuring Exact Synchronization, Privacy, and Layout Fidelity | Hugo L. Hammer et.al. | 2602.10735 | null |
| 2026-02-10 | Emotion-Coherent Speech Data Augmentation and Self-Supervised Contrastive Style Training for Enhancing Kids’s Story Speech Synthesis | Raymond Chung et.al. | 2602.10164 | null |
| 2026-02-10 | Covo-Audio Technical Report | Wenfu Wang et.al. | 2602.09823 | null |
| 2026-02-10 | TVTSyn: Content-Synchronous Time-Varying Timbre for Streaming Voice Conversion and Anonymization | Waris Quamer et.al. | 2602.09389 | null |
| 2026-02-03 | DSFlow: Dual Supervision and Step-Aware Architecture for One-Step Flow Matching Speech Synthesis | Bin Lin et.al. | 2602.09041 | null |
| 2026-02-09 | Tutti: Expressive Multi-Singer Synthesis via Structure-Level Timbre Control and Vocal Texture Modeling | Jiatao Chen et.al. | 2602.08233 | null |
| 2026-02-08 | MARTI-MARS $^2$ : Scaling Multi-Agent Self-Search via Reinforcement Learning for Code Generation | Shijie Wang et.al. | 2602.07848 | null |
| 2026-02-08 | SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis | Jiale Qian et.al. | 2602.07803 | null |
| 2026-02-05 | Private and interpretable clinical prediction with quantum-inspired tensor train models | José Ramón Pareja Monturiol et.al. | 2602.06110 | null |
| 2026-01-14 | PersonaPlex: Voice and Role Control for Full Duplex Conversational Speech Models | Rajarshi Roy et.al. | 2602.06053 | null |
| 2026-02-05 | Zero-Shot TTS With Enhanced Audio Prompts: Bsc Submission For The 2026 Wildspoof Challenge TTS Track | Jose Giraldo et.al. | 2602.05770 | null |
| 2026-02-05 | EGSS: Entropy-guided Stepwise Scaling for Reliable Software Engineering | Chenhui Mao et.al. | 2602.05242 | null |
| 2026-02-05 | ARCHI-TTS: A flow-matching-based Text-to-Speech Model with Self-supervised Semantic Aligner and Accelerated Inference | Chunyat Wu et.al. | 2602.05207 | null |
| 2026-02-04 | HoliAntiSpoof: Audio LLM for Holistic Speech Anti-Spoofing | Xuenan Xu et.al. | 2602.04535 | null |
| 2026-02-04 | SCALE: Self-uncertainty Conditioned Adaptive Looking and Execution for Vision-Language-Action Models | Hyeonbeom Choi et.al. | 2602.04208 | null |
| 2026-02-04 | PFluxTTS: Hybrid Flow-Matching TTS with Robust Cross-Lingual Voice Cloning and Inference-Time Model Fusion | Vikentii Pankov et.al. | 2602.04160 | null |
| 2026-02-01 | Decoding Ambiguous Emotions with Test-Time Scaling in Audio-Language Models | Hong Jia et.al. | 2602.03873 | null |
| 2026-02-03 | CoCoEmo: Composable and Controllable Human-Like Emotional TTS via Activation Steering | Siyi Wang et.al. | 2602.03420 | null |
| 2026-02-03 | SWE-World: Building Software Engineering Agents in Docker-Free Environments | Shuang Sun et.al. | 2602.03419 | null |
| 2026-02-24 | SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training | Huatong Song et.al. | 2602.03411 | null |
| 2026-02-01 | VividVoice: A Unified Framework for Scene-Aware Visually-Driven Speech Synthesis | Chengyuan Ma et.al. | 2602.02591 | null |
| 2026-02-02 | LipSody: Lip-to-Speech Synthesis with Enhanced Prosody Consistency | Jaejun Lee et.al. | 2602.01908 | null |
| 2026-02-02 | Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models | Jinbin Bai et.al. | 2602.01842 | null |
| 2026-02-03 | ARTIS: Agentic Risk-Aware Test-Time Scaling via Iterative Simulation | Xingshan Zeng et.al. | 2602.01709 | null |
| 2026-02-01 | Chronos: Learning Temporal Dynamics of Reasoning Chains for Test-Time Scaling | Kai Zhang et.al. | 2602.01208 | null |
| 2026-02-01 | HierCon: Hierarchical Contrastive Attention for Audio Deepfake Detection | Zhili Nicholas Liang et.al. | 2602.01032 | null |
| 2026-02-09 | APR: Penalizing Structural Redundancy in Large Reasoning Models via Anchor-based Process Rewards | Kaiyan Chang et.al. | 2602.00760 | null |
| 2026-01-30 | Multi-Speaker Conversational Audio Deepfake: Taxonomy, Dataset and Pilot Study | Alabi Ahmed et.al. | 2602.00295 | null |
| 2026-01-30 | Now You Hear Me: Audio Narrative Attacks Against Large Audio-Language Models | Ye Yu et.al. | 2601.23255 | null |
| 2026-01-30 | Hearing is Believing? Evaluating and Analyzing Audio Language Model Sycophancy with SYAUDIO | Junchi Yao et.al. | 2601.23149 | null |
| 2026-01-30 | DiffuSpeech: Silent Thought, Spoken Answer via Unified Speech-Text Diffusion | Yuxuan Lou et.al. | 2601.22889 | null |
| 2026-01-30 | EmoShift: Lightweight Activation Steering for Enhanced Emotion-Aware Speech Synthesis | Li Zhou et.al. | 2601.22873 | null |
| 2026-01-30 | Evaluating and Rewarding LALMs for Expressive Role-Play TTS via Mean Continuation Log-Probability | Yong Ren et.al. | 2601.22661 | null |
| 2026-01-29 | Speech Quality-Based Localization of Low-Quality Speech and Text-to-Speech Synthesis Artefacts | Michael Kuhlmann et.al. | 2601.21886 | null |
| 2026-01-28 | Audio Deepfake Detection in the Age of Advanced Text-to-Speech models | Robin Singh et.al. | 2601.20510 | null |
| 2026-01-28 | Erasing Your Voice Before It’s Heard: Training-free Speaker Unlearning for Zero-shot Text-to-Speech | Myungjin Lee et.al. | 2601.20481 | null |
| 2026-01-29 | Unit-Based Agent for Semi-Cascaded Full-Duplex Dialogue Systems | Haoyuan Yu et.al. | 2601.20230 | null |
| 2026-01-27 | T-Mimi: A Transformer-based Mimi Decoder for Real-Time On-Phone TTS | Haibin Wu et.al. | 2601.20094 | null |
| 2026-01-26 | Neural Multi-Speaker Voice Cloning for Nepali in Low-Resource Settings | Aayush M. Shrestha et.al. | 2601.18694 | null |
| 2026-01-26 | UrgentMOS: Unified Multi-Metric and Preference Learning for Robust Speech Quality Assessment | Wei Wang et.al. | 2601.18438 | null |
| 2026-01-26 | GAIA: A Data Flywheel System for Training GUI Test-Time Scaling Critic Models | Shaokang Wang et.al. | 2601.18197 | null |
| 2026-01-23 | SonoEdit: Null-Space Constrained Knowledge Editing for Pronunciation Correction in LLM-Based TTS | Ayush Pratap Singh et.al. | 2601.17086 | null |
| 2026-01-22 | Timbre-Aware LLM-based Direct Speech-to-Speech Translation Extendable to Multiple Language Pairs | Lalaram Arya et.al. | 2601.16023 | null |
| 2026-01-22 | Qwen3-TTS Technical Report | Hangrui Hu et.al. | 2601.15621 | null |
| 2026-01-22 | DeepASMR: LLM-Based Zero-Shot ASMR Speech Generation for Anyone of Any Voice | Leying Zhang et.al. | 2601.15596 | null |
| 2026-01-20 | Prosody-Guided Harmonic Attention for Phase-Coherent Neural Vocoding in the Complex Spectrum | Mohammed Salah Al-Radhi et.al. | 2601.14472 | null |
| 2026-01-28 | Quantifying Speaker Embedding Phonological Rule Interactions in Accented Speech Synthesis | Thanathai Lertpetchpun et.al. | 2601.14417 | null |
| 2026-01-20 | Synthetic Singers: A Review of Deep-Learning-based Singing Voice Synthesis Approaches | Changhao Pan et.al. | 2601.13910 | null |
| 2026-01-19 | Lombard Speech Synthesis for Any Voice with Controllable Style Embeddings | Seymanur Akti et.al. | 2601.12966 | null |
| 2026-01-18 | A Unified Neural Codec Language Model for Selective Editable Text to Speech Generation | Hanchen Pei et.al. | 2601.12480 | null |
| 2026-01-18 | ParaMETA: Towards Learning Disentangled Paralinguistic Speaking Styles Representations from Speech | Haowei Lou et.al. | 2601.12289 | null |
| 2026-01-18 | Confidence-based Filtering for Speech Dataset Curation with Generative Speech Enhancement Using Discrete Tokens | Kazuki Yamauchi et.al. | 2601.12254 | null |
| 2026-01-17 | Examining possible doubly topped baryon configurations | M. Shekari Tousi et.al. | 2601.11985 | null |
| 2026-01-16 | FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning | Tanyu Chen et.al. | 2601.11141 | null |
| 2026-01-16 | Redefining Machine Simultaneous Interpretation: From Incremental Translation to Human-Like Strategies | Qianen Zhang et.al. | 2601.11002 | null |
| 2026-01-20 | VoiceSculptor: Your Voice, Designed By You | Jingbin Hu et.al. | 2601.10629 | null |
| 2026-01-15 | Memo-SQL: Structured Decomposition and Experience-Driven Self-Correction for Training-Free NL2SQL | Zerui Yang et.al. | 2601.10011 | null |
| 2026-01-13 | Decoding Order Matters in Autoregressive Speech Synthesis | Minghui Zhao et.al. | 2601.08450 | null |
| 2026-01-12 | LJ-Spoof: A Generatively Varied Corpus for Audio Anti-Spoofing and Synthesis Source Tracing | Surya Subramani et.al. | 2601.07958 | null |
| 2026-01-11 | Bridging Attribution and Open-Set Detection using Graph-Augmented Instance Learning in Synthetic Speech | Mohd Mujtaba Akhtar et.al. | 2601.07064 | null |
| 2026-01-10 | Lightweight Resolution-Aware Audio Deepfake Detection via Cross-Scale Attention and Consistency Learning | K. A. Shahriar et.al. | 2601.06560 | null |
| 2026-01-10 | 3D CoCa v2: Contrastive Learners with Test-Time Search for Generalizable Spatial Intelligence | Hao Tang et.al. | 2601.06496 | null |
| 2026-01-09 | SPAM: Style Prompt Adherence Metric for Prompt-based TTS | Chanhee Cho et.al. | 2601.05554 | null |
| 2026-01-08 | Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization | Yuxiang Ji et.al. | 2601.05432 | null |
| 2026-01-08 | CosyEdit: Unlocking End-to-End Speech Editing Capability from Zero-Shot Text-to-Speech Models | Junyang Chen et.al. | 2601.05329 | null |
| 2026-01-08 | FlexiVoice: Enabling Flexible Style Control in Zero-Shot TTS with Natural Language Instructions | Dekun Chen et.al. | 2601.04656 | null |
| 2026-01-04 | LEMAS: Large A 150K-Hour Large-scale Extensible Multilingual Audio Suite with Generative Speech Models | Zhiyuan Zhao et.al. | 2601.04233 | null |
| 2026-01-07 | Agentic Rubrics as Contextual Verifiers for SWE Agents | Mohit Raghavendra et.al. | 2601.04171 | null |
| 2026-01-09 | IndexTTS 2.5 Technical Report | Yunpei Li et.al. | 2601.03888 | null |
| 2026-01-07 | ReStyle-TTS: Relative and Continuous Style Control for Zero-Shot Speech Synthesis | Haitao Li et.al. | 2601.03632 | null |
| 2026-01-06 | Tigrinya Number Verbalization: Rules, Algorithm, and Implementation | Fitsum Gaim et.al. | 2601.03403 | null |
| 2026-01-06 | Segment-Aware Conditioning for Training-Free Intra-Utterance Emotion and Duration Control in Text-to-Speech | Qifan Liang et.al. | 2601.03170 | null |
| 2026-01-24 | XLSR-MamBo: Scaling the Hybrid Mamba-Attention Backbone for Audio Deepfake Detection | Kwok-Ho Ng et.al. | 2601.02944 | null |
| 2026-01-06 | Vulnerabilities of Audio-Based Biometric Authentication Systems Against Deepfake Speech Synthesis | Mengze Hong et.al. | 2601.02914 | null |
| 2026-01-06 | Vclip: Face-based Speaker Generation by Face-voice Association Learning | Yao Shi et.al. | 2601.02753 | null |
| 2026-01-05 | Towards Prosodically Informed Mizo TTS without Explicit Tone Markings | Abhijit Mohanta et.al. | 2601.02073 | null |
| 2026-01-05 | A Training-Free Large Reasoning Model-based Knowledge Tracing Framework for Unified Prediction and Prescription | Unggi Lee et.al. | 2601.01708 | null |
| 2026-01-08 | MM-Sonate: Multimodal Controllable Audio-Video Generation with Zero-Shot Voice Cloning | Chunyu Qiang et.al. | 2601.01568 | null |
| 2026-01-04 | OV-InstructTTS: Towards Open-Vocabulary Instruct Text-to-Speech | Yong Ren et.al. | 2601.01459 | null |
| 2026-01-07 | SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving | Chaofan Tao et.al. | 2601.01426 | null |
| 2026-01-01 | DepFlow: Disentangled Speech Generation to Mitigate Semantic Bias in Depression Detection | Yuxin Li et.al. | 2601.00303 | null |
| 2026-01-01 | Latent Flow Matching for Expressive Singing Voice Synthesis | Minhyeok Yun et.al. | 2601.00217 | null |
| 2025-12-30 | A closer look at the young stellar group around Sh 2-295 | João Victor Corrêa-Rodrigues et.al. | 2512.24388 | null |
| 2025-12-29 | MiMo-Audio: Audio Language Models are Few-Shot Learners | Xiaomi LLM-Core Team et.al. | 2512.23808 | link |
| 2025-12-29 | AI4Reading: Chinese Audiobook Interpretation System Based on Multi-Agent Collaboration | Minjiang Huang et.al. | 2512.23300 | link |
| 2025-12-31 | Task-oriented Learnable Diffusion Timesteps for Universal Few-shot Learning of Dense Tasks | Changgyoon Oh et.al. | 2512.23210 | null |
| 2025-12-27 | Scaling Unverifiable Rewards: A Case Study on Visual Insights | Shuyu Gan et.al. | 2512.22650 | null |
| 2025-12-27 | ManchuTTS: Towards High-Quality Manchu Speech Synthesis via Flow Matching and Hierarchical Text Representation | Suhua Wang et.al. | 2512.22491 | null |
| 2025-12-26 | SWE-RM: Execution-free Feedback For Software Engineering Agents | KaShun Shum et.al. | 2512.21919 | null |
| 2025-12-25 | Zero-Shot to Zero-Lies: Detecting Bengali Deepfake Audio through Transfer Learning | Most. Sharmin Sultana Samu et.al. | 2512.21702 | null |
| 2025-12-22 | Picosecond laser test unit for photosensor characterization at ambient and low temperatures | Matthias Raphael Stock et.al. | 2512.19667 | null |
| 2025-12-22 | dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models | Yi Xin et.al. | 2512.19433 | null |
| 2025-12-22 | JoyVoice: Long-Context Conditioning for Anthropomorphic Multi-Speaker Conversational Synthesis | Fan Yu et.al. | 2512.19090 | null |
| 2025-12-21 | Smark: A Watermark for Text-to-Speech Diffusion Models via Discrete Wavelet Transform | Yichuan Zhang et.al. | 2512.18791 | null |
| 2025-12-21 | Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis | Pengchao Feng et.al. | 2512.18699 | null |
| 2025-12-20 | The MEVIR 2 Framework: A Virtue-Informed Moral-Epistemic Model of Human Trust Decisions | Daniel Schwabe et.al. | 2512.18539 | null |
| 2025-12-19 | Training Text-to-Speech Model with Purely Synthetic Data: Feasibility, Sensitivity, and Generalization Capability | Tingxiao Zhou et.al. | 2512.17356 | null |
| 2025-12-19 | Robust TTS Training via Self-Purifying Flow Matching for the WildSpoof 2026 TTS Track | June Young Yi et.al. | 2512.17293 | null |
| 2025-12-19 | Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience | Jiangjie Chen et.al. | 2512.17260 | null |
| 2025-12-17 | Rotatable IRS-Assisted 6DMA Communications: A Two-timescale Design | Chao Zhou et.al. | 2512.15092 | null |
| 2025-12-16 | Robust Training of Singing Voice Synthesis Using Prior and Posterior Uncertainty | Yiwen Zhao et.al. | 2512.14653 | null |
| 2025-12-16 | GLM-TTS Technical Report | Jiayan Cui et.al. | 2512.14291 | null |
| 2026-01-04 | DisCo-Speech: Controllable Zero-Shot Speech Generation with A Disentangled Speech Codec | Tao Li et.al. | 2512.13251 | null |
| 2025-12-13 | F5-TTS-RO: Extending F5-TTS to Romanian TTS via Lightweight Input Adaptation | Radu-Gabriel Chivereanu et.al. | 2512.12297 | null |
| 2025-12-11 | Limits and Gains of Test-Time Scaling in Vision-Language Reasoning | Mohammadjavad Ahmadpour et.al. | 2512.11109 | null |
| 2025-12-11 | CompanionCast: A Multi-Agent Conversational AI Framework with Spatial Audio for Social Co-Viewing Experiences | Yiyang Wang et.al. | 2512.10918 | null |
| 2025-12-10 | DMP-TTS: Disentangled multi-modal Prompting for Controllable Text-to-Speech with Chained Guidance | Kang Yin et.al. | 2512.09504 | null |
| 2025-12-09 | LG Uplus System with Multi-Speaker IDs and Discriminator-based Sub-Judges for the WildSpoof Challenge | Jinyoung Park et.al. | 2512.09000 | null |
| 2025-12-08 | Beyond Unified Models: A Service-Oriented Approach to Low Latency, Context Aware Phonemization for Real Time TTS | Mahta Fetrat et.al. | 2512.08006 | null |
| 2025-12-09 | Performance Benchmarking of Tensor Trains for accelerated Quantum-Inspired Homogenization on TPU, GPU and CPU architectures | Sascha H. Hauck et.al. | 2512.07811 | null |
| 2025-12-05 | Simulating Life Paths with Digital Twins: AI-Generated Future Selves Influence Decision-Making and Expand Human Choice | Rachel Poonsiriwong et.al. | 2512.05397 | null |
| 2025-11-23 | SyncVoice: Towards Video Dubbing with Vision-Augmented Pretrained TTS Model | Kaidi Wang et.al. | 2512.05126 | null |
| 2025-12-04 | YingMusic-Singer: Zero-shot Singing Voice Synthesis and Editing with Annotation-free Melody Guidance | Junjie Zheng et.al. | 2512.04779 | null |
| 2025-12-04 | M3-TTS: Multi-modal DiT Alignment & Mel-latent for Zero-shot High-fidelity Speech Synthesis | Xiaopeng Wang et.al. | 2512.04720 | null |
| 2025-12-04 | RRPO: Robust Reward Policy Optimization for LLM-based Emotional TTS | Cong Wang et.al. | 2512.04552 | null |
| 2025-12-03 | Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation | Hang Xu et.al. | 2512.03996 | null |
| 2025-12-02 | Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach | Siyuan Yang et.al. | 2512.02834 | null |
| 2025-12-02 | Generative Multi-modal Feedback for Singing Voice Synthesis Evaluation | Xueyan Li et.al. | 2512.02523 | null |
| 2025-12-01 | The Art of Scaling Test-Time Compute for Large Language Models | Aradhye Agarwal et.al. | 2512.02008 | null |
| 2025-11-30 | Arabic TTS with FastPitch: Reproducible Baselines, Adversarial Training, and Oversmoothing Analysis | Lars Nippert et.al. | 2512.00937 | null |
| 2025-11-29 | FR-TTS: Test-Time Scaling for NTP-based Image Generation with Effective Filling-based Reward Signal | Hang Xu et.al. | 2512.00438 | null |
| 2025-11-27 | GLA-Grad++: An Improved Griffin-Lim Guided Diffusion Model for Speech Synthesis | Teysir Baoueb et.al. | 2511.22293 | null |
| 2025-11-27 | VSpeechLM: A Visual Speech Language Model for Visual Text-to-Speech Task | Yuyue Wang et.al. | 2511.22229 | null |
| 2025-11-21 | Asking LLMs to Verify First is Almost Free Lunch | Shiguang Wu et.al. | 2511.21734 | null |
| 2025-11-26 | TSGM: Regular and Irregular Time-series Generation using Score-based Generative Models | Haksoo Lim et.al. | 2511.21335 | null |
| 2025-11-26 | Multi-Reward GRPO for Stable and Prosodic Single-Codebook TTS LLMs at Scale | Yicheng Zhong et.al. | 2511.21270 | null |
| 2025-11-26 | MUSE: Manipulating Unified Framework for Synthesizing Emotions in Images via Test-Time Optimization | Yingjie Xia et.al. | 2511.21051 | null |
| 2025-11-26 | CartoonSing: Unifying Human and Nonhuman Timbres in Singing Generation | Jionghao Han et.al. | 2511.21045 | null |
| 2025-10-30 | Transforming Higher Education with AI-Powered Video Lectures | Dengsheng Zhang et.al. | 2511.20660 | null |
| 2025-11-25 | Continual Audio Deepfake Detection via Universal Adversarial Perturbation | Wangjie Li et.al. | 2511.19974 | null |
| 2025-11-26 | Scale Where It Matters: Training-Free Localized Scaling for Diffusion Models | Qin Ren et.al. | 2511.19917 | null |
| 2025-11-23 | InstructAudio: Unified speech and music generation with natural language instruction | Chunyu Qiang et.al. | 2511.18487 | null |
| 2025-11-22 | A superpersuasive autonomous policy debating system | Allen Roush et.al. | 2511.17854 | null |
| 2025-11-21 | AI in Music and Sound: Pedagogical Reflections, Post-Structuralist Approaches and Creative Outcomes in Seminar Practice | Guilherme Coelho et.al. | 2511.17425 | null |
| 2025-11-20 | Codec2Vec: Self-Supervised Speech Representation Learning Using Neural Speech Codecs | Wei-Cheng Tseng et.al. | 2511.16639 | null |
| 2025-11-20 | SceneGuard: Training-Time Voice Protection with Scene-Consistent Audible Background Noise | Rui Sang et.al. | 2511.16114 | null |
| 2025-11-24 | PresentCoach: Dual-Agent Presentation Coaching through Exemplars and Interactive Feedback | Sirui Chen et.al. | 2511.15253 | null |
| 2025-11-18 | Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech | Nam-Gyu Kim et.al. | 2511.14824 | null |
| 2025-11-06 | The Impact of Prosodic Segmentation on Speech Synthesis of Spontaneous Speech | Julio Cesar Galdino et.al. | 2511.14779 | null |
| 2025-11-16 | Hi-Reco: High-Fidelity Real-Time Conversational Digital Humans | Hongbin Huang et.al. | 2511.12662 | null |
| 2025-11-15 | VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing | Zhisheng Zheng et.al. | 2511.12347 | null |
| 2025-11-14 | CLARITY: Contextual Linguistic Adaptation and Accent Retrieval for Dual-Bias Mitigation in Text-to-Speech Generation | Crystal Min Hui Poon et.al. | 2511.11104 | null |
| 2025-11-14 | Synthetic Voices, Real Threats: Evaluating Large Text-to-Speech Models in Generating Harmful Audio | Guangke Chen et.al. | 2511.10913 | null |
| 2025-11-13 | Curved Worlds, Clear Boundaries: Generalizing Speech Deepfake Detection using Hyperbolic and Spherical Geometry Spaces | Farhan Sheth et.al. | 2511.10793 | null |
| 2025-11-13 | VocalNet-M2: Advancing Low-Latency Spoken Language Modeling via Integrated Multi-Codebook Tokenization and Multi-Token Prediction | Yuhao Wang et.al. | 2511.10232 | null |
| 2025-11-13 | Time-Layer Adaptive Alignment for Speaker Similarity in Flow-Matching Based Zero-Shot TTS | Haoyu Li et.al. | 2511.09995 | null |
| 2025-11-30 | SpeechJudge: Towards Human-Level Judgment for Speech Naturalness | Xueyao Zhang et.al. | 2511.07931 | link |
| 2025-11-24 | SynTTS-Commands: A Public Dataset for On-Device KWS via TTS-Synthesized Multilingual Speech | Lu Gan et.al. | 2511.07821 | null |
| 2025-11-10 | Generating Novel and Realistic Speakers for Voice Conversion | Meiying Melissa Chen et.al. | 2511.07135 | null |
| 2025-10-26 | Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation | Canxiang Yan et.al. | 2511.05516 | null |
| 2025-11-07 | Shared Latent Representation for Joint Text-to-Audio-Visual Synthesis | Dogucan Yaman et.al. | 2511.05432 | null |
| 2025-11-07 | Synthesizing speech with selected perceptual voice qualities - A case study with creaky voice | Frederik Rautenberg et.al. | 2511.05143 | null |
| 2025-11-19 | Step-Audio-EditX Technical Report | Chao Yan et.al. | 2511.03601 | null |
| 2025-11-05 | Seeing What You Say: Expressive Image Generation from Speech | Jiyoung Lee et.al. | 2511.03423 | null |
| 2025-11-05 | PolyNorm: Few-Shot LLM-Based Text Normalization for Text-to-Speech | Michel Wong et.al. | 2511.03080 | null |
| 2025-11-04 | Augmenting Open-Vocabulary Dysarthric Speech Assessment with Human Perceptual Supervision | Kaimeng Jia et.al. | 2511.02270 | null |
| 2025-11-03 | Toward Objective and Interpretable Prosody Evaluation in Text-to-Speech: A Linguistically Motivated Approach | Cedric Chan et.al. | 2511.02104 | null |
| 2025-10-29 | Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph | Fali Wang et.al. | 2511.00086 | null |
| 2025-10-31 | Reconstructing Unseen Sentences from Speech-related Biosignals for Open-vocabulary Neural Communication | Deok-Seon Kim et.al. | 2510.27247 | null |
| 2025-10-30 | Two-Timescale Optimization Framework for IAB-Enabled Heterogeneous UAV Networks | Jikang Deng et.al. | 2510.26578 | null |
| 2025-10-30 | SP-MCQA: Evaluating Intelligibility of TTS Beyond the Word Level | Hitomi Jin Ling Tee et.al. | 2510.26190 | null |
| 2025-10-30 | Reasoning Path Divergence: A New Metric and Curation Strategy to Unlock LLM Diverse Thinking | Feng Ju et.al. | 2510.26122 | null |
| 2025-10-30 | Evaluating the Role of Verifiers in Test-Time Scaling for Legal Reasoning Tasks | Davide Romano et.al. | 2510.25623 | null |
| 2025-10-27 | SFMS-ALR: Script-First Multilingual Speech Synthesis with Adaptive Locale Resolution | Dharma Teja Donepudi et.al. | 2510.25178 | null |
| 2025-10-28 | Can Aha Moments Be Fake? Identifying True and Decorative Thinking Steps in Chain-of-Thought | Jiachen Zhao et.al. | 2510.24941 | null |
| 2025-10-28 | Bayesian Speech synthesizers Can Learn from Multiple Teachers | Ziyang Zhang et.al. | 2510.24372 | null |
| 2025-10-28 | SoulX-Podcast: Towards Realistic Long-form Podcasts with Dialectal and Paralinguistic Diversity | Hanke Xie et.al. | 2510.23541 | null |
| 2025-10-28 | BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents | Litu Ou et.al. | 2510.23458 | null |
| 2025-10-26 | UltraVoice: Scaling Fine-Grained Style-Controlled Speech Conversations for Spoken Dialogue Models | Wenming Tu et.al. | 2510.22588 | null |
| 2025-10-25 | T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models | Jindong Yang et.al. | 2510.22366 | null |
| 2025-10-23 | GuitarFlow: Realistic Electric Guitar Synthesis From Tablatures via Flow Matching and Style Transfer | Jackson Loth et.al. | 2510.21872 | null |
| 2025-10-24 | StylePitcher: Generating Style-Following and Expressive Pitch Curves for Versatile Singing Tasks | Jingyue Huang et.al. | 2510.21685 | null |
| 2025-10-24 | SHAP Meets Tensor Networks: Provably Tractable Explanations with Parallelism | Reda Marzouk et.al. | 2510.21599 | null |
| 2025-10-23 | Vox-Evaluator: Enhancing Stability and Fidelity for Zero-shot TTS with A Multi-Level Evaluator | Hualei Wang et.al. | 2510.20210 | null |
| 2025-10-22 | EchoFake: A Replay-Aware Dataset for Practical Speech Deepfake Detection | Tong Zhang et.al. | 2510.19414 | null |
| 2025-10-21 | ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive Text-to-Speech Generation | Haowei Lou et.al. | 2510.18308 | null |
| 2025-10-19 | U-Codec: Ultra Low Frame-rate Neural Speech Codec for Fast High-fidelity Speech Generation | Xusheng Yang et.al. | 2510.16718 | null |
| 2025-10-18 | TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model | Bin Yu et.al. | 2510.16449 | null |
| 2025-10-22 | VoiceMorph: How AI Voice Morphing Reveals the Boundaries of Auditory Self-Recognition | Kye Shimizu et.al. | 2510.16192 | null |
| 2025-10-15 | Optimal Aggregation of LLM and PRM Signals for Efficient Test-Time Scaling | Peng Kuang et.al. | 2510.13918 | null |
| 2025-10-15 | Generative Universal Verifier as Multimodal Meta-Reasoner | Xinchen Zhang et.al. | 2510.13804 | null |
| 2025-10-15 | Closing the Gap Between Text and Speech Understanding in LLMs | Santiago Cuervo et.al. | 2510.13632 | null |
| 2025-10-15 | Mismatch Aware Guidance for Robust Emotion Control in Auto-Regressive TTS Models | Yizhou Peng et.al. | 2510.13293 | null |
| 2025-10-15 | StressTransfer: Stress-Aware Speech-to-Speech Translation with Emphasis Preservation | Xi Chen et.al. | 2510.13194 | null |
| 2025-10-23 | Continuous-Token Diffusion for Speaker-Referenced TTS in Multimodal LLMs | Xinlu He et.al. | 2510.12995 | null |
| 2025-10-15 | DiSTAR: Diffusion over a Scalable Token Autoregressive Representation for Speech Generation | Yakun Song et.al. | 2510.12210 | null |
| 2025-10-13 | BridgeCode: A Dual Speech Representation Paradigm for Autoregressive Zero-Shot Text-to-Speech Synthesis | Jingyuan Xing et.al. | 2510.11646 | null |
| 2025-10-13 | Perturbation Self-Supervised Representations for Cross-Lingual Emotion TTS: Stage-Wise Modeling of Emotion and Speaker | Cheng Gong et.al. | 2510.11124 | null |
| 2025-10-14 | ParsVoice: A Large-Scale Multi-Speaker Persian Speech Corpus for Text-to-Speech Synthesis | Mohammad Javad Ranjbar Kalahroodi et.al. | 2510.10774 | null |
| 2025-10-17 | MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations | Wenxiang Guo et.al. | 2510.10396 | null |
| 2025-10-11 | Unifying Tree Search Algorithm and Reward Design for LLM Reasoning: A Survey | Jiaqi Wei et.al. | 2510.09988 | null |
| 2025-10-10 | O_O-VC: Synthetic Data-Driven One-to-One Alignment for Any-to-Any Voice Conversion | Huu Tuong Tu et.al. | 2510.09061 | null |
| 2025-10-10 | DiTSinger: Scaling Singing Voice Synthesis with Diffusion Transformer and Implicit Alignment | Zongcai Du et.al. | 2510.09016 | null |
| 2025-10-04 | Less Diverse, Less Safe: The Indirect But Pervasive Risk of Test-Time Scaling in Large Language Models | Shahriar Kabir Nahin et.al. | 2510.08592 | null |
| 2025-10-09 | DialoSpeech: Dual-Speaker Dialogue Generation with LLM and Flow Matching | Hanke Xie et.al. | 2510.08373 | null |
| 2025-10-09 | IntMeanFlow: Few-step Speech Generation with Integral Velocity Distillation | Wei Wang et.al. | 2510.07979 | null |
| 2025-11-05 | VoiceAgentBench: Are Voice Assistants ready for agentic tasks? | Dhruv Jain et.al. | 2510.07978 | null |
| 2025-10-09 | Parallel Test-Time Scaling for Latent Reasoning Models | Runyang You et.al. | 2510.07745 | null |
| 2025-10-08 | AsyncSpade: Efficient Test-Time Scaling with Asynchronous Sparse Decoding | Shuqing Luo et.al. | 2510.07486 | null |
| 2025-10-08 | Making Machines Sound Sarcastic: LLM-Enhanced and Retrieval-Guided Sarcastic Speech Synthesis | Zhu Li et.al. | 2510.07096 | null |
| 2025-10-08 | Towards Responsible Evaluation for Text-to-Speech | Yifan Yang et.al. | 2510.06927 | null |
| 2025-10-08 | XLSR-Kanformer: A KAN-Intergrated model for Synthetic Speech Detection | Phuong Tuan Dat et.al. | 2510.06706 | null |
| 2025-10-07 | Test-Time Scaling of Reasoning Models for Machine Translation | Zihao Li et.al. | 2510.06471 | null |
| 2025-10-07 | TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning | Jiaru Zou et.al. | 2510.06217 | null |
| 2025-10-07 | Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification | Weihao Zeng et.al. | 2510.06135 | null |
| 2025-10-07 | ECTSpeech: Enhancing Efficient Speech Synthesis via Easy Consistency Tuning | Tao Zhu et.al. | 2510.05984 | null |
| 2025-10-07 | Data-efficient Targeted Token-level Preference Optimization for LLM-based Text-to-Speech | Rikuto Kotoge et.al. | 2510.05799 | null |
| 2025-10-07 | EMORL-TTS: Reinforcement Learning for Fine-Grained Emotion Control in LLM-based TTS | Haoxun Li et.al. | 2510.05758 | null |
| 2025-10-07 | Sparse deepfake detection promotes better disentanglement | Antoine Teissier et.al. | 2510.05696 | null |
| 2025-10-09 | Paper2Video: Automatic Video Generation from Scientific Papers | Zeyu Zhu et.al. | 2510.05096 | null |
| 2025-10-28 | Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models | Yolo Yunlong Tang et.al. | 2510.05034 | null |
| 2025-10-06 | Speak, Edit, Repeat: High-Fidelity Voice Editing and Zero-Shot TTS with Cross-Attentive Mamba | Baher Mohammad et.al. | 2510.04738 | null |
| 2025-10-07 | Synthetic Audio Forensics Evaluation (SAFE) Challenge | Kirill Trapeznikov et.al. | 2510.03387 | null |
| 2025-10-03 | Evaluation of preprocessing pipelines in the creation of in-the-wild TTS datasets | Matías Di Bernardo et.al. | 2510.03111 | null |
| 2025-10-03 | Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-Speech | Hieu-Nghia Huynh-Nguyen et.al. | 2510.02848 | null |
| 2025-10-02 | On the Role of Temperature Sampling in Test-Time Scaling | Yuheng Wu et.al. | 2510.02611 | null |
| 2025-10-02 | Emotional Text-To-Speech Based on Mutual-Information-Guided Emotion-Timbre Disentanglement | Jianing Yang et.al. | 2510.01722 | null |
| 2025-09-30 | BatonVoice: An Operationalist Framework for Enhancing Controllable Speech Synthesis with Linguistic Intelligence from LLMs | Yue Wang et.al. | 2509.26514 | null |
| 2025-09-30 | Go with Your Gut: Scaling Confidence for Autoregressive Image Generation | Harold Haodong Chen et.al. | 2509.26376 | null |
| 2025-09-30 | HiStyle: Hierarchical Style Embedding Predictor for Text-Prompt-Guided Controllable Speech Synthesis | Ziyu Zhang et.al. | 2509.25842 | null |
| 2025-09-29 | Emotion-Aligned Generation in Diffusion Text to Speech Models via Preference-Guided Optimization | Jiacheng Shi et.al. | 2509.25416 | null |
| 2025-09-29 | Incentive-Aligned Multi-Source LLM Summaries | Yanchen Jiang et.al. | 2509.25184 | null |
| 2025-09-29 | MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech | Chengyao Wang et.al. | 2509.25131 | null |
| 2025-09-29 | LatentEvolve: Self-Evolving Test-Time Scaling in Latent Space | Guibin Zhang et.al. | 2509.24771 | null |
| 2025-09-29 | VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning | Yixuan Zhou et.al. | 2509.24650 | null |
| 2025-09-29 | Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis | Tianrui Wang et.al. | 2509.24629 | null |
| 2025-09-29 | ContextPRM: Leveraging Contextual Coherence for multi-domain Test-Time Scaling | Haotian Zhang et.al. | 2509.24460 | null |
| 2025-09-29 | UniFlow-Audio: Unified Flow Matching for Audio Generation from Omni-Modalities | Xuenan Xu et.al. | 2509.24391 | null |
| 2025-09-28 | Generalizable Speech Deepfake Detection via Information Bottleneck Enhanced Adversarial Alignment | Pu Huang et.al. | 2509.23618 | null |
| 2025-10-07 | Training Vision-Language Process Reward Models for Test-Time Scaling in Multimodal Reasoning: Key Insights and Lessons Learned | Brandon Ong et.al. | 2509.23250 | null |
| 2025-09-27 | BFA: Real-time Multilingual Text-to-speech Forced Alignment | Abdul Rehman et.al. | 2509.23147 | null |
| 2025-09-25 | DiaMoE-TTS: A Unified IPA-Based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptation | Ziqi Chen et.al. | 2509.22727 | null |
| 2025-09-24 | PerformSinger: Multimodal Singing Voice Synthesis Leveraging Synchronized Lip Cues from Singing Performance Videos | Ke Gu et.al. | 2509.22718 | null |
| 2025-09-26 | Dynamic Experts Search: Enhancing Reasoning in Mixture-of-Experts LLMs at Test Time | Yixuan Han et.al. | 2509.22572 | null |
| 2025-09-26 | Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis | Zhikang Niu et.al. | 2509.22167 | null |
| 2025-09-26 | Speaker Anonymisation for Speech-based Suicide Risk Detection | Ziyun Cui et.al. | 2509.22148 | null |
| 2025-09-26 | Think Right, Not More: Test-Time Scaling for Numerical Claim Verification | Primakov Chungkham et.al. | 2509.22101 | null |
| 2025-09-26 | Redefining Machine Simultaneous Interpretation: From Incremental Translation to Human-Like Strategies | Qianen Zhang et.al. | 2509.21801 | null |
| 2025-09-26 | SPADE: Structured Pruning and Adaptive Distillation for Efficient LLM-TTS | Tan Dat Nguyen et.al. | 2509.20802 | null |
| 2025-09-24 | Reconstruction-Based Adaptive Scheduling Using AI Inferences in Safety-Critical Systems | Samer Alshaer et.al. | 2509.20513 | null |
| 2025-09-24 | Objective Evaluation of Prosody and Intelligibility in Speech Synthesis via Conditional Prediction of Discrete Tokens | Ismail Rasim Ulgen et.al. | 2509.20485 | null |
| 2025-09-20 | Beyond Global Emotion: Fine-Grained Emotional Speech Synthesis with Dynamic Word-Level Modulation | Sirui Wang et.al. | 2509.20378 | null |
| 2025-09-25 | Measuring Prosody Diversity in Zero-Shot TTS: A New Metric, Benchmark, and Exploration | Yifan Yang et.al. | 2509.19928 | null |
| 2025-09-24 | CoMelSinger: Discrete Token-Based Zero-Shot Singing Synthesis With Structured Melody Control and Guidance | Junchuan Zhao et.al. | 2509.19883 | null |
| 2025-09-24 | Eliminating stability hallucinations in llm-based tts models via attention guidance | ShiMing Wang et.al. | 2509.19852 | null |
| 2025-09-24 | Efficient Speech Watermarking for Speech Synthesis via Progressive Knowledge Distillation | Yang Cui et.al. | 2509.19812 | null |
| 2025-09-24 | PART: Progressive Alignment Representation Training for Multilingual Speech-To-Text with LLMs | Pei Zhang et.al. | 2509.19745 | null |
| 2025-09-24 | Selective Classifier-free Guidance for Zero-shot Text-to-speech | John Zheng et.al. | 2509.19668 | null |
| 2025-09-23 | Are We Scaling the Right Thing? A System Perspective on Test-Time Scaling | Youpeng Zhao et.al. | 2509.19645 | null |
| 2025-09-23 | Finding My Voice: Generative Reconstruction of Disordered Speech for Automated Clinical Evaluation | Karen Rosero et.al. | 2509.19231 | null |
| 2025-09-23 | Investigating Test-Time Scaling with Reranking for Machine Translation | Shaomu Tan et.al. | 2509.19020 | null |
| 2025-09-23 | No Verifiable Reward for Prosody: Toward Preference-Guided Prosody Learning in TTS | Seungyoun Shin et.al. | 2509.18531 | null |
| 2025-09-22 | Discrete-time diffusion-like models for speech synthesis | Xiaozhou Tan et.al. | 2509.18470 | null |
| 2025-09-22 | TMD-TTS: A Unified Tibetan Multi-Dialect Text-to-Speech Synthesis for Ü-Tsang, Amdo and Kham Speech Dataset Generation | Yutong Liu et.al. | 2509.18060 | null |
| 2025-09-22 | Variation in Verification: Understanding Verification Dynamics in Large Language Models | Yefan Zhou et.al. | 2509.17995 | null |
| 2025-09-22 | Nord-Parl-TTS: Finnish and Swedish TTS Dataset from Parliament Speech | Zirui Li et.al. | 2509.17988 | null |
| 2025-09-23 | Mitigating Strategy-Selection Bias in Reasoning for More Effective Test-Time Scaling | Zongqian Wu et.al. | 2509.17905 | null |
| 2025-09-22 | Audiobook-CC: Controllable Long-context Speech Generation for Multicast Audiobook | Min Liu et.al. | 2509.17516 | null |
| 2025-09-21 | Bridging the gap between training and inference in LM-based TTS models | Ruonan Zhang et.al. | 2509.17021 | null |
| 2025-09-21 | MBCodec:Thorough disentangle for high-fidelity audio compression | Ruonan Zhang et.al. | 2509.17006 | null |
| 2025-09-19 | Fed-PISA: Federated Voice Cloning via Personalized Identity-Style Adaptation | Qi Wang et.al. | 2509.16010 | null |
| 2025-09-19 | VoXtream: Full-Stream Text-to-Speech with Extremely Low Latency | Nikita Torgashov et.al. | 2509.15969 | null |
| 2025-09-19 | Deep Dubbing: End-to-End Auto-Audiobook System with Text-to-Timbre and Context-Aware Instruct-TTS | Ziqi Dai et.al. | 2509.15845 | null |
| 2025-09-19 | Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech | Xinlei Niu et.al. | 2509.15492 | null |
| 2025-09-18 | Real-Time Streaming Mel Vocoding with Generative Flow Matching | Simon Welker et.al. | 2509.15085 | null |
| 2025-09-19 | DAIEN-TTS: Disentangled Audio Infilling for Environment-Aware Text-to-Speech Synthesis | Ye-Xin Lu et.al. | 2509.14684 | null |
| 2025-09-23 | Cross-Lingual F5-TTS: Towards Language-Agnostic Voice Cloning and Speech Synthesis | Qingyu Liu et.al. | 2509.14579 | null |
| 2025-10-01 | SpeechWeave: Diverse Multilingual Synthetic Text & Audio Data Generation Pipeline for Training Text to Speech Models | Karan Dua et.al. | 2509.14270 | null |
| 2025-09-17 | Slim-SC: Thought Pruning for Efficient Scaling with Self-Consistency | Colin Hong et.al. | 2509.13990 | null |
| 2025-09-22 | Do You Hear What I Mean? Quantifying the Instruction-Perception Gap in Instruction-Guided Expressive Text-To-Speech Systems | Yi-Cheng Lin et.al. | 2509.13989 | null |
| 2025-09-16 | MSR-Codec: A Low-Bitrate Multi-Stream Residual Codec for High-Fidelity Speech Generation with Information Disentanglement | Jingyu Li et.al. | 2509.13068 | null |
| 2025-09-21 | LTA-thinker: Latent Thought-Augmented Training Framework for Large Language Models on Complex Reasoning | Jiaqi Wang et.al. | 2509.12875 | null |
| 2025-09-16 | Towards personalized, precise and survey-free environment recognition: AI-enhanced sensor fusion without pre-deployment | Ruichen Wang et.al. | 2509.12870 | null |
| 2025-09-16 | A Lightweight Pipeline for Noisy Speech Voice Cloning and Accurate Lip Sync Synthesis | Javeria Amir et.al. | 2509.12831 | null |
| 2025-09-21 | Building Coding Agents via Entropy-Enhanced Multi-Turn Preference Optimization | Jiahao Yu et.al. | 2509.12434 | null |
| 2025-09-15 | Preservation of Language Understanding Capabilities in Speech-aware Large Language Models | Marek Kubis et.al. | 2509.12171 | null |
| 2025-09-29 | FuseCodec: Semantic-Contextual Fusion and Supervision for Neural Codecs | Md Mubtasim Ahasan et.al. | 2509.11425 | null |
| 2025-09-14 | Length-Aware Rotary Position Embedding for Text-Speech Alignment | Hyeongju Kim et.al. | 2509.11084 | null |
| 2025-09-12 | Towards Data Drift Monitoring for Speech Deepfake Detection in the context of MLOps | Xin Wang et.al. | 2509.10086 | null |
| 2025-09-11 | DiTReducio: A Training-Free Acceleration for DiT-Based TTS via Progressive Calibration | Yanru Huo et.al. | 2509.09748 | null |
| 2025-09-12 | DiFlow-TTS: Discrete Flow Matching with Factorized Speech Tokens for Low-Latency Zero-Shot Text-To-Speech | Ngoc-Son Nguyen et.al. | 2509.09631 | null |
| 2025-09-11 | HISPASpoof: A New Dataset For Spanish Speech Forensics | Maria Risques et.al. | 2509.09155 | null |
| 2025-09-10 | Accelerating Diffusion Transformer-Based Text-to-Speech with Transformer Layer Caching | Siratish Sakpiboonchit et.al. | 2509.08696 | null |
| 2025-09-14 | Progressive Facial Granularity Aggregation with Bilateral Attribute-based Enhancement for Face-to-Speech Synthesis | Yejin Jeon et.al. | 2509.07376 | null |
| 2025-09-09 | When Fine-Tuning is Not Enough: Lessons from HSAD on Hybrid and Adversarial Audio Spoof Detection | Bin Hu et.al. | 2509.07323 | null |
| 2025-09-08 | Controllable Singing Voice Synthesis using Phoneme-Level Energy Sequence | Yerin Ryu et.al. | 2509.07038 | null |
| 2025-09-07 | Multimodal Fine-grained Context Interaction Graph Modeling for Conversational Speech Synthesis | Zhenqi Jia et.al. | 2509.06074 | null |
| 2025-09-06 | LatinX: Aligning a Multilingual TTS Model with Direct Preference Optimization | Luis Felipe Chary et.al. | 2509.05863 | null |
| 2025-09-08 | Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework | Jie Chen et.al. | 2509.05007 | null |
| 2025-09-04 | Say More with Less: Variable-Frame-Rate Speech Tokenization via Adaptive Clustering and Implicit Duration Coding | Rui-Chen Zheng et.al. | 2509.04685 | null |
| 2025-09-04 | DarkStream: real-time speech anonymization with low latency | Waris Quamer et.al. | 2509.04667 | null |
| 2025-09-04 | AUDETER: A Large-scale Dataset for Deepfake Audio Detection in Open Worlds | Qizhou Wang et.al. | 2509.04345 | null |
| 2025-09-04 | Open-Source Full-Duplex Conversational Datasets for Natural and Interactive Speech Synthesis | Zhitong Zhou et.al. | 2509.04093 | null |
| 2025-09-04 | LibriQuote: A Speech Dataset of Fictional Character Utterances for Expressive Zero-Shot Speech Synthesis | Gaspard Michel et.al. | 2509.04072 | null |
| 2025-09-16 | SwinSRGAN: Swin Transformer-based Generative Adversarial Network for High-Fidelity Speech Super-Resolution | Jiajun Yuan et.al. | 2509.03913 | null |
| 2025-09-03 | Multi-level SSL Feature Gating for Audio Deepfake Detection | Hoan My Tran et.al. | 2509.03409 | null |
| 2025-09-03 | Improving Perceptual Audio Aesthetic Assessment via Triplet Loss and Self-Supervised Embeddings | Dyah A. M. G. Wisnu et.al. | 2509.03292 | null |
| 2025-09-03 | AIVA: An AI-based Virtual Companion for Emotion-aware Interaction | Chenxi Li et.al. | 2509.03212 | null |
| 2025-09-02 | Scale, Don’t Fine-tune: Guiding Multimodal LLMs for Efficient Visual Place Recognition at Test-Time | Jintao Cheng et.al. | 2509.02129 | null |
| 2025-09-04 | FireRedTTS-2: Towards Long Conversational Speech Generation for Podcast and Chatbot | Kun Xie et.al. | 2509.02020 | null |
| 2025-09-03 | MixedG2P-T5: G2P-free Speech Synthesis for Mixed-script texts using Speech Self-Supervised Learning and Language Model | Joonyong Park et.al. | 2509.01391 | null |
| 2025-08-31 | MPO: Multidimensional Preference Optimization for Language Model-based Text-to-Speech | Kangxiang Xia et.al. | 2509.00685 | null |
| 2025-08-31 | Speaker-Conditioned Phrase Break Prediction for Text-to-Speech with Phoneme-Level Pre-trained Language Model | Dong Yang et.al. | 2509.00675 | null |
| 2025-08-29 | Democratizing Agentic AI with Fast Test-Time Scaling on the Edge | Hao Mark Chen et.al. | 2509.00195 | null |
| 2025-08-27 | Learning to Refine: Self-Refinement of Parallel Reasoning in LLMs | Qibin Wang et.al. | 2509.00084 | null |
| 2025-08-28 | Multilingual Dataset Integration Strategies for Robust Audio Deepfake Detection: A SAFE Challenge System | Hashim Ali et.al. | 2508.20983 | null |
| 2025-08-26 | Predicting the optimal noise strength for solving optimization problems with analog Ising machines | Leen Mys et.al. | 2508.19107 | null |
| 2025-08-26 | CLEAR: Continuous Latent Autoregressive Modeling for High-quality and Low-latency Speech Synthesis | Chun Yat Wu et.al. | 2508.19098 | null |
| 2025-08-25 | SwiftF0: Fast and Accurate Monophonic Pitch Detection | Lars Nieradzik et.al. | 2508.18440 | null |
| 2025-08-25 | Unseen Speaker and Language Adaptation for Lightweight Text-To-Speech with Adapters | Alessio Falai et.al. | 2508.18006 | null |
| 2025-08-27 | Vocoder-Projected Feature Discriminator | Takuhiro Kaneko et.al. | 2508.17874 | null |
| 2025-08-25 | ClearMask: Noise-Free and Naturalness-Preserving Protection Against Voice Deepfake Attacks | Yuanda Wang et.al. | 2508.17660 | null |
| 2025-08-24 | Improving French Synthetic Speech Quality via SSML Prosody Control | Nassima Ould Ouali et.al. | 2508.17494 | null |
| 2025-08-23 | WildSpoof Challenge Evaluation Plan | Yihan Wu et.al. | 2508.16858 | null |
| 2025-09-09 | Trust but Verify! A Survey on Verification Design for Test-time Scaling | V Venktesh et.al. | 2508.16665 | null |
| 2025-09-05 | Mitigating Hallucinations in LM-Based TTS Models via Distribution Alignment Using GFlowNets | Chenlin Liu et.al. | 2508.15442 | null |
| 2025-08-25 | Linear Preference Optimization: Decoupled Gradient Control via Absolute Regularization | Rui Wang et.al. | 2508.14947 | null |
| 2025-08-20 | Long-Context Speech Synthesis with Context-Aware Memory | Zhipeng Li et.al. | 2508.14713 | null |
| 2025-08-20 | Improving Resource-Efficient Speech Enhancement via Neural Differentiable DSP Vocoder Refinement | Heitor R. Guimarães et.al. | 2508.14709 | null |
| 2025-08-22 | Your Reward Function for RL is Your Best PRM for Search: Unifying RL and Search-Based TTS | Can Jin et.al. | 2508.14313 | null |
| 2025-08-19 | Who Gets the Mic? Investigating Gender Bias in the Speaker Assignment of a Speech-LLM | Dariia Puhach et.al. | 2508.13603 | null |
| 2025-08-18 | Integrating Feedback Loss from Bi-modal Sarcasm Detector for Sarcastic Speech Synthesis | Zhu Li et.al. | 2508.13028 | null |
| 2025-08-18 | Cooperative Sensing-Assisted Predictive Beam Tracking for MIMO-OFDM Networked ISAC Systems | Xiaoyu Yang et.al. | 2508.12723 | null |
| 2025-08-18 | Real-Time Sign Language Gestures to Speech Transcription using Deep Learning | Brandone Fonya et.al. | 2508.12713 | null |
| 2025-08-19 | FNH-TTS: A Fast, Natural, and Human-Like Speech Synthesis System with advanced prosodic modeling based on Mixture of Experts | Qingliang Meng et.al. | 2508.12001 | null |
| 2025-08-15 | MoE-TTS: Enhancing Out-of-Domain Text Understanding for Description-based TTS via Mixture-of-Experts | Heyang Xue et.al. | 2508.11326 | null |
| 2025-10-07 | EmoSSLSphere: Multilingual Emotional Speech Synthesis with Spherical Vectors and Discrete Speech Tokens | Joonyong Park et.al. | 2508.11273 | null |
| 2025-08-14 | Facilitating Personalized TTS for Dysarthric Speakers Using Knowledge Anchoring and Curriculum Learning | Yejin Jeon et.al. | 2508.10412 | null |
| 2025-08-14 | Towards Frame-level Quality Predictions of Synthetic Speech | Michael Kuhlmann et.al. | 2508.10374 | null |
| 2025-08-15 | Training-Free Multimodal Large Language Model Orchestration | Tianyu Xie et.al. | 2508.10016 | null |
| 2025-09-16 | UtterTune: LoRA-Based Target-Language Pronunciation Edit and Control in Multilingual Text-to-Speech | Shuhei Kato et.al. | 2508.09767 | null |
| 2025-08-12 | ProMode: A Speech Prosody Model Conditioned on Acoustic and Textual Inputs | Eray Eren et.al. | 2508.09389 | null |
| 2025-08-12 | Fake-Mamba: Real-Time Speech Deepfake Detection Using Bidirectional Mamba as Self-Attention’s Alternative | Xi Xuan et.al. | 2508.09294 | null |
| 2025-08-12 | HumanOLAT: A Large-Scale Dataset for Full-Body Human Relighting and Novel-View Synthesis | Timo Teufel et.al. | 2508.09137 | null |
| 2025-08-12 | QAMRO: Quality-aware Adaptive Margin Ranking Optimization for Human-aligned Assessment of Audio Generation Systems | Chien-Chun Wang et.al. | 2508.08957 | null |
| 2025-08-10 | Scalable Controllable Accented TTS | Henry Li Xinyuan et.al. | 2508.07426 | null |
| 2025-08-10 | KLASSify to Verify: Audio-Visual Deepfake Detection Using SSL-based Audio and Handcrafted Visual Features | Ivan Kukanov et.al. | 2508.07337 | null |
| 2025-08-12 | XEmoRAG: Cross-Lingual Emotion Transfer with Controllable Intensity Using Retrieval-Augmented Generation | Tianlun Zuo et.al. | 2508.07302 | null |
| 2025-08-09 | Maestro-EVC: Controllable Emotional Voice Conversion Guided by References and Explicit Prosody | Jinsung Yoon et.al. | 2508.06890 | null |
| 2025-08-09 | Text to Speech System for Meitei Mayek Script | Gangular Singh Irengbam et.al. | 2508.06870 | null |
| 2025-08-08 | Llasa+: Free Lunch for Accelerated and Streaming Llama-Based Speech Synthesis | Wenjie Tian et.al. | 2508.06262 | null |
| 2025-08-08 | NEP: Autoregressive Image Editing via Next Editing Token Prediction | Huimin Wu et.al. | 2508.06044 | null |
| 2025-08-07 | A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understanding | Runchuan Ye et.al. | 2508.05385 | null |
| 2025-08-15 | Fairness in Dysarthric Speech Synthesis: Understanding Intrinsic Bias in Dysarthric Speech Cloning using F5-TTS | M Anuprabha et.al. | 2508.05102 | null |
| 2025-08-07 | UniTalker: Conversational Speech-Visual Synthesis | Yifan Hu et.al. | 2508.04585 | null |
| 2025-08-06 | The State Of TTS: A Case Study with Human Fooling Rates | Praveen Srinivasa Varadhan et.al. | 2508.04179 | null |
| 2025-08-29 | Parallel GPT: Harmonizing the Independence and Interdependence of Acoustic and Semantic Information for Zero-Shot Text-to-Speech | Jingyuan Xing et.al. | 2508.04141 | null |
| 2025-07-04 | Analyzing and Improving Speaker Similarity Assessment for Speech Synthesis | Marc-André Carbonneau et.al. | 2507.02176 | null |
| 2025-07-08 | Pronunciation Editing for Finnish Speech using Phonetic Posteriorgrams | Zirui Li et.al. | 2507.02115 | null |
| 2025-07-03 | Multi-interaction TTS toward professional recording reproduction | Hiroki Kanagawa et.al. | 2507.00808 | null |
| 2025-05-27 | Revival with Voice: Multi-modal Controllable Text-to-Speech Synthesis | Minsu Kim et.al. | 2505.18972 | null |
| 2025-05-13 | Lightweight End-to-end Text-to-speech Synthesis for low resource on-device applications | Biel Tura Vecino et.al. | 2505.07701 | null |
| 2025-01-16 | Speech Synthesis along Perceptual Voice Quality Dimensions | Frederik Rautenberg et.al. | 2501.08791 | null |
| 2025-06-03 | Low-Resource Text-to-Speech Synthesis Using Noise-Augmented Training of ForwardTacotron | Kishor Kayyar Lakshminarayana et.al. | 2501.05976 | null |
| 2024-12-31 | Stable-TTS: Stable Speaker-Adaptive Text-to-Speech Synthesis via Prosody Prompting | Wooseok Han et.al. | 2412.20155 | null |
| 2024-11-12 | Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis | Shijia Liao et.al. | 2411.01156 | null |
| 2024-11-01 | Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis | Théodor Lemerle et.al. | 2410.23320 | null |
| 2024-10-29 | Mitigating Unauthorized Speech Synthesis for Voice Protection | Zhisheng Zhang et.al. | 2410.20742 | null |
| 2025-01-13 | MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion | Sho Inoue et.al. | 2409.09352 | null |
| 2024-09-10 | AS-Speech: Adaptive Style For Speech Synthesis | Zhipeng Li et.al. | 2409.05730 | null |
| 2024-07-02 | FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech Synthesis | Yinlin Guo et.al. | 2407.00753 | null |
| 2024-06-13 | Text-aware and Context-aware Expressive Audiobook Speech Synthesis | Dake Guo et.al. | 2406.05672 | null |
| 2024-10-25 | FlashSpeech: Efficient Zero-Shot Speech Synthesis | Zhen Ye et.al. | 2404.14700 | null |
| 2024-04-03 | Humane Speech Synthesis through Zero-Shot Emotion and Disfluency Generation | Rohan Chaudhury et.al. | 2404.01339 | link |
| 2024-04-02 | CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models | Xiang Li et.al. | 2404.00569 | null |
| 2024-03-21 | Building speech corpus with diverse voice characteristics for its prompt-based representation | Aya Watanabe et.al. | 2403.13353 | null |
| 2024-03-19 | EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech | Ziqi Liang et.al. | 2403.08164 | null |
| 2024-02-05 | Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations | Panos Kakoulidis et.al. | 2402.01520 | null |
| 2024-02-19 | Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis | Vinotha R et.al. | 2401.11771 | null |
| 2024-08-28 | ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations | Cheng Gong et.al. | 2312.14398 | null |
| 2024-02-01 | MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis | Wenhao Guan et.al. | 2312.10687 | null |
| 2023-11-28 | HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis | Sang-Hoon Lee et.al. | 2311.12454 | null |
| 2023-12-19 | High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models | Chunyu Qiang et.al. | 2309.15512 | null |
| 2024-10-28 | Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition | Ziyang Ma et.al. | 2309.10294 | null |
| 2023-08-01 | MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis | Shun Lei et.al. | 2307.16012 | null |
| 2023-07-17 | Controllable Emphasis with zero data for text-to-speech | Arnaud Joly et.al. | 2307.07062 | null |
| 2023-07-12 | On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis | Siyang Wang et.al. | 2307.05132 | null |
| 2024-01-26 | Disentanglement in a GAN for Unconditional Speech Synthesis | Matthew Baas et.al. | 2307.01673 | null |
| 2023-06-29 | UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data | Heeseung Kim et.al. | 2306.16083 | null |
| 2023-06-22 | Visual-Aware Text-to-Speech | Mohan Zhou et.al. | 2306.12020 | null |
| 2023-06-21 | CML-TTS A Multilingual Dataset for Speech Synthesis in Low-Resource Languages | Frederico S. Oliveira et.al. | 2306.10097 | null |
| 2023-06-02 | EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis | Haobin Tang et.al. | 2306.00648 | null |
| 2023-05-23 | MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting | Neil Shah et.al. | 2305.11926 | null |
| 2023-10-31 | CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model | Zhen Ye et.al. | 2305.06908 | null |
| 2023-12-19 | Zero-shot text-to-speech synthesis conditioned using self-supervised speech representation model | Kenichi Fujita et.al. | 2304.11976 | null |
| 2023-05-31 | NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers | Kai Shen et.al. | 2304.09116 | null |
| 2023-12-19 | ParrotTTS: Text-to-Speech synthesis by exploiting self-supervised representations | Neil Shah et.al. | 2303.01261 | null |
| 2023-02-20 | Lip-to-Speech Synthesis in the Wild with Multi-task Learning | Minsu Kim et.al. | 2302.08841 | null |
| 2022-12-07 | UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis | Yi Lei et.al. | 2212.01546 | null |
| 2022-11-30 | Controllable speech synthesis by learning discrete phoneme-level prosodic representations | Nikolaos Ellinas et.al. | 2211.16307 | null |
| 2023-03-15 | Grad-StyleSpeech: Any-speaker Adaptive Text-to-Speech Synthesis with Diffusion Models | Minki Kang et.al. | 2211.09383 | null |
| 2024-10-01 | Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder | Jan Melechovsky et.al. | 2211.03316 | null |
| 2022-10-03 | Detection of Prosodic Boundaries in Speech Using Wav2Vec 2.0 | Marie Kunešová et.al. | 2209.15032 | null |
| 2022-05-25 | TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS | Xulong Zhang et.al. | 2205.11824 | null |
| 2024-06-06 | Parallel Synthesis for Autoregressive Speech Generation | Po-chun Hsu et.al. | 2204.11806 | null |
| 2023-02-07 | The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance | Lin Zhang et.al. | 2204.05177 | null |
| 2022-03-30 | Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise | Tuomo Raitio et.al. | 2203.10637 | null |
| 2022-01-27 | J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis | Shinnosuke Takamichi et.al. | 2201.10896 | null |
| 2021-11-18 | Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control | Konstantinos Markopoulos et.al. | 2111.09146 | null |
| 2022-08-01 | Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech | Sung-Feng Huang et.al. | 2111.04040 | null |
| 2021-07-13 | Extending Text-to-Speech Synthesis with Articulatory Movement Prediction using Ultrasound Tongue Imaging | Tamás Gábor Csapó et.al. | 2107.05550 | null |
| 2021-07-08 | VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis | Hui Lu et.al. | 2107.03298 | null |
| 2021-07-07 | Location, Location: Enhancing the Evaluation of Text-to-Speech Synthesis Using the Rapid Prosody Transcription Paradigm | Elijah Gutierrez et.al. | 2107.02527 | null |
| 2021-07-06 | Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input | Tamás Gábor Csapó et.al. | 2107.02003 | null |
| 2021-07-26 | A Survey on Neural Speech Synthesis | Xu Tan et.al. | 2106.15561 | null |
| 2021-06-29 | Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech | Raahil Shah et.al. | 2106.12896 | null |
| 2021-06-22 | Non-native English lexicon creation for bilingual speech synthesis | Arun Baby et.al. | 2106.10870 | null |
| 2021-06-22 | Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters | Mohammed Salah Al-Radhi et.al. | 2106.10481 | null |
| 2021-05-11 | MASS: Multi-task Anthropomorphic Speech Synthesis Framework | Jinyin Chen et.al. | 2105.04124 | null |
| 2021-07-01 | How do Voices from Past Speech Synthesis Challenges Compare Today? | Erica Cooper et.al. | 2105.02373 | null |
| 2022-02-25 | Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis | Erica Cooper et.al. | 2104.12292 | null |
| 2021-04-06 | Diff-TTS: A Denoising Diffusion Model for Text-to-Speech | Myeonghun Jeong et.al. | 2104.01409 | null |
| 2021-06-15 | Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability | Rui Liu et.al. | 2104.01408 | null |
| 2021-03-09 | AudioVisual Speech Synthesis: A brief literature review | Efthymios Georgiou et.al. | 2103.03927 | null |
| 2021-03-29 | GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis | Rui Liu et.al. | 2010.12423 | null |
| 2020-10-19 | Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion | Shengkui Zhao et.al. | 2010.08136 | null |
| 2021-01-07 | Transfer Learning from Speech Synthesis to Voice Conversion with Non-Parallel Training Data | Mingyang Zhang et.al. | 2009.14399 | null |
| 2020-10-26 | Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search | Jaehyeon Kim et.al. | 2005.11129 | null |
| 2020-05-22 | Cross-lingual Multispeaker Text-to-Speech under Limited-Data Scenario | Zexin Cai et.al. | 2005.10441 | null |
| 2020-02-18 | Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech | Vatsal Aggarwal et.al. | 1911.12760 | null |
| 2019-09-26 | Sequence to Sequence Neural Speech Synthesis with Prosody Modification Capabilities | Slava Shechtman et.al. | 1909.10302 | null |
| 2019-09-10 | Evaluating Long-form Text-to-Speech: Comparing the Ratings of Sentences and Paragraphs | Rob Clark et.al. | 1909.03965 | null |
| 2019-08-28 | Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis | Xin Wang et.al. | 1908.10256 | null |
| 2020-11-04 | Using generative modelling to produce varied intonation for speech synthesis | Zack Hodari et.al. | 1906.04233 | null |
| 2019-09-24 | Problem-Agnostic Speech Embeddings for Multi-Speaker Text-to-Speech with SampleRNN | David Álvarez et.al. | 1906.00733 | null |
| 2019-05-22 | Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems | Ohsung Kwon et.al. | 1905.08486 | null |
| 2019-02-12 | Generative Moment Matching Network-based Random Modulation Post-filter for DNN-based Singing Voice Synthesis and Neural Double-tracking | Hiroki Tamaru et.al. | 1902.03389 | null |
| 2018-08-21 | Multimodal speech synthesis architecture for unsupervised speaker adaptation | Hieu-Thi Luong et.al. | 1808.06288 | null |
| 2019-01-04 | Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis | Ye Jia et.al. | 1806.04558 | null |
| 2018-02-23 | Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning | Wei Ping et.al. | 1710.07654 | null |
| 2017-09-26 | Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks | Yuki Saito et.al. | 1709.08041 | null |
| 2017-09-25 | Techniques and Challenges in Speech Synthesis | David Ferris et.al. | 1709.07552 | null |
| 2016-08-19 | DNN-based Speech Synthesis for Indian Languages from ASCII text | Srikanth Ronanki et.al. | 1608.05374 | null |
| 2016-06-30 | Penambahan emosi menggunakan metode manipulasi prosodi untuk sistem text to speech bahasa Indonesia | Salita Ulitia Prini et.al. | 1606.09222 | null |
📊 712 papers
| 📅 Publish Date | 📝 Title | 👥 Authors | 💻 Code | |
|---|---|---|---|---|
| 2026-04-01 | Translating With Feeling: Centering Translator Perspectives within Translation Technologies | Daniel Chechelnitsky et.al. | 2604.00758 | null |
| 2026-04-01 | AfrIFact: Cultural Information Retrieval, Evidence Extraction and Fact Checking for African Languages | Israel Abebe Azime et.al. | 2604.00706 | null |
| 2026-03-11 | Multi-lingual Multi-institutional Electronic Health Record based Predictive Model | Kyunghoon Hur et.al. | 2604.00027 | null |
| 2026-03-10 | ASCAT: An Arabic Scientific Corpus and Benchmark for Advanced Translation Evaluation | Serry Sibaee et.al. | 2604.00015 | null |
| 2026-03-31 | Rewrite the News: Tracing Editorial Reuse Across News Agencies | Soveatin Kuntur et.al. | 2603.29937 | null |
| 2026-03-31 | Bringing Up a Bilingual BabyLM: Investigating Multilingual Language Acquisition Using Small-Scale Models | Linda Zeng et.al. | 2603.29552 | null |
| 2026-03-31 | L-ReLF: A Framework for Lexical Dataset Creation | Anass Sedrati et.al. | 2603.29346 | null |
| 2026-03-31 | Open Machine Translation for Esperanto | Ona de Gibert et.al. | 2603.29345 | null |
| 2026-03-31 | Advancing LLM-based phoneme-to-grapheme for multilingual speech recognition | Lukuang Dong et.al. | 2603.29217 | null |
| 2026-03-30 | On the limited utility of parallel data for learning shared multilingual representations | Julius Leino et.al. | 2603.29026 | null |
| 2026-03-30 | Graphilosophy: Graph-Based Digital Humanities Computing with The Four Books | Minh-Thu Do et.al. | 2603.28755 | null |
| 2026-03-30 | Top-down string-to-dependency Neural Machine Translation | Shuhei Kondo et.al. | 2603.27938 | null |
| 2026-03-29 | Budget-Xfer: Budget-Constrained Source Language Selection for Cross-Lingual Transfer to African Languages | Tewodros Kederalah Idris et.al. | 2603.27651 | null |
| 2026-03-28 | EuraGovExam: A Multilingual Multimodal Benchmark from Real-World Civil Service Exams | JaeSeong Kim et.al. | 2603.27223 | null |
| 2026-03-23 | Do Multilingual VLMs Reason Equally? A Cross-Lingual Visual Reasoning Audit for Indian Languages | Swastik R et.al. | 2603.26742 | null |
| 2026-03-27 | Toward Culturally Grounded Natural Language Processing | Sina Bagheri Nezhad et.al. | 2603.26013 | null |
| 2026-03-26 | Translation Asymmetry in LLMs as a Data Augmentation Factor: A Case Study for 6 Romansh Language Varieties | Jannis Vamvas et.al. | 2603.25489 | null |
| 2026-03-26 | Translation or Recitation? Calibrating Evaluation Scores for Machine Translation of Extremely Low-Resource Languages | Danlu Chen et.al. | 2603.25222 | null |
| 2026-03-26 | Cross-Preference Learning for Sentence-Level and Context-Aware Machine Translation | Ying Li et.al. | 2603.25183 | null |
| 2026-03-26 | Bilingual Text-to-Motion Generation: A New Benchmark and Baselines | Wanjiang Weng et.al. | 2603.25178 | null |
| 2026-03-26 | Toward domain-specific machine translation and quality estimation systems | Javad Pourmostafa Roshan Sharami et.al. | 2603.24955 | null |
| 2026-03-29 | POLY-SIM: Polyglot Speaker Identification with Missing Modality Grand Challenge 2026 Evaluation Plan | Marta Moscati et.al. | 2603.24569 | null |
| 2026-03-25 | Samasāmayik: A Parallel Dataset for Hindi-Sanskrit Machine Translation | N J Karthika et.al. | 2603.24307 | null |
| 2026-03-25 | MMTIT-Bench: A Multilingual and Multi-Scenario Benchmark with Cognition-Perception-Reasoning Guided Text-Image Machine Translation | Gengluo Li et.al. | 2603.23896 | null |
| 2026-03-07 | Konkani LLM: Multi-Script Instruction Tuning and Evaluation for a Low-Resource Indian Language | Reuben Chagas Fernandes et.al. | 2603.23529 | null |
| 2026-03-24 | From Synthetic to Native: Benchmarking Multilingual Intent Classification in Logistics Customer Service | Haoyu He et.al. | 2603.23172 | null |
| 2026-03-23 | Rashid: A Cipher-Based Framework for Exploring In-Context Language Learning | Niyati Bafna et.al. | 2603.22497 | null |
| 2026-03-24 | Adapting Self-Supervised Speech Representations for Cross-lingual Dysarthria Detection in Parkinson’s Disease | Abner Hernandez et.al. | 2603.22225 | null |
| 2026-03-23 | Enhancing Document-Level Machine Translation via Filtered Synthetic Corpora and Two-Stage LLM Adaptation | Ireh Kim et.al. | 2603.22186 | null |
| 2026-03-23 | DATASHI: A Parallel English-Tashlhiyt Corpus for Orthography Normalization and Low-Resource Language Processing | Nasser-Eddine Monir et.al. | 2603.21571 | null |
| 2026-03-22 | Graph Fusion Across Languages using Large Language Models | Kaung Myat Kyaw et.al. | 2603.21248 | null |
| 2026-03-22 | Left Behind: Cross-Lingual Transfer as a Bridge for Low-Resource Languages in Large Language Models | Abdul-Salem Beibitkhan et.al. | 2603.21036 | null |
| 2026-03-20 | Span-Level Machine Translation Meta-Evaluation | Stefano Perrella et.al. | 2603.19921 | null |
| 2026-03-20 | Neither Here Nor There: Cross-Lingual Representation Dynamics of Code-Mixed Text in Multilingual Encoders | Debajyoti Mazumder et.al. | 2603.19771 | null |
| 2026-03-19 | Vocabulary shapes cross-lingual variation of word-order learnability in language models | Jonas Mayer Martins et.al. | 2603.19427 | null |
| 2026-02-26 | HATL: Hierarchical Adaptive-Transfer Learning Framework for Sign Language Machine Translation | Nada Shahin et.al. | 2603.19260 | null |
| 2026-03-19 | Why Better Cross-Lingual Alignment Fails for Better Cross-Lingual Transfer: Case of Encoders | Yana Veitsman et.al. | 2603.18863 | null |
| 2026-03-19 | Cross-Lingual LLM-Judge Transfer via Evaluation Decomposition | Ivaxi Sheth et.al. | 2603.18557 | null |
| 2026-03-18 | ConGA: Guidelines for Contextual Gender Annotation. A Framework for Annotating Gender in Machine Translation | Argentina Anna Rescigno et.al. | 2603.17962 | null |
| 2026-03-18 | Gender Disambiguation in Machine Translation: Diagnostic Evaluation in Decoder-Only Architectures | Chiara Manna et.al. | 2603.17952 | null |
| 2026-03-18 | ShapleyLaw: A Game-Theoretic Approach to Multilingual Scaling Laws | Xuyang Cao et.al. | 2603.17945 | null |
| 2026-03-18 | Pretrained Multilingual Transformers Reveal Quantitative Distance Between Human Languages | Yue Zhao et.al. | 2603.17912 | null |
| 2026-03-19 | Zipper-LoRA: Dynamic Parameter Decoupling for Speech-LLM based Multilingual Speech Recognition | Yuxiang Mei et.al. | 2603.17558 | null |
| 2026-03-31 | Language on Demand, Knowledge at Core: Composing LLMs with Encoder-Decoder Translation Models for Extensible Multilinguality | Mengyu Bu et.al. | 2603.17512 | null |
| 2026-03-18 | From Words to Worlds: Benchmarking Cross-Cultural Cultural Understanding in Machine Translation | Bangju Han et.al. | 2603.17303 | null |
| 2026-03-17 | Knowledge Localization in Mixture-of-Experts LLMs Using Cross-Lingual Inconsistency | Lucas Bandarkar et.al. | 2603.17102 | null |
| 2026-03-17 | Ensemble Self-Training for Unsupervised Machine Translation | Ido Aharon et.al. | 2603.17087 | null |
| 2026-03-17 | Can Linguistically Related Languages Guide LLM Translation in Low-Resource Settings? | Aishwarya Ramasethu et.al. | 2603.16660 | null |
| 2026-03-18 | Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Embeddings Bridging Massively Multilingual Text and Speech | Omnilingual SONAR Team et.al. | 2603.16606 | null |
| 2026-03-17 | Who Benchmarks the Benchmarks? A Case Study of LLM Evaluation in Icelandic | Finnur Ágúst Ingimundarson et.al. | 2603.16406 | null |
| 2026-03-18 | Omnilingual MT: Machine Translation for 1,600 Languages | Omnilingual MT Team et.al. | 2603.16309 | null |
| 2026-03-16 | Robust Language Identification for Romansh Varieties | Charlotte Model et.al. | 2603.15969 | null |
| 2026-03-16 | Machine Translation in the Wild: User Reaction to Xiaohongshu’s Built-In Translation Feature | Sui He et.al. | 2603.15922 | null |
| 2026-03-16 | Bidirectional Chinese and English Passive Sentences Dataset for Machine Translation | Xinyue Ma et.al. | 2603.15227 | null |
| 2026-03-16 | Pretraining and Benchmarking Modern Encoders for Latvian | Arturs Znotins et.al. | 2603.15005 | null |
| 2026-03-29 | ExPosST: Explicit Positioning with Adaptive Masking for LLM-Based Simultaneous Machine Translation | Yuzhe Shang et.al. | 2603.14903 | null |
| 2026-03-16 | Developing an English-Efik Corpus and Machine Translation System for Digitization Inclusion | Offiong Bassey Edet et.al. | 2603.14873 | null |
| 2026-03-16 | Towards Privacy-Preserving Machine Translation at the Inference Stage: A New Task and Benchmark | Wei Shao et.al. | 2603.14756 | null |
| 2026-03-15 | Multilingual TinyStories: A Synthetic Combinatorial Corpus of Indic Children’s Stories for Training Small Language Models | Deepon Halder et.al. | 2603.14563 | null |
| 2026-03-14 | NepTam: A Nepali-Tamang Parallel Corpus and Baseline Machine Translation Experiments | Rupak Raj Ghimire et.al. | 2603.14053 | null |
| 2026-03-30 | GhanaNLP Parallel Corpora: Comprehensive Multilingual Resources for Low-Resource Ghanaian Languages | Lawrence Adu Gyamfi et.al. | 2603.13793 | null |
| 2026-03-13 | Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation | Yifeng Liu et.al. | 2603.13045 | null |
| 2026-03-16 | Is Human Annotation Necessary? Iterative MBR Distillation for Error Span Detection in Machine Translation | Boxuan Lyu et.al. | 2603.12983 | null |
| 2026-03-13 | HMS-BERT: Hybrid Multi-Task Self-Training for Multilingual and Multi-Label Cyberbullying Detection | Zixin Feng et.al. | 2603.12920 | null |
| 2026-03-13 | Learning from Child-Directed Speech in Two-Language Scenarios: A French-English Case Study | Liel Binyamin et.al. | 2603.12906 | null |
| 2026-03-12 | Translationese as a Rational Response to Translation Task Difficulty | Maria Kunilovskaya et.al. | 2603.12050 | null |
| 2026-03-12 | Just Use XML: Revisiting Joint Translation and Label Projection | Thennal D K et.al. | 2603.12021 | null |
| 2026-03-12 | Semi-Synthetic Parallel Data for Translation Quality Estimation: A Case Study of Dataset Building for an Under-Resourced Language Pair | Assaf Siani et.al. | 2603.11743 | null |
| 2026-03-12 | Streaming Translation and Transcription Through Speech-to-Text Causal Alignment | Roman Koshkin et.al. | 2603.11578 | null |
| 2026-03-11 | Evaluating Explainable AI Attribution Methods in Neural Machine Translation via Attention-Guided Knowledge Distillation | Aria Nourbakhsh et.al. | 2603.11342 | null |
| 2026-03-11 | Large Language Models as Annotators for Machine Translation Quality Estimation | Sidi Wang et.al. | 2603.10775 | null |
| 2026-04-01 | IMTBench: A Multi-Scenario Cross-Modal Collaborative Evaluation Benchmark for In-Image Machine Translation | Jiahao Lyu et.al. | 2603.10495 | null |
| 2026-03-11 | Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck | Hongbin Zhang et.al. | 2603.10351 | null |
| 2026-02-15 | Automated evaluation of LLMs for effective machine translation of Mandarin Chinese to English | Yue Zhang et.al. | 2603.09998 | null |
| 2026-03-10 | Do What I Say: A Spoken Prompt Dataset for Instruction-Following | Maike Züfle et.al. | 2603.09881 | null |
| 2026-03-13 | EPIC-EuroParl-UdS: Information-Theoretic Perspectives on Translation and Interpreting | Maria Kunilovskaya et.al. | 2603.09785 | null |
| 2026-03-11 | AutoViVQA: A Large-Scale Automatically Constructed Dataset for Vietnamese Visual Question Answering | Nguyen Anh Tuong et.al. | 2603.09689 | null |
| 2026-03-10 | LLM as a Meta-Judge: Synthetic Data for NLP Evaluation Metric Validation | Lukáš Eigler et.al. | 2603.09403 | null |
| 2026-03-10 | ICDAR 2025 Competition on End-to-End Document Image Machine Translation Towards Complex Layouts | Yaping Zhang et.al. | 2603.09392 | null |
| 2026-03-10 | Geometry-Aware Metric Learning for Cross-Lingual Few-Shot Sign Language Recognition on Static Hand Keypoints | Chayanin Chamachot et.al. | 2603.09213 | null |
| 2026-03-14 | MultiGraSCCo: A Multilingual Anonymization Benchmark with Annotations of Personal Identifiers | Ibrahim Baroud et.al. | 2603.08879 | null |
| 2026-03-09 | Using Multimodal and Language-Agnostic Sentence Embeddings for Abstractive Summarization | Chaimae Chellaf et.al. | 2603.08282 | null |
| 2026-03-09 | Quantifying Cross-Lingual Transfer in Paralinguistic Speech Tasks | Pol Buitrago et.al. | 2603.08231 | null |
| 2026-03-09 | Is continuous CoT better suited for multi-lingual reasoning? | Ali Hamza Bashir et.al. | 2603.08177 | null |
| 2026-03-09 | Gender Bias in MT for a Genderless Language: New Benchmarks for Basque | Amaia Murillo et.al. | 2603.08153 | null |
| 2026-03-30 | Nwāchā Munā: A Devanagari Speech Corpus and Proximal Transfer Benchmark for Nepal Bhasha ASR | Rishikesh Kumar Sharma et.al. | 2603.07554 | null |
| 2026-03-07 | Domain-Specific Quality Estimation for Machine Translation in Low-Resource Scenarios | Namrata Patil Gurav et.al. | 2603.07372 | null |
| 2026-03-07 | How Much Noise Can BERT Handle? Insights from Multilingual Sentence Difficulty Detection | Nouran Khallaf et.al. | 2603.07346 | null |
| 2026-03-10 | Governance Architecture for Autonomous Agent Systems: Threats, Framework, and Engineering Practice | Yuxu Ge et.al. | 2603.07191 | null |
| 2026-03-06 | LIT-RAGBench: Benchmarking Generator Capabilities of Large Language Models in Retrieval-Augmented Generation | Koki Itai et.al. | 2603.06198 | null |
| 2026-03-05 | NeuronMoE: Neuron-Guided Mixture-of-Experts for Efficient Multilingual LLM Extension | Rongzhi Li et.al. | 2603.05046 | null |
| 2026-03-04 | Hindsight Quality Prediction Experiments in Multi-Candidate Human-Post-Edited Machine Translation | Malik Marmonier et.al. | 2603.04083 | null |
| 2026-02-08 | The Logovista English-Japanese Machine Translation System | Barton D. Wright et.al. | 2603.03311 | null |
| 2026-02-27 | Universal Conceptual Structure in Neural Translation: Probing NLLB-200’s Multilingual Geometry | Kyle Elliott Mathewson et.al. | 2603.02258 | null |
| 2026-02-28 | BLUFF: Benchmarking the Detection of False and Synthetic Content across 58 Low-Resource Languages | Jason Lucas et.al. | 2603.00634 | null |
| 2026-02-23 | Distance Learning and Multilingual Education: A Case Study of Challenges and Pedagogical Perspectives in the Greek Border Region | Ariadni Mandala et.al. | 2603.00128 | null |
| 2026-02-27 | Terminology Rarity Predicts Catastrophic Failure in LLM Translation of Low-Resource Ancient Languages: Evidence from Ancient Greek | James L. Zainaldin et.al. | 2602.24119 | null |
| 2026-03-04 | Extending Czech Aspect-Based Sentiment Analysis with Opinion Terms: Dataset and LLM Benchmarks | Jakub Šmíd et.al. | 2602.22730 | null |
| 2026-02-26 | Layer-Targeted Multilingual Knowledge Erasure in Large Language Models | Taoran Li et.al. | 2602.22562 | null |
| 2026-02-26 | Multilingual Safety Alignment Via Sparse Weight Editing | Jiaming Liang et.al. | 2602.22554 | null |
| 2026-02-27 | Bridging Latent Reasoning and Target-Language Generation via Retrieval-Transition Heads | Shaswat Patel et.al. | 2602.22453 | null |
| 2026-02-25 | IndicIFEval: A Benchmark for Verifiable Instruction-Following Evaluation in 14 Indic Languages | Thanmay Jayakumar et.al. | 2602.22125 | null |
| 2026-02-25 | TG-ASR: Translation-Guided Learning with Parallel Gated Cross Attention for Low-Resource Automatic Speech Recognition | Cheng-Yeh Yang et.al. | 2602.22039 | null |
| 2026-02-25 | Global-Local Dual Perception for MLLMs in High-Resolution Text-Rich Image Translation | Junxin Lu et.al. | 2602.21956 | null |
| 2026-03-02 | Mitigating Structural Noise in Low-Resource S2TT: An Optimized Cascaded Nepali-English Pipeline with Punctuation Restoration | Tangsang Chongbang et.al. | 2602.21647 | null |
| 2026-02-25 | Enhancing Multilingual Embeddings via Multi-Way Parallel Text Alignment | Barah Fazili et.al. | 2602.21543 | null |
| 2026-02-24 | Small Language Models for Privacy-Preserving Clinical Information Extraction in Low-Resource Languages | Mohammadreza Ghaffarzadeh-Esfahani et.al. | 2602.21374 | null |
| 2026-02-24 | **Naver Labs Europe @ WSDM CUP | Multilingual Retrieval** | Thibault Formal et.al. | 2602.20986 |
| 2026-02-23 | Cross-lingual Matryoshka Representation Learning across Speech and Text | Yaya Sy et.al. | 2602.19991 | null |
| 2026-02-23 | DEEP: Docker-based Execution and Evaluation Platform | Sergio Gómez González et.al. | 2602.19583 | null |
| 2026-03-16 | TurkicNLP: An NLP Toolkit for Turkic Languages | Sherzod Hakimov et.al. | 2602.19174 | null |
| 2026-02-21 | Why Agent Caching Fails and How to Fix It: Structured Intent Canonicalization with Few-Shot Learning | Abhinaba Basu et.al. | 2602.18922 | null |
| 2026-02-25 | BURMESE-SAN: Burmese NLP Benchmark for Evaluating Large Language Models | Thura Aung et.al. | 2602.18788 | null |
| 2026-02-20 | Tower of Babel in Cross-Cultural Communication: A Case Study of #Give Me a Chinese Name# Dialogues During the “TikTok Refugees’’ Event | Jielin Feng et.al. | 2602.18549 | null |
| 2026-02-05 | Synthetic Media in Multilingual MOOCs: Deepfake Tutors, Pedagogical Effects, and Ethical-Policy Challenges | Alexandros Gazis et.al. | 2602.18457 | null |
| 2026-02-20 | Learning Long-Range Dependencies with Temporal Predictive Coding | Tom Potter et.al. | 2602.18131 | null |
| 2026-02-19 | What Language is This? Ask Your Tokenizer | Clara Meister et.al. | 2602.17655 | null |
| 2026-02-19 | Evaluating Extremely Low-Resource Machine Translation: A Comparative Study of ChrF++ and BLEU Metrics | Sanjeev Kumar et.al. | 2602.17425 | null |
| 2026-02-19 | WebFAQ 2.0: A Multilingual QA Dataset with Mined Hard Negatives for Dense Retrieval | Michael Dinzinger et.al. | 2602.17327 | null |
| 2026-02-19 | Representation Collapse in Machine Translation Through the Lens of Angular Dispersion | Evgeniia Tokarchuk et.al. | 2602.17287 | null |
| 2026-02-19 | Towards Cross-lingual Values Assessment: A Consensus-Pluralism Perspective | Yukun Chen et.al. | 2602.17283 | null |
| 2026-02-19 | Evaluating Cross-Lingual Classification Approaches Enabling Topic Discovery for Multilingual Social Media Data | Deepak Uniyal et.al. | 2602.17051 | null |
| 2026-02-18 | When Semantic Overlap Is Not Enough: Cross-Lingual Euphemism Transfer Between Turkish and English | Hasan Can Biyik et.al. | 2602.16957 | null |
| 2026-02-18 | Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment | Yuyan Bu et.al. | 2602.16660 | null |
| 2026-02-18 | Training Models on Dialects of Translationese Shows How Lexical Diversity and Source-Target Syntactic Similarity Shape Learning | Jenny Kunz et.al. | 2602.16469 | null |
| 2026-02-17 | A Curious Class of Adpositional Multiword Expressions in Korean | Junghyun Min et.al. | 2602.16023 | null |
| 2026-01-22 | KD4MT: A Survey of Knowledge Distillation for Machine Translation | Ona de Gibert et.al. | 2602.15845 | null |
| 2026-02-17 | Operationalising the Superficial Alignment Hypothesis via Task Complexity | Tomás Vergara-Browne et.al. | 2602.15829 | null |
| 2026-02-17 | LuxMT Technical Report | Nils Rehlinger et.al. | 2602.15506 | null |
| 2026-02-17 | Bridging Day and Night: Target-Class Hallucination Suppression in Unpaired Image Translation | Shuwei Li et.al. | 2602.15383 | null |
| 2026-02-18 | Indic-TunedLens: Interpreting Multilingual Models in Indian Languages | Mihir Panchal et.al. | 2602.15038 | null |
| 2026-02-16 | Unlocking Reasoning Capability on Machine Translation in Large Language Models | Sara Rajaee et.al. | 2602.14763 | null |
| 2026-02-16 | Crowdsourcing Piedmontese to Test LLMs on Non-Standard Orthography | Gianluca Vico et.al. | 2602.14675 | null |
| 2026-02-22 | BETA-Labeling for Multilingual Dataset Construction in Low-Resource IR | Md. Najib Hasan et.al. | 2602.14488 | null |
| 2026-02-15 | GRRM: Group Relative Reward Modeling for Machine Translation | Sen Yang et.al. | 2602.14028 | null |
| 2026-02-13 | LLM-Powered Automatic Translation and Urgency in Crisis Scenarios | Belu Ticona et.al. | 2602.13452 | null |
| 2026-02-13 | $\mathcal{X}$ -KD: General Experiential Knowledge Distillation for Large Language Models | Yuang Cai et.al. | 2602.12674 | null |
| 2026-02-25 | Scaling Model and Data for Multilingual Machine Translation with Open Large Language Models | Yuzhe Shang et.al. | 2602.11961 | null |
| 2026-02-12 | Cross-Modal Robustness Transfer (CMRT): Training Robust Speech Translation Models Using Adversarial Text | Abderrahmane Issam et.al. | 2602.11933 | null |
| 2026-02-11 | Towards Reliable Machine Translation: Scaling LLMs for Critical Error Detection and Safety | Muskaan Chopra et.al. | 2602.11444 | null |
| 2026-02-09 | SinFoS: A Parallel Dataset for Translating Sinhala Figures of Speech | Johan Sofalas et.al. | 2602.09866 | null |
| 2026-02-10 | From FusHa to Folk: Exploring Cross-Lingual Transfer in Arabic Language Models | Abdulmuizz Khalak et.al. | 2602.09826 | null |
| 2026-02-10 | Life Cycle-Aware Evaluation of Knowledge Distillation for Machine Translation: Environmental Impact and Translation Quality Trade-offs | Joseph Attieh et.al. | 2602.09691 | null |
| 2026-02-10 | LEMUR: A Corpus for Robust Fine-Tuning of Multilingual Law Embedding Models for Retrieval | Narges Baba Ahmadi et.al. | 2602.09570 | null |
| 2026-02-10 | AfriNLLB: Efficient Translation Models for African Languages | Yasmin Moslem et.al. | 2602.09373 | null |
| 2026-02-10 | Unsupervised Cross-Lingual Part-of-Speech Tagging with Monolingual Corpora Only | Jianyu Zheng et.al. | 2602.09366 | null |
| 2026-02-10 | Positive-Unlabelled Active Learning to Curate a Dataset for Orca Resident Interpretation | Bret Nestor et.al. | 2602.09295 | null |
| 2026-02-09 | Generalizing Sports Feedback Generation by Watching Competitions and Reading Books: A Rock Climbing Case Study | Arushi Rai et.al. | 2602.08996 | null |
| 2026-02-09 | Challenges in Translating Technical Lectures: Insights from the NPTEL | Basudha Raje et.al. | 2602.08698 | null |
| 2026-02-09 | Do Multilingual LLMs have specialized language heads? | Muhammad Naufil et.al. | 2602.08625 | null |
| 2026-02-09 | Beyond Scalar Scores: Reinforcement Learning for Error-Aware Quality Estimation of Machine Translation | Archchana Sindhujan et.al. | 2602.08600 | null |
| 2026-02-08 | Lost in Translation? A Comparative Study on the Cross-Lingual Transfer of Composite Harms | Vaibhav Shukla et.al. | 2602.07963 | null |
| 2026-01-31 | Vectra: A New Metric, Dataset, and Model for Visual Quality Assessment in E-Commerce In-Image Machine Translation | Qingyu Wu et.al. | 2602.07014 | null |
| 2026-02-06 | MTQE.en-he: Machine Translation Quality Estimation for English-Hebrew | Andy Rosenbaum et.al. | 2602.06546 | null |
| 2026-02-05 | Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training | Junxiao Liu et.al. | 2602.05940 | null |
| 2026-02-05 | Polyglots or Multitudes? Multilingual LLM Answers to Value-laden Multiple-Choice Questions | Léo Labat et.al. | 2602.05932 | null |
| 2026-02-05 | Consensus-Aligned Neuron Efficient Fine-Tuning Large Language Models for Multi-Domain Machine Translation | Shuting Jiang et.al. | 2602.05694 | null |
| 2026-02-05 | BhashaSetu: Cross-Lingual Knowledge Transfer from High-Resource to Extreme Low-Resource Languages | Subhadip Maji et.al. | 2602.05599 | null |
| 2026-02-05 | Cross-Lingual Empirical Evaluation of Large Language Models for Arabic Medical Tasks | Chaimae Abouzahir et.al. | 2602.05374 | null |
| 2026-02-04 | Multilingual Extraction and Recognition of Implicit Discourse Relations in Speech and Text | Ahmed Ruby et.al. | 2602.05107 | null |
| 2026-02-04 | Data Kernel Perspective Space Performance Guarantees for Synthetic Data from Transformer Models | Michael Browder et.al. | 2602.05106 | null |
| 2026-02-04 | Beyond Many-Shot Translation: Scaling In-Context Demonstrations For Low-Resource Machine Translation | Luis Frentzen Salim et.al. | 2602.04764 | null |
| 2026-02-04 | “Be My Cheese?”: Cultural Nuance Benchmarking for Machine Translation in Multilingual LLMs | Madison Van Doren et.al. | 2602.04729 | null |
| 2026-02-04 | Disentangling meaning from language in LLM-based machine translation | Théo Lasnier et.al. | 2602.04613 | null |
| 2026-02-04 | No One-Size-Fits-All: Building Systems For Translation to Bashkir, Kazakh, Kyrgyz, Tatar and Chuvash Using Synthetic And Original Data | Dmitry Karpov et.al. | 2602.04442 | null |
| 2026-02-14 | Tokenization and Morphological Fidelity in Uralic NLP: A Cross-Lingual Evaluation | Nuo Xu et.al. | 2602.04241 | null |
| 2026-02-03 | BIRDTurk: Adaptation of the BIRD Text-to-SQL Dataset to Turkish | Burak Aktaş et.al. | 2602.03633 | null |
| 2026-02-03 | Assessing the Impact of Typological Features on Multilingual Machine Translation in the Age of Large Language Models | Vitalii Hirak et.al. | 2602.03551 | null |
| 2026-02-03 | PEGRL: Improving Machine Translation by Post-Editing Guided Reinforcement Learning | Yunzhi Shen et.al. | 2602.03352 | null |
| 2026-02-03 | Consensus Group Relative Policy Optimization for Text Generation | Yuki Ichihara et.al. | 2602.03102 | null |
| 2026-02-02 | Controlled disagreement improves generalization in decentralized training | Zesen Wang et.al. | 2602.02899 | null |
| 2026-02-02 | Large Language Models for Mental Health: A Multilingual Evaluation | Nishat Raihan et.al. | 2602.02440 | null |
| 2026-02-02 | Cross-Lingual Stability of LLM Judges Under Controlled Generation: Evidence from Finno-Ugric Languages | Isaac Chung et.al. | 2602.02287 | null |
| 2026-02-02 | BBPE16: UTF-16-based byte-level byte-pair encoding for improved multilingual speech recognition | Hyunsik Kim et.al. | 2602.01717 | null |
| 2026-02-02 | SEA-Guard: Culturally Grounded Multilingual Safeguard for Southeast Asia | Panuthep Tasawong et.al. | 2602.01618 | null |
| 2026-02-01 | Who Transfers Safety? Identifying and Targeting Cross-Lingual Shared Safety Neurons | Xianhui Zhang et.al. | 2602.01283 | null |
| 2026-02-01 | From Utterance to Vividity: Training Expressive Subtitle Translation LLM via Adaptive Local Preference Optimization | Chaoqun Cui et.al. | 2602.01068 | null |
| 2026-01-19 | Extending Beacon to Hindi: Cultural Adaptation Drives Cross-Lingual Sycophancy | Sarthak Sattigeri et.al. | 2602.00046 | null |
| 2026-02-11 | Bias Beyond Borders: Political Ideology Evaluation and Steering in Multilingual LLMs | Afrozah Nadeem et.al. | 2601.23001 | null |
| 2026-01-30 | Benchmarking Machine Translation on Chinese Social Media Texts | Kaiyan Zhao et.al. | 2601.22931 | null |
| 2026-01-30 | When Meanings Meet: Investigating the Emergence and Quality of Shared Concept Spaces during Multilingual Language Model Training | Felicia Körner et.al. | 2601.22851 | null |
| 2026-01-30 | RASST: Fast Cross-modal Retrieval-Augmented Simultaneous Speech Translation | Jiaxuan Luo et.al. | 2601.22777 | null |
| 2026-01-29 | TidyVoice 2026 Challenge Evaluation Plan | Aref Farhadipour et.al. | 2601.21960 | null |
| 2026-02-06 | DimStance: Multilingual Datasets for Dimensional Stance Analysis | Jonas Becker et.al. | 2601.21483 | null |
| 2026-01-28 | UrduBench: An Urdu Reasoning Benchmark using Contextually Ensembled Translations with Human-in-the-Loop | Muhammad Ali Shafique et.al. | 2601.21000 | null |
| 2026-01-28 | When Flores Bloomz Wrong: Cross-Direction Contamination in Machine Translation Evaluation | David Tan et.al. | 2601.20858 | null |
| 2026-01-28 | MiLorE-SSL: Scaling Multilingual Capabilities in Self-Supervised Models without Forgetting | Jing Xu et.al. | 2601.20300 | null |
| 2026-01-27 | FFE-Hallu:Hallucinations in Fixed Figurative Expressions:Benchmark of Idioms and Proverbs in the Persian Language | Faezeh Hosseini et.al. | 2601.20105 | null |
| 2026-01-27 | LinguaMap: Which Layers of LLMs Speak Your Language and How to Tune Them? | J. Ben Tamo et.al. | 2601.20009 | null |
| 2026-01-27 | Reflective Translation: Improving Low-Resource Machine Translation via Structured Self-Reflection | Nicholas Cheng et.al. | 2601.19871 | null |
| 2026-01-27 | Dynamic Multi-Expert Projectors with Stabilized Routing for Multilingual Speech Recognition | Isha Pandey et.al. | 2601.19451 | null |
| 2026-02-14 | Do LLMs Truly Benefit from Longer Context in Automatic Post-Editing? | Ahrii Kim et.al. | 2601.19410 | null |
| 2026-01-27 | Leveraging Sentence-oriented Augmentation and Transformer-Based Architecture for Vietnamese-Bahnaric Translation | Tan Sang Nguyen et.al. | 2601.19124 | null |
| 2026-01-26 | XProvence: Zero-Cost Multilingual Context Pruning for Retrieval-Augmented Generation | Youssef Mohamed et.al. | 2601.18886 | null |
| 2026-01-26 | Mitigating the OWASP Top 10 For Large Language Models Applications using Intelligent Agents | Mohammad Fasha et.al. | 2601.18105 | null |
| 2026-01-25 | PEAR: Pairwise Evaluation for Automatic Relative Scoring in Machine Translation | Lorenzo Proietti et.al. | 2601.18006 | null |
| 2026-01-25 | DIETA: A Decoder-only transformer-based model for Italian-English machine TrAnslation | Pranav Kasela et.al. | 2601.17823 | null |
| 2026-01-25 | Cross-Lingual Probing and Community-Grounded Analysis of Gender Bias in Low-Resource Bengali | Md Asgor Hossain Reaj et.al. | 2601.17764 | null |
| 2026-01-25 | Align to the Pivot: Dual Alignment with Self-Feedback for Multilingual Math Reasoning | Chunxu Zhao et.al. | 2601.17671 | link |
| 2026-01-24 | CLM-Bench: Benchmarking and Analyzing Cross-lingual Misalignment of LLMs in Knowledge Editing | Yucheng Hu et.al. | 2601.17397 | null |
| 2026-01-23 | Do LLM hallucination detectors suffer from low-resource effect? | Debtanu Datta et.al. | 2601.16766 | null |
| 2026-01-23 | Typologically Informed Parameter Aggregation | Stef Accou et.al. | 2601.16629 | null |
| 2026-01-23 | Cross-Lingual Activation Steering for Multilingual Language Models | Rhitabrat Pokharel et.al. | 2601.16390 | null |
| 2026-01-21 | Large-Scale Multidimensional Knowledge Profiling of Scientific Literature | Zhucun Xue et.al. | 2601.15170 | null |
| 2026-01-21 | Obscuring Data Contamination Through Translation: Evidence from Arabic Corpora | Chaymaa Abbas et.al. | 2601.14994 | null |
| 2026-01-20 | PRiSM: Benchmarking Phone Realization in Speech Models | Shikhar Bharadwaj et.al. | 2601.14046 | null |
| 2026-01-20 | On Temperature-Constrained Non-Deterministic Machine Translation: Potential and Evaluation | Weichuan Wang et.al. | 2601.13729 | null |
| 2026-01-19 | Alexandria: A Multi-Domain Dialectal Arabic Machine Translation Dataset for Culturally Inclusive and Linguistically Diverse LLMs | Abdellah El Mekki et.al. | 2601.13099 | null |
| 2026-01-19 | A Shared Geometry of Difficulty in Multilingual Language Models | Stefano Civelli et.al. | 2601.12731 | null |
| 2026-01-19 | UbuntuGuard: A Culturally-Grounded Policy Benchmark for Equitable AI Safety in African Languages | Tassallah Abdullahi et.al. | 2601.12696 | null |
| 2026-01-18 | Benchmarking Concept-Spilling Across Languages in LLMs | Ilia Badanin et.al. | 2601.12549 | null |
| 2026-02-04 | Improving Low-Resource Machine Translation via Round-Trip Reinforcement Learning | Ahmed Attia et.al. | 2601.12535 | null |
| 2026-02-02 | The Language You Ask In: Language-Conditioned Ideological Divergence in LLM Analysis of Contested Political Documents | Oleg Smirnov et.al. | 2601.12164 | null |
| 2026-01-17 | GloCTM: Cross-Lingual Topic Modeling via a Global Context Space | Nguyen Tien Phat et.al. | 2601.11872 | null |
| 2026-01-16 | Translation as a Scalable Proxy for Multilingual Evaluation | Sheriff Issaka et.al. | 2601.11778 | null |
| 2026-01-14 | Semantic Differentiation for Tackling Challenges in Watermarking Low-Entropy Constrained Generation Outputs | Nghia T. Le et.al. | 2601.11629 | null |
| 2025-12-25 | Compass-Embedding v4: Robust Contrastive Learning for Multilingual E-commerce Embeddings | Pakorn Ueareeworakul et.al. | 2601.11565 | null |
| 2026-01-16 | MultiCaption: Detecting disinformation using multilingual visual claims | Rafael Martins Frade et.al. | 2601.11220 | null |
| 2026-01-15 | BYOL: Bring Your Own Language Into LLMs | Syed Waqas Zamir et.al. | 2601.10804 | null |
| 2026-01-15 | INDIC DIALECT: A Multi Task Benchmark to Evaluate and Translate in Indian Language Dialects | Tarun Sharma et.al. | 2601.10388 | null |
| 2026-01-15 | Untangling Input Language from Reasoning Language: A Diagnostic Framework for Cross-Lingual Moral Alignment in LLMs | Nan Li et.al. | 2601.10257 | null |
| 2026-01-15 | One Instruction Does Not Fit All: How Well Do Embeddings Align Personas and Instructions in Low-Resource Indian Languages? | Arya Shah et.al. | 2601.10205 | null |
| 2026-01-28 | HOMURA: Taming the Sand-Glass for Time-Constrained LLM Translation via Reinforcement Learning | Ziang Cui et.al. | 2601.10187 | null |
| 2026-01-20 | Multilingual-To-Multimodal (M2M): Unlocking New Languages with Monolingual Text | Piyush Singh Pasi et.al. | 2601.10096 | null |
| 2026-01-15 | Context Volume Drives Performance: Tackling Domain Shift in Extremely Low-Resource Translation via RAG | David Samuel Setiawan et.al. | 2601.09982 | null |
| 2025-12-29 | Benchmarking Cross-Lingual Semantic Alignment in Multilingual Embeddings | Wen G. Gong et.al. | 2601.09732 | null |
| 2026-01-16 | Assessing and Improving Punctuation Robustness in English-Marathi Machine Translation | Kaustubh Shivshankar Shejole et.al. | 2601.09725 | null |
| 2025-12-24 | Opportunities and Challenges of Natural Language Processing for Low-Resource Senegalese Languages in Social Science Research | Derguene Mbaye et.al. | 2601.09716 | null |
| 2026-01-14 | Creating a Hybrid Rule and Neural Network Based Semantic Tagger using Silver Standard Data: the PyMUSAS framework for Multilingual Semantic Annotation | Andrew Moore et.al. | 2601.09648 | null |
| 2026-01-24 | Layer-Parallel Training for Transformers | Shuai Jiang et.al. | 2601.09026 | null |
| 2026-01-19 | TranslateGemma Technical Report | Mara Finkelstein et.al. | 2601.09012 | null |
| 2026-01-13 | A Parallel Cross-Lingual Benchmark for Multimodal Idiomaticity Understanding | Dilara Torunoğlu-Selamet et.al. | 2601.08645 | null |
| 2026-01-13 | Get away with less: Need of source side data curation to build parallel corpus for low resource Machine Translation | Saumitra Yadav et.al. | 2601.08629 | null |
| 2026-01-13 | CLaS-Bench: A Cross-Lingual Alignment and Steering Benchmark | Daniil Gurgurov et.al. | 2601.08331 | null |
| 2026-01-12 | Order in the Evaluation Court: A Critical Analysis of NLG Evaluation Trends | Jing Yang et.al. | 2601.07648 | null |
| 2026-01-12 | Beyond Literal Mapping: Benchmarking and Improving Non-Literal Translation Evaluation | Yanzhi Tian et.al. | 2601.07338 | null |
| 2026-01-12 | Mitrasamgraha: A Comprehensive Classical Sanskrit Machine Translation Dataset | Sebastian Nehrdich et.al. | 2601.07314 | null |
| 2026-01-11 | When Abundance Conceals Weakness: Knowledge Conflict in Multilingual Models | Jiaqi Zhao et.al. | 2601.07041 | null |
| 2026-01-11 | BiasLab: A Multilingual, Dual-Framing Framework for Robust Measurement of Output-Level Bias in Large Language Models | William Guey et.al. | 2601.06861 | null |
| 2026-01-10 | Evaluating Cross-Lingual Unlearning in Multilingual Language Models | Tyler Lizzo et.al. | 2601.06675 | null |
| 2026-01-10 | MITRA: A Large-Scale Parallel Corpus and Multilingual Pretrained Language Model for Machine Translation and Semantic Retrieval for Pāli, Sanskrit, Buddhist Chinese, and Tibetan | Sebastian Nehrdich et.al. | 2601.06400 | null |
| 2026-01-10 | AfriqueLLM: How Data Mixing and Model Architecture Impact Continued Pre-training for African Languages | Hao Yu et.al. | 2601.06395 | null |
| 2026-01-09 | Evaluating Robustness of Large Language Models in Enterprise Applications: Benchmarks for Perturbation Consistency Across Formats and Languages | Tara Bogavelli et.al. | 2601.06341 | null |
| 2026-01-09 | A Rising Tide Lifts All Boats: MTQE Rewards for Idioms Improve General Translation Quality | Ishika Agarwal et.al. | 2601.06307 | null |
| 2026-01-09 | AdaFuse: Adaptive Ensemble Decoding with Test-Time Scaling for LLMs | Chengming Cui et.al. | 2601.06022 | null |
| 2026-01-09 | CLewR: Curriculum Learning with Restarts for Machine Translation Preference Learning | Alexandra Dragomir et.al. | 2601.05858 | null |
| 2026-01-09 | One Script Instead of Hundreds? On Pretraining Romanized Encoder Language Models | Benedikt Ebing et.al. | 2601.05776 | null |
| 2026-01-14 | Afri-MCQA: Multimodal Cultural Question Answering for African Languages | Atnafu Lambebo Tonja et.al. | 2601.05699 | null |
| 2026-01-09 | Multilingual Amnesia: On the Transferability of Unlearning in Multilingual LLMs | Alireza Dehghanpour Farashah et.al. | 2601.05641 | null |
| 2026-01-09 | Text Detoxification in isiXhosa and Yorùbá: A Cross-Lingual Machine Learning Approach for Low-Resource African Languages | Abayomi O. Agbeyangi et.al. | 2601.05624 | null |
| 2026-01-09 | Enabling Stroke-Level Structural Analysis of Hieroglyphic Scripts without Language-Specific Priors | Fuwen Luo et.al. | 2601.05508 | null |
| 2026-01-08 | BanglaLorica: Design and Evaluation of a Robust Watermarking Algorithm for Large Language Models in Bangla Text Generation | Amit Bin Tariqul et.al. | 2601.04534 | null |
| 2026-01-07 | The Overlooked Role of Graded Relevance Thresholds in Multilingual Dense Retrieval | Tomer Wullach et.al. | 2601.04395 | null |
| 2026-01-07 | Dialect Matters: Cross-Lingual ASR Transfer for Low-Resource Indic Language Varieties | Akriti Dhasmana et.al. | 2601.04373 | null |
| 2026-01-07 | Analyzing and Improving Cross-lingual Knowledge Transfer for Machine Translation | David Stap et.al. | 2601.04036 | null |
| 2026-01-12 | NeoAMT: Neologism-Aware Agentic Machine Translation with Reinforcement Learning | Zhongtao Miao et.al. | 2601.03790 | null |
| 2026-01-07 | Bootstrapping Code Translation with Weighted Multilanguage Exploration | Yuhan Wu et.al. | 2601.03512 | null |
| 2026-01-06 | Eye-Q: A Multilingual Benchmark for Visual Word Puzzle Solving and Image-to-Phrase Reasoning | Ali Najar et.al. | 2601.03400 | null |
| 2026-01-06 | Can Embedding Similarity Predict Cross-Lingual Transfer? A Systematic Study on African Languages | Tewodros Kederalah Idris et.al. | 2601.03168 | null |
| 2026-01-10 | Improving Indigenous Language Machine Translation with Synthetic Data and Language-Specific Preprocessing | Aashish Dhawan et.al. | 2601.03135 | null |
| 2026-01-06 | Enhancing Multilingual RAG Systems with Debiased Language Preference-Guided Query Fusion | Jeonghyun Park et.al. | 2601.02956 | null |
| 2026-01-10 | Pearmut: Human Evaluation of Translation Made Trivial | Vilém Zouhar et.al. | 2601.02933 | null |
| 2026-01-05 | Cost-Efficient Cross-Lingual Retrieval-Augmented Generation for Low-Resource Languages: A Case Study in Bengali Agricultural Advisory | Md. Asif Hossain et.al. | 2601.02065 | null |
| 2026-01-20 | Semantic Alignment of Multilingual Knowledge Graphs via Contextualized Vector Projections | Abhishek Kumar et.al. | 2601.00814 | null |
| 2026-01-23 | The Role of Mixed-Language Documents for Multilingual Large Language Model Pretraining | Jiandong Shao et.al. | 2601.00364 | null |
| 2026-01-01 | Parallel Universes, Parallel Languages: A Comprehensive Study on LLM-based Multilingual Counterfactual Example Generation | Qianli Wang et.al. | 2601.00263 | null |
| 2025-12-31 | Triangulation as an Acceptance Rule for Multilingual Mechanistic Interpretability | Yanan Long et.al. | 2512.24842 | null |
| 2025-12-30 | HY-MT1.5 Technical Report | Mao Zheng et.al. | 2512.24092 | null |
| 2025-12-29 | A Stepwise-Enhanced Reasoning Framework for Large Language Models Based on External Subgraph Generation | Xin Zhang et.al. | 2512.23356 | null |
| 2026-01-01 | AlignAR: Generative Sentence Alignment for Arabic-English Parallel Corpora of Legal and Literary Texts | Baorong Huang et.al. | 2512.21842 | null |
| 2025-12-25 | Ara-HOPE: Human-Centric Post-Editing Evaluation for Dialectal Arabic to Modern Standard Arabic Translation | Abdullah Alabdullah et.al. | 2512.21787 | null |
| 2025-12-29 | Gamayun’s Path to Multilingual Mastery: Cost-Efficient Training of a 1.5B-Parameter LLM | Alexander Podolskiy et.al. | 2512.21580 | null |
| 2025-12-23 | SweRank+: Multilingual, Multi-Turn Code Ranking for Software Issue Localization | Revanth Gangi Reddy et.al. | 2512.20482 | null |
| 2025-12-23 | Corpus of Cross-lingual Dialogues with Minutes and Detection of Misunderstandings | Marko Čechovič et.al. | 2512.20204 | null |
| 2025-12-23 | Well Begun is Half Done: Location-Aware and Trace-Guided Iterative Automated Vulnerability Repair | Zhenlei Ye et.al. | 2512.20203 | null |
| 2025-12-22 | MauBERT: Universal Phonetic Inductive Biases for Few-Shot Acoustic Units Discovery | Angelo Ortiz Tandazo et.al. | 2512.19612 | null |
| 2025-12-21 | Remedy-R: Generative Reasoning for Machine Translation Evaluation without Error Annotations | Shaomu Tan et.al. | 2512.18906 | null |
| 2025-12-21 | From Scratch to Fine-Tuned: A Comparative Study of Transformer Training Strategies for Legal Machine Translation | Amit Barman et.al. | 2512.18593 | null |
| 2025-12-19 | Seeing Justice Clearly: Handwritten Legal Document Translation with OCR and Vision-Language Models | Shubham Kumar Nigam et.al. | 2512.18004 | null |
| 2025-12-17 | Cross-Language Bias Examination in Large Language Models | Yuxuan Liang et.al. | 2512.16029 | null |
| 2025-12-17 | An Empirical Study on Chinese Character Decomposition in Multiword Expression-Aware Neural Machine Translation | Lifeng Han et.al. | 2512.15556 | null |
| 2025-12-17 | Yes-MT’s Submission to the Low-Resource Indic Language Translation Shared Task in WMT 2024 | Yash Bhaskar et.al. | 2512.15226 | null |
| 2025-12-16 | Low-Resource, High-Impact: Building Corpora for Inclusive Language Technologies | Ekaterina Artemova et.al. | 2512.14576 | link |
| 2025-12-16 | A Comparative Analysis of Retrieval-Augmented Generation Techniques for Bengali Standard-to-Dialect Machine Translation Using LLMs | K. M. Jubair Sami et.al. | 2512.14179 | null |
| 2025-12-16 | Multilingual and Continuous Backchannel Prediction: A Cross-lingual Study | Koji Inoue et.al. | 2512.14085 | null |
| 2025-12-15 | PrahokBART: A Pre-trained Sequence-to-Sequence Model for Khmer Natural Language Generation | Hour Kaing et.al. | 2512.13552 | null |
| 2025-12-15 | Advancing Bangla Machine Translation Through Informal Datasets | Ayon Roy et.al. | 2512.13487 | null |
| 2025-12-15 | Scaling Laws for Code: Every Programming Language Matters | Jian Yang et.al. | 2512.13472 | null |
| 2025-12-12 | Improving Translation Quality by Selecting Better Data for LLM Fine-Tuning: A Comparative Analysis | Felipe Ribeiro Fujita de Mello et.al. | 2512.11388 | null |
| 2025-12-11 | MultiScript30k: Leveraging Multilingual Embeddings to Extend Cross Script Parallel Data | Christopher Driggers-Ellis et.al. | 2512.11074 | null |
| 2025-12-10 | Efficient Continual Learning in Neural Machine Translation: A Low-Rank Adaptation Approach | Salvador Carrión et.al. | 2512.09910 | null |
| 2025-12-10 | Mitigating Social Bias in English and Urdu Language Models Using PRM-Guided Candidate Selection and Sequential Refinement | Muneeb Ur Raheem Khan et.al. | 2512.09854 | null |
| 2025-12-09 | What Triggers my Model? Contrastive Explanations Inform Gender Choices by Translation Models | Janiça Hackenbuchner et.al. | 2512.08440 | null |
| 2025-12-30 | Minimum Bayes Risk Decoding for Error Span Detection in Reference-Free Automatic Machine Translation Evaluation | Boxuan Lyu et.al. | 2512.07540 | null |
| 2025-12-08 | SwissGov-RSD: A Human-annotated, Cross-lingual Benchmark for Token-level Recognition of Semantic Differences Between Related Documents | Michelle Wastl et.al. | 2512.07538 | null |
| 2025-12-08 | Persian-Phi: Efficient Cross-Lingual Adaptation of Compact LLMs via Curriculum Learning | Amir Mohammad Akhlaghi et.al. | 2512.07454 | null |
| 2025-12-08 | Efficient ASR for Low-Resource Languages: Leveraging Cross-Lingual Unlabeled Data | Srihari Bandarupalli et.al. | 2512.07277 | null |
| 2025-12-08 | MASim: Multilingual Agent-Based Simulation for Social Science | Xuan Zhang et.al. | 2512.07195 | null |
| 2025-12-05 | Grounded Multilingual Medical Reasoning for Question Answering with Large Language Models | Pietro Ferrazzi et.al. | 2512.05658 | null |
| 2025-12-04 | Structured Document Translation via Format Reinforcement Learning | Haiyue Song et.al. | 2512.05100 | null |
| 2025-12-04 | AdiBhashaa: A Community-Curated Benchmark for Machine Translation into Indian Tribal Languages | Pooja Singh et.al. | 2512.04765 | null |
| 2025-12-03 | Adapting Large Language Models to Low-Resource Tibetan: A Two-Stage Continual and Supervised Fine-Tuning Study | Lifeng Chen et.al. | 2512.03976 | null |
| 2025-12-03 | Zero-Shot Video Translation and Editing with Frame Spatial-Temporal Correspondence | Shuai Yang et.al. | 2512.03905 | null |
| 2025-12-03 | HieroGlyphTranslator: Automatic Recognition and Translation of Egyptian Hieroglyphs to English | Ahmed Nasser et.al. | 2512.03817 | null |
| 2025-12-03 | M3DR: Towards Universal Multilingual Multimodal Document Retrieval | Adithya S Kolavi et.al. | 2512.03514 | null |
| 2025-12-03 | From Hypothesis to Premises: LLM-based Backward Logical Reasoning with Selective Symbolic Translation | Qingchuan Li et.al. | 2512.03360 | null |
| 2025-11-29 | Beyond Code Pairs: Dialogue-Based Data Generation for LLM Code Translation | Le Chen et.al. | 2512.03086 | null |
| 2025-12-02 | Fine-Tuned Large Language Models for Logical Translation: Reducing Hallucinations with Lang2Logic | Muyu Pan et.al. | 2512.02987 | null |
| 2025-12-02 | Cross-Lingual Prompt Steerability: Towards Accurate and Robust LLM Behavior across Languages | Lechen Zhang et.al. | 2512.02841 | null |
| 2025-12-02 | BOOM: Beyond Only One Modality KIT’s Multimodal Multilingual Lecture Companion | Sai Koneru et.al. | 2512.02817 | null |
| 2025-12-02 | TriLex: A Framework for Multilingual Sentiment Analysis in Low-Resource South African Languages | Mike Nkongolo et.al. | 2512.02799 | null |
| 2025-12-02 | Towards Language-Independent Face-Voice Association with Multimodal Foundation Models | Aref Farhadipour et.al. | 2512.02759 | null |
| 2025-12-03 | Invariance under Structure Translation as the Origin of Host Immune Capacity Conservation from Noether’s Theorem | Yexing Chen et.al. | 2512.02730 | null |
| 2025-12-02 | CREST: Universal Safety Guardrails Through Cluster-Guided Cross-Lingual Transfer | Lavish Bansal et.al. | 2512.02711 | null |
| 2025-12-02 | Feedback Loops and Code Perturbations in LLM-based Software Engineering: A Case Study on a C-to-Rust Translation System | Martin Weiss et.al. | 2512.02567 | null |
| 2025-12-01 | Cross-Lingual Interleaving for Speech Language Models | Adel Moumen et.al. | 2512.01865 | null |
| 2025-12-01 | BHRAM-IL: A Benchmark for Hallucination Recognition and Assessment in Multiple Indian Languages | Hrishikesh Terdalkar et.al. | 2512.01852 | null |
| 2025-12-01 | MCAT: Scaling Many-to-Many Speech-to-Text Translation with MLLMs to 70 Languages | Yexing Du et.al. | 2512.01512 | null |
| 2025-12-01 | LAURA: Enhancing Code Review Generation with Context-Enriched Retrieval-Augmented LLM | Yuxin Zhang et.al. | 2512.01356 | null |
| 2025-12-15 | Conveying Imagistic Thinking in Traditional Chinese Medicine Translation: A Prompt Engineering and LLM-Based Evaluation Framework | Jiatong Han et.al. | 2512.01198 | null |
| 2025-12-02 | Multilingual Training-Free Remote Sensing Image Captioning | Carlos Rebelo et.al. | 2512.00887 | null |
| 2025-11-30 | SHRAG: AFrameworkfor Combining Human-Inspired Search with RAG | Hyunseok Ryu et.al. | 2512.00772 | null |
| 2025-11-30 | MPR-GUI: Benchmarking and Enhancing Multilingual Perception and Reasoning in GUI Agents | Ruihan Chen et.al. | 2512.00756 | null |
| 2025-11-29 | Partial Cross-Compilation and Mixed Execution for Accelerating Dynamic Binary Translation | Yuhao Gu et.al. | 2512.00487 | null |
| 2025-11-29 | IndicParam: Benchmark to evaluate LLMs on low-resource Indic Languages | Ayush Maheshwari et.al. | 2512.00333 | null |
| 2025-11-29 | Lost without translation – Can transformer (language models) understand mood states? | Prakrithi Shivaprakash et.al. | 2512.00274 | null |
| 2025-11-28 | OmniFusion: Simultaneous Multilingual Multimodal Translations via Modular Fusion | Sai Koneru et.al. | 2512.00234 | null |
| 2025-11-28 | Asm2SrcEval: Evaluating Large Language Models for Assembly-to-Source Code Translation | Parisa Hamedi et.al. | 2512.00134 | null |
| 2025-11-28 | Unlocking Multilingual Reasoning Capability of LLMs and LVLMs through Representation Engineering | Qiming Li et.al. | 2511.23231 | null |
| 2025-12-09 | Conveying Imagistic Thinking in Traditional Chinese Medicine Translation: A Prompt Engineering and LLM-Based Evaluation Framework | Jiatong Han et.al. | 2511.23059 | null |
| 2025-11-26 | Advancing Automated In-Isolation Validation in Repository-Level Code Translation | Kaiyao Ke et.al. | 2511.21878 | null |
| 2025-11-24 | LLMs for Low-Resource Dialect Translation Using Context-Aware Prompting: A Case Study on Sylheti | Tabia Tanzin Prama et.al. | 2511.21761 | null |
| 2025-11-16 | On the Cross-lingual Transferability of Pre-trained wav2vec2-based Models | Jonatas Grosman et.al. | 2511.21704 | null |
| 2025-11-26 | Rigidity of Solitons to the Mean Curvature Flow in $\mathbb{H}^3$ as Translation Surfaces | Tarcios Andrey Ferreira et.al. | 2511.21545 | null |
| 2025-11-26 | Voice, Bias, and Coreference: An Interpretability Study of Gender in Speech Translation | Lina Conti et.al. | 2511.21517 | null |
| 2025-11-26 | RosettaSpeech: Zero-Shot Speech-to-Speech Translation from Monolingual Data | Zhisheng Zheng et.al. | 2511.20974 | null |
| 2025-11-25 | Winning with Less for Low Resource Languages: Advantage of Cross-Lingual English_Persian Argument Mining Model over LLM Augmentation | Ali Jahan et.al. | 2511.20872 | null |
| 2025-11-25 | TReFT: Taming Rectified Flow Models For One-Step Image Translation | Shengqian Li et.al. | 2511.20307 | null |
| 2025-11-24 | Generative Query Expansion with Multilingual LLMs for Cross-Lingual Information Retrieval | Olivia Macmillan-Scott et.al. | 2511.19325 | null |
| 2025-11-24 | What Drives Cross-lingual Ranking? Retrieval Approaches with Multilingual Language Models | Roksana Goworek et.al. | 2511.19324 | null |
| 2025-11-24 | Large Language Models for the Summarization of Czech Documents: From History to the Present | Václav Tran et.al. | 2511.18848 | null |
| 2025-11-23 | DocPTBench: Benchmarking End-to-End Photographed Document Parsing and Translation | Yongkun Du et.al. | 2511.18434 | null |
| 2025-11-23 | SmolKalam: Ensemble Quality-Filtered Translation at Scale for High Quality Arabic Post-Training Data | Sultan Alrashed et.al. | 2511.18411 | null |
| 2025-11-21 | Estonian WinoGrande Dataset: Comparative Analysis of LLM Performance on Human and Machine Translation | Marii Ojastu et.al. | 2511.17290 | null |
| 2025-11-21 | Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation | Chuancheng Shi et.al. | 2511.17282 | null |
| 2025-11-21 | Lost in Translation and Noise: A Deep Dive into the Failure Modes of VLMs on Real-World Tables | Anshul Singh et.al. | 2511.17238 | null |
| 2025-11-21 | LangMark: A Multilingual Dataset for Automatic Post-Editing | Diego Velazquez et.al. | 2511.17153 | null |
| 2025-11-19 | HinTel-AlignBench: A Framework and Benchmark for Hindi-Telugu with English-Aligned Samples | Rishikant Chigrupaatii et.al. | 2511.15183 | null |
| 2025-11-21 | LiveCLKTBench: Towards Reliable Evaluation of Cross-Lingual Knowledge Transfer in Multilingual LLMs | Pei-Fu Guo et.al. | 2511.14774 | null |
| 2025-11-18 | NeuCLIRBench: A Modern Evaluation Collection for Monolingual, Cross-Language, and Multilingual Information Retrieval | Dawn Lawrie et.al. | 2511.14758 | null |
| 2025-11-18 | TTA: Transcribe, Translate and Alignment for Cross-lingual Speech Representation | Wei Liu et.al. | 2511.14410 | null |
| 2025-11-17 | Can QE-informed (Re)Translation lead to Error Correction? | Govardhan Padmanabhan et.al. | 2511.13884 | null |
| 2025-11-18 | Crossing Borders: A Multimodal Challenge for Indian Poetry Translation and Image Generation | Sofia Jamil et.al. | 2511.13689 | null |
| 2025-11-23 | Non-Linear Scoring Model for Translation Quality Evaluation | Serge Gladkoff et.al. | 2511.13467 | null |
| 2025-11-15 | Exploring Parameter-Efficient Fine-Tuning and Backtranslation for the WMT 25 General Translation Task | Felipe Fujita et.al. | 2511.12109 | null |
| 2025-11-14 | Do LLMs Really Struggle at NL-FOL Translation? Revealing their Strengths via a Novel Benchmarking Strategy | Andrea Brunello et.al. | 2511.11816 | null |
| 2025-11-14 | Beyond Exascale: Dataflow Domain Translation on a Cerebras Cluster | Tomas Oppelstrup et.al. | 2511.11542 | null |
| 2025-11-14 | Translation-Symmetric Market: Enabling Incentive Compatibility For DER Aggregation | Ruike Lyu et.al. | 2511.11453 | null |
| 2025-11-14 | Comprehension of Multilingual Expressions Referring to Target Objects in Visual Inputs | Francisco Nogueira et.al. | 2511.11427 | null |
| 2025-11-14 | OT-ALD: Aligning Latent Distributions with Optimal Transport for Accelerated Image-to-Image Translation | Zhanpeng Wang et.al. | 2511.11162 | null |
| 2025-12-17 | DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains | Xiying Zhao et.al. | 2511.10984 | null |
| 2025-11-13 | Towards Attribution of Generators and Emotional Manipulation in Cross-Lingual Synthetic Speech using Geometric Learning | Girish et.al. | 2511.10790 | null |
| 2025-11-13 | TEDxTN: A Three-way Speech Translation Corpus for Code-Switched Tunisian Arabic - English | Fethi Bougares et.al. | 2511.10780 | null |
| 2025-11-13 | Faithful Summarization of Consumer Health Queries: A Cross-Lingual Framework with LLMs | Ajwad Abrar et.al. | 2511.10768 | null |
| 2025-11-09 | Towards Fine-Grained Code-Switch Speech Translation with Semantic Space Alignment | Yan Gao et.al. | 2511.10670 | null |
| 2025-11-05 | Evaluating Modern Large Language Models on Low-Resource and Morphologically Rich Languages:A Cross-Lingual Benchmark Across Cantonese, Japanese, and Turkish | Chengxuan Xia et.al. | 2511.10664 | null |
| 2025-11-13 | LangGPS: Language Separability Guided Data Pre-Selection for Joint Multilingual Instruction Tuning | Yangfan Ye et.al. | 2511.10229 | null |
| 2025-11-13 | Fractional neural attention for efficient multiscale sequence processing | Cheng Kevin Qu et.al. | 2511.10208 | null |
| 2025-11-13 | Scalable data-driven modeling of microstructure evolution by learning local dependency and spatiotemporal translation invariance rules in phase field simulation | Zishuo Lan et.al. | 2511.10171 | null |
| 2025-11-13 | Language Drift in Multilingual Retrieval-Augmented Generation: Characterization and Decoding-Time Mitigation | Bo Li et.al. | 2511.09984 | null |
| 2025-11-14 | HI-TransPA: Hearing Impairments Translation Personal Assistant | Zhiming Ma et.al. | 2511.09915 | null |
| 2025-11-12 | Predicate-Argument Structure Divergences in Chinese and English Parallel Sentences and their Impact on Language Transfer | Rocco Tripodi et.al. | 2511.09796 | null |
| 2025-11-12 | How Small Can You Go? Compact Language Models for On-Device Critical Error Detection in Machine Translation | Muskaan Chopra et.al. | 2511.09748 | null |
| 2025-11-12 | NSL-MT: Linguistically Informed Negative Samples for Efficient Machine Translation in Low-Resource Languages | Mamadou K. Keita et.al. | 2511.09537 | null |
| 2025-11-12 | Spatial Audio Rendering for Real-Time Speech Translation in Virtual Meetings | Margarita Geleta et.al. | 2511.09525 | null |
| 2025-11-12 | POTSA: A Cross-Lingual Speech Alignment Framework for Low Resource Speech-to-Text Translation | Xuanchen Li et.al. | 2511.09232 | null |
| 2025-11-07 | The LLM Pro Finance Suite: Multilingual Large Language Models for Financial Applications | Gaëtan Caillaut et.al. | 2511.08621 | null |
| 2025-11-11 | Large Sign Language Models: Toward 3D American Sign Language Translation | Sen Zhang et.al. | 2511.08535 | null |
| 2025-11-11 | Introducing A Bangla Sentence - Gloss Pair Dataset for Bangla Sign Language Translation and Research | Neelavro Saha et.al. | 2511.08507 | null |
| 2025-12-03 | Focusing on Language: Revealing and Exploiting Language Attention Heads in Multilingual Large Language Models | Xin Liu et.al. | 2511.07498 | null |
| 2025-11-07 | It Takes Two: A Dual Stage Approach for Terminology-Aware Translation | Akshat Singh Jaswal et.al. | 2511.07461 | null |
| 2025-11-10 | Discourse Graph Guided Document Translation with Large Language Models | Viet-Thanh Pham et.al. | 2511.07230 | null |
| 2025-11-10 | Llama-Embed-Nemotron-8B: A Universal Text Embedding Model for Multilingual and Cross-Lingual Tasks | Yauhen Babakhin et.al. | 2511.07025 | null |
| 2025-11-10 | A Picture is Worth a Thousand (Correct) Captions: A Vision-Guided Judge-Corrector System for Multimodal Machine Translation | Siddharth Betala et.al. | 2511.07010 | null |
| 2025-11-10 | Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs | Yingfeng Luo et.al. | 2511.07003 | null |
| 2025-11-10 | CLiFT-ASR: A Cross-Lingual Fine-Tuning Framework for Low-Resource Taiwanese Hokkien Speech Recognition | Hung-Yang Sung et.al. | 2511.06860 | null |
| 2025-11-10 | Steering LLMs toward Korean Local Speech: Iterative Refinement Framework for Faithful Dialect Translation | Keunhyeung Park et.al. | 2511.06680 | null |
| 2025-11-09 | Ibom NLP: A Step Toward Inclusive Natural Language Processing for Nigeria’s Minority Languages | Oluwadara Kalejaiye et.al. | 2511.06531 | null |
| 2025-11-09 | Rethinking what Matters: Effective and Robust Multilingual Realignment for Low-Resource Languages | Quang Phuoc Nguyen et.al. | 2511.06497 | null |
| 2025-11-07 | A multimodal multiplex of the mental lexicon for multilingual individuals | Maria Huynh et.al. | 2511.05361 | null |
| 2025-11-07 | Translation via Annotation: A Computational Study of Translating Classical Chinese into Japanese | Zilong Li et.al. | 2511.05239 | null |
| 2025-11-07 | Mind the Gap… or Not? How Translation Errors and Evaluation Details Skew Multilingual Results | Jan-Thorsten Peter et.al. | 2511.05162 | null |
| 2025-11-07 | Reasoning-Guided Claim Normalization for Noisy Multilingual Social Media Posts | Manan Sharma et.al. | 2511.05078 | null |
| 2025-11-12 | MERaLiON-SER: Robust Speech Emotion Recognition Model for English and SEA Languages | Hardik B. Sailor et.al. | 2511.04914 | null |
| 2025-11-06 | IndicVisionBench: Benchmarking Cultural and Multilingual Understanding in VLMs | Ali Faraz et.al. | 2511.04727 | null |
| 2025-11-01 | Cross-Lingual SynthDocs: A Large-Scale Synthetic Corpus for Any to Arabic OCR and Document Understanding | Haneen Al-Homoud et.al. | 2511.04699 | null |
| 2025-11-06 | Dynamic Jointly Batch Selection for Data Efficient Machine Translation Fine-Tuning | Mohammad Amin Ghanizadeh et.al. | 2511.04406 | null |
| 2025-11-06 | Direct Semantic Communication Between Large Language Models via Vector Translation | Fu-Chun Yang et.al. | 2511.03945 | null |
| 2025-11-05 | Evaluating Machine Translation Datasets for Low-Web Data Languages: A Gendered Lens | Hellina Hailu Nigatu et.al. | 2511.03880 | null |
| 2025-11-05 | BanglaSTEM: A Parallel Corpus for Technical Domain Bangla-English Translation | Kazi Reyazul Hasan et.al. | 2511.03498 | null |
| 2025-11-18 | Segmentation Beyond Defaults: Asymmetrical Byte Pair Encoding for Optimal Machine Translation Performance | Saumitra Yadav et.al. | 2511.03383 | null |
| 2025-11-11 | How to Evaluate Speech Translation with Source-Aware Neural MT Metrics | Mauro Cettolo et.al. | 2511.03295 | null |
| 2025-11-05 | Beyond Ranked Lists: The SARAL Framework for Cross-Lingual Document Set Retrieval | Shantanu Agarwal et.al. | 2511.03228 | null |
| 2025-11-04 | Automatic Machine Translation Detection Using a Surrogate Multilingual Translation Model | Cristian García-Romero et.al. | 2511.02958 | null |
| 2025-11-04 | PragExTra: A Multilingual Corpus of Pragmatic Explicitation in Translation | Doreen Osmelak et.al. | 2511.02721 | null |
| 2025-11-04 | The Analysis of Lexical Errors in Machine Translation from English into Romanian | Angela Stamatie et.al. | 2511.02587 | null |
| 2025-11-05 | HPLT 3.0: Very Large-Scale Multilingual Resources for LLM and MT. Mono- and Bi-lingual Data, Multilingual Evaluation, and Pre-Trained Models | Stephan Oepen et.al. | 2511.01066 | null |
| 2025-11-04 | Do Methods to Jailbreak and Defend LLMs Generalize Across Languages? | Berk Atil et.al. | 2511.00689 | null |
| 2025-11-01 | Leveraging the Cross-Domain & Cross-Linguistic Corpus for Low Resource NMT: A Case Study On Bhili-Hindi-English Parallel Corpus | Pooja Singh et.al. | 2511.00486 | null |
| 2025-10-31 | POSESTITCH-SLT: Linguistically Inspired Pose-Stitching for End-to-End Sign Language Translation | Abhinav Joshi et.al. | 2511.00270 | null |
| 2025-10-31 | Multilingual BERT language model for medical tasks: Evaluation on domain-specific adaptation and cross-linguality | Yinghao Luo et.al. | 2510.27552 | null |
| 2025-10-31 | TransAlign: Machine Translation Encoders are Strong Word Aligners, Too | Benedikt Ebing et.al. | 2510.27337 | null |
| 2025-10-31 | Languages are Modalities: Cross-Lingual Alignment via Encoder Injection | Rajan Agarwal et.al. | 2510.27254 | null |
| 2025-10-31 | Simple Additions, Substantial Gains: Expanding Scripts, Languages, and Lineage Coverage in URIEL+ | Mason Shipton et.al. | 2510.27183 | null |
| 2025-10-30 | Distilling Multilingual Vision-Language Models: When Smaller Models Stay Multilingual | Sukrit Sriratanawilai et.al. | 2510.26271 | null |
| 2025-10-29 | Rethinking Cross-lingual Alignment: Balancing Transfer and Cultural Erasure in Multilingual LLMs | HyoJung Han et.al. | 2510.26024 | null |
| 2025-10-29 | Semantic Label Drift in Cross-Cultural Translation | Mohsinul Kabir et.al. | 2510.25967 | null |
| 2025-11-04 | Hybrid Quantum-Classical Recurrent Neural Networks | Wenduan Xu et.al. | 2510.25557 | null |
| 2025-10-29 | Testing Cross-Lingual Text Comprehension In LLMs Using Next Sentence Prediction | Ritesh Sunil Chavan et.al. | 2510.25187 | null |
| 2025-10-29 | Pretraining Strategies using Monolingual and Parallel Data for Low-Resource Machine Translation | Idriss Nguepi Nguefack et.al. | 2510.25116 | null |
| 2025-10-27 | Cross-Lingual Summarization as a Black-Box Watermark Removal Attack | Gokul Ganesan et.al. | 2510.24789 | null |
| 2025-10-28 | MQM Re-Annotation: A Technique for Collaborative Evaluation of Machine Translation | Parker Riley et.al. | 2510.24664 | null |
| 2025-10-28 | Zero-Shot Cross-Lingual Transfer using Prefix-Based Adaptation | Snegha A et.al. | 2510.24619 | null |
| 2025-10-28 | Ko-MuSR: A Multistep Soft Reasoning Benchmark for LLMs Capable of Understanding Korean | Chanwoo Park et.al. | 2510.24150 | null |
| 2025-10-28 | Challenging Multilingual LLMs: A New Taxonomy and Benchmark for Unraveling Hallucination in Translation | Xinwei Wu et.al. | 2510.24073 | null |
| 2025-10-27 | AfriMTEB and AfriE5: Benchmarking and Adapting Text Embedding Models for African Languages | Kosei Uemura et.al. | 2510.23896 | null |
| 2025-10-27 | A U-Net and Transformer Pipeline for Multilingual Image Translation | Siddharth Sahay et.al. | 2510.23554 | null |
| 2025-10-27 | Quality-Aware Translation Tagging in Multilingual RAG system | Hoyeon Moon et.al. | 2510.23070 | null |
| 2025-10-27 | Cross-Lingual Sponsored Search via Dual-Encoder and Graph Neural Networks for Context-Aware Query Translation in Advertising Platforms | Ziyang Gao et.al. | 2510.22957 | null |
| 2025-10-26 | Iterative Layer Pruning for Efficient Translation Inference | Yasmin Moslem et.al. | 2510.22763 | null |
| 2025-11-05 | TraceTrans: Translation and Spatial Tracing for Surgical Prediction | Xiyu Luo et.al. | 2510.22379 | null |
| 2025-10-24 | Penalizing Length: Uncovering Systematic Bias in Quality Estimation Metrics | Yilin Zhang et.al. | 2510.22028 | null |
| 2025-10-24 | Estonian Native Large Language Model Benchmark | Helena Grete Lillepalu et.al. | 2510.21193 | null |
| 2025-10-24 | Bridging Language Gaps with Adaptive RAG: Improving Indonesian Language Question Answering | William Christian et.al. | 2510.21068 | null |
| 2025-10-23 | Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost | Runzhe Zhan et.al. | 2510.20780 | null |
| 2025-10-23 | Structure-Conditional Minimum Bayes Risk Decoding | Bryan Eikema et.al. | 2510.20700 | null |
| 2025-10-23 | Assessing the Political Fairness of Multilingual LLMs: A Case Study based on a 21-way Multiparallel EuroParl Dataset | Paul Lerner et.al. | 2510.20508 | null |
| 2025-10-22 | Conditions for Catastrophic Forgetting in Multilingual Translation | Danni Liu et.al. | 2510.19546 | null |
| 2025-10-22 | Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech Recognition | Yuu Jinnai et.al. | 2510.19471 | null |
| 2025-10-22 | Spatio-temporal Sign Language Representation and Translation | Yasser Hamidullah et.al. | 2510.19413 | null |
| 2025-10-22 | SONAR-SLT: Multilingual Sign Language Translation via Language-Agnostic Sentence Embedding Supervision | Yasser Hamidullah et.al. | 2510.19398 | null |
| 2025-10-22 | Tibetan Language and AI: A Comprehensive Survey of Resources, Methods and Challenges | Cheng Huang et.al. | 2510.19144 | null |
| 2025-10-20 | Transformer-Based Low-Resource Language Translation: A Study on Standard Bengali to Sylheti | Mangsura Kabir Oni et.al. | 2510.18898 | null |
| 2025-10-21 | SemiAdapt and SemiLoRA: Efficient Domain Adaptation for Transformer-based Low-Resource Language Translation with a Case Study on Irish | Josh McGiff et.al. | 2510.18725 | null |
| 2025-10-20 | Lingua Custodi’s participation at the WMT 2025 Terminology shared task | Jingshu Liu et.al. | 2510.17504 | null |
| 2025-10-20 | Evaluating Large Language Models on Urdu Idiom Translation | Muhammad Farmal Khan et.al. | 2510.17460 | null |
| 2025-10-19 | Zero-Shot Performance Prediction for Probabilistic Scaling Laws | Viktoria Schram et.al. | 2510.16743 | null |
| 2025-10-17 | On Non-interactive Evaluation of Animal Communication Translators | Orr Paradise et.al. | 2510.15768 | null |
| 2025-10-16 | Predicting Task Performance with Context-aware Scaling Laws | Kyle Montgomery et.al. | 2510.14919 | null |
| 2025-10-16 | Semantic Prosody in Machine Translation: the English-Chinese Case of Passive Structures | Xinyue Ma et.al. | 2510.14662 | null |
| 2025-10-16 | LiRA: Linguistic Robust Anchoring for Cross-lingual Large Language Models | Haolin Li et.al. | 2510.14466 | null |
| 2025-10-16 | From Binary to Bilingual: How the National Weather Service is Using Artificial Intelligence to Develop a Comprehensive Translation Program | Joseph E. Trujillo-Falcon et.al. | 2510.14369 | null |
| 2025-10-15 | Beyond Single-Reward: Multi-Pair, Multi-Perspective Preference Optimization for Machine Translation | Hao Wang et.al. | 2510.13434 | null |
| 2025-10-15 | A fully automated and scalable Parallel Data Augmentation for Low Resource Languages using Image and Text Analytics | Prawaal Sharma et.al. | 2510.13211 | null |
| 2025-10-15 | ACADATA: Parallel Dataset of Academic Data for Machine Translation | Iñaki Lacunza et.al. | 2510.12621 | null |
| 2025-10-14 | Uncertainty Quantification for Hallucination Detection in Large Language Models: Foundations, Methodology, and Future Directions | Sungmin Kang et.al. | 2510.12040 | null |
| 2025-10-13 | LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens | Armel Zebaze et.al. | 2510.11919 | null |
| 2025-10-12 | Bhasha-Rupantarika: Algorithm-Hardware Co-design approach for Multilingual Neural Machine Translation | Mukul Lokhande et.al. | 2510.10676 | null |
| 2025-10-11 | End-to-end Automatic Speech Recognition and Speech Translation: Integration of Speech Foundational Models and LLMs | Nam Luu et.al. | 2510.10329 | null |
| 2025-10-11 | Toward Machine Translation Literacy: How Lay Users Perceive and Rely on Imperfect Translations | Yimin Xiao et.al. | 2510.09994 | null |
| 2025-10-10 | Evaluating Robustness of Large Language Models Against Multilingual Typographical Errors | Yihong Liu et.al. | 2510.09536 | null |
| 2025-10-13 | DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation | Enze Zhang et.al. | 2510.09116 | null |
| 2025-10-10 | Quality Estimation Reranking for Document-Level Translation | Krzysztof Mrozinski et.al. | 2510.08870 | null |
| 2025-10-31 | Ready to Translate, Not to Represent? Bias and Performance Gaps in Multilingual LLMs Across Language Families and Domains | Md. Faiyaz Abdullah Sayeedi et.al. | 2510.07877 | null |
| 2025-10-08 | LuxInstruct: A Cross-Lingual Instruction Tuning Dataset For Luxembourgish | Fred Philippy et.al. | 2510.07074 | null |
| 2025-10-08 | Revisiting Metric Reliability for Fine-grained Evaluation of Machine Translation and Summarization in Indian Languages | Amir Hossein Yari et.al. | 2510.07061 | null |
| 2025-10-08 | GAMBIT+: A Challenge Set for Evaluating Gender Bias in Machine Translation Quality Estimation Metrics | Giorgos Filandrianos et.al. | 2510.06841 | null |
| 2025-10-08 | Learning to Rewrite Prompts for Bootstrapping LLMs on Downstream Tasks | Qinhao Zhou et.al. | 2510.06695 | null |
| 2025-10-11 | TRepLiNa: Layer-wise CKA+REPINA Alignment Improves Low-Resource Machine Translation in Aya-23 8B | Toshiki Nakai et.al. | 2510.06249 | null |
| 2025-10-01 | SynCED-EnDe 2025: A Synthetic and Curated English - German Dataset for Critical Error Detection in Machine Translation | Muskaan Chopra et.al. | 2510.05144 | null |
| 2025-09-27 | Trainable Reference-Based Evaluation Metric for Identifying Quality of English-Gujarati Machine Translation System | Nisheeth Joshi et.al. | 2510.05113 | null |
| 2025-10-05 | Enhancing OCR for Sino-Vietnamese Language Processing via Fine-tuned PaddleOCRv5 | Minh Hoang Nguyen et.al. | 2510.04003 | null |
| 2025-10-04 | Rezwan: Leveraging Large Language Models for Comprehensive Hadith Text Processing: A 1.2M Corpus Development | Majid Asgari-Bidhendi et.al. | 2510.03781 | null |
| 2025-10-04 | TreePrompt: Leveraging Hierarchical Few-Shot Example Selection for Improved English-Persian and English-German Translation | Ramtin Kakavand et.al. | 2510.03748 | null |
| 2025-09-30 | Scaling Laws Revisited: Modeling the Role of Data Quality in Language Model Pretraining | Anirudh Subramanyam et.al. | 2510.03313 | null |
| 2025-09-24 | GemDetox at TextDetox CLEF 2025: Enhancing a Massively Multilingual Model for Text Detoxification on Low-resource Languages | Trung Duc Anh Dang et.al. | 2510.01250 | null |
| 2025-10-01 | Exposing the Cracks: Vulnerabilities of Retrieval-Augmented LLM-based Machine Translation | Yanming Sun et.al. | 2510.00829 | null |
| 2025-10-02 | Tenyidie Syllabification corpus creation and deep learning applications | Teisovi Angami et.al. | 2510.00629 | null |
| 2025-09-30 | Searching for Difficult-to-Translate Test Examples at Scale | Wenda Xu et.al. | 2509.26619 | null |
| 2025-10-02 | Generating Difficult-to-Translate Texts | Vilém Zouhar et.al. | 2509.26592 | null |
| 2025-09-29 | Don’t Sweat the Small Stuff: Segment-Level Meta-Evaluation Based on Pairwise Difference Correlation | Colten DiIanni et.al. | 2509.25546 | null |
| 2025-09-29 | Aligning Multilingual Reasoning with Verifiable Semantics from a High-Resource Expert Model | Fahim Faisal et.al. | 2509.25543 | null |
| 2025-09-29 | ThermalGen: Style-Disentangled Flow-Based Generative Models for RGB-to-Thermal Image Translation | Jiuhong Xiao et.al. | 2509.24878 | null |
| 2025-10-02 | The Hidden Costs of Translation Accuracy: Distillation, Quantization, and Environmental Impact | Dhaathri Vijay et.al. | 2509.23990 | null |
| 2025-09-27 | Liaozhai through the Looking-Glass: On Paratextual Explicitation of Culture-Bound Terms in Machine Translation | Sherrie Shen et.al. | 2509.23395 | null |
| 2025-09-26 | From tests to effect sizes: Quantifying uncertainty and statistical variability in multilingual and multitask NLP evaluation benchmarks | Jonne Sälevä et.al. | 2509.22612 | null |
| 2025-09-26 | JGU Mainz’s Submission to the WMT25 Shared Task on LLMs with Limited Resources for Slavic Languages: MT and QA | Hossain Shaikh Saadi et.al. | 2509.22490 | null |
| 2025-09-26 | MO-GRPO: Mitigating Reward Hacking of Group Relative Policy Optimization on Multi-Objective Problems | Yuki Ichihara et.al. | 2509.22047 | null |
| 2025-09-25 | “Be My Cheese?”: Assessing Cultural Nuance in Multilingual LLM Translations | Madison Van Doren et.al. | 2509.21577 | null |
| 2025-09-24 | SiniticMTError: A Machine Translation Dataset with Error Annotations for Sinitic Languages | Hannah Liu et.al. | 2509.20557 | null |
| 2025-09-24 | Feeding Two Birds or Favoring One? Adequacy-Fluency Tradeoffs in Evaluation and Meta-Evaluation of Machine Translation | Behzad Shayegh et.al. | 2509.20287 | null |
| 2025-09-24 | Low-Resource English-Tigrinya MT: Leveraging Multilingual Models, Custom Tokenizers, and Clean Evaluation Benchmarks | Hailay Kidu Teklehaymanot et.al. | 2509.20209 | null |
| 2025-09-24 | CorIL: Towards Enriching Indian Language to Indian Language Parallel Corpora and Machine Translation Systems | Soham Bhattacharjee et.al. | 2509.19941 | null |
| 2025-09-24 | EnAnchored-X2X: English-Anchored Optimization for Many-to-Many Translation | Sen Yang et.al. | 2509.19770 | null |
| 2025-09-23 | Evaluating Language Translation Models by Playing Telephone | Syeda Jannatus Saba et.al. | 2509.19611 | null |
| 2025-09-22 | Transformer-Encoder Trees for Efficient Multilingual Machine Translation and Speech Translation | Yiwen Guan et.al. | 2509.17930 | null |
| 2025-09-22 | Specification-Aware Machine Translation and Evaluation for Purpose Alignment | Yoko Kayano et.al. | 2509.17559 | null |
| 2025-09-22 | Enhancing Cross-Lingual Transfer through Reversible Transliteration: A Huffman-Based Approach for Low-Resource Languages | Wenhao Zhuang et.al. | 2509.17493 | null |
| 2025-10-10 | Filling in the Clinical Gaps in Benchmark: Case for HealthBench for the Japanese medical system | Shohei Hisada et.al. | 2509.17444 | null |
| 2025-09-22 | Scaling, Simplification, and Adaptation: Lessons from Pretraining on Machine-Translated Text | Dan John Velasco et.al. | 2509.17317 | null |
| 2025-09-22 | JPResUnet: A Joint Probability Density Function Translation Model in Partially Premixed Flames | Hanying Yang et.al. | 2509.17297 | null |
| 2025-09-21 | Extending Automatic Machine Translation Evaluation to Book-Length Documents | Kuang-Da Wang et.al. | 2509.17249 | null |
| 2025-09-21 | CUTE: A Multilingual Dataset for Enhancing Cross-Lingual Knowledge Transfer in Low-Resource Languages | Wenhao Zhuang et.al. | 2509.16914 | null |
| 2025-09-20 | Angular Dispersion Accelerates $k$ -Nearest Neighbors Machine Translation | Evgeniia Tokarchuk et.al. | 2509.16729 | null |
| 2025-09-19 | Whisper-UT: A Unified Translation Framework for Speech and Text | Cihan Xiao et.al. | 2509.16375 | null |
| 2025-09-19 | UPRPRC: Unified Pipeline for Reproducing Parallel Resources – Corpus from the United Nations | Qiuyang Lu et.al. | 2509.15789 | null |
| 2025-10-23 | Multilingual LLM Prompting Strategies for Medical English-Vietnamese Machine Translation | Nhu Vo et.al. | 2509.15640 | null |
| 2025-09-18 | RulER: Automated Rule-Based Semantic Error Localization and Repair for Code Translation | Shuo Jin et.al. | 2509.14829 | null |
| 2025-09-18 | Evaluating Large Language Models for Cross-Lingual Retrieval | Longfei Zuo et.al. | 2509.14749 | null |
| 2025-09-17 | Translate, then Detect: Leveraging Machine Translation for Cross-Lingual Toxicity Classification | Samuel J. Bell et.al. | 2509.14493 | null |
| 2025-09-17 | You Are What You Train: Effects of Data Composition on Training Context-aware Machine Translation Models | Paweł Mąka et.al. | 2509.14031 | null |
| 2025-09-17 | Audio-Based Crowd-Sourced Evaluation of Machine Translation Quality | Sami Ul Haq et.al. | 2509.14023 | null |
| 2025-09-17 | Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale | Hasan Abed Al Kader Hammoud et.al. | 2509.14008 | null |
| 2025-09-17 | Long-context Reference-based MT Quality Estimation | Sami Ul Haq et.al. | 2509.13980 | null |
| 2025-09-20 | Data Augmentation for Maltese NLP using Transliterated and Machine Translated Arabic Data | Kurt Micallef et.al. | 2509.12853 | null |
| 2025-10-06 | Human + AI for Accelerating Ad Localization Evaluation | Harshit Rajgarhia et.al. | 2509.12543 | null |
| 2025-09-15 | A comparison of pipelines for the translation of a low resource language based on transformers | Chiara Bonfanti et.al. | 2509.12514 | null |
| 2025-09-14 | PATIMT-Bench: A Multi-Scenario Benchmark for Position-Aware Text Image Machine Translation in Large Vision-Language Models | Wanru Zhuang et.al. | 2509.12278 | null |
| 2025-09-15 | XplaiNLP at CheckThat! 2025: Multilingual Subjectivity Detection with Finetuned Transformers and Prompt-Based Inference with Large Language Models | Ariana Sahitaj et.al. | 2509.12130 | null |
| 2025-09-04 | Optimal Multi-Task Learning at Regularization Horizon for Speech Translation Task | JungHo Jung et.al. | 2509.09701 | null |
| 2025-09-11 | Mitigating Language Barriers in Education: Developing Multilingual Digital Learning Materials with Machine Translation | Lucie Poláková et.al. | 2509.09473 | null |
| 2025-09-09 | Small Open Models Achieve Near Parity with Large Models in Low Resource Literary Translation at a Fraction of the Cost | Mihai Nadas et.al. | 2509.07829 | null |
| 2025-10-18 | From Scarcity to Efficiency: Investigating the Effects of Data Augmentation on African Machine Translation | Mardiyyah Oduwole et.al. | 2509.07471 | null |
| 2025-09-09 | Hunyuan-MT Technical Report | Mao Zheng et.al. | 2509.05209 | null |
| 2025-09-05 | PRIM: Towards Practical In-Image Multilingual Machine Translation | Yanzhi Tian et.al. | 2509.05146 | null |
| 2025-09-28 | Artificially Fluent: Swahili AI Performance Benchmarks Between English-Trained and Natively-Trained Datasets | Sophie Jaffer et.al. | 2509.04516 | null |
| 2025-09-04 | Exploring NLP Benchmarks in an Extremely Low-Resource Setting | Ulin Nuha et.al. | 2509.03962 | null |
| 2025-09-04 | Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation | Jiaxin Guo et.al. | 2509.03809 | null |
| 2025-09-24 | Expanding the WMT24++ Benchmark with Rumantsch Grischun, Sursilvan, Sutsilvan, Surmiran, Puter, and Vallader | Jannis Vamvas et.al. | 2509.03148 | null |
| 2025-09-02 | The Forgotten Code: Validating a Century-Old Translation System with AI | Jean-Marie Le Ray et.al. | 2509.02506 | null |
| 2025-09-18 | CSRM-LLM: Embracing Multilingual LLMs for Cold-Start Relevance Matching in Emerging E-commerce Markets | Yujing Wang et.al. | 2509.01566 | null |
| 2025-08-28 | The Uneven Impact of Post-Training Quantization in Machine Translation | Benjamin Marie et.al. | 2508.20893 | null |
| 2025-08-28 | Languages Still Left Behind: Toward a Better Multilingual Machine Translation Benchmark | Chihiro Taguchi et.al. | 2508.20511 | null |
| 2025-09-06 | FlowMalTrans: Unsupervised Binary Code Translation for Malware Detection Using Flow-Adapter Architecture | Minghao Hu et.al. | 2508.20212 | null |
| 2025-08-26 | Improving Low-Resource Translation with Dictionary-Guided Fine-Tuning and RL: A Spanish-to-Wayuunaiki Study | Manuel Mosquera et.al. | 2508.19481 | null |
| 2025-09-03 | The Ramon Llull’s Thinking Machine for Automated Ideation | Xinran Zhao et.al. | 2508.19200 | null |
| 2025-10-10 | LaTeXTrans: Structured LaTeX Translation with Multi-Agent Coordination | Ziming Zhu et.al. | 2508.18791 | null |
| 2025-08-26 | A New NMT Model for Translating Clinical Texts from English to Spanish | Rumeng Li et.al. | 2508.18607 | null |
| 2025-08-25 | COMET-poly: Machine Translation Metric Grounded in Other Candidates | Maike Züfle et.al. | 2508.18549 | null |
| 2025-08-24 | Evaluating the Impact of Verbal Multiword Expressions on Machine Translation | Linfeng Liu et.al. | 2508.17458 | null |
| 2025-08-22 | Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish | Yakup Abrek Er et.al. | 2508.16431 | null |
| 2025-08-22 | The Mediomatix Corpus: Parallel Data for Romansh Idioms via Comparable Schoolbooks | Zachary Hopton et.al. | 2508.16371 | null |
| 2025-10-06 | OpenWHO: A Document-Level Parallel Corpus for Health Translation in Low-Resource Languages | Raphaël Merx et.al. | 2508.16048 | null |
| 2025-08-21 | Confidence-Modulated Speculative Decoding for Large Language Models | Jaydip Sen et.al. | 2508.15371 | null |
| 2025-08-20 | Improving LLMs for Machine Translation Using Synthetic Preference Data | Dario Vajda et.al. | 2508.14951 | null |
| 2025-08-24 | Preliminary Ranking of WMT25 General Machine Translation Systems | Tom Kocmi et.al. | 2508.14909 | null |
| 2025-08-20 | Filling the Gap for Uzbek: Creating Translation Resources for Southern Uzbek | Mukhammadsaid Mamasaidov et.al. | 2508.14586 | null |
| 2025-08-20 | In2x at WMT25 Translation Task | Lei Pang et.al. | 2508.14472 | null |
| 2025-08-18 | Overcoming Latency Bottlenecks in On-Device Speech Translation: A Cascaded Approach with Alignment-Based Streaming MT | Zeeshan Ahmed et.al. | 2508.13358 | null |
| 2025-09-29 | DocHPLT: A Massively Multilingual Document-Level Translation Dataset | Dayyán O’Brien et.al. | 2508.13079 | null |
| 2025-08-18 | From SALAMANDRA to SALAMANDRATA: BSC Submission for WMT25 General Machine Translation Shared Task | Javier Garcia Gilabert et.al. | 2508.12774 | null |
| 2025-08-25 | SEA-BED: Southeast Asia Embedding Benchmark | Wuttikorn Ponwitayarat et.al. | 2508.12243 | null |
| 2025-08-14 | Neural Machine Translation for Coptic-French: Strategies for Low-Resource Ancient Languages | Nasma Chaoui et.al. | 2508.10683 | null |
| 2025-08-14 | Evaluating LLMs on Chinese Idiom Translation | Cai Yang et.al. | 2508.10421 | null |
| 2025-08-28 | Estimating Machine Translation Difficulty | Lorenzo Proietti et.al. | 2508.10175 | null |
| 2025-08-12 | TopXGen: Topic-Diverse Parallel Data Generation for Low-Resource Machine Translation | Armel Zebaze et.al. | 2508.08680 | null |
| 2025-08-12 | UWB at WASSA-2024 Shared Task 2: Cross-lingual Emotion Detection | Jakub Šmíd et.al. | 2508.08650 | null |
| 2025-08-11 | Toward Machine Interpreting: Lessons from Human Interpreting Studies | Matthias Sperber et.al. | 2508.07964 | null |
| 2025-08-10 | ALOPE: Adaptive Layer Optimization for Translation Quality Estimation using Large Language Models | Archchana Sindhujan et.al. | 2508.07484 | null |
| 2025-08-08 | Testing the Limits of Machine Translation from One Book | Jonathan Shaw et.al. | 2508.06665 | null |
| 2025-08-08 | Train It and Forget It: Merge Lists are Unnecessary for BPE Inference in Language Models | Tomohiro Sawada et.al. | 2508.06621 | null |
| 2025-08-07 | PEACH: A sentence-aligned Parallel English-Arabic Corpus for Healthcare | Rania Al-Sabbagh et.al. | 2508.05722 | null |
| 2025-08-07 | MELLA: Bridging Linguistic Capability and Cultural Groundedness for Low-Resource Language MLLMs | Yufei Gao et.al. | 2508.05502 | null |
| 2025-08-07 | Optimal Corpus Aware Training for Neural Machine Translation | Yi-Hsiu Liao et.al. | 2508.05364 | null |
| 2025-08-11 | REINA: Regularized Entropy Information-Based Loss for Efficient Simultaneous Speech Translation | Nameer Hirschkind et.al. | 2508.04946 | null |
| 2025-08-05 | Marito: Structuring and Building Open Multilingual Terminologies for South African NLP | Vukosi Marivate et.al. | 2508.03529 | null |
| 2025-08-05 | Investigation on deep learning-based galaxy image translation models | Hengxin Ruan et.al. | 2508.03291 | null |
| 2025-08-05 | Cross-lingual Opinions and Emotions Mining in Comparable Documents | Motaz Saad et.al. | 2508.03112 | null |
| 2025-08-04 | A Survey on Data Security in Large Language Models | Kang Chen et.al. | 2508.02312 | null |
| 2025-08-04 | A French Version of the OLDI Seed Corpus | Malik Marmonier et.al. | 2508.02290 | null |
| 2025-08-04 | SHAMI-MT: A Syrian Arabic Dialect to Modern Standard Arabic Bidirectional Machine Translation System | Serry Sibaee et.al. | 2508.02268 | null |
| 2025-08-25 | CultureGuard: Towards Culturally-Aware Dataset and Guard Model for Multilingual Safety Applications | Raviraj Joshi et.al. | 2508.01710 | null |
| 2025-08-02 | ArzEn-MultiGenre: An aligned parallel dataset of Egyptian Arabic song lyrics, novels, and subtitles, with English translations | Rania Al-Sabbagh et.al. | 2508.01411 | null |
| 2025-09-16 | Sample-Aware Test-Time Adaptation for Medical Image-to-Image Translation | Irene Iele et.al. | 2508.00766 | null |
| 2025-07-31 | Arabic Hate Speech Identification and Masking in Social Media using Deep Learning Models and Pre-trained Models Fine-tuning | Salam Thabet Doghmash et.al. | 2507.23661 | null |
| 2025-07-31 | Beyond the Cloud: Assessing the Benefits and Drawbacks of Local LLM Deployment for Translators | Peter Sandrini et.al. | 2507.23399 | null |
| 2025-07-29 | RL from Teacher-Model Refinement: Gradual Imitation Learning for Machine Translation | Dongyub Jude Lee et.al. | 2507.22219 | null |
| 2025-07-31 | Multi-Hypothesis Distillation of Multilingual Neural Translation Models for Low-Resource Languages | Aarón Galiano-Jiménez et.al. | 2507.21568 | null |
| 2025-07-07 | iLSU-T: an Open Dataset for Uruguayan Sign Language Translation | Ariel E. Stassi et.al. | 2507.21104 | null |
| 2025-07-28 | Multilingual Self-Taught Faithfulness Evaluators | Carlo Alfano et.al. | 2507.20752 | null |
| 2025-09-02 | Advancing Dialectal Arabic to Modern Standard Arabic Machine Translation | Abdullah Alabdullah et.al. | 2507.20301 | null |
| 2025-07-29 | Mind the Language Gap in Digital Humanities: LLM-Aided Translation of SKOS Thesauri | Felix Kraus et.al. | 2507.19537 | null |
| 2025-07-25 | LLaVA-NeuMT: Selective Layer-Neuron Modulation for Efficient Multilingual Multimodal Translation | Jingxuan Wei et.al. | 2507.18940 | null |
| 2025-07-24 | GIIFT: Graph-guided Inductive Image-free Multimodal Machine Translation | Jiafeng Xiong et.al. | 2507.18562 | null |
| 2025-07-24 | Uncertainty Quantification for Evaluating Machine Translation Bias | Ieva Raminta Staliūnaitė et.al. | 2507.18338 | null |
| 2025-07-25 | Natural Language Processing for Tigrinya: Current State and Future Directions | Fitsum Gaim et.al. | 2507.17974 | null |
| 2025-07-23 | Dual-branch Prompting for Multimodal Machine Translation | Jie Wang et.al. | 2507.17588 | null |
| 2025-07-22 | Introducing Quality Estimation to Machine Translation Post-editing Workflow: An Empirical Study on Its Usefulness | Siqi Liu et.al. | 2507.16515 | null |
| 2025-07-22 | GG-BBQ: German Gender Bias Benchmark for Question Answering | Shalaka Satheesh et.al. | 2507.16410 | null |
| 2025-07-21 | Evaluating Text Style Transfer: A Nine-Language Benchmark for Text Detoxification | Vitaly Protasov et.al. | 2507.15557 | null |
| 2025-07-20 | A Case Against Implicit Standards: Homophone Normalization in Machine Translation for Languages that use the Ge’ez Script | Hellina Hailu Nigatu et.al. | 2507.15142 | null |
| 2025-08-21 | Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters | Shanbo Cheng et.al. | 2507.13618 | null |
| 2025-07-16 | Mitigating Stylistic Biases of Machine Translation Systems via Monolingual Corpora Only | Xuanqi Gao et.al. | 2507.13395 | null |
| 2025-07-16 | The first open machine translation system for the Chechen language | Abu-Viskhan A. Umishov et.al. | 2507.12672 | null |
| 2025-09-19 | Translationese-index: Using Likelihood Ratios for Graded and Generalizable Measurement of Translationese | Yikang Liu et.al. | 2507.12260 | null |
| 2025-07-16 | Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language Models | Bo Zeng et.al. | 2507.11882 | null |
| 2025-07-31 | ILID: Native Script Language Identification for Indian Languages | Yash Ingle et.al. | 2507.11832 | null |
| 2025-08-30 | How Important is `Perfect’ English for Machine Translation Prompts? | Patrícia Schmidtová et.al. | 2507.09509 | null |
| 2025-07-11 | Improving MLLM’s Document Image Machine Translation via Synchronously Self-reviewing Its OCR Proficiency | Yupu Liang et.al. | 2507.08309 | null |
| 2025-07-10 | Conditional Unigram Tokenization with Parallel Data | Gianluca Vico et.al. | 2507.07824 | null |
| 2025-07-10 | Single-to-mix Modality Alignment with Multimodal Large Language Model for Document Image Machine Translation | Yupu Liang et.al. | 2507.07572 | null |
| 2025-07-09 | Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation | Kazi Mahathir Rahman et.al. | 2507.06530 | null |
| 2025-07-09 | Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings | Russell Taylor et.al. | 2507.06506 | null |
| 2025-07-07 | A Tale of Two Scripts: Transliteration and Post-Correction for Judeo-Arabic | Juan Moreno Gonzalez et.al. | 2507.04746 | null |
| 2025-07-09 | Losing our Tail – Again: On (Un)Natural Selection And Multilingual Large Language Models | Eva Vanmassenhove et.al. | 2507.03933 | null |
| 2025-07-17 | Learning to Translate Ambiguous Terminology by Preference Optimization on Post-Edits | Nathaniel Berger et.al. | 2507.03580 | null |
| 2025-07-04 | GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation | Himanshu Dutta et.al. | 2507.03311 | null |
| 2025-07-01 | TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation | Xi Xuan et.al. | 2507.00875 | null |
| 2025-07-01 | Neural translation for Stokes inversion and synthesis | A. Asensio Ramos et.al. | 2507.00594 | null |
| 2025-06-30 | Natural language processing for African languages | David Ifeoluwa Adelani et.al. | 2507.00297 | link |
| 2025-06-30 | Bridging the Gap with Retrieval-Augmented Generation: Making Prosthetic Device User Manuals Available in Marginalised Languages | Ikechukwu Ogbonna et.al. | 2506.23958 | null |
| 2025-07-07 | CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation | Yi Liu et.al. | 2506.23347 | null |
| 2025-05-12 | Do Not Change Me: On Transferring Entities Without Modification in Neural Machine Translation – a Multilingual Perspective | Dawid Wisniewski et.al. | 2505.06010 | null |
| 2024-12-30 | Advancing Explainability in Neural Machine Translation: Analytical Metrics for Attention and Alignment Consistency | Anurag Mishra et.al. | 2412.18669 | null |
| 2025-07-31 | Instruction-tuned Large Language Models for Machine Translation in the Medical Domain | Miguel Rios et.al. | 2408.16440 | null |
| 2024-08-07 | Conditioning LLMs with Emotion in Neural Machine Translation | Charles Brazier et.al. | 2408.03150 | null |
| 2024-06-11 | CantonMT: Cantonese to English NMT Platform with Fine-Tuned Models Using Synthetic Back-Translation Data | Kung Yin Hong et.al. | 2403.11346 | null |
| 2023-11-21 | Context-aware Neural Machine Translation for English-Japanese Business Scene Dialogues | Sumire Honda et.al. | 2311.11976 | null |
| 2023-11-01 | Is Robustness Transferable across Languages in Multilingual Neural Machine Translation? | Leiyu Pan et.al. | 2310.20162 | null |
| 2023-08-28 | Ngambay-French Neural Machine Translation (sba-Fr) | Sakayo Toadoum Sari et.al. | 2308.13497 | null |
| 2023-07-18 | Enhancing Supervised Learning with Contrastive Markings in Neural Machine Translation Training | Nathaniel Berger et.al. | 2307.08416 | null |
| 2023-05-29 | Gender Lost In Translation: How Bridging The Gap Between Languages Affects Gender Bias in Zero-Shot Multilingual Translation | Lena Cabrera et.al. | 2305.16935 | null |
| 2023-05-15 | Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency Regularization | Pengzhi Gao et.al. | 2305.07310 | null |
| 2022-09-07 | Informative Language Representation Learning for Massively Multilingual Neural Machine Translation | Renren Jin et.al. | 2209.01530 | null |
| 2022-08-16 | Fast Vocabulary Projection Method via Clustering for Multilingual Machine Translation on GPU | Hossam Amer et.al. | 2208.06874 | null |
| 2022-08-25 | UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation | Jian Yang et.al. | 2207.04900 | null |
| 2022-04-12 | MMTAfrica: Multilingual Machine Translation for African Languages | Chris C. Emezue et.al. | 2204.04306 | null |
| 2022-03-09 | ViNMT: Neural Machine Translation Toolkit | Nguyen Hoang Quan et.al. | 2112.15272 | null |
| 2021-12-23 | English2Gbe: A multilingual machine translation model for {Fon/Ewe}Gbe | Gilles Hacheme et.al. | 2112.11482 | null |
| 2022-04-14 | Towards Making the Most of Multilingual Pretraining for Zero-Shot Neural Machine Translation | Guanhua Chen et.al. | 2110.08547 | null |
| 2021-09-10 | Generalised Unsupervised Domain Adaptation of Neural Machine Translation with Cross-Lingual Data Selection | Thuy-Trang Vu et.al. | 2109.04292 | null |
| 2021-11-08 | Zero-shot Cross-lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders | Guanhua Chen et.al. | 2104.08757 | null |
| 2020-11-04 | Cross-lingual Word Embeddings beyond Zero-shot Machine Translation | Shifei Chen et.al. | 2011.01682 | null |
| 2020-10-21 | Complete Multilingual Neural Machine Translation | Markus Freitag et.al. | 2010.10239 | null |
| 2020-10-20 | Diving Deep into Context-Aware Neural Machine Translation | Jingjing Huo et.al. | 2010.09482 | null |
| 2022-03-15 | Rethinking Document-level Neural Machine Translation | Zewei Sun et.al. | 2010.08961 | null |
| 2020-10-07 | Multi-task Learning for Multilingual Neural Machine Translation | Yiren Wang et.al. | 2010.02523 | null |
| 2020-10-16 | Very Deep Transformers for Neural Machine Translation | Xiaodong Liu et.al. | 2008.07772 | null |
| 2020-08-10 | A Multilingual Neural Machine Translation Model for Biomedical Data | Alexandre Bérard et.al. | 2008.02878 | null |
| 2020-05-27 | The Unreasonable Volatility of Neural Machine Translation Models | Marzieh Fadaee et.al. | 2005.12398 | null |
| 2020-05-12 | Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation | Aditya Siddhant et.al. | 2005.04816 | null |
| 2020-04-27 | Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation | Biao Zhang et.al. | 2004.11867 | null |
| 2020-12-10 | Transfer learning and subword sampling for asymmetric-resource one-to-many neural translation | Stig-Arne Grönroos et.al. | 2004.04002 | null |
| 2021-04-02 | Cross-lingual Supervision Improves Unsupervised Neural Machine Translation | Mingxuan Wang et.al. | 2004.03137 | null |
| 2020-02-21 | Compositional Neural Machine Translation by Removing the Lexicon from Syntax | Tristan Thrush et.al. | 2002.08899 | null |
| 2020-01-08 | A Comprehensive Survey of Multilingual Neural Machine Translation | Raj Dabre et.al. | 2001.01115 | null |
| 2019-12-30 | A Study of Multilingual Neural Machine Translation | Xu Tan et.al. | 1912.11625 | null |
| 2019-12-09 | Pairwise Neural Machine Translation Evaluation | Francisco Guzman et.al. | 1912.03135 | null |
| 2019-12-09 | Machine Translation Evaluation Meets Community Question Answering | Francisco Guzmán et.al. | 1912.02998 | null |
| 2020-10-01 | Neural Machine Translation: A Review and Survey | Felix Stahlberg et.al. | 1912.02047 | null |
| 2019-12-04 | Cross-lingual Pre-training Based Transfer for Zero-shot Neural Machine Translation | Baijun Ji et.al. | 1912.01214 | null |
| 2019-10-31 | Adapting Multilingual Neural Machine Translation to Unseen Languages | Surafel M. Lakew et.al. | 1910.13998 | null |
| 2019-10-29 | Multitask Learning For Different Subword Segmentations In Neural Machine Translation | Tejas Srinivasan et.al. | 1910.12368 | null |
| 2019-10-22 | On the Importance of Word Boundaries in Character-level Neural Machine Translation | Duygu Ataman et.al. | 1910.06753 | null |
| 2019-10-02 | Interrogating the Explanatory Power of Attention in Neural Machine Translation | Pooya Moradi et.al. | 1910.00139 | null |
| 2019-09-25 | Data Ordering Patterns for Neural Machine Translation: An Empirical Study | Siddhant Garg et.al. | 1909.10642 | null |
| 2019-09-17 | Multilingual Neural Machine Translation for Zero-Resource Languages | Surafel M. Lakew et.al. | 1909.07342 | link |
| 2019-11-13 | Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation | Aditya Siddhant et.al. | 1909.00437 | null |
| 2019-10-09 | Transductive Data-Selection Algorithms for Fine-Tuning Neural Machine Translation | Alberto Poncelas et.al. | 1908.09532 | null |
| 2019-08-27 | Multilingual Neural Machine Translation with Language Clustering | Xu Tan et.al. | 1908.09324 | null |
| 2019-07-16 | Simple Automatic Post-editing for Arabic-Japanese Machine Translation | Ella Noll et.al. | 1907.06210 | null |
| 2019-07-12 | Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges | Naveen Arivazhagan et.al. | 1907.05019 | null |
| 2019-07-10 | An Intrinsic Nearest Neighbor Analysis of Neural Machine Translation Architectures | Hamidreza Ghader et.al. | 1907.03885 | null |
| 2019-07-09 | Exploiting Out-of-Domain Parallel Data through Multilingual Transfer Learning for Low-Resource Neural Machine Translation | Aizhan Imankulova et.al. | 1907.03060 | null |
| 2019-07-08 | Interactive-Predictive Neural Machine Translation through Reinforcement and Imitation | Tsz Kin Lam et.al. | 1907.02326 | null |
| 2019-07-03 | Improving Robustness in Real-World Neural Machine Translation Engines | Rohit Gupta et.al. | 1907.01279 | null |
| 2019-06-19 | Generalizing Back-Translation in Neural Machine Translation | Miguel Graça et.al. | 1906.07286 | null |
| 2019-06-10 | Word-based Domain Adaptation for Neural Machine Translation | Shen Yan et.al. | 1906.03129 | null |
| 2019-06-06 | Effective Cross-lingual Transfer of Neural Machine Translation Models without Shared Vocabularies | Yunsu Kim et.al. | 1905.05475 | null |
| 2019-03-28 | Using Monolingual Data in Neural Machine Translation: a Systematic Study | Franck Burlot et.al. | 1903.11437 | null |
| 2019-07-03 | Massively Multilingual Neural Machine Translation | Roee Aharoni et.al. | 1903.00089 | null |
| 2018-12-04 | The RGNLP Machine Translation Systems for WAT 2018 | Atul Kr. Ojha et.al. | 1812.00798 | null |
| 2018-11-06 | Improving Zero-Shot Translation of Low-Resource Languages | Surafel M. Lakew et.al. | 1811.01389 | null |
| 2018-11-06 | Transfer Learning in Multilingual Neural Machine Translation with Dynamic Vocabulary | Surafel M. Lakew et.al. | 1811.01137 | null |
| 2018-11-06 | Neural Machine Translation into Language Varieties | Surafel M. Lakew et.al. | 1811.01064 | null |
| 2018-09-14 | Zero-Shot Cross-lingual Classification Using Multilingual Neural Machine Translation | Akiko Eriguchi et.al. | 1809.04686 | null |
| 2018-09-11 | Towards one-shot learning for rare-word translation with external experts | Ngoc-Quan Pham et.al. | 1809.03182 | null |
| 2020-07-09 | Trivial Transfer Learning for Low-Resource Neural Machine Translation | Tom Kocmi et.al. | 1809.00357 | null |
| 2018-09-14 | Parameter Sharing Methods for Multilingual Self-Attentional Translation Models | Devendra Singh Sachan et.al. | 1809.00252 | null |
| 2018-09-05 | Denoising Neural Machine Translation Training with Trusted Data and Online Data Selection | Wei Wang et.al. | 1809.00068 | null |
| 2018-06-22 | A Comparison of Transformer and Recurrent Neural Networks on Multilingual Neural Machine Translation | Surafel M. Lakew et.al. | 1806.06957 | null |
| 2018-06-14 | Generative Neural Machine Translation | Harshil Shah et.al. | 1806.05138 | null |
| 2018-06-11 | Multilingual Neural Machine Translation with Task-Specific Attention | Graeme Blackwood et.al. | 1806.03280 | null |
| 2018-06-11 | Multi-Source Neural Machine Translation with Missing Data | Yuta Nishimura et.al. | 1806.02525 | null |
| 2020-09-14 | On the Impact of Various Types of Noise on Neural Machine Translation | Huda Khayrallah et.al. | 1805.12282 | null |
| 2018-05-31 | Bi-Directional Neural Machine Translation with Synthetic Parallel Data | Xing Niu et.al. | 1805.11213 | null |
| 2018-05-14 | Bootstrapping Multilingual Intent Models via Machine Translation for Dialog Automation | Nicholas Ruiz et.al. | 1805.04453 | null |
| 2018-05-14 | Deep Neural Machine Translation with Weakly-Recurrent Units | Mattia Antonino Di Gangi et.al. | 1805.04185 | null |
| 2018-05-08 | Multi-Domain Neural Machine Translation | Sander Tars et.al. | 1805.02282 | null |
| 2021-09-15 | Exploring Hyper-Parameter Optimization for Neural Machine Translation on GPU Architectures | Robert Lim et.al. | 1805.02094 | null |
| 2018-10-17 | A neural interlingua for multilingual machine translation | Yichao Lu et.al. | 1804.08198 | null |
| 2021-05-20 | Massively Parallel Cross-Lingual Learning in Low-Resource Target Language Translation | Zhong Zhou et.al. | 1804.07878 | null |
| 2018-02-13 | Quantitative Fine-Grained Human Evaluation of Machine Translation Systems: a Case Study on English to Croatian | Filip Klubička et.al. | 1802.01451 | null |
| 2018-09-19 | A User-Study on Online Adaptation of Neural Machine Translation to Human Post-Edits | Sariya Karimova et.al. | 1712.04853 | null |
| 2017-10-06 | Machine Translation Evaluation with Neural Networks | Francisco Guzmán et.al. | 1710.02095 | null |
| 2017-08-22 | Neural Machine Translation with Extended Context | Jörg Tiedemann et.al. | 1708.05943 | null |
| 2017-08-22 | The Helsinki Neural Machine Translation System | Robert Östling et.al. | 1708.05942 | null |
| 2017-08-04 | Exploiting Linguistic Resources for Neural Machine Translation Using Multi-task Learning | Jan Niehues et.al. | 1708.00993 | null |
| 2017-08-01 | Linguistically Motivated Vocabulary Reduction for Neural Machine Translation from Turkish to English | Duygu Ataman et.al. | 1707.09879 | null |
| 2017-06-30 | Stronger Baselines for Trustable Results in Neural Machine Translation | Michael Denkowski et.al. | 1706.09733 | null |
| 2017-06-20 | An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation | Makoto Morishita et.al. | 1706.05765 | null |
| 2017-06-14 | Six Challenges for Neural Machine Translation | Philipp Koehn et.al. | 1706.03872 | null |
| 2018-12-19 | Beam Search Strategies for Neural Machine Translation | Markus Freitag et.al. | 1702.01806 | null |
| 2017-07-19 | Predicting Target Language CCG Supertags Improves Neural Machine Translation | Maria Nadejde et.al. | 1702.01147 | null |
| 2017-08-23 | Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation | Melvin Johnson et.al. | 1611.04558 | null |
| 2016-10-21 | Lexicons and Minimum Risk Training for Neural Machine Translation: NAIST-CMU at WAT2016 | Graham Neubig et.al. | 1610.06542 | null |
| 2016-01-07 | Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism | Orhan Firat et.al. | 1601.01073 | null |
| 2016-06-06 | Improving Neural Machine Translation Models with Monolingual Data | Rico Sennrich et.al. | 1511.06709 | null |
| 2015-09-30 | Neural-based machine translation for medical text domain. Based on European Medicines Agency leaflet texts | Krzysztof Wołk et.al. | 1509.08644 | null |
| 2014-09-26 | Semantically-Informed Syntactic Machine Translation: A Tree-Grafting Approach | Kathryn Baker et.al. | 1409.7085 | null |
| 2014-10-08 | On the Properties of Neural Machine Translation: Encoder-Decoder Approaches | Kyunghyun Cho et.al. | 1409.1259 | null |
| 2014-10-08 | Overcoming the Curse of Sentence Length for Neural Machine Translation using Automatic Segmentation | Jean Pouget-Abadie et.al. | 1409.1257 | null |
📊 3333 papers
| 📅 Publish Date | 📝 Title | 👥 Authors | 💻 Code | |
|---|---|---|---|---|
| 2026-04-01 | Universal YOCO for Efficient Depth Scaling | Yutao Sun et.al. | 2604.01220 | null |
| 2026-04-01 | LLM REgression with a Latent Iterative State Head | Yiheng Su et.al. | 2604.01206 | null |
| 2026-04-01 | AdaLoRA-QAT: Adaptive Low-Rank and Quantization-Aware Segmentation | Prantik Deb et.al. | 2604.01167 | null |
| 2026-04-01 | Lightweight Prompt-Guided CLIP Adaptation for Monocular Depth Estimation | Reyhaneh Ahani Manghotay et.al. | 2604.01118 | null |
| 2026-04-01 | A Hierarchical Importance-Guided Multi-objective Evolutionary Framework for Deep Neural Network Pruning | Zak Khan et.al. | 2604.01076 | null |
| 2026-04-01 | ONE-SHOT: Compositional Human-Environment Video Synthesis via Spatial-Decoupled Motion Injection and Hybrid Context Integration | Fengyuan Yang et.al. | 2604.01043 | null |
| 2026-04-01 | Integer-State Dynamics of Quantized Spiking Neural Networks for Efficient Hardware Acceleration | Lei Zhang et.al. | 2604.01042 | null |
| 2026-04-01 | Fast and Accurate Probing of In-Training LLMs’ Downstream Performances | Zhichen Liu et.al. | 2604.01025 | null |
| 2026-04-01 | Parameter-Efficient Fine-Tuning of Machine-Learning Interatomic Potentials for Phonon and Thermal Properties | Jonas Grandel et.al. | 2604.01017 | null |
| 2026-04-01 | Toral Chern-Simons TQFT via Geometric Quantization in Real Polarization | Daniel Galviz et.al. | 2604.01016 | null |
| 2026-04-01 | PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding | Nan Wang et.al. | 2604.00886 | null |
| 2026-04-01 | LinguDistill: Recovering Linguistic Ability in Vision- Language Models via Selective Cross-Modal Distillation | Patrick Amadeus Irawan et.al. | 2604.00829 | null |
| 2026-04-01 | Video Patch Pruning: Efficient Video Instance Segmentation via Early Token Reduction | Patrick Glandorf et.al. | 2604.00827 | null |
| 2026-04-01 | Compact Keyframe-Optimized Multi-Agent Gaussian Splatting SLAM | Monica M. Q. Li et.al. | 2604.00804 | null |
| 2026-04-01 | From Baselines to Preferences: A Comparative Study of LoRA/QLoRA and Preference Optimization for Mental Health Text Classification | Mihael Arcan et.al. | 2604.00773 | null |
| 2026-04-01 | IWP: Token Pruning as Implicit Weight Pruning in Large Vision Language Models | Dong-Jae Lee et.al. | 2604.00757 | null |
| 2026-04-01 | Andreev-enhanced conductance quantization and gate-tunable induced superconducting gap in germanium | Elyjah Kiyooka et.al. | 2604.00755 | null |
| 2026-04-01 | Spectral Compact Training: Pre-Training Large Language Models via Permanent Truncated SVD and Stiefel QR Retraction | Björn Roman Kohlberger et.al. | 2604.00733 | null |
| 2026-04-01 | A Survey of On-Policy Distillation for Large Language Models | Mingyang Song et.al. | 2604.00626 | null |
| 2026-04-01 | A Physical Imitation Learning Pipeline for Energy-Efficient Quadruped Locomotion Assisted by Parallel Elastic Joint | Huyue Ma et.al. | 2604.00611 | null |
| 2026-04-01 | TALENT: Target-aware Efficient Tuning for Referring Image Segmentation | Shuo Jin et.al. | 2604.00609 | null |
| 2026-04-01 | More Human, More Efficient: Aligning Annotations with Quantized SLMs | Jiayu Wang et.al. | 2604.00586 | null |
| 2026-04-01 | Learning from Many and Adapting to the Unknown in Open-set Test Streams | Xiao Zhang et.al. | 2604.00533 | null |
| 2026-04-01 | Formal Deformation quantization as a Fréchet algebra | Qin Li et.al. | 2604.00532 | null |
| 2026-04-01 | MF-QAT: Multi-Format Quantization-Aware Training for Elastic Inference | Zifei Xu et.al. | 2604.00529 | null |
| 2026-04-01 | Adaptive Parallel Monte Carlo Tree Search for Efficient Test-time Compute Scaling | Hongbeen Kim et.al. | 2604.00510 | null |
| 2026-04-01 | VADMamba++: Efficient Video Anomaly Detection via Hybrid Modeling in Grayscale Space | Jihao Lyu et.al. | 2604.00360 | null |
| 2026-03-31 | UCell: rethinking generalizability and scaling of bio-medical vision models | Nicholas Kuang et.al. | 2604.00243 | null |
| 2026-03-31 | The Kormendy Relation in the First Billion Years: Evidence from $JWST$ | Anshuman Borgohain et.al. | 2604.00104 | null |
| 2026-03-31 | Meteorology-Driven GPT4AP: A Multi-Task Forecasting LLM for Atmospheric Air Pollution in Data-Scarce Settings | Prasanjit Dey et.al. | 2603.29974 | null |
| 2026-03-31 | Curvature-Guided LoRA: Steering in the pretrained NTK subspace | Frédéric Zheng et.al. | 2603.29824 | null |
| 2026-03-31 | Compiling Code LLMs into Lightweight Executables | Jieke Shi et.al. | 2603.29813 | null |
| 2026-03-31 | Big2Small: A Unifying Neural Network Framework for Model Compression | Jing-Xiao Liao et.al. | 2603.29768 | null |
| 2026-03-31 | One-for-All: A Lightweight Stabilized and Parameter-Efficient Pre-trained LLM for Time Series Forecasting | Prasanjit Dey et.al. | 2603.29756 | null |
| 2026-03-31 | Client-Verifiable and Efficient Federated Unlearning in Low-Altitude Wireless Networks | Yuhua Xu et.al. | 2603.29688 | null |
| 2026-03-31 | Quantization with Unified Adaptive Distillation to enable multi-LoRA based one-for-all Generative Vision Models on edge | Sowmya Vajrala et.al. | 2603.29535 | null |
| 2026-03-31 | Distilling Human-Aligned Privacy Sensitivity Assessment from Large Language Models | Gabriel Loiseau et.al. | 2603.29497 | null |
| 2026-03-31 | SeGPruner: Semantic-Geometric Visual Token Pruner for 3D Question Answering | Wenli Li et.al. | 2603.29437 | null |
| 2026-03-31 | AP-DRL: A Synergistic Algorithm-Hardware Framework for Automatic Task Partitioning of Deep Reinforcement Learning on Versal ACAP | Enlai Li et.al. | 2603.29369 | null |
| 2026-03-31 | Long-Document QA with Chain-of-Structured-Thought and Fine-Tuned SLMs | Zhuowen Liang et.al. | 2603.29232 | null |
| 2026-03-31 | Dual-Imbalance Continual Learning for Real-World Food Recognition | Xiaoyan Zhang et.al. | 2603.29133 | null |
| 2026-03-31 | A Multi-Sensor Fusion Parking Barrier System with Lightweight Vision on Edge | Yuwen Zhu et.al. | 2603.29126 | null |
| 2026-03-30 | PolarQuant: Optimal Gaussian Weight Quantization via Hadamard Rotation for LLM Compression | Caio Vicentino et.al. | 2603.29078 | null |
| 2026-03-30 | A Unified Algebraic Framework for Subspace Pruning in Koopman Operator Approximation via Principal Vectors | Dhruv Shah et.al. | 2603.29001 | null |
| 2026-03-30 | Zero-shot Cross-domain Knowledge Distillation: A Case study on YouTube Music | Srivaths Ranganathan et.al. | 2603.28994 | null |
| 2026-03-30 | Linear Regression from 1-bit Quantized Data | Daniel Hill et.al. | 2603.28989 | null |
| 2026-03-30 | Privacy Guard & Token Parsimony by Prompt and Context Handling and LLM Routing | Alessio Langiu et.al. | 2603.28972 | null |
| 2026-03-30 | OneComp: One-Line Revolution for Generative AI Model Compression | Yuma Ichikawa et.al. | 2603.28845 | null |
| 2026-03-30 | DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing | Kailai Feng et.al. | 2603.28713 | null |
| 2026-03-30 | Trust-Aware Routing for Distributed Generative AI Inference at the Edge | Chanh Nguyen et.al. | 2603.28622 | null |
| 2026-03-30 | Fine-Tuning Large Language Models for Cooperative Tactical Deconfliction of Small Unmanned Aerial Systems | Iman Sharifi et.al. | 2603.28561 | null |
| 2026-03-30 | Compressing Transformer Language Models via Matrix Product Operator Decomposition: A Case Study on PicoGPT | Younes Javanmard et.al. | 2603.28534 | link |
| 2026-03-30 | HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention | Yufei Xu et.al. | 2603.28458 | null |
| 2026-03-31 | LG-HCC: Local Geometry-Aware Hierarchical Context Compression for 3D Gaussian Splatting | Xuan Deng et.al. | 2603.28431 | null |
| 2026-03-30 | IsoQuant: Hardware-Aligned SO(4) Isoclinic Rotations for LLM KV Cache Compression | Zhongping Ji et.al. | 2603.28430 | null |
| 2026-03-31 | Resource-efficient quantum approximate optimization algorithm via Bayesian optimization and maximum-probability evaluation | Siran Zhang et.al. | 2603.28413 | null |
| 2026-03-30 | EdgeDiT: Hardware-Aware Diffusion Transformers for Efficient On-Device Image Generation | Sravanth Kodavanti et.al. | 2603.28405 | null |
| 2026-03-30 | DinoDental: Benchmarking DINOv3 as a Unified Vision Encoder for Dental Image Analysis | Kun Tang et.al. | 2603.28297 | null |
| 2026-03-30 | Cost-Matching Model Predictive Control for Efficient Reinforcement Learning in Humanoid Locomotion | Wenqi Cai et.al. | 2603.28243 | null |
| 2026-03-30 | TwinMixing: A Shuffle-Aware Feature Interaction Model for Multi-Task Segmentation | Minh-Khoi Do et.al. | 2603.28233 | null |
| 2026-03-30 | Spinning Particles around Einstein-Geometric Proca AdS Compact Objects | Gulzoda Rakhimova et.al. | 2603.28181 | null |
| 2026-03-30 | CoT2-Meta: Budgeted Metacognitive Control for Test-Time Reasoning | Siyuan Ma et.al. | 2603.28135 | null |
| 2026-03-30 | Q-DIVER: Integrated Quantum Transfer Learning and Differentiable Quantum Architecture Search with EEG Data | Junghoon Justin Park et.al. | 2603.28122 | null |
| 2026-03-30 | DELTA: A DAG-aware Efficient OCS Logical Topology Optimization Framework for AIDCs | Niangen Ye et.al. | 2603.28096 | null |
| 2026-03-30 | Octree-based Learned Point Cloud Geometry Compression: A Lossy Perspective | Kaiyu Zheng et.al. | 2603.28095 | null |
| 2026-03-30 | Reducing Oracle Feedback with Vision-Language Embeddings for Preference-Based RL | Udita Ghosh et.al. | 2603.28053 | null |
| 2026-03-30 | Adapting SAM to Nuclei Instance Segmentation and Classification via Cooperative Fine-Grained Refinement | Jingze Su et.al. | 2603.28027 | null |
| 2026-03-30 | ExFusion: Efficient Transformer Training via Multi-Experts Fusion | Jiacheng Ruan et.al. | 2603.27965 | null |
| 2026-03-30 | ITQ3_S: High-Fidelity 3-bit LLM Inference via Interleaved Ternary Quantization with Rotation-Domain Smoothing | Edward J. Yoon et.al. | 2603.27914 | null |
| 2026-03-29 | Rényi Entropy: A New Token Pruning Metric for Vision Transformers | Wei-Yuan Su et.al. | 2603.27900 | null |
| 2026-03-29 | Energy Efficient Orchestration in Multiple-Access Vehicular Aerial-Terrestrial 6G Networks | Mohammad Farhoudi et.al. | 2603.27870 | null |
| 2026-03-29 | A Resource-Aligned Hybrid Quantum-Classical Framework for Multimodal Face Anti-Spoofing | Wanqi Sun et.al. | 2603.27852 | null |
| 2026-03-29 | KVSculpt: KV Cache Compression as Distillation | Bo Jiang et.al. | 2603.27819 | null |
| 2026-03-29 | Synergizing Discriminative Exemplars and Self-Refined Experience for MLLM-based In-Context Learning in Medical Diagnosis | Wenkai Zhao et.al. | 2603.27737 | null |
| 2026-03-29 | Low-Rank Adaptation Reduces Catastrophic Forgetting in Sequential Transformer Encoder Fine-Tuning: Controlled Empirical Evidence and Frozen-Backbone Representation Probes | Ashish Pandey et.al. | 2603.27707 | null |
| 2026-03-29 | Customized Visual Storytelling with Unified Multimodal LLMs | Wei-Hua Li et.al. | 2603.27690 | null |
| 2026-03-29 | CrossHGL: A Text-Free Foundation Model for Cross-Domain Heterogeneous Graph Learning | Xuanze Chen et.al. | 2603.27685 | null |
| 2026-03-29 | Prototype-Aligned Federated Soft-Prompts for Continual Web Personalization | Canran Xiao et.al. | 2603.27678 | null |
| 2026-03-29 | Amped: Adaptive Multi-stage Non-edge Pruning for Edge Detection | Yuhan Gao et.al. | 2603.27661 | null |
| 2026-03-29 | V-CAST: Video Curvature-Aware Spatio-Temporal Pruning for Efficient Video Large Language Models | Xinying Lin et.al. | 2603.27650 | null |
| 2026-03-29 | OPRO: Orthogonal Panel-Relative Operators for Panel-Aware In-Context Image Generation | Sanghyeon Lee et.al. | 2603.27637 | null |
| 2026-03-29 | KV Cache Quantization for Self-Forcing Video Generation: A 33-Method Empirical Study | Suraj Ranganath et.al. | 2603.27469 | null |
| 2026-03-29 | TurboAngle: Near-Lossless KV Cache Compression via Uniform Angle Quantization | Dipkumar Patel et.al. | 2603.27467 | null |
| 2026-03-29 | RSR-core: A High-Performance Engine for Low-Bit Matrix-Vector Multiplication | Mohsen Dehghankar et.al. | 2603.27462 | link |
| 2026-03-28 | Decompose, Mix, Adapt: A Unified Framework for Parameter-Efficient Neural Network Recombination and Compression | Nazia Tasnim et.al. | 2603.27383 | null |
| 2026-03-28 | TokenDance: Token-to-Token Music-to-Dance Generation with Bidirectional Mamba | Ziyue Yang et.al. | 2603.27314 | null |
| 2026-03-28 | HiFlow: Tokenization-Free Scale-Wise Autoregressive Policy Learning via Flow Matching | Daichi Yashima et.al. | 2603.27281 | null |
| 2026-03-28 | From Foundation ECG Models to NISQ Learners: Distilling ECGFounder into a VQC Student | Giovanni dos Santos Franco et.al. | 2603.27269 | null |
| 2026-03-27 | PQuantML: A Tool for End-to-End Hardware-aware Model Compression | Roope Niemi et.al. | 2603.26595 | null |
| 2026-03-27 | When Perplexity Lies: Generation-Focused Distillation of Hybrid Sequence Models | Juan Gabriel Kostelec et.al. | 2603.26556 | null |
| 2026-03-27 | Learnable Quantum Efficiency Filters for Urban Hyperspectral Segmentation | Imad Ali Shah et.al. | 2603.26528 | null |
| 2026-03-27 | SPECTRA: An Efficient Spectral-Informed Neural Network for Sensor-Based Activity Recognition | Deepika Gurung et.al. | 2603.26482 | null |
| 2026-03-27 | Domain decomposition of large neural network surrogate models | Timm Gödde et.al. | 2603.26396 | null |
| 2026-03-27 | From Pen to Pixel: Translating Hand-Drawn Plots into Graphical APIs via a Novel Benchmark and Efficient Adapter | Zhenghao Xu et.al. | 2603.26356 | null |
| 2026-03-27 | From Pixels to Privacy: Temporally Consistent Video Anonymization via Token Pruning for Privacy Preserving Action Recognition | Nazia Aslam et.al. | 2603.26336 | null |
| 2026-03-27 | Mitigating the Reasoning Tax in Vision-Language Fine-Tuning with Input-Adaptive Depth Aggregation | Yiming Ren et.al. | 2603.26330 | null |
| 2026-03-27 | Query-Specific Pruning of RML Mappings (Extended Version) | Sitt Min Oo et.al. | 2603.26269 | null |
| 2026-03-27 | ARTA: Adaptive Mixed-Resolution Token Allocation for Efficient Dense Feature Extraction | David Hagerman et.al. | 2603.26258 | null |
| 2026-03-27 | Real-Time Branch-to-Tool Distance Estimation for Autonomous UAV Pruning: Benchmarking Five DEFOM-Stereo Variants from Simulation to Jetson Deployment | Yida Lin et.al. | 2603.26250 | null |
| 2026-03-27 | Knowledge Distillation for Efficient Transformer-Based Reinforcement Learning in Hardware-Constrained Energy Management Systems | Pascal Henrich et.al. | 2603.26249 | null |
| 2026-03-27 | EPDQ: Efficient and Privacy-Preserving Exact Distance Query on Encrypted Graphs | Xuemei Fu et.al. | 2603.26219 | null |
| 2026-03-27 | 4DRaL: Bridging 4D Radar with LiDAR for Place Recognition using Knowledge Distillation | Ningyuan Huang et.al. | 2603.26206 | null |
| 2026-03-27 | Efficient Few-Shot Learning for Edge AI via Knowledge Distillation on MobileViT | Shuhei Tsuyuki et.al. | 2603.26145 | null |
| 2026-03-27 | PruneFuse: Efficient Data Selection via Weight Pruning and Network Fusion | Humaira Kousar et.al. | 2603.26138 | null |
| 2026-03-27 | InstaVSR: Taming Diffusion for Efficient and Temporally Consistent Video Super-Resolution | Jintong Hu et.al. | 2603.26134 | null |
| 2026-03-27 | TurboESM: Ultra-Efficient 3-Bit KV Cache Quantization for Protein Language Models with Orthogonal Rotation and QJL Correction | Yue Hu et.al. | 2603.26110 | null |
| 2026-03-27 | Learnable Instance Attention Filtering for Adaptive Detector Distillation | Chen Liu et.al. | 2603.26088 | null |
| 2026-03-27 | Rethinking Token Pruning for Historical Screenshots in GUI Visual Agents: Semantic, Spatial, and Temporal Perspectives | Daiqiang Li et.al. | 2603.26041 | null |
| 2026-03-27 | Learning to Trim: End-to-End Causal Graph Pruning with Dynamic Anatomical Feature Banks for Medical VQA | Zibo Xu et.al. | 2603.26028 | null |
| 2026-03-27 | VeRA+: Vector-Based Lightweight Digital Compensation for Drift-Resilient RRAM In-Memory Computing | Weirong Dong et.al. | 2603.26016 | null |
| 2026-03-27 | FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants | Mahesh Bhosale et.al. | 2603.26008 | null |
| 2026-03-26 | Density-aware Soft Context Compression with Semi-Dynamic Compression Ratio | Yijiong Yu et.al. | 2603.25926 | null |
| 2026-03-26 | GazeQwen: Lightweight Gaze-Conditioned LLM Modulation for Streaming Video Understanding | Trong Thang Pham et.al. | 2603.25841 | null |
| 2026-03-26 | ETA-VLA: Efficient Token Adaptation via Temporal Fusion and Intra-LLM Sparsification for Vision-Language-Action Models | Yiru Wang et.al. | 2603.25766 | null |
| 2026-03-26 | Transverse force tomography inside a proton from Basis Light-front Quantization | Ziqi Zhang et.al. | 2603.25548 | null |
| 2026-03-26 | Investigating the Fundamental Limit: A Feasibility Study of Hybrid-Neural Archival | Marcus Armstrong et.al. | 2603.25526 | null |
| 2026-03-27 | CLIP-RD: Relational Distillation for Efficient CLIP Knowledge Distillation | Jeannie Chung et.al. | 2603.25383 | null |
| 2026-03-26 | Optimizing Entanglement Distribution Protocols: Maximizing Classical Information in Quantum Networks | Ethan Sanchez Hidalgo et.al. | 2603.25360 | null |
| 2026-03-26 | How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models | Hector Borobia et.al. | 2603.25325 | null |
| 2026-03-26 | Towards Controllable Low-Light Image Enhancement: A Continuous Multi-illumination Dataset and Efficient State Space Framework | Hongru Han et.al. | 2603.25296 | null |
| 2026-03-26 | Non-Minimally Coupled Scalar Field, Area Quantization and Black Hole Entropy | Sahil Devdutt et.al. | 2603.25292 | null |
| 2026-03-26 | SliderQuant: Accurate Post-Training Quantization for LLMs | Shigeng Wang et.al. | 2603.25284 | null |
| 2026-03-26 | Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model | Jiahao Wu et.al. | 2603.25184 | null |
| 2026-03-26 | SIGMA: Structure-Invariant Generative Molecular Alignment for Chemical Language Models via Autoregressive Contrastive Learning | Xinyu Wang et.al. | 2603.25062 | null |
| 2026-03-26 | Mechanistically Interpreting Compression in Vision-Language Models | Veeraraju Elluru et.al. | 2603.25035 | null |
| 2026-03-26 | A Public Theory of Distillation Resistance via Constraint-Coupled Reasoning Architectures | Peng Wei et.al. | 2603.25022 | null |
| 2026-03-26 | Topological Quantization of Complex Velocity in Stochastic Spacetimes | Jorge Meza-Domíguez et.al. | 2603.25016 | null |
| 2026-03-26 | LiteGuard: Efficient Task-Agnostic Model Fingerprinting with Enhanced Generalization | Guang Yang et.al. | 2603.24982 | null |
| 2026-03-26 | Toward domain-specific machine translation and quality estimation systems | Javad Pourmostafa Roshan Sharami et.al. | 2603.24955 | null |
| 2026-03-26 | Surrogates, Spikes, and Sparsity: Performance Analysis and Characterization of SNN Hyperparameters on Hardware | Ilkin Aliyev et.al. | 2603.24891 | null |
| 2026-03-25 | Prune as You Generate: Online Rollout Pruning for Faster and Better RLVR | Haobo Xu et.al. | 2603.24840 | null |
| 2026-03-25 | Coefficient-Decoupled Matrix Product Operators as an Interface to Linear-Combination-of-Unitaries Circuits | Younes Javanmard et.al. | 2603.24822 | null |
| 2026-03-25 | Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration | Danil Tokhchukov et.al. | 2603.24800 | null |
| 2026-03-25 | Quantization of Beta Functions in Self-Dual Backgrounds and Emergent Non-Commutative EFT | Mithat Ünsal et.al. | 2603.24799 | null |
| 2026-03-25 | Rafture: Erasure-coded Raft with Post-Dissemination Pruning | Rithwik Kerur et.al. | 2603.24761 | null |
| 2026-03-25 | Bound states of anyons: a geometric quantization approach | Qingchen Li et.al. | 2603.24701 | null |
| 2026-03-25 | ReDiPrune: Relevance-Diversity Pre-Projection Token Pruning for Efficient Multimodal LLMs | An Yu et.al. | 2603.24680 | null |
| 2026-03-25 | Demystifying When Pruning Works via Representation Hierarchies | Shwai He et.al. | 2603.24652 | null |
| 2026-03-25 | From friction scaling to an efficient method for estimating bubble wall velocity | Tomasz Krajewski et.al. | 2603.24583 | null |
| 2026-03-25 | Latent-WAM: Latent World Action Modeling for End-to-End Autonomous Driving | Linbo Wang et.al. | 2603.24581 | null |
| 2026-03-25 | TuneShift-KD: Knowledge Distillation and Transfer for Fine-tuned Models | Yushi Guan et.al. | 2603.24518 | null |
| 2026-03-25 | JSSAnet: Theory-Guided Subchannel Partitioning and Joint Spatial Attention for Near-Field Channel Estimation | Zhiming Zhu et.al. | 2603.24505 | null |
| 2026-03-25 | Marchuk: Efficient Global Weather Forecasting from Mid-Range to Sub-Seasonal Scales via Flow Matching | Arsen Kuzhamuratov et.al. | 2603.24428 | null |
| 2026-03-25 | PP-OCRv5: A Specialized 5M-Parameter Model Rivaling Billion-Parameter Vision-Language Models on OCR Tasks | Cheng Cui et.al. | 2603.24373 | null |
| 2026-03-25 | LATS: Large Language Model Assisted Teacher-Student Framework for Multi-Agent Reinforcement Learning in Traffic Signal Control | Yifeng Zhang et.al. | 2603.24361 | null |
| 2026-03-25 | Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing | Cheng Cui et.al. | 2603.24326 | null |
| 2026-03-25 | Powerful Teachers Matter: Text-Guided Multi-view Knowledge Distillation with Visual Prior Enhancement | Xin Zhang et.al. | 2603.24208 | null |
| 2026-03-25 | Linear-Nonlinear Fusion Neural Operator for Partial Differential Equations | Heng Wu et.al. | 2603.24143 | null |
| 2026-03-25 | MedAidDialog: A Multilingual Multi-Turn Medical Dialogue Dataset for Accessible Healthcare | Shubham Kumar Nigam et.al. | 2603.24132 | null |
| 2026-03-25 | UW-VOS: A Large-Scale Dataset for Underwater Video Object Segmentation | Hongshen Zhao et.al. | 2603.24006 | null |
| 2026-03-25 | Diet Your LLM: Dimension-wise Global Pruning of LLMs via Merging Task-specific Importance Score | Jimyung Hong et.al. | 2603.23985 | null |
| 2026-03-25 | Towards Energy-aware Requirements Dependency Classification: Knowledge-Graph vs. Vector-Retrieval Augmented Inference with SLMs | Shreyas Patil et.al. | 2603.23954 | null |
| 2026-03-25 | Attention-aware Inference Optimizations for Large Vision-Language Models with Memory-efficient Decoding | Fatih Ilhan et.al. | 2603.23914 | null |
| 2026-03-25 | PowerFlow-DNN: Compiler-Directed Fine-Grained Power Orchestration for End-to-End Edge AI Inference | Paul Chen et.al. | 2603.23882 | null |
| 2026-03-25 | Self-Evolving Multi-Agent Framework for Efficient Decision Making in Real-Time Strategy Scenarios | Li Ma et.al. | 2603.23875 | null |
| 2026-03-25 | How Vulnerable Are Edge LLMs? | Ao Ding et.al. | 2603.23822 | null |
| 2026-03-24 | An Adapter-free Fine-tuning Approach for Tuning 3D Foundation Models | Sneha Paul et.al. | 2603.23730 | null |
| 2026-03-24 | Energy Efficient Software Hardware CoDesign for Machine Learning: From TinyML to Large Language Models | Mohammad Saleh Vahdatpour et.al. | 2603.23668 | null |
| 2026-03-24 | QuickQudits: A Framework for Efficient Simulation of Noisy Qudit Clifford Circuits via an Extended Stabilizer Tableau Formalism | Nina Brandl et.al. | 2603.23641 | null |
| 2026-03-24 | APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs | Meriem Bouzouad et.al. | 2603.23575 | null |
| 2026-03-24 | Deformation quantization for systems with second-class constraints in deformed fermionic phase space | Bing-Sheng Lin et.al. | 2603.23411 | null |
| 2026-03-24 | GeoSANE: Learning Geospatial Representations from Models, Not Data | Joelle Hanna et.al. | 2603.23408 | null |
| 2026-03-24 | Harnessing Lightweight Transformer with Contextual Synergic Enhancement for Efficient 3D Medical Image Segmentation | Xinyu Liu et.al. | 2603.23390 | null |
| 2026-03-24 | Pruning for efficient deterministic global optimization over trained ReLU neural networks | Giacomo Lastrucci et.al. | 2603.23299 | null |
| 2026-03-24 | Block Coordinate Descent for Dynamic Portfolio Optimization on Finite-Precision Coherent Ising Machines | Keming He et.al. | 2603.23200 | null |
| 2026-03-24 | LiZIP: An Auto-Regressive Compression Framework for LiDAR Point Clouds | Aditya Shibu et.al. | 2603.23162 | null |
| 2026-03-24 | Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair | Aditya Kakade et.al. | 2603.23129 | null |
| 2026-03-24 | High-Resolution Tensor-Network Fourier Methods for Exponentially Compressed Non-Gaussian Aggregate Distributions | Juan José Rodríguez-Aldavero et.al. | 2603.23106 | null |
| 2026-03-24 | Good for the Planet, Bad for Me? Intended and Unintended Consequences of AI Energy Consumption Disclosure | Michael Klesel et.al. | 2603.23075 | null |
| 2026-03-24 | VLA-IAP: Training-Free Visual Token Pruning via Interaction Alignment for Vision-Language-Action Models | Jintao Cheng et.al. | 2603.22991 | null |
| 2026-03-24 | Markov-Enforced Discrete Diffusion Model for Digital Semantic Symbol Error Correction | Yoon Huh et.al. | 2603.22983 | null |
| 2026-03-24 | PersonalQ: Select, Quantize, and Serve Personalized Diffusion Models for Efficient Inference | Qirui Wang et.al. | 2603.22943 | null |
| 2026-03-24 | Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning | Anshul Solanki et.al. | 2603.22942 | null |
| 2026-03-24 | ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling | Shaobo Ju et.al. | 2603.22911 | null |
| 2026-03-24 | Balancing Safety and Efficiency in Aircraft Health Diagnosis: A Task Decomposition Framework with Heterogeneous Long-Micro Scale Cascading and Knowledge Distillation-based Interpretability | Xinhang Chen et.al. | 2603.22885 | null |
| 2026-03-24 | TRINE: A Token-Aware, Runtime-Adaptive FPGA Inference Engine for Multimodal AI | Hyunwoo Oh et.al. | 2603.22867 | null |
| 2026-03-24 | Aerial Agentic AI: Synergizing LLM and SLM for Low-Altitude Wireless Networks | Li Dong et.al. | 2603.22866 | null |
| 2026-03-25 | Two-dimensional bound excitons in the real space and Landau quantization space: a comparative study | Kunxiang Li et.al. | 2603.22715 | null |
| 2026-03-23 | Communication-Efficient Approximate Gradient Coding | Sifat Munim et.al. | 2603.22514 | null |
| 2026-03-23 | A Theoretical Framework for Energy-Aware Gradient Pruning in Federated Learning | Emmanouil M. Athanasakos et.al. | 2603.22465 | null |
| 2026-03-23 | A Brief Comparison of Training-Free Multi-Vector Sequence Compression Methods | Rohan Jha et.al. | 2603.22434 | null |
| 2026-03-23 | An Exact Conjugation Identity for the Many-Body Wilson-Loop Beyond Quantization | Kai Watanabe et.al. | 2603.22217 | null |
| 2026-03-23 | Mixture of Mini Experts: Overcoming the Linear Layer Bottleneck in Multiple Instance Learning | Daniel Shao et.al. | 2603.22198 | null |
| 2026-03-23 | Dual-Space Knowledge Distillation with Key-Query Matching for Large Language Models with Vocabulary Mismatch | Stella Eva Tsiapali et.al. | 2603.22056 | null |
| 2026-03-23 | SegMaFormer: A Hybrid State-Space and Transformer Model for Efficient Segmentation | Duy D. Nguyen et.al. | 2603.22002 | null |
| 2026-03-23 | Parameter-Efficient Fine-Tuning for Medical Text Summarization: A Comparative Study of Lora, Prompt Tuning, and Full Fine-Tuning | Ulugbek Shernazarov et.al. | 2603.21970 | null |
| 2026-03-23 | Suiren-1.0 Technical Report: A Family of Molecular Foundation Models | Junyi An et.al. | 2603.21942 | null |
| 2026-03-23 | Camera-Agnostic Pruning of 3D Gaussian Splats via Descriptor-Based Beta Evidence | Peter Fasogbon et.al. | 2603.21933 | null |
| 2026-03-23 | The Golden Subspace: Where Efficiency Meets Generalization in Continual Test-Time Adaptation | Guannan Lai et.al. | 2603.21928 | null |
| 2026-03-23 | olLOSC: Unified and efficient density functional approximation to correct delocalization error in molecules and periodic materials | Yichen Fan et.al. | 2603.21906 | null |
| 2026-03-23 | SmaAT-QMix-UNet: A Parameter-Efficient Vector-Quantized UNet for Precipitation Nowcasting | Nikolas Stavrou et.al. | 2603.21879 | null |
| 2026-03-23 | Many-body mobility edges in one dimension revealed by efficient and interpretable feature-based learning with Kolmogorov-Arnold Networks | Siqi Dai et.al. | 2603.21807 | null |
| 2026-03-23 | CurvZO: Adaptive Curvature-Guided Sparse Zeroth-Order Optimization for Efficient LLM Fine-Tuning | Shuo Wang et.al. | 2603.21725 | null |
| 2026-03-23 | Rethinking Token Reduction for Large Vision-Language Models | Yi Wang et.al. | 2603.21701 | null |
| 2026-03-23 | Distilling the knowledge with quantum neural networks | Yuxuan Yan et.al. | 2603.21586 | null |
| 2026-03-23 | Rethinking SAR ATR: A Target-Aware Frequency-Spatial Enhancement Framework with Noise-Resilient Knowledge Guidance | Yansong Lin et.al. | 2603.21565 | null |
| 2026-03-23 | Parameter-efficient Prompt Tuning and Hierarchical Textual Guidance for Few-shot Whole Slide Image Classification | Jayanie Bogahawatte et.al. | 2603.21504 | null |
| 2026-03-22 | KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning | Shuai Wang et.al. | 2603.21440 | null |
| 2026-03-22 | Uncertainty-Aware Knowledge Distillation for Multimodal Large Language Models | Jingchen Sun et.al. | 2603.21426 | null |
| 2026-03-22 | Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs | Mariela M. Nina et.al. | 2603.21418 | null |
| 2026-03-22 | Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models | Jinghan Cao et.al. | 2603.21389 | null |
| 2026-03-22 | FluidWorld: Reaction-Diffusion Dynamics as a Predictive Substrate for World Models | Fabien Polly et.al. | 2603.21315 | null |
| 2026-03-22 | DepthTCM: High Efficient Depth Compression via Physics-aware Transformer-CNN Mixed Architecture | Young-Seo Chang et.al. | 2603.21233 | null |
| 2026-03-22 | QMoP: Query Guided Mixture-of-Projector for Efficient Visual Token Compression | Zhongyang Li et.al. | 2603.21232 | null |
| 2026-03-22 | Emotion-Aware Quantization for Discrete Speech Representations: An Analysis of Emotion Preservation | Haoguang Zhou et.al. | 2603.21224 | null |
| 2026-03-22 | Frequency Switching Mechanism for Parameter-E!cient Multi-Task Learning | Shih-Wen Liu et.al. | 2603.21111 | null |
| 2026-03-22 | A lightweight Outlier Detection for Characterizing Radio- and Environment-Specific Link Quality Fluctuation in Low-Power Wireless Networks | Zegeye Mekasha Kidane et.al. | 2603.21107 | null |
| 2026-03-22 | ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models | Xu Li et.al. | 2603.21105 | null |
| 2026-03-22 | Learning Progressive Adaptation for Multi-Modal Tracking | He Wang et.al. | 2603.21100 | null |
| 2026-03-22 | SkinCLIP-VL: Consistency-Aware Vision-Language Learning for Multimodal Skin Cancer Diagnosis | Zhixiang Lu et.al. | 2603.21010 | null |
| 2026-03-22 | Structural Sensitivity in Compressed Transformers: Error Propagation, Lyapunov Stability, and Formally Verified Bounds | Abhinaba Basu et.al. | 2603.20991 | null |
| 2026-03-22 | Joint Surrogate Learning of Objectives, Constraints, and Sensitivities for Efficient Multi-objective Optimization of Neural Dynamical Systems | Frithjof Gressmann et.al. | 2603.20984 | null |
| 2026-03-21 | SozKZ: Training Efficient Small Language Models for Kazakh from Scratch | Saken Tukenov et.al. | 2603.20854 | null |
| 2026-03-21 | HiCI: Hierarchical Construction-Integration for Long-Context Attention | Xiangyu Zeng et.al. | 2603.20843 | null |
| 2026-03-21 | Lean Learning Beyond Clouds: Efficient Discrepancy-Conditioned Optical-SAR Fusion for Semantic Segmentation | Chenxing Meng et.al. | 2603.20811 | null |
| 2026-03-21 | Less is More in Semantic Space: Intrinsic Decoupling via Clifford-M for Fundus Image Classification | Yifeng Zheng et.al. | 2603.20806 | null |
| 2026-03-21 | VSD-MOT: End-to-End Multi-Object Tracking in Low-Quality Video Scenes Guided by Visual Semantic Distillation | Jun Du et.al. | 2603.20731 | null |
| 2026-03-21 | Centrality-Based Pruning for Efficient Echo State Networks | Sudip Laudari et.al. | 2603.20684 | null |
| 2026-03-21 | Enhancing Vision-Based Policies with Omni-View and Cross-Modality Knowledge Distillation for Mobile Robots | Kai Li et.al. | 2603.20679 | null |
| 2026-03-20 | Understanding Behavior Cloning with Action Quantization | Haoqun Cao et.al. | 2603.20538 | null |
| 2026-03-20 | AE-LLM: Adaptive Efficiency Optimization for Large Language Models | Kaito Tanaka et.al. | 2603.20492 | null |
| 2026-03-20 | Developing an ESG-Oriented Large Language Model through ESG Practices | Gabriel Assis et.al. | 2603.20480 | null |
| 2026-03-20 | Diffutron: A Masked Diffusion Language Model for Turkish Language | Şuayp Talha Kocabay et.al. | 2603.20466 | null |
| 2026-03-20 | Accurate and efficient simulation-based inference for massive black-hole binaries with LISA | Alice Spadaro et.al. | 2603.20431 | link |
| 2026-03-20 | TinyML Enhances CubeSat Mission Capabilities | Luigi Capogrosso et.al. | 2603.20174 | null |
| 2026-03-20 | An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models | Yuming Feng et.al. | 2603.20100 | null |
| 2026-03-20 | TAPAS: Efficient Two-Server Asymmetric Private Aggregation Beyond Prio(+) | Harish Karthikeyan et.al. | 2603.19949 | null |
| 2026-03-20 | Timestep-Aware Block Masking for Efficient Diffusion Model Inference | Haodong He et.al. | 2603.19939 | null |
| 2026-03-20 | SIMPLER: Efficient Foundation Model Adaptation via Similarity-Guided Layer Pruning for Earth Observation | Víctor Barreiro et.al. | 2603.19873 | null |
| 2026-03-20 | Generalized Task-Driven Design of Soft Robots via Reduced-Order FEM-based Surrogate Modeling | Yao Yao et.al. | 2603.19794 | null |
| 2026-03-20 | Growing Networks with Autonomous Pruning | Charles De Lambilly et.al. | 2603.19759 | null |
| 2026-03-20 | FedPDPO: Federated Personalized Direct Preference Optimization for Large Language Model Alignment | Kewen Zhu et.al. | 2603.19741 | null |
| 2026-03-20 | A two-step sequential approach for hyperparameter selection in finite context models | José Contente et.al. | 2603.19736 | null |
| 2026-03-20 | Stepwise: Neuro-Symbolic Proof Search for Automated Systems Verification | Baoding He et.al. | 2603.19715 | null |
| 2026-03-20 | RiboSphere: Learning Unified and Efficient Representations of RNA Structures | Zhou Zhang et.al. | 2603.19636 | null |
| 2026-03-20 | BEAVER: A Training-Free Hierarchical Prompt Compression Method via Structure-Aware Page Selection | Zhengpei Hu et.al. | 2603.19635 | null |
| 2026-03-20 | Dual-Domain Representation Alignment: Bridging 2D and 3D Vision via Geometry-Aware Architecture Search | Haoyu Zhang et.al. | 2603.19563 | null |
| 2026-03-20 | Optimal Scalar Quantization for Matrix Multiplication: Closed-Form Density and Phase Transition | Calvin Ang et.al. | 2603.19559 | null |
| 2026-03-19 | Vision Tiny Recursion Model (ViTRM): Parameter-Efficient Image Classification via Recursive State Refinement | Ange-Clément Akazan et.al. | 2603.19503 | null |
| 2026-03-19 | VeloxNet: Efficient Spatial Gating for Lightweight Embedded Image Classification | Md Meftahul Ferdaus et.al. | 2603.19496 | null |
| 2026-03-19 | F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World | Ziyin Zhang et.al. | 2603.19223 | null |
| 2026-03-19 | Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation | Zhuolin Yang et.al. | 2603.19220 | null |
| 2026-03-19 | DyMoE: Dynamic Expert Orchestration with Mixed-Precision Quantization for Efficient MoE Inference on Edge | Yuegui Huang et.al. | 2603.19172 | null |
| 2026-03-19 | Quasinormal Modes of Extremal Reissner-Nordstrom Black Holes via Seiberg-Witten Quantization | Yi-Rong Wang et.al. | 2603.19168 | null |
| 2026-03-19 | A Pipelined Collaborative Speculative Decoding Framework for Efficient Edge-Cloud LLM Inference | Yida Zhang et.al. | 2603.19133 | null |
| 2026-03-19 | LuMamba: Latent Unified Mamba for Electrode Topology-Invariant and Efficient EEG Modeling | Danaé Broustail et.al. | 2603.19100 | null |
| 2026-03-19 | Towards Verifiable AI with Lightweight Cryptographic Proofs of Inference | Pranay Anchuri et.al. | 2603.19025 | null |
| 2026-03-19 | End-to-End Simulation of Chemical Dynamics on a Quantum Computer | Elliot C. Eklund et.al. | 2603.19007 | null |
| 2026-03-19 | Functional Subspace Watermarking for Large Language Models | Zikang Ding et.al. | 2603.18793 | null |
| 2026-03-19 | 6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models | Rundong Su et.al. | 2603.18742 | null |
| 2026-03-19 | EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation | Longfei Liu et.al. | 2603.18739 | null |
| 2026-03-19 | Multimodal Model for Computational Pathology:Representation Learning and Image Compression | Peihang Wu et.al. | 2603.18660 | null |
| 2026-03-19 | AIMER: Calibration-Free Task-Agnostic MoE Pruning | Zongfang Liu et.al. | 2603.18492 | null |
| 2026-03-19 | Prune-then-Quantize or Quantize-then-Prune? Understanding the Impact of Compression Order in Joint Model Compression | Minjun Kim et.al. | 2603.18426 | null |
| 2026-03-19 | SynQ: Accurate Zero-shot Quantization by Synthesis-aware Fine-tuning | Minjun Kim et.al. | 2603.18423 | null |
| 2026-03-18 | Energy-Aware Frame Rate Selection for Video Coding | Geetha Ramasubbu et.al. | 2603.18305 | null |
| 2026-03-18 | LRConv-NeRV: Low Rank Convolution for Efficient Neural Video Compression | Tamer Shanableh et.al. | 2603.18261 | null |
| 2026-03-18 | A Computationally Efficient Learning of Artificial Intelligence System Reliability Considering Error Propagation | Fenglian Pan et.al. | 2603.18201 | null |
| 2026-03-18 | Q-Drift: Quantization-Aware Drift Correction for Diffusion Model Sampling | Sooyoung Ryu et.al. | 2603.18095 | null |
| 2026-03-18 | Unified Spatio-Temporal Token Scoring for Efficient Video VLMs | Jianrui Zhang et.al. | 2603.18004 | null |
| 2026-03-18 | Universal Skeleton Understanding via Differentiable Rendering and MLLMs | Ziyi Wang et.al. | 2603.18003 | null |
| 2026-03-18 | AdaRadar: Rate Adaptive Spectral Compression for Radar-based Perception | Jinho Park et.al. | 2603.17979 | null |
| 2026-03-18 | Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing | Raghavv Goel et.al. | 2603.17942 | null |
| 2026-03-18 | Energy extraction from a rotating Buchdahl star via magnetic reconnection | Ikhtiyor Eshtursunov et.al. | 2603.17928 | null |
| 2026-03-18 | RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference | Arpit Singh Gautam et.al. | 2603.17891 | null |
| 2026-03-18 | Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated Gradients | Ziwei Xiang et.al. | 2603.17809 | null |
| 2026-03-18 | Exploring parameter-efficient fine-tuning (PEFT) of billion-parameter vision models with QLoRA and DoRA: insights into generalization for limited-data image classification under a 98:1 test-to-train regime | Haiyu Yang et.al. | 2603.17782 | null |
| 2026-03-18 | Parameter-Efficient Modality-Balanced Symmetric Fusion for Multimodal Remote Sensing Semantic Segmentation | Haocheng Li et.al. | 2603.17705 | null |
| 2026-03-18 | Halo: Domain-Aware Query Optimization for Long-Context Question Answering | Pramod Chunduri et.al. | 2603.17668 | null |
| 2026-03-18 | ReLaGS: Relational Language Gaussian Splatting | Yaxu Xie et.al. | 2603.17605 | null |
| 2026-03-18 | LoGSAM: Parameter-Efficient Cross-Modal Grounding for MRI Segmentation | Mohammad Robaitul Islam Bhuiyan et.al. | 2603.17576 | null |
| 2026-03-18 | EI: Early Intervention for Multimodal Imaging based Disease Recognition | Qijie Wei et.al. | 2603.17514 | null |
| 2026-03-18 | ZipServ: Fast and Memory-Efficient LLM Inference with Hardware-Aware Lossless Compression | Ruibo Fan et.al. | 2603.17435 | null |
| 2026-03-18 | The Phasor Transformer: Resolving Attention Bottlenecks on the Unit Circle | Dibakar Sigdel et.al. | 2603.17433 | null |
| 2026-03-18 | Motion-Adaptive Temporal Attention for Lightweight Video Generation with Stable Diffusion | Rui Hong et.al. | 2603.17398 | null |
| 2026-03-18 | Beyond Outliers: A Data-Free Layer-wise Mixed-Precision Quantization Approach Driven by Numerical and Structural Dual-Sensitivity | Hengyuan Zhang et.al. | 2603.17354 | null |
| 2026-03-18 | DANCE: Dynamic 3D CNN Pruning: Joint Frame, Channel, and Feature Adaptation for Energy Efficiency on the Edge | Mohamed Mejri et.al. | 2603.17275 | null |
| 2026-03-18 | Efficient and flexible preparation of photonic NOON states in a superconducting system | Dong-Sheng Li et.al. | 2603.17253 | null |
| 2026-03-18 | KANtize: Exploring Low-bit Quantization of Kolmogorov-Arnold Networks for Efficient Inference | Sohaib Errabii et.al. | 2603.17230 | null |
| 2026-03-17 | OPERA: Online Data Pruning for Efficient Retrieval Model Adaptation | Haoyang Fang et.al. | 2603.17205 | null |
| 2026-03-17 | On quantization and the classical variational principle for the metric mean dimension | Maria Carvalho et.al. | 2603.17091 | null |
| 2026-03-17 | ACE-LoRA: Graph-Attentive Context Enhancement for Parameter-Efficient Adaptation of Medical Vision-Language Models | M. Arda Aydın et.al. | 2603.17079 | null |
| 2026-03-17 | Early Quantization Shrinks Codebook: A Simple Fix for Diversity-Preserving Tokenization | Wenhao Zhao et.al. | 2603.17052 | null |
| 2026-03-17 | Empirical Recipes for Efficient and Compact Vision-Language Models | Jiabo Huang et.al. | 2603.16987 | null |
| 2026-03-17 | Integrating Inductive Biases in Transformers via Distillation for Financial Time Series Forecasting | Yu-Chen Den et.al. | 2603.16985 | null |
| 2026-03-17 | Understanding Quantization of Optimizer States in LLM Pre-training: Dynamics of State Staleness and Effectiveness of State Resets | Kristi Topollai et.al. | 2603.16731 | null |
| 2026-03-17 | Efficient generation of entangled photons in the telecommunications range using nonlinear metasurfaces integrated with ScAlN/GaN heterostructures | Jaeyeon Yu et.al. | 2603.16699 | null |
| 2026-03-17 | Can Linguistically Related Languages Guide LLM Translation in Low-Resource Settings? | Aishwarya Ramasethu et.al. | 2603.16660 | null |
| 2026-03-17 | FlowComposer: Composable Flows for Compositional Zero-Shot Learning | Zhenqi He et.al. | 2603.16641 | null |
| 2026-03-17 | BATQuant: Outlier-resilient MXFP4 Quantization via Learnable Block-wise Optimization | Ji-Fu Li et.al. | 2603.16590 | null |
| 2026-03-17 | Exploring different approaches to customize language models for domain-specific text-to-code generation | Luís Freire et.al. | 2603.16526 | null |
| 2026-03-17 | TinyGLASS: Real-Time Self-Supervised In-Sensor Anomaly Detection | Pietro Bonazzi et.al. | 2603.16451 | null |
| 2026-03-17 | Fast-HaMeR: Boosting Hand Mesh Reconstruction using Knowledge Distillation | Hunain Ahmed Jillani et.al. | 2603.16444 | null |
| 2026-03-17 | Capability-Guided Compression: Toward Interpretability-Aware Budget Allocation for Large Language Models | Rishaank Gupta et.al. | 2603.16440 | null |
| 2026-03-17 | CD-FKD: Cross-Domain Feature Knowledge Distillation for Robust Single-Domain Generalization in Object Detection | Junseok Lee et.al. | 2603.16439 | null |
| 2026-03-17 | VQKV: High-Fidelity and High-Ratio Cache Compression via Vector-Quantization | Yixuan Wang et.al. | 2603.16435 | null |
| 2026-03-18 | EngGPT2: Sovereign, Efficient and Open Intelligence | G. Ciarfaglia et.al. | 2603.16430 | null |
| 2026-03-17 | PlotTwist: A Creative Plot Generation Framework with Small Language Models | Abhinav Thorat et.al. | 2603.16410 | null |
| 2026-03-17 | DermaFlux: Synthetic Skin Lesion Generation with Rectified Flows for Enhanced Image Classification | Stathis Galanakis et.al. | 2603.16392 | null |
| 2026-03-17 | RASLF: Representation-Aware State Space Model for Light Field Super-Resolution | Zeqiang Wei et.al. | 2603.16243 | null |
| 2026-03-17 | SpecSteer: Synergizing Local Context and Global Reasoning for Efficient Personalized Generation | Hang Lv et.al. | 2603.16219 | null |
| 2026-03-17 | SIA: A Synthesize-Inject-Align Framework for Knowledge-Grounded and Secure E-commerce Search LLMs with Industrial Deployment | Zhouwei Zhai et.al. | 2603.16137 | null |
| 2026-03-17 | Knowledge Distillation for Collaborative Learning in Distributed Communications and Sensing | Nhan Thanh Nguyen et.al. | 2603.16116 | null |
| 2026-03-17 | Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization | Francesco Pio Monaco et.al. | 2603.16105 | null |
| 2026-03-17 | POaaS: Minimal-Edit Prompt Optimization as a Service to Lift Accuracy and Cut Hallucinations on On-Device sLLMs | Jungwoo Shim et.al. | 2603.16045 | null |
| 2026-03-17 | Enhancing Linguistic Generalization of VLA: Fine-Tuning OpenVLA via Synthetic Instruction Augmentation | Dongik Shin et.al. | 2603.16044 | null |
| 2026-03-16 | Mostly Text, Smart Visuals: Asymmetric Text-Visual Pruning for Large Vision-Language Models | Sijie Li et.al. | 2603.16001 | null |
| 2026-03-16 | Sparse but not Simpler: A Multi-Level Interpretability Analysis of Vision Transformers | Siyu Zhang et.al. | 2603.15919 | null |
| 2026-03-16 | Agent-based imitation dynamics can yield efficiently compressed population-level vocabularies | Nathaniel Imel et.al. | 2603.15903 | null |
| 2026-03-16 | Domain Adaptation Without the Compute Burden for Efficient Whole Slide Image Analysis | Umar Marikkar et.al. | 2603.15774 | null |
| 2026-03-16 | S2Act: Simple Spiking Actor | Ugur Akcal et.al. | 2603.15725 | null |
| 2026-03-16 | Mastering the Minority: An Uncertainty-guided Multi-Expert Framework for Challenging-tailed Sequence Learning | Ye Wang et.al. | 2603.15708 | null |
| 2026-03-16 | TabKD: Tabular Knowledge Distillation through Interaction Diversity of Learned Feature Bins | Shovon Niverd Pereira et.al. | 2603.15481 | null |
| 2026-03-16 | CLAG: Adaptive Memory Organization via Agent-Driven Clustering for Small Language Model Agents | Taeyun Roh et.al. | 2603.15421 | null |
| 2026-03-16 | RESQ: A Unified Framework for REliability- and Security Enhancement of Quantized Deep Neural Networks | Ali Soltan Mohammadi et.al. | 2603.15413 | null |
| 2026-03-16 | Spectral Rectification for Parameter-Efficient Adaptation of Foundation Models in Colonoscopy Depth Estimation | Xiaoxian Zhang et.al. | 2603.15374 | null |
| 2026-03-16 | Physically Motivated Knowledge Distillation for Blind Geometric Correction of Side-Scan Sonar Imagery | Can Lei et.al. | 2603.15200 | null |
| 2026-03-16 | Joint Routing and Model Pruning for Decentralized Federated Learning in Bandwidth-Constrained Multi-Hop Wireless Networks | Xiaoyu He et.al. | 2603.15188 | null |
| 2026-03-16 | DAIT: Distillation from Vision-Language Models to Lightweight Classifiers with Adaptive Intermediate Teacher Transfer | Zhengxu He et.al. | 2603.15166 | null |
| 2026-03-16 | An Efficient Cumulative Edge-Detection Method for Image Reconstruction | Toluwani Okunola et.al. | 2603.15151 | null |
| 2026-03-16 | Accelerating Byzantine-Robust Distributed Learning with Compressed Communication via Double Momentum and Variance Reduction | Yanghao Li et.al. | 2603.15144 | null |
| 2026-03-16 | PrototypeNAS: Rapid Design of Deep Neural Networks for Microcontroller Units | Mark Deutel et.al. | 2603.15106 | null |
| 2026-03-16 | Affordable Precision Agriculture: A Deployment-Oriented Review of Low-Cost, Low-Power Edge AI and TinyML for Resource-Constrained Farming Systems | Riya Samanta et.al. | 2603.15085 | null |
| 2026-03-16 | Edit2Interp: Adapting Image Foundation Models from Spatial Editing to Video Frame Interpolation with Few-Shot Learning | Nasrin Rahimi et.al. | 2603.15003 | null |
| 2026-03-16 | Smooth finite time singularity formation without quantization | Istvan Kadar et.al. | 2603.14985 | null |
| 2026-03-16 | Lightweight User-Personalization Method for Closed Split Computing | Yuya Okada et.al. | 2603.14958 | null |
| 2026-03-16 | GT-PCQA: Geometry-Texture Decoupled Point Cloud Quality Assessment with MLLM | Guohua Zhang et.al. | 2603.14951 | null |
| 2026-03-16 | Spiking Layer-Adaptive Magnitude-based Pruning | Junqiao Wang et.al. | 2603.14946 | null |
| 2026-03-16 | Directional Routing in Transformers | Kevin Taylor et.al. | 2603.14923 | null |
| 2026-03-16 | Photonic Quantum-Enhanced Knowledge Distillation | Kuan-Cheng Chen et.al. | 2603.14898 | null |
| 2026-03-16 | RAZOR: Ratio-Aware Layer Editing for Targeted Unlearning in Vision Transformers and Diffusion Models | Ravi Ranjan et.al. | 2603.14819 | null |
| 2026-03-16 | SimCert: Probabilistic Certification for Behavioral Similarity in Deep Neural Network Compression | Jingyang Li et.al. | 2603.14818 | null |
| 2026-03-16 | Efficient Event Camera Volume System | Juan Camilo Soto et.al. | 2603.14738 | null |
| 2026-03-15 | Parameter-Efficient Quality Estimation via Frozen Recursive Models | Umar Abubacar et.al. | 2603.14593 | null |
| 2026-03-15 | FlashHead: Efficient Drop-In Replacement for the Classification Head in Language Model Inference | Wilhelm Tranheden et.al. | 2603.14591 | null |
| 2026-03-15 | Multilingual TinyStories: A Synthetic Combinatorial Corpus of Indic Children’s Stories for Training Small Language Models | Deepon Halder et.al. | 2603.14563 | null |
| 2026-03-15 | ASAP: Attention-Shift-Aware Pruning for Efficient LVLM Inference | Surendra Pathak et.al. | 2603.14549 | null |
| 2026-03-15 | Distilling Latent Manifolds: Resolution Extrapolation by Variational Autoencoders | Jiaming Chu et.al. | 2603.14536 | null |
| 2026-03-15 | LatSearch: Latent Reward-Guided Search for Faster Inference-Time Scaling in Video Diffusion | Zengqun Zhao et.al. | 2603.14526 | null |
| 2026-03-15 | Uni-MDTrack: Learning Decoupled Memory and Dynamic States for Parameter-Efficient Visual Tracking in All Modality | Wenrui Cai et.al. | 2603.14452 | null |
| 2026-03-15 | Flux Quantization on M-Strings | Pinak Banerjee et.al. | 2603.14440 | null |
| 2026-03-15 | SPARQ: Spiking Early-Exit Neural Networks for Energy-Efficient Edge AI | Parth Patne et.al. | 2603.14380 | null |
| 2026-03-15 | Direct Object-Level Reconstruction via Probabilistic Gaussian Splatting | Shuai Guo et.al. | 2603.14316 | null |
| 2026-03-15 | All-day Multi-scenes Lifelong Vision-and-Language Navigation with Tucker Adaptation | Xudong Wang et.al. | 2603.14276 | null |
| 2026-03-15 | On aggregation-quantization permutability problem for discrete-time Markov chains | Adam Doliwa et.al. | 2603.14269 | null |
| 2026-03-15 | Not All Directions Matter: Toward Structured and Task-Aware Low-Rank Adaptation | Xi Xiao et.al. | 2603.14228 | null |
| 2026-03-15 | Self-Indexing KVCache: Predicting Sparse Attention from Compressed Keys | Xu Yang et.al. | 2603.14224 | null |
| 2026-03-15 | Safety-Potential Pruning for Enhancing Safety Prompts Against VLM Jailbreaking Without Retraining | Chongxin Li et.al. | 2603.14219 | null |
| 2026-03-15 | Relationship-Aware Safety Unlearning for Multimodal LLMs | Vishnu Narayanan Anilkumar et.al. | 2603.14185 | null |
| 2026-03-15 | Selective Fine-Tuning of GPT Architectures for Parameter-Efficient Clinical Text Classification | Fariba Afrin Irany et.al. | 2603.14183 | null |
| 2026-03-14 | Universal method of selective detection of a wide range of pollutants in liquids using conductance quantization | O. Pospelov et.al. | 2603.14140 | null |
| 2026-03-13 | MoEKD: Mixture-of-Experts Knowledge Distillation for Robust and High-Performing Compressed Code Models | Md. Abdul Awal et.al. | 2603.13213 | null |
| 2026-03-13 | Resource-efficient Quantum Algorithms for Selected Hamiltonian Subspace Diagonalization | Vincent Graves et.al. | 2603.13160 | null |
| 2026-03-13 | Steve-Evolving: Open-World Embodied Self-Evolution via Fine-Grained Diagnosis and Dual-Track Knowledge Distillation | Zhengwei Xie et.al. | 2603.13131 | null |
| 2026-03-13 | ZO-SAM: Zero-Order Sharpness-Aware Minimization for Efficient Sparse Training | Jie Ji et.al. | 2603.13115 | null |
| 2026-03-13 | Efficient and Interpretable Multi-Agent LLM Routing via Ant Colony Optimization | Xudong Wang et.al. | 2603.12933 | null |
| 2026-03-13 | Consistent and Efficient MSCKF-based LiDAR-Inertial Odometry with Inferred Cluster-to-Plane Constraints for UAVs | Jinwen Zhu et.al. | 2603.12904 | null |
| 2026-03-13 | Surrogates for Physics-based and Data-driven Modelling of Parametric Systems: Review and New Perspectives | Matteo Giacomini et.al. | 2603.12870 | null |
| 2026-03-13 | HIFICL: High-Fidelity In-Context Learning for Multimodal Tasks | Xiaoyu Li et.al. | 2603.12760 | null |
| 2026-03-13 | ToolTree: Efficient LLM Agent Tool Planning via Dual-Feedback Monte Carlo Tree Search and Bidirectional Pruning | Shuo Yang et.al. | 2603.12740 | null |
| 2026-03-13 | Vision Verification Enhanced Fusion of VLMs for Efficient Visual Reasoning | Selim Furkan Tekin et.al. | 2603.12669 | null |
| 2026-03-13 | AVION: Aerial Vision-Language Instruction from Offline Teacher to Prompt-Tuned Network | Yu Hu et.al. | 2603.12659 | null |
| 2026-03-13 | VGGT-World: Transforming VGGT into an Autoregressive Geometry World Model | Xiangyu Sun et.al. | 2603.12655 | null |
| 2026-03-13 | Sobolev–Ricci Curvature | Kyoichi Iwasaki et.al. | 2603.12652 | null |
| 2026-03-13 | LightMoE: Reducing Mixture-of-Experts Redundancy through Expert Replacing | Jiawei Hao et.al. | 2603.12645 | null |
| 2026-03-13 | Early Pruning for Public Transport Routing | Andrii Rohovyi et.al. | 2603.12592 | null |
| 2026-03-13 | CA-HFP: Curvature-Aware Heterogeneous Federated Pruning with Model Reconstruction | Gang Hu et.al. | 2603.12591 | null |
| 2026-03-13 | Expert Pyramid Tuning: Efficient Parameter Fine-Tuning for Expertise-Driven Task Allocation | Jia-Chen Zhang et.al. | 2603.12577 | null |
| 2026-03-13 | Spatio-Semantic Expert Routing Architecture with Mixture-of-Experts for Referring Image Segmentation | Alaa Dalaq et.al. | 2603.12538 | null |
| 2026-03-12 | Efficient Quantum Simulation for Nonlinear Stochastic Differential Equations | Xiangyu Li et.al. | 2603.12398 | null |
| 2026-03-12 | NeuroLoRA: Context-Aware Neuromodulation for Parameter-Efficient Multi-Task Adaptation | Yuxin Yang et.al. | 2603.12378 | null |
| 2026-03-12 | Efficient Reasoning with Balanced Thinking | Yulin Li et.al. | 2603.12372 | null |
| 2026-03-12 | Alternating Gradient Flow Utility: A Unified Metric for Structural Pruning and Dynamic Routing in Deep Networks | Tianhao Qian et.al. | 2603.12354 | null |
| 2026-03-12 | Pruning-induced phases in fully-connected neural networks: the eumentia, the dementia, and the amentia | Haining Pan et.al. | 2603.12316 | null |
| 2026-03-12 | HiAP: A Multi-Granular Stochastic Auto-Pruning Framework for Vision Transformers | Andy Li et.al. | 2603.12222 | null |
| 2026-03-12 | ForensicZip: More Tokens are Better but Not Necessary in Forensic Vision-Language Models | Yingxin Lai et.al. | 2603.12208 | null |
| 2026-03-12 | Long-Context Encoder Models for Polish Language Understanding | Sławomir Dadas et.al. | 2603.12191 | null |
| 2026-03-12 | Space-Efficient Approximate Spherical Range Counting in High Dimensions | Andreas Kalavas et.al. | 2603.12106 | null |
| 2026-03-12 | Resource-Efficient Iterative LLM-Based NAS with Feedback Memory | Xiaojie Gu et.al. | 2603.12091 | null |
| 2026-03-12 | EmbTracker: Traceable Black-box Watermarking for Federated Language Models | Haodong Zhao et.al. | 2603.12089 | null |
| 2026-03-12 | Intelligent 6G Edge Connectivity: A Knowledge Driven Optimization Framework for Small Cell Selection | Tuğçe Bilen et.al. | 2603.12086 | null |
| 2026-03-12 | A Joint JSCC-Resource Allocation Framework for QoS-Aware Semantic Communication in LEO Satellite-based EO Missions | Hung Nguyen-Kha et.al. | 2603.12027 | null |
| 2026-03-12 | Efficient Generative Modeling with Unitary Matrix Product States Using Riemannian Optimization | Haotong Duan et.al. | 2603.12026 | null |
| 2026-03-12 | Asymptotically Efficient Recursive Identification Under One-Bit Communications Achieving Original CRLB | Xingrui Liu et.al. | 2603.11964 | null |
| 2026-03-12 | PicoSAM3: Real-Time In-Sensor Region-of-Interest Segmentation | Pietro Bonazzi et.al. | 2603.11917 | null |
| 2026-03-12 | Bielik-Minitron-7B: Compressing Large Language Models via Structured Pruning and Knowledge Distillation for the Polish Language | Remigiusz Kinas et.al. | 2603.11881 | null |
| 2026-03-12 | AdaFuse: Accelerating Dynamic Adapter Inference via Token-Level Pre-Gating and Fused Kernel Optimization | Qiyang Li et.al. | 2603.11873 | null |
| 2026-03-12 | A Further Efficient Algorithm with Best-of-Both-Worlds Guarantees for $m$ -Set Semi-Bandit Problem | Botao Chen et.al. | 2603.11764 | null |
| 2026-03-12 | UCAN: Unified Convolutional Attention Network for Expansive Receptive Fields in Lightweight Super-Resolution | Cao Thien Tan et.al. | 2603.11680 | null |
| 2026-03-12 | Simple Recipe Works: Vision-Language-Action Models are Natural Continual Learners with Reinforcement Learning | Jiaheng Hu et.al. | 2603.11653 | null |
| 2026-03-12 | MedPruner: Training-Free Hierarchical Token Pruning for Efficient 3D Medical Image Understanding in Vision-Language Models | Shengyuan Liu et.al. | 2603.11625 | null |
| 2026-03-12 | DyWeight: Dynamic Gradient Weighting for Few-Step Diffusion Sampling | Tong Zhao et.al. | 2603.11607 | null |
| 2026-03-12 | Quantum mechanical framework for quantization-based optimization: from Gradient flow to Schroedinger equation | Jinwuk Seok et.al. | 2603.11536 | null |
| 2026-03-12 | Mobile-GS: Real-time Gaussian Splatting for Mobile Devices | Xiaobiao Du et.al. | 2603.11531 | null |
| 2026-03-12 | Can Small Language Models Use What They Retrieve? An Empirical Study of Retrieval Utilization Across Model Scale | Sanchit Pandey et.al. | 2603.11513 | null |
| 2026-03-12 | AutoVeriFix+: High-Correctness RTL Generation via Trace-Aware Causal Fix and Semantic Redundancy Pruning | Yan Tan et.al. | 2603.11489 | null |
| 2026-03-11 | Evaluating Explainable AI Attribution Methods in Neural Machine Translation via Attention-Guided Knowledge Distillation | Aria Nourbakhsh et.al. | 2603.11342 | null |
| 2026-03-11 | Unified Flavor: Lattice Quantization, Chain Locality, and a Dynamical Origin of Hierarchical Yukawas | Vernon Barger et.al. | 2603.11341 | null |
| 2026-03-11 | Reversible Lifelong Model Editing via Semantic Routing-Based LoRA | Haihua Luo et.al. | 2603.11239 | null |
| 2026-03-11 | Representation Finetuning for Continual Learning | Haihua Luo et.al. | 2603.11201 | null |
| 2026-03-11 | Efficient Approximation to Analytic and $L^p$ functions by Height-Augmented ReLU Networks | ZeYu Li et.al. | 2603.11128 | null |
| 2026-03-11 | Leech Lattice Vector Quantization for Efficient LLM Compression | Tycho F. A. van der Ouderaa et.al. | 2603.11021 | null |
| 2026-03-11 | Med-DualLoRA: Local Adaptation of Foundation Models for 3D Cardiac MRI | Joan Perramon-Llussà et.al. | 2603.10967 | null |
| 2026-03-11 | GLM-OCR Technical Report | Shuaiqi Duan et.al. | 2603.10910 | null |
| 2026-03-11 | LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation | Jinwoo Ahn et.al. | 2603.10899 | null |
| 2026-03-11 | Continuous Diffusion Transformers for Designing Synthetic Regulatory Elements | Jonathan Liu et.al. | 2603.10885 | null |
| 2026-03-11 | From Images to Words: Efficient Cross-Modal Knowledge Distillation to Language Models from Black-box Teachers | Ayan Sengupta et.al. | 2603.10877 | null |
| 2026-03-11 | Denoising diffusion and latent diffusion models for physics field simulations | Yuan Jia et.al. | 2603.10799 | null |
| 2026-03-11 | From path integral quantization to stochastic quantization: a pedestrian’s journey | Dario Benedetti et.al. | 2603.10761 | null |
| 2026-03-11 | Double-Precision Matrix Multiplication Emulation via Ozaki-II Scheme with FP8 Quantization | Yuki Uchino et.al. | 2603.10634 | null |
| 2026-03-11 | TacLoc: Global Tactile Localization on Objects from a Registration Perspective | Zirui Zhang et.al. | 2603.10565 | null |
| 2026-03-11 | Quantization Robustness of Monotone Operator Equilibrium Networks | James Li et.al. | 2603.10562 | null |
| 2026-03-11 | PET-F2I: A Comprehensive Benchmark and Parameter-Efficient Fine-Tuning of LLMs for PET/CT Report Impression Generation | Yuchen Liu et.al. | 2603.10560 | null |
| 2026-03-11 | SCORE: Replacing Layer Stacking with Contractive Recurrent Depth | Guillaume Godin et.al. | 2603.10544 | null |
| 2026-03-11 | In-Memory ADC-Based Nonlinear Activation Quantization for Efficient In-Memory Computing | Shuai Dong et.al. | 2603.10540 | null |
| 2026-03-11 | DepthCache: Depth-Guided Training-Free Visual Token Merging for Vision-Language-Action Model Inference | Yuquan Li et.al. | 2603.10469 | null |
| 2026-03-11 | The Curse and Blessing of Mean Bias in FP4-Quantized LLM Training | Hengjie Cao et.al. | 2603.10444 | null |
| 2026-03-11 | AgentServe: Algorithm-System Co-Design for Efficient Agentic AI Serving on a Consumer-Grade GPU | Yuning Zhang et.al. | 2603.10342 | null |
| 2026-03-11 | GaLoRA: Parameter-Efficient Graph-Aware LLMs for Node Classification | Mayur Choudhary et.al. | 2603.10298 | null |
| 2026-03-10 | WME: Extending CDCL-based Model Enumeration with Weights | Giuseppe Spallitta et.al. | 2603.10236 | null |
| 2026-03-10 | ARCHE: Autoregressive Residual Compression with Hyperprior and Excitation | Sofia Iliopoulou et.al. | 2603.10188 | null |
| 2026-03-10 | ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning | Ruizhong Qiu et.al. | 2603.10160 | null |
| 2026-03-10 | Batalin-Fradkin-Vilkovisky quantization of Einstein gravity with off-diagonal solutions encoding Hořava type generating functions | Elşen Veli Veliev et.al. | 2603.10082 | null |
| 2026-03-10 | When Learning Rates Go Wrong: Early Structural Signals in PPO Actor-Critic | Alberto Fernández-Hernández et.al. | 2603.09950 | null |
| 2026-03-11 | A Voronoi Cell Formulation for Principled Token Pruning in Late-Interaction Retrieval Models | Yash Kankanampati et.al. | 2603.09933 | null |
| 2026-03-10 | GAST: Gradient-aligned Sparse Tuning of Large Language Models with Data-layer Selection | Kai Yao et.al. | 2603.09865 | null |
| 2026-03-10 | Multi-spacecraft constraints on relativistic solar energetic particle transport in the widespread 28 October 2021 event | E. Lavasa et.al. | 2603.09839 | null |
| 2026-03-10 | Exploiting Label-Aware Channel Scoring for Adaptive Channel Pruning in Split Learning | Jialei Tan et.al. | 2603.09792 | null |
| 2026-03-10 | A Multi-Prototype-Guided Federated Knowledge Distillation Approach in AI-RAN Enabled Multi-Access Edge Computing System | Luyao Zou et.al. | 2603.09727 | null |
| 2026-03-10 | TemporalDoRA: Temporal PEFT for Robust Surgical Video Question Answering | Luca Carlini et.al. | 2603.09696 | null |
| 2026-03-10 | On Catastrophic Forgetting in Low-Rank Decomposition-Based Parameter-Efficient Fine-Tuning | Muhammad Ahmad et.al. | 2603.09684 | null |
| 2026-03-10 | X-GS: An Extensible Open Framework Unifying 3DGS Architectures with Downstream Multimodal Models | Yueen Ma et.al. | 2603.09632 | null |
| 2026-03-10 | Decoder-Free Distillation for Quantized Image Restoration | S. M. A. Sharif et.al. | 2603.09624 | null |
| 2026-03-10 | BinaryAttention: One-Bit QK-Attention for Vision and Diffusion Transformers | Chaodong Xiao et.al. | 2603.09582 | null |
| 2026-03-10 | Randomized Distributed Function Computation (RDFC): Ultra-Efficient Semantic Communication Applications to Privacy | Onur Günlü et.al. | 2603.09577 | null |
| 2026-03-10 | Routing without Forgetting | Alessio Masano et.al. | 2603.09576 | null |
| 2026-03-10 | Efficiently Aligning Draft Models via Parameter- and Data-Efficient Adaptation | Luxi Lin et.al. | 2603.09527 | null |
| 2026-03-10 | Beyond Short-Horizon: VQ-Memory for Robust Long-Horizon Manipulation in Non-Markovian Simulation Benchmarks | Wang Honghui et.al. | 2603.09513 | null |
| 2026-03-10 | TrainDeeploy: Hardware-Accelerated Parameter-Efficient Fine-Tuning of Small Transformer Models at the Extreme Edge | Run Wang et.al. | 2603.09511 | null |
| 2026-03-10 | Evolving Prompt Adaptation for Vision-Language Models | Enming Zhang et.al. | 2603.09493 | null |
| 2026-03-11 | Prune Redundancy, Preserve Essence: Vision Token Compression in VLMs via Synergistic Importance-Diversity | Zhengyao Fang et.al. | 2603.09480 | null |
| 2026-03-10 | Reviving ConvNeXt for Efficient Convolutional Diffusion Models | Taesung Kwon et.al. | 2603.09408 | null |
| 2026-03-10 | Deep Learning Search for Gravitational Waves from Compact Binary Coalescence | Lorenzo Mobilia et.al. | 2603.09386 | null |
| 2026-03-10 | MIL-PF: Multiple Instance Learning on Precomputed Features for Mammography Classification | Nikola Jovišić et.al. | 2603.09374 | null |
| 2026-03-10 | Efficient Reasoning at Fixed Test-Time Cost via Length-Aware Attention Priors and Gain-Aware Training | Rian Atri et.al. | 2603.09253 | null |
| 2026-03-10 | LooComp: Leverage Leave-One-Out Strategy to Encoder-only Transformer for Efficient Query-aware Context Compression | Thao Do et.al. | 2603.09222 | null |
| 2026-03-10 | Explainable Innovation Engine: Dual-Tree Agent-RAG with Methods-as-Nodes and Verifiable Write-Back | Renwei Meng et.al. | 2603.09192 | null |
| 2026-03-10 | Point Cloud as a Foreign Language for Multi-modal Large Language Model | Sneha Paul et.al. | 2603.09173 | null |
| 2026-03-10 | RTFDNet: Fusion-Decoupling for Robust RGB-T Segmentation | Kunyu Tan et.al. | 2603.09149 | null |
| 2026-03-09 | Predictive first-principles simulations for co-designing next-generation energy-efficient AI systems | Denis Mamaluy et.al. | 2603.08995 | null |
| 2026-03-09 | The $qs$ Inequality: Quantifying the Double Penalty of Mixture-of-Experts at Inference | Vignesh Adhinarayanan et.al. | 2603.08960 | null |
| 2026-03-09 | An implicit restriction in the Dirac quantization | Han Geurdes et.al. | 2603.08516 | null |
| 2026-03-09 | Oracle-Guided Soft Shielding for Safe Move Prediction in Chess | Prajit T Rajendran et.al. | 2603.08506 | null |
| 2026-03-09 | Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck | Fabio Valerio Massoli et.al. | 2603.08462 | null |
| 2026-03-09 | LycheeCluster: Efficient Long-Context Inference with Structure-Aware Chunking and Hierarchical KV Indexing | Dongfang Li et.al. | 2603.08453 | null |
| 2026-03-09 | Alfa: Attentive Low-Rank Filter Adaptation for Structure-Aware Cross-Domain Personalized Gaze Estimation | He-Yen Hsieh et.al. | 2603.08445 | null |
| 2026-03-09 | $Δ$ VLA: Prior-Guided Vision-Language-Action Models via World Knowledge Variation | Yijie Zhu et.al. | 2603.08361 | null |
| 2026-03-09 | Rethinking Attention Output Projection: Structured Hadamard Transforms for Efficient Transformers | Shubham Aggarwal et.al. | 2603.08343 | null |
| 2026-03-09 | PRIME: Efficient Algorithm for Token Graph Routing Problem | Haotian Xu et.al. | 2603.08337 | null |
| 2026-03-09 | WaDi: Weight Direction-aware Distillation for One-step Image Synthesis | Lei Wang et.al. | 2603.08258 | null |
| 2026-03-09 | NCL-UoR at SemEval-2026 Task 5: Embedding-Based Methods, Fine-Tuning, and LLMs for Word Sense Plausibility Rating | Tong Wu et.al. | 2603.08256 | null |
| 2026-03-09 | SRNeRV: A Scale-wise Recursive Framework for Neural Video Representation | Jia Wang et.al. | 2603.08227 | null |
| 2026-03-09 | SERQ: Saliency-Aware Low-Rank Error Reconstruction for LLM Quantization | Yeonsik Park et.al. | 2603.08185 | null |
| 2026-03-09 | Adaptive MLP Pruning for Large Vision Transformers | Chengchao Shen et.al. | 2603.08100 | null |
| 2026-03-09 | High-Fidelity Pruning for Large Language Models | Yijun Zhu et.al. | 2603.08083 | null |
| 2026-03-09 | Deterministic Differentiable Structured Pruning for Large Language Models | Weiyu Huang et.al. | 2603.08065 | null |
| 2026-03-09 | Stabilized Fine-Tuning with LoRA in Federated Learning: Mitigating the Side Effect of Client Size and Rank via the Scaling Factor | Jiayu Huang et.al. | 2603.08058 | null |
| 2026-03-09 | Distributed Coordination Algorithms with Efficient Communication for Open Multi-Agent Systems with Dynamic Communication Links and Processing Delays | Jiaqi Hu et.al. | 2603.08038 | null |
| 2026-03-09 | Capacity-Aware Mixture Law Enables Efficient LLM Data Optimization | Jingwei Li et.al. | 2603.08022 | null |
| 2026-03-09 | Model-Free DRL Control for Power Inverters: From Policy Learning to Real-Time Implementation via Knowledge Distillation | Yang Yang et.al. | 2603.07964 | null |
| 2026-03-09 | PSTNet: Physically-Structured Turbulence Network | Boris Kriuk et.al. | 2603.07957 | null |
| 2026-03-09 | Geometric Transformation-Embedded Mamba for Learned Video Compression | Hao Wei et.al. | 2603.07912 | null |
| 2026-03-09 | DyQ-VLA: Temporal-Dynamic-Aware Quantization for Embodied Vision-Language-Action Models | Zihao Zheng et.al. | 2603.07904 | null |
| 2026-03-08 | DistillGuard: Evaluating Defenses Against LLM Knowledge Distillation | Bo Jiang et.al. | 2603.07835 | null |
| 2026-03-08 | GazeShift: Unsupervised Gaze Estimation and Dataset for VR | Gil Shapira et.al. | 2603.07832 | null |
| 2026-03-08 | SGI: Structured 2D Gaussians for Efficient and Compact Large Image Representation | Zixuan Pan et.al. | 2603.07789 | null |
| 2026-03-08 | Geometric Knowledge-Assisted Federated Dual Knowledge Distillation Approach Towards Remote Sensing Satellite Imagery | Luyao Zou et.al. | 2603.07774 | null |
| 2026-03-08 | Compressed Proximal Federated Learning for Non-Convex Composite Optimization on Heterogeneous Data | Pu Qiu et.al. | 2603.07654 | null |
| 2026-03-08 | Efficient RGB-D Scene Understanding via Multi-task Adaptive Learning and Cross-dimensional Feature Guidance | Guodong Sun et.al. | 2603.07570 | null |
| 2026-03-08 | CONSTANT: Towards High-Quality One-Shot Handwriting Generation with Patch Contrastive Enhancement and Style-Aware Quantization | Anh-Duy Le et.al. | 2603.07543 | null |
| 2026-03-08 | TableMind++: An Uncertainty-Aware Programmatic Agent for Tool-Augmented Table Reasoning | Mingyue Cheng et.al. | 2603.07528 | null |
| 2026-03-08 | GP-Tree: An in-memory spatial index combining adaptive grid cells with a prefix tree for efficient spatial querying | Xiangyang Yang et.al. | 2603.07517 | null |
| 2026-03-08 | FedEU: Evidential Uncertainty-Driven Federated Fine-Tuning of Vision Foundation Models for Remote Sensing Image Segmentation | Xiaokang Zhang et.al. | 2603.07468 | null |
| 2026-03-08 | Selective Transfer Learning of Cross-Modality Distillation for Monocular 3D Object Detection | Rui Ding et.al. | 2603.07464 | null |
| 2026-03-08 | SLNet: A Super-Lightweight Geometry-Adaptive Network for 3D Point Cloud Recognition | Mohammad Saeid et.al. | 2603.07454 | null |
| 2026-03-08 | Adaptive Capacity Allocation for Vision Language Action Fine-tuning | Donghoon Kim et.al. | 2603.07404 | null |
| 2026-03-07 | Explainable and Hardware-Efficient Jamming Detection for 5G Networks Using the Convolutional Tsetlin Machine | Vojtech Halenka et.al. | 2603.07336 | null |
| 2026-03-07 | Faster-HEAL: An Efficient and Privacy-Preserving Collaborative Perception Framework for Heterogeneous Autonomous Vehicles | Armin Maleki et.al. | 2603.07314 | null |
| 2026-03-07 | LightMedSeg: Lightweight 3D Medical Image Segmentation with Learned Spatial Anchors | Kavyansh Tyagi et.al. | 2603.07228 | null |
| 2026-03-07 | FastSTAR: Spatiotemporal Token Pruning for Efficient Autoregressive Video Synthesis | Sungwoong Yune et.al. | 2603.07192 | null |
| 2026-03-07 | The Model Knows Which Tokens Matter: Automatic Token Selection via Noise Gating | Landi He et.al. | 2603.07135 | null |
| 2026-03-07 | Enhancing User Fairness in Two-Layer RSMA: A Movable Antenna Approach | Ji Luo et.al. | 2603.07127 | null |
| 2026-03-07 | Efficient Personalized Reranking with Semi-Autoregressive Generation and Online Knowledge Distillation | Kai Cheng et.al. | 2603.07107 | null |
| 2026-03-07 | Exploring the Reasoning Depth of Small Language Models in Software Architecture: A Multidimensional Evaluation Framework Towards Software Engineering 2.0 | Ha Vo et.al. | 2603.07091 | null |
| 2026-03-07 | Can Safety Emerge from Weak Supervision? A Systematic Analysis of Small Language Models | Punyajoy Saha et.al. | 2603.07017 | null |
| 2026-03-07 | Two-Stage Path Following for Mobile Manipulators via Dimensionality-Reduced Graph Search and Numerical Optimization | Fuyu Guo et.al. | 2603.07003 | null |
| 2026-03-06 | NOBLE: Accelerating Transformers with Nonlinear Low-Rank Branches | Ethan Smith et.al. | 2603.06492 | null |
| 2026-03-06 | History-Conditioned Spatio-Temporal Visual Token Pruning for Efficient Vision-Language Navigation | Qitong Wang et.al. | 2603.06480 | null |
| 2026-03-06 | GreenRFM: Toward a resource-efficient radiology foundation model | Yingtai Li et.al. | 2603.06467 | null |
| 2026-03-06 | Spinor moving frame, type II superparticle quantization, hidden $SU(8)$ symmetry of linearized 10D supergravity, and superamplitudes | Igor Bandos et.al. | 2603.06404 | null |
| 2026-03-06 | HiPP-Prune: Hierarchical Preference-Conditioned Structured Pruning for Vision-Language Models | Lincen Bai et.al. | 2603.06270 | null |
| 2026-03-06 | TaPD: Temporal-adaptive Progressive Distillation for Observation-Adaptive Trajectory Forecasting in Autonomous Driving | Mingyu Fan et.al. | 2603.06231 | null |
| 2026-03-06 | SPOT: Span-level Pause-of-Thought for Efficient and Interpretable Latent Reasoning in Large Language Models | Yunlong Chu et.al. | 2603.06222 | null |
| 2026-03-06 | Multimodal Behavior Tree Generation: A Small Vision-Language Model for Robot Task Planning | Cristiano Battistini et.al. | 2603.06084 | null |
| 2026-03-06 | EvoESAP: Non-Uniform Expert Pruning for Sparse MoE | Zongfang Liu et.al. | 2603.06003 | null |
| 2026-03-06 | Balancing Latency and Accuracy of Code Completion via Local-Cloud Model Cascading | Hanzhen Lu et.al. | 2603.05974 | null |
| 2026-03-06 | CR-QAT: Curriculum Relational Quantization-Aware Training for Open-Vocabulary Object Detection | Jinyeong Park et.al. | 2603.05964 | null |
| 2026-03-06 | Omni-Masked Gradient Descent: Memory-Efficient Optimization via Mask Traversal with Improved Convergence | Hui Yang et.al. | 2603.05960 | null |
| 2026-03-06 | Energy-Driven Adaptive Visual Token Pruning for Efficient Vision-Language Models | Jialuo He et.al. | 2603.05950 | null |
| 2026-03-06 | Implicit Style Conditioning: A Structured Style-Rewrite Framework for Low-Resource Character Modeling | Chanhui Zhu et.al. | 2603.05933 | null |
| 2026-03-06 | ROSE: Reordered SparseGPT for More Accurate One-Shot Large Language Models Pruning | Mingluo Su et.al. | 2603.05878 | link |
| 2026-03-06 | Shifting Adaptation from Weight Space to Memory Space: A Memory-Augmented Agent for Medical Image Segmentation | Bowen Chen et.al. | 2603.05873 | null |
| 2026-03-06 | Chiral Terahertz Amplification and Lasing using Two-Dimensional Materials with Berry Curvature Dipole | Amin Hakimi et.al. | 2603.05825 | null |
| 2026-03-06 | Self-Auditing Parameter-Efficient Fine-Tuning for Few-Shot 3D Medical Image Segmentation | Son Thai Ly et.al. | 2603.05822 | null |
| 2026-03-06 | Training-free Latent Inter-Frame Pruning with Attention Recovery | Dennis Menn et.al. | 2603.05811 | null |
| 2026-03-06 | MoE Lens – An Expert Is All You Need | Marmik Chaudhari et.al. | 2603.05806 | null |
| 2026-03-06 | Sparse Crosscoders for diffing MoEs and Dense models | Marmik Chaudhari et.al. | 2603.05805 | null |
| 2026-03-06 | A Quantization-Aware Training Based Lightweight Method for Neural Distinguishers | Guangwei Xiong et.al. | 2603.05791 | null |
| 2026-03-05 | LTLGuard: Formalizing LTL Specifications with Compact Language Models and Lightweight Symbolic Reasoning | Medina Andresel et.al. | 2603.05728 | null |
| 2026-03-05 | Interpretable Motion Artificat Detection in structural Brain MRI | Naveetha Nithianandam et.al. | 2603.05726 | null |
| 2026-03-05 | Gabor Primitives for Accelerated Cardiac Cine MRI Reconstruction | Wenqi Huang et.al. | 2603.05681 | null |
| 2026-03-05 | Keeping the Evidence Chain: Semantic Evidence Allocation for Training-Free Token Pruning in Video Temporal Grounding | Jiaqi Li et.al. | 2603.05663 | null |
| 2026-03-05 | Bias In, Bias Out? Finding Unbiased Subnetworks in Vanilla Models | Ivan Luiz De Moura Matos et.al. | 2603.05582 | null |
| 2026-03-05 | POET-X: Memory-efficient LLM Training by Scaling Orthogonal Transformation | Zeju Qiu et.al. | 2603.05500 | null |
| 2026-03-05 | Efficient simulation of Bose-Einstein condensates in nontrivial topologies | Abel Beregi et.al. | 2603.05447 | null |
| 2026-03-05 | MobileFetalCLIP: Selective Repulsive Knowledge Distillation for Mobile Fetal Ultrasound Analysis | Numan Saeed et.al. | 2603.05421 | null |
| 2026-03-05 | Preserving Continuous Symmetry in Discrete Spaces: Geometric-Aware Quantization for SO(3)-Equivariant GNNs | Haoyu Zhou et.al. | 2603.05343 | null |
| 2026-03-05 | Med-V1: Small Language Models for Zero-shot and Scalable Biomedical Evidence Attribution | Qiao Jin et.al. | 2603.05308 | null |
| 2026-03-05 | WavSLM: Single-Stream Speech Language Modeling via WavLM Distillation | Luca Della Libera et.al. | 2603.05299 | null |
| 2026-03-05 | SlideSparse: Fast and Flexible (2N-2):2N Structured Sparsity | Hanyong Shao et.al. | 2603.05232 | null |
| 2026-03-05 | Stable-LoRA: Stabilizing Feature Learning of Low-Rank Adaptation | Yize Wu et.al. | 2603.05204 | link |
| 2026-03-05 | An efficient and accurate numerical method for computing the ground states of three-dimensional rotating dipolar Bose-Einstein condensates under strongly anisotropic trap | Qinglin Tang et.al. | 2603.05194 | null |
| 2026-03-05 | CRISP: Correlation-Resilient Indexing via Subspace Partitioning | Dimitris Dimitropoulos et.al. | 2603.05180 | null |
| 2026-03-05 | Trainable Bitwise Soft Quantization for Input Feature Compression | Karsten Schrödter et.al. | 2603.05172 | null |
| 2026-03-05 | Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity | Di Zhang et.al. | 2603.05168 | null |
| 2026-03-05 | FedBCD:Communication-Efficient Accelerated Block Coordinate Gradient Descent for Federated Learning | Junkang Liu et.al. | 2603.05116 | null |
| 2026-03-05 | Diff-ES: Stage-wise Structural Diffusion Pruning via Evolutionary Search | Zongfang Liu et.al. | 2603.05105 | null |
| 2026-03-05 | Beyond Positional Encoding: A 5D Spatio-Directional Hash Encoding | Philippe Weier et.al. | 2603.05079 | null |
| 2026-03-05 | Constrained Symplectic Quantization: Disclosing the Deterministic Framework Behind Quantum Mechanics | Martina Giachello et.al. | 2603.05072 | null |
| 2026-03-05 | MCEL: Margin-Based Cross-Entropy Loss for Error-Tolerant Quantized Neural Networks | Mikail Yayla et.al. | 2603.05048 | null |
| 2026-03-05 | A loop quantization of the marginally bound Lemaître-Tolman-Bondi dust model | Luca Cafaro et.al. | 2603.04995 | null |
| 2026-03-05 | Programmable superconducting neuron with intrinsic in-memory computation and dual-timescale plasticity for ultra-efficient neuromorphic computing | Muen Wang et.al. | 2603.04966 | null |
| 2026-03-05 | VisionPangu: A Compact and Fine-Grained Multimodal Assistant with 1.7B Parameters | Jiaxin Fan et.al. | 2603.04957 | null |
| 2026-03-05 | WaterSIC: information-theoretically (near) optimal linear layer quantization | Egor Lifar et.al. | 2603.04956 | null |
| 2026-03-05 | AILS-NTUA at SemEval-2026 Task 3: Efficient Dimensional Aspect-Based Sentiment Analysis | Stavros Gazetas et.al. | 2603.04933 | null |
| 2026-03-05 | MASQuant: Modality-Aware Smoothing Quantization for Multimodal Large Language Models | Lulu Hu et.al. | 2603.04800 | null |
| 2026-03-05 | Stacked from One: Multi-Scale Self-Injection for Context Window Extension | Wei Han et.al. | 2603.04759 | null |
| 2026-03-05 | A Benchmark Study of Neural Network Compression Methods for Hyperspectral Image Classification | Sai Shi et.al. | 2603.04720 | null |
| 2026-03-05 | Detection of Illicit Content on Online Marketplaces using Large Language Models | Quoc Khoa Tran et.al. | 2603.04707 | null |
| 2026-03-04 | Unified Integer and Fractional Quantum Hall Effects from Boundary-Induced Edge-State Quantization | Pedro Pereyra et.al. | 2603.04652 | null |
| 2026-03-04 | An LLM-Guided Query-Aware Inference System for GNN Models on Large Knowledge Graphs | Waleed Afandi et.al. | 2603.04545 | null |
| 2026-03-04 | Dissecting Quantization Error: A Concentration-Alignment Perspective | Marco Federici et.al. | 2603.04359 | null |
| 2026-03-04 | Efficient Refusal Ablation in LLM through Optimal Transport | Geraldin Nanfack et.al. | 2603.04355 | null |
| 2026-03-04 | Direct derivation of the modified Langevin noise formalism from the canonical quantization of macroscopic electromagnetism | Alessandro Ciattoni et.al. | 2603.04336 | null |
| 2026-03-04 | Activation Outliers in Transformer Quantization: Reproduction, Statistical Analysis, and Deployment Tradeoffs | Pranav Kumar Kaliaperumal et.al. | 2603.04308 | null |
| 2026-03-04 | Constraint-Aware Generative Re-ranking for Multi-Objective Optimization in Advertising Feeds | Chenfei Li et.al. | 2603.04227 | null |
| 2026-03-05 | Bielik-Q2-Sharp: A Comparative Study of Extreme 2-bit Quantization Methods for a Polish 11B Language Model | Jakub Prejzner et.al. | 2603.04162 | null |
| 2026-03-04 | Unbiased Dynamic Pruning for Efficient Group-Based Policy Optimization | Haodong Zhu et.al. | 2603.04135 | null |
| 2026-03-04 | BeamPERL: Parameter-Efficient RL with Verifiable Rewards Specializes Compact LLMs for Structured Beam Mechanics Reasoning | Tarjei Paule Hage et.al. | 2603.04124 | null |
| 2026-03-04 | Wasserstein Gradient Flows of semi-discret energies: evolution of urban areas anduniform quantization | Joao Miguel Machado et.al. | 2603.04088 | null |
| 2026-03-04 | Tuning Just Enough: Lightweight Backdoor Attacks on Multi-Encoder Diffusion Models | Ziyuan Chen et.al. | 2603.04064 | null |
| 2026-03-05 | LoRA-MME: Multi-Model Ensemble of LoRA-Tuned Encoders for Code Comment Classification | Md Akib Haider et.al. | 2603.03959 | null |
| 2026-03-04 | Vector-Quantized Soft Label Compression for Dataset Distillation | Ali Abbasi et.al. | 2603.03808 | null |
| 2026-03-04 | Adaptive Enhancement and Dual-Pooling Sequential Attention for Lightweight Underwater Object Detection with YOLOv10 | Md. Mushibur Rahman et.al. | 2603.03807 | null |
| 2026-03-04 | LEA: Label Enumeration Attack in Vertical Federated Learning | Wenhao Jiang et.al. | 2603.03777 | null |
| 2026-03-04 | Confidence-Calibrated Small-Large Language Model Collaboration for Cost-Efficient Reasoning | Chuang Zhang et.al. | 2603.03752 | null |
| 2026-03-04 | EvoPrune: Early-Stage Visual Token Pruning for Efficient MLLMs | Yuhao Chen et.al. | 2603.03681 | null |
| 2026-03-04 | ARMOR: Robust and Efficient CNN-Based SAR ATR through Model-Hardware Co-Design | Sachini Wickramasinghe et.al. | 2603.03598 | null |
| 2026-03-03 | Trade-offs in Ensembling, Merging and Routing Among Parameter-Efficient Experts | Sanae Lotfi et.al. | 2603.03535 | null |
| 2026-03-03 | Raising Bars, Not Parameters: LilMoo Compact Language Model for Hindi | Shiza Fatimah et.al. | 2603.03508 | null |
| 2026-03-03 | DKD-KAN: A Lightweight knowledge-distilled KAN intrusion detection framework, based on MLP and KAN | Mohammad Alikhani et.al. | 2603.03486 | null |
| 2026-03-03 | Towards Improved Sentence Representations using Token Graphs | Krishna Sri Ipsit Mantri et.al. | 2603.03389 | null |
| 2026-03-03 | LiteVLA-Edge: Quantized On-Device Multimodal Control for Embedded Robotics | Justin Williams et.al. | 2603.03380 | null |
| 2026-03-03 | No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models | Omer Sela et.al. | 2603.03203 | null |
| 2026-03-03 | Channel-Adaptive Edge AI: Maximizing Inference Throughput by Adapting Computational Complexity to Channel States | Jierui Zhang et.al. | 2603.03146 | null |
| 2026-03-03 | TinyIceNet: Low-Power SAR Sea Ice Segmentation for On-Board FPGA Inference | Mhd Rashed Al Koutayni et.al. | 2603.03075 | null |
| 2026-03-03 | Stability properties of Minimal Gated Unit neural networks | Stefano De Carli et.al. | 2603.03017 | null |
| 2026-03-03 | Reproducing and Comparing Distillation Techniques for Cross-Encoders | Victor Morand et.al. | 2603.03010 | null |
| 2026-03-03 | QAOA-Predictor: Forecasting Success Probabilities and Minimal Depths for Efficient Fixed-Parameter Optimization | Rodrigo Coelho et.al. | 2603.02990 | null |
| 2026-03-03 | ProGIC: Progressive and Lightweight Generative Image Compression with Residual Vector Quantization | Hao Cao et.al. | 2603.02897 | null |
| 2026-03-03 | MuxTune: Efficient Multi-Task LLM Fine-Tuning in Multi-Tenant Datacenters via Spatial-Temporal Backbone Multiplexing | Chunyu Xue et.al. | 2603.02885 | null |
| 2026-03-03 | SemanticDialect: Semantic-Aware Mixed-Format Quantization for Video Diffusion Transformers | Wonsuk Jang et.al. | 2603.02883 | null |
| 2026-03-03 | Fast and memory-efficient classical simulation of quantum machine learning via forward and backward gate fusion | Yoshiaki Kawase et.al. | 2603.02804 | null |
| 2026-03-03 | Hardware Implementation of Photonic Spiking Hash Retrieval | Shangxuan Shi et.al. | 2603.02738 | null |
| 2026-03-03 | Gated Differential Linear Attention: A Linear-Time Decoder for High-Fidelity Medical Segmentation | Hongbo Zheng et.al. | 2603.02727 | null |
| 2026-03-03 | SUN: Shared Use of Next-token Prediction for Efficient Multi-LLM Disaggregated Serving | Sunghyeon Woo et.al. | 2603.02599 | null |
| 2026-03-03 | Synthetic-Child: An AIGC-Based Synthetic Data Pipeline for Privacy-Preserving Child Posture Estimation | Taowen Zeng et.al. | 2603.02598 | null |
| 2026-03-03 | Generalizable Knowledge Distillation from Vision Foundation Models for Semantic Segmentation | Chonghua Lv et.al. | 2603.02554 | null |
| 2026-03-03 | Semantic Forwarding and Codebook-Enhanced Model Division Multiple Access for Satellite-Terrestrial Networks | Jinghong Huang et.al. | 2603.02536 | null |
| 2026-03-03 | Learning Object-Centric Spatial Reasoning for Sequential Manipulation in Cluttered Environments | Chrisantus Eze et.al. | 2603.02511 | null |
| 2026-03-02 | A Unified Revisit of Temperature in Classification-Based Knowledge Distillation | Logan Frank et.al. | 2603.02430 | null |
| 2026-03-02 | From Fewer Samples to Fewer Bits: Reframing Dataset Distillation as Joint Optimization of Precision and Compactness | My H. Dinh et.al. | 2603.02411 | null |
| 2026-03-02 | Fast and Versatile RNA Design via Motif-level Divide-and-Conquer and Structure-level Rival Search | Tianshuo Zhou et.al. | 2603.02283 | null |
| 2026-03-02 | Deep Unfolding for SIM-Assisted Multiband MU-MISO Downlink Systems | Muhammad Ibrahim et.al. | 2603.02122 | null |
| 2026-03-02 | MetaRCA: A Generalizable Root Cause Analysis Framework for Cloud-Native Systems Powered by Meta Causal Knowledge | Shuai Liang et.al. | 2603.02032 | null |
| 2026-03-02 | Rich Insights from Cheap Signals: Efficient Evaluations via Tensor Factorization | Felipe Maia Polo et.al. | 2603.02029 | null |
| 2026-03-02 | KDFlow: A User-Friendly and Efficient Knowledge Distillation Framework for Large Language Models | Songming Zhang et.al. | 2603.01875 | null |
| 2026-03-02 | FreeAct: Freeing Activations for LLM Quantization | Xiaohao Liu et.al. | 2603.01776 | null |
| 2026-03-02 | Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning | Zichen Tian et.al. | 2603.01759 | null |
| 2026-03-02 | StepVAR: Structure-Texture Guided Pruning for Visual Autoregressive Models | Keli Liu et.al. | 2603.01757 | null |
| 2026-03-02 | CA-AFP: Cluster-Aware Adaptive Federated Pruning | Om Govind Jha et.al. | 2603.01739 | null |
| 2026-03-02 | FastLightGen: Fast and Light Video Generation with Fewer Steps and Parameters | Shao Shitong et.al. | 2603.01685 | null |
| 2026-03-02 | Beyond the Grid: Layout-Informed Multi-Vector Retrieval with Parsed Visual Document Representations | Yibo Yan et.al. | 2603.01666 | null |
| 2026-03-02 | Boosting Entropy with Bell Box Quantization | Ningfeng Yang et.al. | 2603.01599 | null |
| 2026-03-02 | Keyword-based Community Search in Bipartite Spatial-Social Networks (Technical Report) | Kovan A. Bavi et.al. | 2603.01500 | null |
| 2026-03-02 | Token Reduction via Local and Global Contexts Optimization for Efficient Video Large Language Models | Jinlong Li et.al. | 2603.01400 | null |
| 2026-03-02 | Quasar: Quantized Self-Speculative Acceleration for Rapid Inference via Memory-Efficient Verification | Guang Huang et.al. | 2603.01399 | null |
| 2026-03-02 | 3BASiL: An Algorithmic Framework for Sparse plus Low-Rank Compression of LLMs | Mehdi Makni et.al. | 2603.01376 | null |
| 2026-03-02 | MixerCSeg: An Efficient Mixer Architecture for Crack Segmentation via Decoupled Mamba Attention | Zilong Zhao et.al. | 2603.01361 | null |
| 2026-03-01 | AgilePruner: An Empirical Study of Attention and Diversity for Adaptive Visual Token Pruning in Large Vision-Language Models | Changwoo Baek et.al. | 2603.01236 | link |
| 2026-03-01 | VP-Hype: A Hybrid Mamba-Transformer Framework with Visual-Textual Prompting for Hyperspectral Image Classification | Abdellah Zakaria Sellam et.al. | 2603.01174 | null |
| 2026-03-01 | GuiDINO: Rethinking Vision Foundation Model in Medical Image Segmentation | Zhuonan Liang et.al. | 2603.01115 | null |
| 2026-03-01 | \textsc{Mobile-VTON}: High-Fidelity On-Device Virtual Try-On | Zhenchen Wan et.al. | 2603.00947 | null |
| 2026-03-01 | On the Exact Algorithmic Extraction of Finite Tesselations Through Prime Extraction of Minimal Representative Forms | Sushish Baral et.al. | 2603.00911 | null |
| 2026-03-01 | Curvature-Weighted Capacity Allocation: A Minimum Description Length Framework for Layer-Adaptive Large Language Model Optimization | Theophilus Amaefuna et.al. | 2603.00910 | null |
| 2026-03-01 | Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models | Yichao Wu et.al. | 2603.00846 | null |
| 2026-03-01 | MedGPT-oss: Training a General-Purpose Vision-Language Model for Biomedicine | Kai Zhang et.al. | 2603.00842 | null |
| 2026-02-28 | BornoViT: A Novel Efficient Vision Transformer for Bengali Handwritten Basic Characters Classification | Rafi Hassan Chowdhury et.al. | 2603.00755 | null |
| 2026-02-28 | MARS: Harmonizing Multimodal Convergence via Adaptive Rank Search | Minkyoung Cho et.al. | 2603.00720 | null |
| 2026-02-28 | Preliminary study of the $H$ dibaryon in $N_{\rm f}=2+1$ lattice QCD | André Baião Raposo et.al. | 2603.00698 | null |
| 2026-02-28 | Specializing Foundation Models via Mixture of Low-Rank Experts for Comprehensive Head CT Analysis | Youngjin Yoo et.al. | 2603.00675 | null |
| 2026-02-28 | Exploring 3D Dataset Pruning | Xiaohan Zhao et.al. | 2603.00651 | null |
| 2026-02-28 | Linking Modality Isolation in Heterogeneous Collaborative Perception | Changxing Liu et.al. | 2603.00609 | null |
| 2026-02-28 | CoMoL: Efficient Mixture of LoRA Experts via Dynamic Core Space Merging | Jie Cao et.al. | 2603.00573 | null |
| 2026-02-28 | TP-Spikformer: Token Pruned Spiking Transformer | Wenjie Wei et.al. | 2603.00527 | null |
| 2026-02-28 | What Do Visual Tokens Really Encode? Uncovering Sparsity and Redundancy in Multimodal Large Language Models | Yingqi Fan et.al. | 2603.00510 | null |
| 2026-02-28 | COLE $^+$ : Towards Practical Column-based Learned Storage for Blockchain Systems | Ce Zhang et.al. | 2603.00509 | null |
| 2026-02-28 | Improved Adversarial Diffusion Compression for Real-World Video Super-Resolution | Bin Chen et.al. | 2603.00458 | null |
| 2026-02-28 | TAP-SLF: Parameter-Efficient Adaptation of Vision Foundation Models for Multi-Task Ultrasound Image Analysis | Hui Wan et.al. | 2603.00433 | null |
| 2026-02-28 | Efficient Decoder Scaling Strategy for Neural Routing Solvers | Qing Luo et.al. | 2603.00430 | null |
| 2026-02-28 | Weight Updates as Activation Shifts: A Principled Framework for Steering | Dyah Adila et.al. | 2603.00425 | null |
| 2026-02-27 | Taming Momentum: Rethinking Optimizer States Through Low-Rank Approximation | Zhengbo Wang et.al. | 2602.24283 | null |
| 2026-02-27 | Efficient Discovery of Approximate Causal Abstractions via Neural Mechanism Sparsification | Amir Asiaee et.al. | 2602.24266 | null |
| 2026-02-27 | Joint Geometric and Trajectory Consistency Learning for One-Step Real-World Super-Resolution | Chengyan Deng et.al. | 2602.24240 | null |
| 2026-02-27 | Task-Centric Acceleration of Small-Language Models | Dor Tsur et.al. | 2602.24174 | null |
| 2026-02-27 | Prune Wisely, Reconstruct Sharply: Compact 3D Gaussian Splatting via Adaptive Pruning and Difference-of-Gaussian Primitives | Haoran Wang et.al. | 2602.24136 | null |
| 2026-02-27 | Quant Experts: Token-aware Adaptive Error Reconstruction with Mixture of Experts for Large Vision-Language Models Quantization | Chenwei Jia et.al. | 2602.24059 | null |
| 2026-02-27 | GPU-Native Approximate Nearest Neighbor Search with IVF-RaBitQ: Fast Index Build and Search | Jifan Shi et.al. | 2602.23999 | null |
| 2026-02-27 | Towards Efficient and Generalizable Retrieval: Adaptive Semantic Quantization and Residual Knowledge Transfer | Huimu Wang et.al. | 2602.23978 | null |
| 2026-02-27 | Bandwidth-adaptive Cloud-Assisted 360-Degree 3D Perception for Autonomous Vehicles | Faisal Hawladera et.al. | 2602.23871 | null |
| 2026-02-27 | ULW-SleepNet: An Ultra-Lightweight Network for Multimodal Sleep Stage Scoring | Zhaowen Wang et.al. | 2602.23852 | null |
| 2026-02-27 | GRAIL: Post-hoc Compensation by Linear Reconstruction for Compressed Networks | Wenwu Tang et.al. | 2602.23795 | null |
| 2026-02-27 | UTPTrack: Towards Simple and Unified Token Pruning for Visual Tracking | Hao Wu et.al. | 2602.23734 | null |
| 2026-02-27 | HiDrop: Hierarchical Vision Token Reduction in MLLMs via Late Injection, Concave Pyramid Pruning, and Early Exit | Hao Wu et.al. | 2602.23699 | null |
| 2026-02-27 | ProtoDCS: Towards Robust and Efficient Open-Set Test-Time Adaptation for Vision-Language Models | Wei Luo et.al. | 2602.23653 | null |
| 2026-02-27 | From quantum time to manifestly covariant QFT: on the need for a quantum-action-based quantization | N. L. Diaz et.al. | 2602.23625 | null |
| 2026-02-27 | PDF: PUF-based DNN Fingerprinting for Knowledge Distillation Traceability | Ning Lyu et.al. | 2602.23587 | null |
| 2026-02-27 | Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem | Guillem Rodríguez-Corominas et.al. | 2602.23579 | null |
| 2026-02-27 | Hybrid Quantum Temporal Convolutional Networks | Junghoon Justin Park et.al. | 2602.23578 | null |
| 2026-02-26 | BiKA: Kolmogorov-Arnold-Network-inspired Ultra Lightweight Neural Network Hardware Accelerator | Yuhao Liu et.al. | 2602.23455 | null |
| 2026-02-26 | U-CAN: Utility-Aware Contrastive Attenuation for Efficient Unlearning in Generative Recommendation | Zezheng Wu et.al. | 2602.23400 | null |
| 2026-02-26 | A Dataset is Worth 1 MB | Elad Kimchi Shoshani et.al. | 2602.23358 | null |
| 2026-02-26 | FlashOptim: Optimizers for Memory Efficient Training | Jose Javier Gonzalez Ortiz et.al. | 2602.23349 | null |
| 2026-02-26 | Bitwise Systolic Array Architecture for Runtime-Reconfigurable Multi-precision Quantized Multiplication on Hardware Accelerators | Yuhao Liu et.al. | 2602.23334 | null |
| 2026-02-26 | Evaluating Zero-Shot and One-Shot Adaptation of Small Language Models in Leader-Follower Interaction | Rafael R. Baptista et.al. | 2602.23312 | null |
| 2026-02-26 | Data-Efficient Generative Modeling of Non-Gaussian Global Climate Fields via Scalable Composite Transformations | Johannes Brachem et.al. | 2602.23311 | null |
| 2026-02-26 | Efficient evaluation of fundamental sensitivity limits and full counting statistics for continuously monitored Gaussian quantum systems | Francesco Albarelli et.al. | 2602.23304 | null |
| 2026-02-26 | Workload-Aware Incremental Reclustering in Cloud Data Warehouses | Yipeng Liu et.al. | 2602.23289 | null |
| 2026-02-26 | Real-Time Stream Compaction for Sparse Machine Learning on FPGAs | Marc Neu et.al. | 2602.23281 | null |
| 2026-02-26 | AgentDropoutV2: Optimizing Information Flow in Multi-Agent Systems via Test-Time Rectify-or-Reject Pruning | Yutong Wang et.al. | 2602.23258 | null |
| 2026-02-26 | A Scaling Law for Bandwidth Under Quantization | Maximilian Kalcher et.al. | 2602.23252 | null |
| 2026-02-26 | Spatio-Temporal Token Pruning for Efficient High-Resolution GUI Agents | Zhou Xu et.al. | 2602.23235 | null |
| 2026-02-27 | Motion-aware Event Suppression for Event Cameras | Roberto Pellerito et.al. | 2602.23204 | null |
| 2026-02-26 | InnerQ: Hardware-aware Tuning-free Quantization of KV Cache for Large Language Models | Sayed Mohammadreza Tayaranian Hosseini et.al. | 2602.23200 | null |
| 2026-02-26 | FairQuant: Fairness-Aware Mixed-Precision Quantization for Medical Image Classification | Thomas Woergaard et.al. | 2602.23192 | null |
| 2026-02-26 | Efficient Real-Time Adaptation of ROMs for Unsteady Flows Using Data Assimilation | Ismaël Zighed et.al. | 2602.23188 | null |
| 2026-02-26 | Efficient Encoder-Free Fourier-based 3D Large Multimodal Model | Guofeng Mei et.al. | 2602.23153 | null |
| 2026-02-26 | TriLite: Efficient Weakly Supervised Object Localization with Universal Visual Features and Tri-Region Disentanglement | Arian Sabaghi et.al. | 2602.23120 | null |
| 2026-02-26 | Learning Physical Operators using Neural Operators | Vignesh Gopakumar et.al. | 2602.23113 | null |
| 2026-02-26 | Align then Adapt: Rethinking Parameter-Efficient Transfer Learning in 4D Perception | Yiding Sun et.al. | 2602.23069 | null |
| 2026-02-26 | PackUV: Packed Gaussian UV Maps for 4D Volumetric Video | Aashish Rai et.al. | 2602.23040 | null |
| 2026-02-26 | Sequential Regression for Continuous Value Prediction using Residual Quantization | Runpeng Cui et.al. | 2602.23012 | null |
| 2026-02-26 | Holomorphic Quantization in Constant Curvature Backgrounds | Dmitri Bykov et.al. | 2602.22984 | null |
| 2026-02-26 | ToProVAR: Efficient Visual Autoregressive Modeling via Tri-Dimensional Entropy-Aware Semantic Analysis and Sparsity Optimization | Jiayu Chen et.al. | 2602.22948 | null |
| 2026-02-26 | pMoE: Prompting Diverse Experts Together Wins More in Visual Adaptation | Shentong Mo et.al. | 2602.22938 | null |
| 2026-02-26 | NoRA: Breaking the Linear Ceiling of Low-Rank Adaptation via Manifold Expansion | Hung-Hsuan Chen et.al. | 2602.22911 | null |
| 2026-02-27 | DySL-VLA: Efficient Vision-Language-Action Model Inference via Dynamic-Static Layer-Skipping for Robot Manipulation | Zebin Yang et.al. | 2602.22896 | null |
| 2026-02-26 | Beyond Detection: Multi-Scale Hidden-Code for Natural Image Deepfake Recovery and Factual Retrieval | Yuan-Chih Chen et.al. | 2602.22759 | null |
| 2026-02-27 | GFRRN: Explore the Gaps in Single Image Reflection Removal | Yu Chen et.al. | 2602.22695 | null |
| 2026-02-26 | LoR-LUT: Learning Compact 3D Lookup Tables via Low-Rank Residuals | Ziqi Zhao et.al. | 2602.22607 | null |
| 2026-02-26 | pQuant: Towards Effective Low-Bit Language Models via Decoupled Linear Quantization-Aware Training | Wenzheng Zhang et.al. | 2602.22592 | null |
| 2026-02-26 | Quantum corrected thermodynamics and horizon quantization of the Reissner–Nordström black hole | S. Jalalzadeh et.al. | 2602.22559 | null |
| 2026-02-26 | Autoregressive Visual Decoding from EEG Signals | Sicheng Dai et.al. | 2602.22555 | null |
| 2026-02-26 | Agentic AI for Intent-driven Optimization in Cell-free O-RAN | Mohammad Hossein Shokouhi et.al. | 2602.22539 | null |
| 2026-02-26 | Reinforcement-aware Knowledge Distillation for LLM Reasoning | Zhaoyang Zhang et.al. | 2602.22495 | null |
| 2026-02-25 | Efficient Continual Learning in Language Models via Thalamically Routed Cortical Columns | Afshin Khadangi et.al. | 2602.22479 | null |
| 2026-02-25 | MammoWise: Multi-Model Local RAG Pipeline for Mammography Report Generation | Raiyan Jahangir et.al. | 2602.22462 | null |
| 2026-02-25 | How Do Latent Reasoning Methods Perform Under Weak and Strong Supervision? | Yingqian Cui et.al. | 2602.22441 | null |
| 2026-02-25 | veScale-FSDP: Flexible and High-Performance FSDP at Scale | Zezhou Wang et.al. | 2602.22437 | null |
| 2026-02-25 | Decoder-based Sense Knowledge Distillation | Qitong Wang et.al. | 2602.22351 | null |
| 2026-02-25 | Structure and Redundancy in Large Language Models: A Spectral Study via Random Matrix Theory | Davide Ettori et.al. | 2602.22345 | null |
| 2026-02-25 | Queue occupancy and server size distribution of a queue length dependent vacation queue with an optional service | Ashish Verma et.al. | 2602.22295 | null |
| 2026-02-25 | SigmaQuant: Hardware-Aware Heterogeneous Quantization Method for Edge DNN Inference | Qunyou Liu et.al. | 2602.22136 | null |
| 2026-02-25 | SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents | Patrick Tser Jern Kon et.al. | 2602.22124 | null |
| 2026-02-25 | PatchDenoiser: Parameter-efficient multi-scale patch learning and fusion denoiser for medical images | Jitindra Fartiyal et.al. | 2602.21987 | null |
| 2026-02-25 | Compact Circulant Layers with Spectral Priors | Joseph Margaryan et.al. | 2602.21965 | null |
| 2026-02-25 | D-COT: Disciplined Chain-of-Thought Learning for Efficient Reasoning in Small Language Models | Shunsuke Ubukata et.al. | 2602.21786 | null |
| 2026-02-25 | XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression | Zunhai Su et.al. | 2602.21780 | null |
| 2026-02-25 | Learning from Yesterday’s Error: An Efficient Online Learning Method for Traffic Demand Prediction | Xiannan Huang et.al. | 2602.21757 | null |
| 2026-02-25 | DWA-KD: Dual-Space Weighting and Time-Warped Alignment for Cross-Tokenizer Knowledge Distillation | Duc Trung Vu et.al. | 2602.21669 | null |
| 2026-02-25 | HybridINR-PCGC: Hybrid Lossless Point Cloud Geometry Compression Bridging Pretrained Model and Implicit Neural Representation | Wenjie Huang et.al. | 2602.21662 | null |
| 2026-02-25 | Sparsity Induction for Accurate Post-Training Pruning of Large Language Models | Minhao Jiang et.al. | 2602.21652 | null |
| 2026-02-25 | AQR-HNSW: Accelerating Approximate Nearest Neighbor Search via Density-aware Quantization and Multi-stage Re-ranking | Ganap Ashit Tewary et.al. | 2602.21600 | null |
| 2026-02-25 | CADC: Content Adaptive Diffusion-Based Generative Image Compression | Xihua Sheng et.al. | 2602.21591 | null |
| 2026-02-24 | MINAR: Mechanistic Interpretability for Neural Algorithmic Reasoning | Jesse He et.al. | 2602.21442 | null |
| 2026-02-24 | Efficient Uncoupled Learning Dynamics with $\tilde{O}!\left(T^{-1/4}\right)$ Last-Iterate Convergence in Bilinear Saddle-Point Problems over Convex Sets under Bandit Feedback | Arnab Maiti et.al. | 2602.21436 | null |
| 2026-02-24 | MMLoP: Multi-Modal Low-Rank Prompting for Efficient Vision-Language Adaptation | Sajjad Ghiasvand et.al. | 2602.21397 | null |
| 2026-02-24 | Momentum Memory for Knowledge Distillation in Computational Pathology | Yongxin Guo et.al. | 2602.21395 | null |
| 2026-02-24 | Small Language Models for Privacy-Preserving Clinical Information Extraction in Low-Resource Languages | Mohammadreza Ghaffarzadeh-Esfahani et.al. | 2602.21374 | null |
| 2026-02-24 | OmniOCR: Generalist OCR for Ethnic Minority Languages | Bonan Liu et.al. | 2602.21042 | null |
| 2026-02-24 | HiSAC: Hierarchical Sparse Activation Compression for Ultra-long Sequence Modeling in Recommenders | Kun Yuan et.al. | 2602.21009 | null |
| 2026-02-25 | Constraints on dynamically-formed massive black holes in Little Red Dots from X-ray non-detections | M. Liempi et.al. | 2602.21002 | null |
| 2026-02-24 | ParkDiffusion++: Ego Intention Conditioned Joint Multi-Agent Trajectory Prediction for Automated Parking using Diffusion Models | Jiarong Wei et.al. | 2602.20923 | null |
| 2026-02-24 | Don’t Ignore the Tail: Decoupling top-K Probabilities for Efficient Language Model Distillation | Sayantan Dasgupta et.al. | 2602.20816 | null |
| 2026-02-24 | CHESS: Context-aware Hierarchical Efficient Semantic Selection for Long-Context LLM Inference | Chao Fei et.al. | 2602.20732 | null |
| 2026-02-24 | ID-LoRA: Efficient Low-Rank Adaptation Inspired by Matrix Interpolative Decomposition | Xindian Ma et.al. | 2602.20727 | null |
| 2026-02-24 | PRECTR-V2:Unified Relevance-CTR Framework with Cross-User Preference Mining, Exposure Bias Correction, and LLM-Distilled Encoder Optimization | Shuzhi Cao et.al. | 2602.20676 | null |
| 2026-02-24 | CAMEL: Confidence-Gated Reflection for Reward Modeling | Zirui Zhu et.al. | 2602.20670 | null |
| 2026-02-24 | TOM: A Ternary Read-only Memory Accelerator for LLM-powered Edge Intelligence | Hongyi Guan et.al. | 2602.20662 | null |
| 2026-02-24 | Dataset Color Quantization: A Training-Oriented Framework for Dataset-Level Compression | Chenyue Yu et.al. | 2602.20650 | null |
| 2026-02-24 | OptiLeak: Efficient Prompt Reconstruction via Reinforcement Learning in Multi-tenant LLM Services | Longxiang Wang et.al. | 2602.20595 | null |
| 2026-02-24 | BFA++: Hierarchical Best-Feature-Aware Token Prune for Multi-View Vision Language Action Model | Haosheng Li et.al. | 2602.20566 | null |
| 2026-02-24 | PFGNet: A Fully Convolutional Frequency-Guided Peripheral Gating Network for Efficient Spatiotemporal Predictive Learning | Xinyong Cai et.al. | 2602.20537 | null |
| 2026-02-24 | Elimination-compensation pruning for fully-connected neural networks | Enrico Ballini et.al. | 2602.20467 | null |
| 2026-02-23 | CLIPoint3D: Language-Grounded Few-Shot Unsupervised 3D Point Cloud Domain Adaptation | Mainak Singha et.al. | 2602.20409 | null |
| 2026-02-23 | Highly Efficient Selection of High-Redshift Emission-Line Galaxies for future DESI-like surveys with Deep Multi-band Imaging | Yoquelbin Salcedo Hernandez et.al. | 2602.20405 | null |
| 2026-02-25 | QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models | Jingxuan Zhang et.al. | 2602.20309 | null |
| 2026-02-23 | Mitigating Artifacts in Pre-quantization Based Scientific Data Compressors with Quantization-aware Interpolation | Pu Jiao et.al. | 2602.20097 | null |
| 2026-02-23 | CQ-CiM: Hardware-Aware Embedding Shaping for Robust CiM-Based Retrieval | Xinzhao Li et.al. | 2602.20083 | null |
| 2026-02-23 | Token-UNet: A New Case for Transformers Integration in Efficient and Interpretable 3D UNets for Brain Imaging Segmentation | Louis Fabrice Tshimanga et.al. | 2602.20008 | null |
| 2026-02-23 | A Computationally Efficient Multidimensional Vision Transformer | Alaa El Ichi et.al. | 2602.19982 | null |
| 2026-02-23 | Unlearning Noise in PINNs: A Selective Pruning Framework for PDE Inverse Problems | Yongsheng Chen et.al. | 2602.19967 | null |
| 2026-02-23 | Rethinking LoRA for Privacy-Preserving Federated Learning in Large Models | Jin Liu et.al. | 2602.19926 | null |
| 2026-02-23 | DerMAE: Improving skin lesion classification through conditioned latent diffusion and MAE distillation | Francisco Filho et.al. | 2602.19848 | null |
| 2026-02-23 | Path-conditioned training: a principled way to rescale ReLU neural networks | Arthur Lebeurrier et.al. | 2602.19799 | null |
| 2026-02-23 | Transcendental momentum quantization in semiconducting Rashba nanowires and zero energy states in their normal and superconducting phase | Nico Leumer et.al. | 2602.19796 | null |
| 2026-02-23 | Training Deep Stereo Matching Networks on Tree Branch Imagery: A Benchmark Study for Real-Time UAV Forestry Applications | Yida Lin et.al. | 2602.19763 | null |
| 2026-02-23 | Multimodal Dataset Distillation Made Simple by Prototype-Guided Data Synthesis | Junhyeok Choi et.al. | 2602.19756 | null |
| 2026-02-23 | RAP: Fast Feedforward Rendering-Free Attribute-Guided Primitive Importance Score Prediction for Efficient 3D Gaussian Splatting Processing | Kaifa Yang et.al. | 2602.19753 | null |
| 2026-02-24 | NEXUS: A compact neural architecture for high-resolution spatiotemporal air quality forecasting in Delhi National Capital Region | Rampunit Kumar et.al. | 2602.19654 | null |
| 2026-02-24 | Nacrith: Neural Lossless Compression via Ensemble Context Modeling and High-Precision CDF Coding | Roberto Tacconelli et.al. | 2602.19626 | null |
| 2026-02-23 | VecFormer: Towards Efficient and Generalizable Graph Transformer with Graph Token Attention | Jingbo Zhou et.al. | 2602.19622 | null |
| 2026-02-23 | Sculpting the Vector Space: Towards Efficient Multi-Vector Visual Document Retrieval via Prune-then-Merge Framework | Yibo Yan et.al. | 2602.19549 | null |
| 2026-02-23 | A Text-Guided Vision Model for Enhanced Recognition of Small Instances | Hyun-Ki Jung et.al. | 2602.19503 | null |
| 2026-02-23 | Decoupling Vision and Language: Codebook Anchored Visual Adaptation | Jason Wu et.al. | 2602.19449 | null |
| 2026-02-23 | FinSight-Net:A Physics-Aware Decoupled Network with Frequency-Domain Compensation for Underwater Fish Detection in Smart Aquaculture | Jinsong Yang et.al. | 2602.19437 | null |
| 2026-02-22 | Adaptive Data Augmentation with Multi-armed Bandit: Sample-Efficient Embedding Calibration for Implicit Pattern Recognition | Minxue Tang et.al. | 2602.19385 | null |
| 2026-02-22 | Prompt Tuning for CLIP on the Pretrained Manifold | Xi Yang et.al. | 2602.19198 | null |
| 2026-02-22 | PositionOCR: Augmenting Positional Awareness in Multi-Modal Models via Hybrid Specialist Integration | Chen Duan et.al. | 2602.19188 | null |
| 2026-02-22 | S $^3$ GND: An Effective Learning-Based Approach for Subgraph Similarity Search Under Generalized Neighbor Difference Semantics (Technical Report) | Qi Wen et.al. | 2602.19167 | null |
| 2026-02-22 | Flash-VAED: Plug-and-Play VAE Decoders for Efficient Video Generation | Lunjie Zhu et.al. | 2602.19161 | null |
| 2026-02-22 | Mapping Networks | Lord Sen et.al. | 2602.19134 | null |
| 2026-02-22 | Learning from Complexity: Exploring Dynamic Sample Pruning of Spatio-Temporal Training | Wei Chen et.al. | 2602.19113 | null |
| 2026-02-22 | Astra: Activation-Space Tail-Eigenvector Low-Rank Adaptation of Large Language Models | Kainan Liu et.al. | 2602.19111 | null |
| 2026-02-22 | Do LLMs and VLMs Share Neurons for Inference? Evidence and Mechanisms of Cross-Modal Transfer | Chenhang Cui et.al. | 2602.19058 | null |
| 2026-02-22 | SKYLIGHT: A Scalable Hundred-Channel 3D Photonic In-Memory Tensor Core Architecture for Real-time AI Inference | Meng Zhang et.al. | 2602.19031 | null |
| 2026-02-22 | GUIDE-US: Grade-Informed Unpaired Distillation of Encoder Knowledge from Histopathology to Micro-UltraSound | Emma Willis et.al. | 2602.19005 | null |
| 2026-02-21 | PCA-VAE: Differentiable Subspace Quantization without Codebook Collapse | Hao Lu et.al. | 2602.18904 | null |
| 2026-02-21 | Beyond Stationarity: Rethinking Codebook Collapse in Vector Quantization | Hao Lu et.al. | 2602.18896 | null |
| 2026-02-21 | Structure-Level Disentangled Diffusion for Few-Shot Chinese Font Generation | Jie Li et.al. | 2602.18874 | null |
| 2026-02-21 | Joint Post-Training Quantization of Vision Transformers with Learned Prompt-Guided Data Generation | Shile Li et.al. | 2602.18861 | null |
| 2026-02-21 | Hyperbolic Busemann Neural Networks | Ziheng Chen et.al. | 2602.18858 | null |
| 2026-02-21 | DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference | Aditya Kumar Singh et.al. | 2602.18846 | null |
| 2026-02-21 | UFO: Unlocking Ultra-Efficient Quantized Private Inference with Protocol and Algorithm Co-Optimization | Wenxuan Zeng et.al. | 2602.18758 | null |
| 2026-02-21 | Federated Reasoning Distillation Framework with Model Learnability-Aware Data Allocation | Wei Guo et.al. | 2602.18749 | null |
| 2026-02-21 | Deep LoRA-Unfolding Networks for Image Restoration | Xiangming Wang et.al. | 2602.18697 | null |
| 2026-02-21 | In-Context Planning with Latent Temporal Abstractions | Baiting Luo et.al. | 2602.18694 | null |
| 2026-02-20 | Communication-Efficient Personalized Adaptation via Federated-Local Model Merging | Yinan Zou et.al. | 2602.18658 | null |
| 2026-02-20 | Ensemble Prediction of Task Affinity for Efficient Multi-Task Learning | Afiya Ayman et.al. | 2602.18591 | null |
| 2026-02-20 | GIST: Targeted Data Selection for Instruction Tuning via Coupled Optimization Geometry | Guanghui Min et.al. | 2602.18584 | null |
| 2026-02-20 | Luna-2: Scalable Single-Token Evaluation with Small Language Models | Vatsal Goel et.al. | 2602.18583 | null |
| 2026-02-20 | SPQ: An Ensemble Technique for Large Language Model Compression | Jiamin Yao et.al. | 2602.18420 | null |
| 2026-02-20 | MD-AirComp+: Adaptive Quantization for Blind Massive Digital Over-the-Air Computation | Li Qiao et.al. | 2602.18332 | null |
| 2026-02-20 | Neural-HSS: Hierarchical Semi-Separable Neural PDE Solver | Pietro Sittoni et.al. | 2602.18248 | null |
| 2026-02-20 | Parameter-Efficient Domain Adaptation of Physics-Informed Self-Attention based GNNs for AC Power Flow Prediction | Redwanul Karim et.al. | 2602.18227 | null |
| 2026-02-20 | Cut Less, Fold More: Model Compression through the Lens of Projection Geometry | Olga Saukh et.al. | 2602.18116 | null |
| 2026-02-20 | MUOT_3M: A 3 Million Frame Multimodal Underwater Benchmark and the MUTrack Tracking Method | Ahsan Baidar Bakht et.al. | 2602.18006 | null |
| 2026-02-20 | Higher order quantization conditions for two-body scattering with spin | Lucas Chandler et.al. | 2602.17924 | null |
| 2026-02-19 | Calibrated Adaptation: Bayesian Stiefel Manifold Priors for Reliable Parameter-Efficient Fine-Tuning | Ibne Farabi Shihab et.al. | 2602.17809 | null |
| 2026-02-19 | Hardware-Aware Design of a GNN-Based Hit Filtering Algorithm for the Belle II Level-1 Trigger | Greta Heine et.al. | 2602.17761 | null |
| 2026-02-19 | Sink-Aware Pruning for Diffusion Language Models | Aidar Myrzakhan et.al. | 2602.17664 | null |
| 2026-02-19 | Reverso: Efficient Time Series Foundation Models for Zero-shot Forecasting | Xinghong Fu et.al. | 2602.17634 | null |
| 2026-02-19 | Revisiting Weight Regularization for Low-Rank Continual Learning | Yaoyue Zheng et.al. | 2602.17559 | null |
| 2026-02-19 | LORA-CRAFT: Cross-layer Rank Adaptation via Frozen Tucker Decomposition of Pre-trained Attention Weights | Kasun Dewage et.al. | 2602.17510 | null |
| 2026-02-19 | Analytical Derivation of Quantization Error in Threshold Level Quantizers Using Bipolar PFM | Ricardo Carrero et.al. | 2602.17471 | null |
| 2026-02-19 | SpectralGCD: Spectral Concept Selection and Cross-modal Representation Learning for Generalized Category Discovery | Lorenzo Caselli et.al. | 2602.17395 | null |
| 2026-02-20 | Contact-Anchored Proprioceptive Odometry for Quadruped Robots | Minxing Sun et.al. | 2602.17393 | null |
| 2026-02-19 | Efficient privacy loss accounting for subsampling and random allocation | Vitaly Feldman et.al. | 2602.17284 | null |
| 2026-02-19 | EntropyPrune: Matrix Entropy Guided Visual Token Pruning for Multimodal Large Language Models | Yahong Wang et.al. | 2602.17196 | null |
| 2026-02-19 | Bonsai: A Framework for Convolutional Neural Network Acceleration Using Criterion-Based Pruning | Joseph Bingham et.al. | 2602.17145 | null |
| 2026-02-19 | Efficient Parallel Algorithm for Decomposing Hard CircuitSAT Instances | Victor Kondratiev et.al. | 2602.17130 | null |
| 2026-02-19 | FLoRG: Federated Fine-tuning with Low-rank Gram Matrices and Procrustes Alignment | Chuiyang Meng et.al. | 2602.17095 | null |
| 2026-02-19 | Sign Lock-In: Randomly Initialized Weight Signs Persist and Bottleneck Sub-Bit Model Compression | Akira Sakai et.al. | 2602.17063 | null |
| 2026-02-19 | Amber-Image: Efficient Compression of Large-Scale Diffusion Transformers | Chaojie Yang et.al. | 2602.17047 | null |
| 2026-02-18 | BrainRVQ: A High-Fidelity EEG Foundation Model via Dual-Domain Residual Quantization and Hierarchical Autoregression | Mingzhe Cui et.al. | 2602.16951 | null |
| 2026-02-18 | Numerical study of electron acceleration by microwave-driven plasma wakefields in rectangular waveguides | Jesús E. López et.al. | 2602.16896 | null |
| 2026-02-18 | ML-driven detection and reduction of ballast information in multi-modal datasets | Yaroslav Solovko et.al. | 2602.16876 | null |
| 2026-02-18 | Training Large Reasoning Models Efficiently via Progressive Thought Encoding | Zeliang Zhang et.al. | 2602.16839 | null |
| 2026-02-18 | NeST: Neuron Selective Tuning for LLM Safety | Sasha Behrouzi et.al. | 2602.16835 | null |
| 2026-02-18 | U-FedTomAtt: Ultra-lightweight Federated Learning with Attention for Tomato Disease Recognition | Romiyal George et.al. | 2602.16749 | null |
| 2026-02-18 | One Hand to Rule Them All: Canonical Representations for Unified Dexterous Manipulation | Zhenyu Wei et.al. | 2602.16712 | null |
| 2026-02-20 | Agent Skill Framework: Perspectives on the Potential of Small Language Models in Industrial Environments | Yangjie Xu et.al. | 2602.16653 | null |
| 2026-02-18 | Quecto-V1: Empirical Analysis of 8-bit Quantized Small Language Models for On-Device Legal Retrieval | Subrit Dikshit et.al. | 2602.16640 | null |
| 2026-02-18 | A Scalable Approach to Solving Simulation-Based Network Security Games | Michael Lanier et.al. | 2602.16564 | null |
| 2026-02-18 | Subtractive Modulative Network with Learnable Periodic Activations | Tiou Wang et.al. | 2602.16337 | null |
| 2026-02-18 | RefineFormer3D: Efficient 3D Medical Image Segmentation via Adaptive Multi-Scale Transformer with Cross Attention Fusion | Kavyansh Tyagi et.al. | 2602.16320 | null |
| 2026-02-18 | AFFMAE: Scalable and Efficient Vision Pretraining for Desktop Graphics Cards | David Smerkous et.al. | 2602.16249 | null |
| 2026-02-18 | Uncertainty-Guided Inference-Time Depth Adaptation for Transformer-Based Visual Tracking | Patrick Poggi et.al. | 2602.16160 | null |
| 2026-02-18 | Rethinking ANN-based Retrieval: Multifaceted Learnable Index for Large-scale Recommendation System | Jiang Zhang et.al. | 2602.16124 | null |
| 2026-02-18 | Collaborative Zone-Adaptive Zero-Day Intrusion Detection for IoBT | Amirmohammad Pasdar et.al. | 2602.16098 | null |
| 2026-02-17 | LGQ: Learning Discretization Geometry for Scalable and Stable Image Tokenization | Idil Bilge Altun et.al. | 2602.16086 | null |
| 2026-02-17 | ROIX-Comp: Optimizing X-ray Computed Tomography Imaging Strategy for Data Reduction and Reconstruction | Amarjit Singh et.al. | 2602.15917 | null |
| 2026-02-17 | QwaveMPS: An efficient open-source Python package for simulating non-Markovian waveguide-QED using matrix product states | Sofia Arranz Regidor et.al. | 2602.15826 | null |
| 2026-02-17 | Quantitative local recovery of Kerr-de Sitter parameters from high-frequency equatorial quasinormal modes | Ruiliang Li et.al. | 2602.15764 | null |
| 2026-02-17 | Enabling Low-Latency Machine learning on Radiation-Hard FPGAs with hls4ml | Katya Govorkova et.al. | 2602.15751 | null |
| 2026-02-17 | Learning to Retrieve Navigable Candidates for Efficient Vision-and-Language Navigation | Shutian Gu et.al. | 2602.15724 | null |
| 2026-02-18 | ToaSt: Token Channel Selection and Structured Pruning for Efficient ViT | Hyunchan Moon et.al. | 2602.15720 | null |
| 2026-02-17 | 1-Bit Wonder: Improving QAT Performance in the Low-Bit Regime through K-Means Quantization | Sohir Maskey et.al. | 2602.15563 | null |
| 2026-02-17 | Efficient Road Renovation Scheduling under Uncertainty using Lower Bound Pruning | Robbert Bosch et.al. | 2602.15554 | null |
| 2026-02-17 | jina-embeddings-v5-text: Task-Targeted Embedding Distillation | Mohammad Kalim Akram et.al. | 2602.15547 | null |
| 2026-02-17 | LEADER: Lightweight End-to-End Attention-Gated Dual Autoencoder for Robust Minutiae Extraction | Raffaele Cappelli et.al. | 2602.15493 | null |
| 2026-02-17 | The Vision Wormhole: Latent-Space Communication in Heterogeneous Multi-Agent Systems | Xiaoze Liu et.al. | 2602.15382 | null |
| 2026-02-17 | Orchestration-Free Customer Service Automation: A Privacy-Preserving and Flowchart-Guided Framework | Mengze Hong et.al. | 2602.15377 | null |
| 2026-02-17 | Sparse Additive Model Pruning for Order-Based Causal Structure Learning | Kentaro Kanamori et.al. | 2602.15306 | null |
| 2026-02-16 | Pruning distance of upset-decomposable persistence modules | Roy Nicolas Nehme et.al. | 2602.15243 | null |
| 2026-02-16 | Phase Transitions in Neural Networks Pruning | Diego Pesce et.al. | 2602.15224 | null |
| 2026-02-16 | COMPOT: Calibration-Optimized Matrix Procrustes Orthogonalization for Transformers Compression | Denis Makhov et.al. | 2602.15200 | null |
| 2026-02-16 | ScrapeGraphAI-100k: A Large-Scale Dataset for LLM-Based Web Information Extraction | William Brach et.al. | 2602.15189 | null |
| 2026-02-16 | Quantization as a Categorical Equivalence for Hilbert Bimodules and Lagrangian Relations | Benjamin H. Feintzeig et.al. | 2602.15188 | null |
| 2026-02-16 | Learning Data-Efficient and Generalizable Neural Operators via Fundamental Physics Knowledge | Siying Ma et.al. | 2602.15184 | null |
| 2026-02-16 | Synthesizing Trajectory Queries from Examples | Stephen Mell et.al. | 2602.15164 | null |
| 2026-02-16 | Protecting Language Models Against Unauthorized Distillation through Trace Rewriting | Xinhang Ma et.al. | 2602.15143 | null |
| 2026-02-16 | CGRA-DeBERTa Concept Guided Residual Augmentation Transformer for Theologically Islamic Understanding | Tahir Hussain et.al. | 2602.15139 | null |
| 2026-02-16 | Text Style Transfer with Parameter-efficient LLM Finetuning and Round-trip Translation | Ruoxi Liu et.al. | 2602.15013 | null |
| 2026-02-16 | Scaling QAOA: transferring optimal adiabatic schedules from small-scale to large-scale variational circuits | Ugo Nzongani et.al. | 2602.14986 | null |
| 2026-02-16 | DRAMA: Domain Retrieval using Adaptive Module Allocation | Pranav Kasela et.al. | 2602.14960 | null |
| 2026-02-16 | Algorithmic Simplification of Neural Networks with Mosaic-of-Motifs | Pedram Bakhtiarifard et.al. | 2602.14896 | link |
| 2026-02-16 | Depth Completion as Parameter-Efficient Test-Time Adaptation | Bingxin Ke et.al. | 2602.14751 | null |
| 2026-02-16 | D2-LoRA: A Synergistic Approach to Differential and Directional Low-Rank Adaptation | Nozomu Fujisawa et.al. | 2602.14728 | null |
| 2026-02-16 | GradMAP: Faster Layer Pruning with Gradient Metric and Projection Compensation | Hao Liu et.al. | 2602.14649 | null |
| 2026-02-16 | RNM-TD3: N:M Semi-structured Sparse Reinforcement Learning From Scratch | Isam Vrce et.al. | 2602.14578 | null |
| 2026-02-16 | Efficient Text-Guided Convolutional Adapter for the Diffusion Model | Aryan Das et.al. | 2602.14514 | null |
| 2026-02-16 | Parameter-Efficient Fine-Tuning of LLMs with Mixture of Space Experts | Buze Zhang et.al. | 2602.14490 | null |
| 2026-02-16 | S2D: Selective Spectral Decay for Quantization-Friendly Conditioning of Neural Activations | Arnav Chavan et.al. | 2602.14432 | null |
| 2026-02-16 | LLM-Guided Knowledge Distillation for Temporal Knowledge Graph Reasoning | Wang Xing et.al. | 2602.14428 | null |
| 2026-02-15 | Train Less, Learn More: Adaptive Efficient Rollout Optimization for Group-Based Reinforcement Learning | Zhi Zhang et.al. | 2602.14338 | null |
| 2026-02-15 | Floe: Federated Specialization for Real-Time LLM-SLM Inference | Chunlin Tian et.al. | 2602.14302 | null |
| 2026-02-15 | DeepFusion: Accelerating MoE Training via Federated Knowledge Distillation from Heterogeneous Edge Devices | Songyuan Li et.al. | 2602.14301 | null |
| 2026-02-15 | Energy-Efficient Over-the-Air Federated Learning via Pinching Antenna Systems | Saba Asaad et.al. | 2602.14250 | null |
| 2026-02-15 | Towards Spatial Transcriptomics-driven Pathology Foundation Models | Konstantin Hemker et.al. | 2602.14177 | null |
| 2026-02-15 | ROAST: Rollout-based On-distribution Activation Steering Technique | Xuanbo Su et.al. | 2602.14143 | null |
| 2026-02-15 | TabTracer: Monte Carlo Tree Search for Complex Table Reasoning with Large Language Models | Zhizhao Luo et.al. | 2602.14089 | null |
| 2026-02-15 | Policy Gradient with Adaptive Entropy Annealing for Continual Fine-Tuning | Yaqian Zhang et.al. | 2602.14078 | null |
| 2026-02-15 | LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts | Yang Liu et.al. | 2602.14060 | null |
| 2026-02-15 | Explainability-Inspired Layer-Wise Pruning of Deep Neural Networks for Efficient Object Detection | Abhinav Shukla et.al. | 2602.14040 | null |
| 2026-02-15 | Extended Universal Joint Source-Channel Coding for Digital Semantic Communications: Improving Channel-Adaptability | Eunsoo Kim et.al. | 2602.14018 | null |
| 2026-02-15 | A Deployment-Friendly Foundational Framework for Efficient Computational Pathology | Yu Cai et.al. | 2602.14010 | null |
| 2026-02-15 | Elastic Diffusion Transformer | Jiangshan Wang et.al. | 2602.13993 | null |
| 2026-02-15 | Efficient Off-Grid Near-Field Cascade Channel Estimation for XL-IRS Systems via Tucker Decomposition | Wenzhou Cao et.al. | 2602.13988 | null |
| 2026-02-15 | QuRL: Efficient Reinforcement Learning with Quantized Rollout | Yuhang Li et.al. | 2602.13953 | null |
| 2026-02-14 | Evaluating Prompt Engineering Techniques for RAG in Small Language Models: A Multi-Hop QA Approach | Amir Hossein Mohammadi et.al. | 2602.13890 | null |
| 2026-02-14 | Parameter-Efficient Fine-Tuning of DINOv2 for Large-Scale Font Classification | Daniel Chen et.al. | 2602.13889 | null |
| 2026-02-14 | Bridging the Multilingual Safety Divide: Efficient, Culturally-Aware Alignment for Global South Languages | Somnath Banerjee et.al. | 2602.13867 | null |
| 2026-02-14 | High-Fidelity Causal Video Diffusion Models for Real-Time Ultra-Low-Bitrate Semantic Communication | Cem Eteke et.al. | 2602.13837 | null |
| 2026-02-14 | NeuroMambaLLM: Dynamic Graph Learning of fMRI Functional Connectivity in Autistic Brains Using Mamba and Language Model Reasoning | Yasaman Torabi et.al. | 2602.13770 | null |
| 2026-02-14 | MOTIF: Learning Action Motifs for Few-shot Cross-Embodiment Transfer | Heng Zhi et.al. | 2602.13764 | null |
| 2026-02-14 | HBVLA: Pushing 1-Bit Post-Training Quantization for Vision-Language-Action Models | Xin Yan et.al. | 2602.13710 | null |
| 2026-02-14 | A WDLoRA-Based Multimodal Generative Framework for Clinically Guided Corneal Confocal Microscopy Image Synthesis in Diabetic Neuropathy | Xin Zhang et.al. | 2602.13693 | null |
| 2026-02-14 | HyFunc: Accelerating LLM-based Function Calls for Agentic AI through Hybrid-Model Cascade and Dynamic Templating | Weibin Liao et.al. | 2602.13665 | null |
| 2026-02-14 | Layer-Guided UAV Tracking: Enhancing Efficiency and Occlusion Robustness | Yang Zhou et.al. | 2602.13636 | null |
| 2026-02-14 | GEMs: Breaking the Long-Sequence Barrier in Generative Recommendation with a Multi-Stream Decoder | Yu Zhou et.al. | 2602.13631 | null |
| 2026-02-14 | Compact LLM Deployment and World Model Assisted Offloading in Mobile Edge Computing | Ruichen Zhang et.al. | 2602.13628 | null |
| 2026-02-14 | Parametric-Sensitivity Aware Retransmission for Efficient AI Downloading | You Zhou et.al. | 2602.13607 | null |
| 2026-02-14 | The Quantization Trap: Breaking Linear Scaling Laws in Multi-Hop Reasoning | Henry Han et.al. | 2602.13595 | null |
| 2026-02-14 | Unleash the Potential of Long Semantic IDs for Generative Recommendation | Ming Xia et.al. | 2602.13573 | null |
| 2026-02-14 | DistillLens: Symmetric Knowledge Distillation Through Logit Lens | Manish Dhakal et.al. | 2602.13567 | null |
| 2026-02-13 | Quantization-Robust LLM Unlearning via Low-Rank Adaptation | João Vitor Boer Abitante et.al. | 2602.13151 | null |
| 2026-02-13 | FlashSchNet: Fast and Accurate Coarse-Grained Neural Network Molecular Dynamics | Pingzhi Li et.al. | 2602.13140 | null |
| 2026-02-13 | EXCODER: EXplainable Classification Of DiscretE time series Representations | Yannik Hahn et.al. | 2602.13087 | null |
| 2026-02-13 | LCSB: Layer-Cyclic Selective Backpropagation for Memory-Efficient On-Device LLM Fine-Tuning | Juneyoung Park et.al. | 2602.13073 | null |
| 2026-02-13 | Quantization-Aware Collaborative Inference for Large Embodied AI Models | Zhonghao Lyu et.al. | 2602.13052 | null |
| 2026-02-13 | Resource-Efficient Gesture Recognition through Convexified Attention | Daniel Schwartz et.al. | 2602.13030 | null |
| 2026-02-13 | A two-step approach for speech enhancement in low-SNR scenarios using cyclostationary beamforming and DNNs | Giovanni Bologni et.al. | 2602.12986 | null |
| 2026-02-13 | Multi-Dimensional Visual Data Recovery: Scale-Aware Tensor Modeling and Accelerated Randomized Computation | Wenjin Qin et.al. | 2602.12982 | null |
| 2026-02-13 | Limits of Thermal Conductance Quantization in Chiral Topological Josephson Junctions | Daniel Gresta et.al. | 2602.12947 | null |
| 2026-02-13 | Unleashing MLLMs on the Edge: A Unified Framework for Cross-Modal ReID via Adaptive SVD Distillation | Hongbo Jiang et.al. | 2602.12936 | null |
| 2026-02-13 | WebClipper: Efficient Evolution of Web Agents with Graph-based Trajectory Pruning | Junjie Wang et.al. | 2602.12852 | null |
| 2026-02-13 | Adaptive Structured Pruning of Convolutional Neural Networks for Time Series Classification | Javidan Abdullayev et.al. | 2602.12744 | null |
| 2026-02-13 | Trust the uncertain teacher: distilling dark knowledge via calibrated uncertainty | Jeonghyun Kim et.al. | 2602.12687 | null |
| 2026-02-13 | $\mathcal{X}$ -KD: General Experiential Knowledge Distillation for Large Language Models | Yuang Cai et.al. | 2602.12674 | null |
| 2026-02-13 | PMG: Parameterized Motion Generator for Human-like Locomotion Control | Chenxi Han et.al. | 2602.12656 | null |
| 2026-02-13 | Vision Token Reduction via Attention-Driven Self-Compression for Efficient Multimodal Large Language Models | Omer Faruk Deniz et.al. | 2602.12618 | null |
| 2026-02-13 | QuEPT: Quantized Elastic Precision Transformers with One-Shot Calibration for Multi-Bit Switching | Ke Xu et.al. | 2602.12609 | null |
| 2026-02-13 | Monte Carlo Tree Search with Reasoning Path Refinement for Small Language Models in Conversational Text-to-NoSQL | Xubang Xiong et.al. | 2602.12574 | null |
| 2026-02-13 | Constraint-Rectified Training for Efficient Chain-of-Thought | Qinhang Wu et.al. | 2602.12526 | null |
| 2026-02-12 | Human-Like Coarse Object Representations in Vision Models | Andrey Gizdov et.al. | 2602.12486 | null |
| 2026-02-12 | Rational Neural Networks have Expressivity Advantages | Maosen Tang et.al. | 2602.12390 | null |
| 2026-02-12 | LLaMo: Scaling Pretrained Language Models for Unified Motion Understanding and Generation with Continuous Autoregressive Tokens | Zekun Li et.al. | 2602.12370 | null |
| 2026-02-12 | On-Policy Context Distillation for Language Models | Tianzhu Ye et.al. | 2602.12275 | null |
| 2026-02-13 | DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing | Dianyi Wang et.al. | 2602.12205 | null |
| 2026-02-12 | SAM3-LiteText: An Anatomical Study of the SAM3 Text Encoder for Efficient Vision-Language Segmentation | Chengxi Zeng et.al. | 2602.12173 | null |
| 2026-02-12 | Pedagogically-Inspired Data Synthesis for Language Model Knowledge Distillation | Bowei He et.al. | 2602.12172 | null |
| 2026-02-12 | Compress, Cross and Scale: Multi-Level Compression Cross Networks for Efficient Scaling in Recommender Systems | Heng Yu et.al. | 2602.12041 | null |
| 2026-02-12 | Improved state mixing in higher-order and block diagonal linear recurrent networks | Igor Dubinin et.al. | 2602.12021 | null |
| 2026-02-13 | LaCy: What Small Language Models Can and Should Learn is Not Just a Question of Loss | Szilvia Ujváry et.al. | 2602.12005 | null |
| 2026-02-12 | Manifold-Aware Temporal Domain Generalization for Large Language Models | Yiheng Yao et.al. | 2602.11965 | null |
| 2026-02-12 | Extending Puzzle for Mixture-of-Experts Reasoning Models with Application to GPT-OSS Acceleration | Akhiad Bercovich et.al. | 2602.11937 | null |
| 2026-02-12 | Optimal Quantization for Nonuniform Densities on Spherical Curves | Silpi Saha et.al. | 2602.11926 | null |
| 2026-02-12 | Improving Code Generation via Small Language Model-as-a-judge | Giuseppe Crupi et.al. | 2602.11911 | null |
| 2026-02-12 | Where Bits Matter in World Model Planning: A Paired Mixed-Bit Study for Efficient Spatial Reasoning | Suraj Ranganath et.al. | 2602.11882 | null |
| 2026-02-12 | MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling | MiniCPM Team et.al. | 2602.11761 | null |
| 2026-02-12 | Dopamine: Brain Modes, Not Brains | Shervin Ghasemlou et.al. | 2602.11726 | null |
| 2026-02-12 | LAER-MoE: Load-Adaptive Expert Re-layout for Efficient Mixture-of-Experts Training | Xinyi Liu et.al. | 2602.11686 | null |
| 2026-02-12 | U-Net with Hadamard Transform and DCT Latent Spaces for Next-day Wildfire Spread Prediction | Yingyi Luo et.al. | 2602.11672 | null |
| 2026-02-12 | LoRA-based Parameter-Efficient LLMs for Continuous Learning in Edge-based Malware Detection | Christian Rondanini et.al. | 2602.11655 | null |
| 2026-02-12 | Quantization Mapping on Dirac Dynamics via Voltage-Driven Charge Density in Monolayer Graphene: A Klein Paradox and Entropy-Ruled Wavevector Mechanics Study | Karuppuchamy Navamani et.al. | 2602.11604 | null |
| 2026-02-12 | Move What Matters: Parameter-Efficient Domain Adaptation via Optimal Transport Flow for Collaborative Perception | Zesheng Jia et.al. | 2602.11565 | null |
| 2026-02-12 | Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm | Jinrui Zhang et.al. | 2602.11543 | null |
| 2026-02-12 | Differentially Private and Communication Efficient Large Language Model Split Inference via Stochastic Quantization and Soft Prompt | Yujie Gu et.al. | 2602.11513 | null |
| 2026-02-11 | Investigation of Toroidal Rotation Effects on Spherical Torus Equilibria using the Fast Spectral Solver VEQ-R | Xingyu Li et.al. | 2602.11422 | null |
| 2026-02-11 | Efficient Simulation of Pre-Born-Oppenheimer Dynamics on a Quantum Computer | Matthew Pocrnic et.al. | 2602.11272 | null |
| 2026-02-11 | Reed-Muller Error-Correction Code Encoder for SFQ-to-CMOS Interface Circuits | Yerzhan Mustafa et.al. | 2602.11140 | null |
| 2026-02-11 | PuriLight: A Lightweight Shuffle and Purification Framework for Monocular Depth Estimation | Yujie Chen et.al. | 2602.11066 | null |
| 2026-02-11 | ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression | Ammar Ali et.al. | 2602.11008 | null |
| 2026-02-11 | Enhancing Predictability of Multi-Tenant DNN Inference for Autonomous Vehicles’ Perception | Liangkai Liu et.al. | 2602.11004 | null |
| 2026-02-11 | LoRA-Squeeze: Simple and Effective Post-Tuning and In-Tuning Compression of LoRA Modules | Ivan Vulić et.al. | 2602.10993 | null |
| 2026-02-11 | Deformation quantization of symplectic vector fields | Haoyuan Gao et.al. | 2602.10988 | null |
| 2026-02-11 | MoEEdit: Efficient and Routing-Stable Knowledge Editing for Mixture-of-Experts LLMs | Yupu Gu et.al. | 2602.10965 | null |
| 2026-02-11 | Agentic Knowledge Distillation: Autonomous Training of Small Language Models for SMS Threat Detection | Adel ElZemity et.al. | 2602.10869 | null |
| 2026-02-11 | Enhancing Multivariate Time Series Forecasting with Global Temporal Retrieval | Fanpu Cao et.al. | 2602.10847 | null |
| 2026-02-11 | Resource-Efficient RGB-Only Action Recognition for Edge Deployment | Dongsik Yoon et.al. | 2602.10818 | null |
| 2026-02-11 | EST: Towards Efficient Scaling Laws in Click-Through Rate Prediction via Unified Modeling | Mingyang Liu et.al. | 2602.10811 | null |
| 2026-02-11 | GoodVibe: Security-by-Vibe for LLM-Based Code Generation | Maximilian Thang et.al. | 2602.10778 | null |
| 2026-02-12 | Efficient Operator Selection and Warm-Start Strategy for Excitations in Variational Quantum Eigensolvers | Max Haas et.al. | 2602.10776 | null |
| 2026-02-11 | Kalman Linear Attention: Parallel Bayesian Filtering For Efficient Language Modelling and State Tracking | Vaisakh Shaj et.al. | 2602.10743 | null |
| 2026-02-11 | SnapMLA: Efficient Long-Context MLA Decoding via Hardware-Aware FP8 Quantized Pipelining | Yifan Zhang et.al. | 2602.10718 | null |
| 2026-02-11 | Spend Search Where It Pays: Value-Guided Structured Sampling and Optimization for Generative Recommendation | Jie Jiang et.al. | 2602.10699 | null |
| 2026-02-11 | Bridging the Compression-Precision Paradox: A Hybrid Architecture for Clinical EEG Report Generation with Guaranteed Measurement Accuracy | Wuyang Zhang et.al. | 2602.10544 | null |
| 2026-02-11 | Efficient Computation of Maximum Flexi-Clique in Networks | Song Kim et.al. | 2602.10459 | null |
| 2026-02-11 | Compute Only Once: UG-Separation for Efficient Large Recommendation Models | Hui Lu et.al. | 2602.10455 | null |
| 2026-02-11 | End-to-End Semantic ID Generation for Generative Advertisement Recommendation | Jie Jiang et.al. | 2602.10445 | null |
| 2026-02-11 | QTALE: Quantization-Robust Token-Adaptive Layer Execution for LLMs | Kanghyun Noh et.al. | 2602.10431 | null |
| 2026-02-11 | Modular Multi-Task Learning for Chemical Reaction Prediction | Jiayun Pang et.al. | 2602.10404 | null |
| 2026-02-10 | Theoretical Analysis of Contrastive Learning under Imbalanced Data: From Training Dynamics to a Pruning Solution | Haixu Liao et.al. | 2602.10357 | null |
| 2026-02-10 | Efficient Policy Adaptation for Voltage Control Under Unknown Topology Changes | Jie Feng et.al. | 2602.10355 | null |
| 2026-02-10 | Efficient reduction of stellar contamination and noise in planetary transmission spectra using neural networks | David S. Duque-Castaño et.al. | 2602.10330 | null |
| 2026-02-10 | R2RAG-Flood: A reasoning-reinforced training-free retrieval augmentation generation framework for flood damage nowcasting | Lipai Huang et.al. | 2602.10312 | null |
| 2026-02-10 | Optimal Bounds-Only Pruning for Spatial AkNN Joins | Dominik Winecki et.al. | 2602.10027 | null |
| 2026-02-10 | Answer First, Reason Later: Aligning Search Relevance via Mode-Balanced Reinforcement Learning | Shijie Zhang et.al. | 2602.10006 | null |
| 2026-02-10 | AdaTSQ: Pushing the Pareto Frontier of Diffusion Transformers via Temporal-Sensitivity Quantization | Shaoqiu Zhang et.al. | 2602.09883 | null |
| 2026-02-10 | BabyMamba-HAR: Lightweight Selective State Space Models for Efficient Human Activity Recognition on Resource Constrained Devices | Mridankan Mandal et.al. | 2602.09872 | null |
| 2026-02-11 | Text summarization via global structure awareness | Jiaquan Zhang et.al. | 2602.09821 | null |
| 2026-02-10 | CompSplat: Compression-aware 3D Gaussian Splatting for Real-world Video | Hojun Song et.al. | 2602.09816 | null |
| 2026-02-10 | From Lightweight CNNs to SpikeNets: Benchmarking Accuracy-Energy Tradeoffs with Pruned Spiking SqueezeNet | Radib Bin Kabir et.al. | 2602.09717 | null |
| 2026-02-10 | Stellar-mass black holes in young massive and open stellar clusters – VII. Comparisons with gravitational-wave events until LVK-O4a and Gaia compact binaries | Sambaran Banerjee et.al. | 2602.09694 | null |
| 2026-02-10 | Life Cycle-Aware Evaluation of Knowledge Distillation for Machine Translation: Environmental Impact and Translation Quality Trade-offs | Joseph Attieh et.al. | 2602.09691 | null |
| 2026-02-10 | Talking with the Latents – how to convert your LLM into an astronomer | Ilay Kamai et.al. | 2602.09670 | null |
| 2026-02-10 | MATA: Multi-Agent Framework for Reliable and Flexible Table Question Answering | Sieun Hyeon et.al. | 2602.09642 | null |
| 2026-02-10 | TeleGate: Whole-Body Humanoid Teleoperation via Gated Expert Selection with Motion Prior | Jie Li et.al. | 2602.09628 | null |
| 2026-02-10 | Multimode fiber laser cavities as nonlinear optical processors | Dilem Eşlik et.al. | 2602.09519 | null |
| 2026-02-11 | Beyond Student: An Asymmetric Network for Neural Network Inheritance | Yiyun Zhou et.al. | 2602.09509 | null |
| 2026-02-10 | Beyond Next-Token Alignment: Distilling Multimodal Large Language Models via Token Interactions | Lin Chen et.al. | 2602.09483 | null |
| 2026-02-10 | Personalized Parameter-Efficient Fine-Tuning of Foundation Models for Multimodal Recommendation | Sunwoo Kim et.al. | 2602.09445 | null |
| 2026-02-10 | Sparse Layer Sharpness-Aware Minimization for Efficient Fine-Tuning | Yifei Cheng et.al. | 2602.09395 | null |
| 2026-02-10 | AfriNLLB: Efficient Translation Models for African Languages | Yasmin Moslem et.al. | 2602.09373 | null |
| 2026-02-10 | LLM-CoOpt: A Co-Design and Optimization Framework for Efficient LLM Inference on Heterogeneous Platforms | Jie Kong et.al. | 2602.09323 | null |
| 2026-02-10 | Effective MoE-based LLM Compression by Exploiting Heterogeneous Inter-Group Experts Routing Frequency and Information Density | Zhendong Mi et.al. | 2602.09316 | null |
| 2026-02-09 | A Lightweight Multi-View Approach to Short-Term Load Forecasting | Julien Guité-Vinet et.al. | 2602.09220 | null |
| 2026-02-09 | Train Less, Infer Faster: Efficient Model Finetuning and Compression via Structured Sparsity | Jonathan Svirsky et.al. | 2602.09169 | null |
| 2026-02-09 | UniComp: A Unified Evaluation of Large Language Model Compression via Pruning, Quantization and Distillation | Jonathan von Rad et.al. | 2602.09130 | null |
| 2026-02-09 | Looping Back to Move Forward: Recursive Transformers for Efficient and Flexible Large Multimodal Models | Ruihan Xu et.al. | 2602.09080 | null |
| 2026-02-09 | CLUE: Crossmodal disambiguation via Language-vision Understanding with attEntion | Mouad Abrini et.al. | 2602.08999 | null |
| 2026-02-09 | AMS-HD: Hyperdimensional Computing for Real-Time and Energy-Efficient Acute Mountain Sickness Detection | Abu Masum et.al. | 2602.08916 | null |
| 2026-02-09 | Efficient and Stable Reinforcement Learning for Diffusion Language Models | Jiawei Liu et.al. | 2602.08905 | null |
| 2026-02-09 | FlattenGPT: Depth Compression for Transformer with Layer Flattening | Ruihan Xu et.al. | 2602.08858 | null |
| 2026-02-09 | Omni-Video 2: Scaling MLLM-Conditioned Diffusion for Unified Video Generation and Editing | Hao Yang et.al. | 2602.08820 | null |
| 2026-02-09 | FlexMoRE: A Flexible Mixture of Rank-heterogeneous Experts for Efficient Federatedly-trained Large Language Models | Annemette Brok Pirchert et.al. | 2602.08818 | null |
| 2026-02-09 | Reliable one-bit quantization of bandlimited graph data via single-shot noise shaping | Johannes Maly et.al. | 2602.08669 | null |
| 2026-02-09 | OneLive: Dynamically Unified Generative Framework for Live-Streaming Recommendation | Shen Wang et.al. | 2602.08612 | null |
| 2026-02-09 | Beyond Scalar Scores: Reinforcement Learning for Error-Aware Quality Estimation of Machine Translation | Archchana Sindhujan et.al. | 2602.08600 | null |
| 2026-02-09 | SDFed: Bridging Local Global Discrepancy via Subspace Refinement and Divergence Control in Federated Prompt Learning | Yicheng Di et.al. | 2602.08590 | null |
| 2026-02-09 | M-Loss: Quantifying Model Merging Compatibility with Limited Unlabeled Data | Tiantong Wang et.al. | 2602.08564 | null |
| 2026-02-09 | Are Vision Foundation Models Foundational for Electron Microscopy Image Segmentation? | Caterina Fuster-Barceló et.al. | 2602.08505 | null |
| 2026-02-09 | RIFLE: Robust Distillation-based FL for Deep Model Deployment on Resource-Constrained IoT Networks | Pouria Arefijamal et.al. | 2602.08446 | null |
| 2026-02-09 | OJBKQ: Objective-Joint Babai-Klein Quantization | Xinyu Wang et.al. | 2602.08376 | null |
| 2026-02-09 | Quantization-aware Photonic Homodyne computing for Accelerated Artificial Intelligence and Scientific Simulation | Lian Zhou et.al. | 2602.08269 | null |
| 2026-02-09 | PTS-SNN: A Prompt-Tuned Temporal Shift Spiking Neural Networks for Efficient Speech Emotion Recognition | Xun Su et.al. | 2602.08240 | null |
| 2026-02-09 | Linearization Explains Fine-Tuning in Large Language Models | Zahra Rahimi Afzal et.al. | 2602.08239 | null |
| 2026-02-10 | Efficient-SAM2: Accelerating SAM2 with Object-Aware Visual Encoding and Memory Retrieval | Jing Zhang et.al. | 2602.08224 | null |
| 2026-02-09 | CADO: From Imitation to Cost Minimization for Heatmap-based Solvers in Combinatorial Optimization | Hyungseok Song et.al. | 2602.08210 | null |
| 2026-02-09 | DAS-SK: An Adaptive Model Integrating Dual Atrous Separable and Selective Kernel CNN for Agriculture Semantic Segmentation | Mei Ling Chee et.al. | 2602.08168 | null |
| 2026-02-10 | AFDM: Evolving OFDM Towards 6G+ | Hyeon Seok Rou et.al. | 2602.08163 | null |
| 2026-02-08 | Robustness of Vision Language Models Against Split-Image Harmful Input Attacks | Md Rafi Ur Rashid et.al. | 2602.08136 | null |
| 2026-02-08 | Prune, Don’t Rebuild: Efficiently Tuning $α$ -Reachable Graphs for Nearest Neighbor Search | Tian Zhang et.al. | 2602.08097 | null |
| 2026-02-08 | Efficient and Adaptable Detection of Malicious LLM Prompts via Bootstrap Aggregation | Shayan Ali Hassan et.al. | 2602.08062 | null |
| 2026-02-08 | Bielik Guard: Efficient Polish Language Safety Classifiers for LLM Content Moderation | Krzysztof Wróbel et.al. | 2602.07954 | null |
| 2026-02-08 | Rethinking Practical and Efficient Quantization Calibration for Vision-Language Models | Zhenhao Shang et.al. | 2602.07899 | null |
| 2026-02-08 | Efficient Anti-exploration via VQVAE and Fuzzy Clustering in Offline Reinforcement Learning | Long Chen et.al. | 2602.07889 | null |
| 2026-02-08 | LQA: A Lightweight Quantized-Adaptive Framework for Vision-Language Models on the Edge | Xin Wang et.al. | 2602.07849 | null |
| 2026-02-08 | Pruning as a Cooperative Game: Surrogate-Assisted Layer Contribution Estimation for Large Language Models | Xuan Ding et.al. | 2602.07804 | null |
| 2026-02-08 | Accelerating Black Hole Image Generation via Latent Space Diffusion Models | Ao Liu et.al. | 2602.07786 | null |
| 2026-02-07 | Do We Need Adam? Surprisingly Strong and Sparse Reinforcement Learning with SGD in LLMs | Sagnik Mukherjee et.al. | 2602.07729 | null |
| 2026-02-07 | High-Resolution Solvers for 3D Helmholtz Scattering Problems Using PFFT and Eigenvector-Based Preconditioning | Yury Gryazin et.al. | 2602.07711 | null |
| 2026-02-07 | SERE: Similarity-based Expert Re-routing for Efficient Batch Decoding in MoE Models | Juntong Wu et.al. | 2602.07616 | null |
| 2026-02-07 | Astro: Activation-guided Structured Regularization for Outlier-Robust LLM Post-Training Quantization | Xi Chen et.al. | 2602.07596 | null |
| 2026-02-07 | ViCA: Efficient Multimodal LLMs with Vision-Only Cross-Attention | Wenjie Liu et.al. | 2602.07574 | null |
| 2026-02-07 | VISOR: VIsual Spatial Object Reasoning for Language-driven Object Navigation | Francesco Taioli et.al. | 2602.07555 | null |
| 2026-02-07 | Linguistic properties and model scale in brain encoding: from small to compressed language models | Subba Reddy Oota et.al. | 2602.07547 | null |
| 2026-02-07 | Physical Analog Kolmogorov-Arnold Networks based on Reconfigurable Nonlinear-Processing Units | Manuel Escudero et.al. | 2602.07518 | null |
| 2026-02-07 | ODELoRA: Training Low-Rank Adaptation by Solving Ordinary Differential Equations | Yihang Gao et.al. | 2602.07479 | null |
| 2026-02-07 | On the Importance of a Multi-Scale Calibration for Quantization | Seungwoo Son et.al. | 2602.07465 | null |
| 2026-02-07 | Efficient Post-Training Pruning of Large Language Models with Statistical Correction | Peiqi Yu et.al. | 2602.07375 | null |
| 2026-02-07 | TernaryLM: Memory-Efficient Language Modeling via Native 1-Bit Quantization with Adaptive Layer-wise Scaling | Nisharg Nargund et.al. | 2602.07374 | null |
| 2026-02-07 | Semantic Search At LinkedIn | Fedor Borisyuk et.al. | 2602.07309 | null |
| 2026-02-05 | Shared LoRA Subspaces for almost Strict Continual Learning | Prakhar Kaushik et.al. | 2602.06043 | null |
| 2026-02-05 | Correctness-Optimized Residual Activation Lens (CORAL): Transferrable and Calibration-Aware Inference-Time Steering | Miranda Muqing Miao et.al. | 2602.06022 | null |
| 2026-02-05 | MambaVF: State Space Model for Efficient Video Fusion | Zixiang Zhao et.al. | 2602.06017 | null |
| 2026-02-05 | Layer-wise LoRA fine-tuning: a similarity metric approach | Keith Ando Ogawa et.al. | 2602.05988 | null |
| 2026-02-05 | CLIP-Map: Structured Matrix Mapping for Parameter-Efficient CLIP Compression | Kangjie Zhang et.al. | 2602.05909 | null |
| 2026-02-05 | Regularized Calibration with Successive Rounding for Post-Training Quantization | Seohyeon Cha et.al. | 2602.05902 | null |
| 2026-02-05 | Learning Compact Boolean Networks | Shengpu Wang et.al. | 2602.05830 | null |
| 2026-02-05 | Focus-Scan-Refine: From Human Visual Perception to Efficient Visual Token Pruning | Enwei Tong et.al. | 2602.05809 | null |
| 2026-02-05 | Price of universality in vector quantization is at most 0.11 bit | Alina Harbuzova et.al. | 2602.05790 | null |
| 2026-02-05 | OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale | Jingze Shi et.al. | 2602.05711 | null |
| 2026-02-05 | Cost-Efficient RAG for Entity Matching with LLMs: A Blocking-based Exploration | Chuangtao Ma et.al. | 2602.05708 | null |
| 2026-02-05 | Consensus-Aligned Neuron Efficient Fine-Tuning Large Language Models for Multi-Domain Machine Translation | Shuting Jiang et.al. | 2602.05694 | null |
| 2026-02-05 | Time-Complexity Characterization of NIST Lightweight Cryptography Finalists | Najmul Hasan et.al. | 2602.05641 | null |
| 2026-02-05 | Shiva-DiT: Residual-Based Differentiable Top- $k$ Selection for Efficient Diffusion Transformers | Jiaji Zhang et.al. | 2602.05605 | null |
| 2026-02-05 | MAGPrompt: Message-Adaptive Graph Prompt Tuning for Graph Neural Networks | Long D. Nguyen et.al. | 2602.05567 | null |
| 2026-02-05 | Mapper-GIN: Lightweight Structural Graph Abstraction for Corrupted 3D Point Cloud Classification | Jeongbin You et.al. | 2602.05522 | null |
| 2026-02-05 | VGGT-Motion: Motion-Aware Calibration-Free Monocular SLAM for Long-Range Consistency | Zhuang Xiong et.al. | 2602.05508 | null |
| 2026-02-05 | SDFP: Speculative Decoding with FIT-Pruned Models for Training-Free and Plug-and-Play LLM Acceleration | Hanyu Wei et.al. | 2602.05499 | null |
| 2026-02-05 | DistillER: Knowledge Distillation in Entity Resolution with Large Language Models | Alexandros Zeakis et.al. | 2602.05452 | null |
| 2026-02-05 | RaBiT: Residual-Aware Binarization Training for Accurate and Efficient LLMs | Youngcheon You et.al. | 2602.05367 | null |
| 2026-02-05 | AgentXRay: White-Boxing Agentic Systems via Workflow Reconstruction | Ruijie Shi et.al. | 2602.05353 | null |
| 2026-02-05 | Consistency-Preserving Concept Erasure via Unsafe-Safe Pairing and Directional Fisher-weighted Adaptation | Yongwoo Kim et.al. | 2602.05339 | null |
| 2026-02-05 | MentorCollab: Selective Large-to-Small Inference-Time Guidance for Efficient Reasoning | Haojin Wang et.al. | 2602.05307 | null |
| 2026-02-05 | High-Performance Moment-Encoded Lattice Boltzmann Method with Stability-Guided Quantization | Yixin Chen et.al. | 2602.05295 | null |
| 2026-02-05 | Unlocking Prototype Potential: An Efficient Tuning Framework for Few-Shot Class-Incremental Learning | Shengqin Jiang et.al. | 2602.05271 | null |
| 2026-02-05 | CORP: Closed-Form One-shot Representation-Preserving Structured Pruning for Vision Transformers | Boxiang Zhang et.al. | 2602.05243 | null |
| 2026-02-05 | Radon–Wasserstein Gradient Flows for Interacting-Particle Sampling in High Dimensions | Elias Hess-Childs et.al. | 2602.05227 | null |
| 2026-02-05 | Diffusion-aided Extreme Video Compression with Lightweight Semantics Guidance | Maojun Zhang et.al. | 2602.05201 | null |
| 2026-02-05 | An introduction to string states and their interactions | Chrysoula Markou et.al. | 2602.05173 | null |
| 2026-02-05 | CoSA: Compressed Sensing-Based Adaptation of Large Language Models | Songtao Wei et.al. | 2602.05148 | null |
| 2026-02-04 | Locas: Your Models are Principled Initializers of Locally-Supported Parametric Memories | Sidi Lu et.al. | 2602.05085 | null |
| 2026-02-04 | Gabor Fields: Orientation-Selective Level-of-Detail for Volume Rendering | Jorge Condor et.al. | 2602.05081 | null |
| 2026-02-04 | SynthForensics: A Multi-Generator Benchmark for Detecting Synthetic Video Deepfakes | Roberto Leotta et.al. | 2602.04939 | null |
| 2026-02-04 | TurboBoA: Faster and Exact Attention-aware Quantization without Backpropagation | Junhan Kim et.al. | 2602.04929 | null |
| 2026-02-04 | Pruning Minimal Reasoning Graphs for Efficient Retrieval-Augmented Generation | Ning Wang et.al. | 2602.04926 | null |
| 2026-02-04 | The Key to State Reduction in Linear Attention: A Rank-based Perspective | Philipp Nazari et.al. | 2602.04852 | null |
| 2026-02-04 | Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention | Chengtao Lv et.al. | 2602.04789 | null |
| 2026-02-04 | Knowledge Distillation for mmWave Beam Prediction Using Sub-6 GHz Channels | Sina Tavakolian et.al. | 2602.04703 | null |
| 2026-02-04 | REDistill: Robust Estimator Distillation for Balancing Robustness and Efficiency | Ondrej Tybl et.al. | 2602.04677 | null |
| 2026-02-04 | Delving into Muon and Beyond: Deep Analysis and Extensions | Xianbiao Qi et.al. | 2602.04669 | null |
| 2026-02-04 | Harmonia: Algorithm-Hardware Co-Design for Memory- and Compute-Efficient BFP-based LLM Inference | Xinyu Wang et.al. | 2602.04595 | null |
| 2026-02-04 | Rethinking Weight Tying: Pseudo-Inverse Tying for Stable LM Training and Updates | Jian Gu et.al. | 2602.04556 | null |
| 2026-02-04 | An Efficient Bayesian Framework for Inverse Problems via Optimization and Inversion: Surrogate Modeling, Parameter Inference, and Uncertainty Quantification | Mihaela Chiappetta et.al. | 2602.04537 | null |
| 2026-02-04 | Greedy-Gnorm: A Gradient Matrix Norm-Based Alternative to Attention Entropy for Head Pruning | Yuxi Guo et.al. | 2602.04491 | null |
| 2026-02-04 | Fine-tuning Pre-trained Vision-Language Models in a Human-Annotation-Free Manner | Qian-Wei Wang et.al. | 2602.04337 | null |
| 2026-02-04 | Canonical Quantization of Cylindrical Waveguides: A Gauge-Based Approach | Alexandre Delattre et.al. | 2602.04295 | null |
| 2026-02-04 | MiniRec: Data-Efficient Reinforcement Learning for LLM-based Recommendation | Lin Wang et.al. | 2602.04278 | null |
| 2026-02-04 | Decoupled Hierarchical Distillation for Multimodal Emotion Recognition | Yong Li et.al. | 2602.04260 | null |
| 2026-02-04 | Constructing Compact ADAPT Unitary Coupled-Cluster Ansatz with Parameter-Based Criterion | Runhong He et.al. | 2602.04253 | null |
| 2026-02-04 | Provable Target Sample Complexity Improvements as Pre-Trained Models Scale | Kazuto Fukuchi et.al. | 2602.04233 | null |
| 2026-02-04 | OAT: Ordered Action Tokenization | Chaoqi Liu et.al. | 2602.04215 | null |
| 2026-02-04 | LatentTune: Efficient Tuning of High Dimensional Database Parameters via Latent Representation Learning | Sein Kwon et.al. | 2602.04190 | null |
| 2026-02-04 | HoloEv-Net: Efficient Event-based Action Recognition via Holographic Spatial Embedding and Global Spectral Gating | Weidong Hao et.al. | 2602.04182 | null |
| 2026-02-04 | Topology-Aware Revival for Efficient Sparse Training | Meiling Jin et.al. | 2602.04166 | null |
| 2026-02-04 | BPDQ: Bit-Plane Decomposition Quantization on a Variable Grid for Large Language Models | Junyu Chen et.al. | 2602.04163 | null |
| 2026-02-04 | Pruning for Generalization: A Transfer-Oriented Spatiotemporal Graph Framework | Zihao Jing et.al. | 2602.04153 | null |
| 2026-02-04 | Interfaze: The Future of AI is built on Task-Specific Small Models | Harsha Vardhan Khurdula et.al. | 2602.04101 | null |
| 2026-02-03 | Efficient Subgroup Analysis via Optimal Trees with Global Parameter Fusion | Zhongming Xie et.al. | 2602.04077 | null |
| 2026-02-03 | Understanding and Guiding Layer Placement in Parameter-Efficient Fine-Tuning of Large Language Models | Yichen Xu et.al. | 2602.04019 | null |
| 2026-02-03 | Efficient Long-Horizon Vision-Language-Action Models via Static-Dynamic Disentanglement | Weikang Qiu et.al. | 2602.03983 | null |
| 2026-02-03 | Active Epistemic Control for Query-Efficient Verified Planning | Shuhui Qu et.al. | 2602.03974 | null |
| 2026-02-03 | Entropy Reveals Block Importance in Masked Self-Supervised Vision Transformers | Peihao Xiang et.al. | 2602.03918 | null |
| 2026-02-03 | Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing | Tong Zheng et.al. | 2602.03845 | null |
| 2026-02-03 | Understanding and Exploiting Weight Update Sparsity for Communication-Efficient Distributed RL | Erfan Miahi et.al. | 2602.03839 | null |
| 2026-02-03 | They Said Memes Were Harmless-We Found the Ones That Hurt: Decoding Jokes, Symbols, and Cultural References | Sahil Tripathi et.al. | 2602.03822 | null |
| 2026-02-03 | Fast-Slow Efficient Training for Multimodal Large Language Models via Visual Token Pruning | Dingkun Zhang et.al. | 2602.03815 | null |
| 2026-02-03 | On the Quantization-Dequantization Correspondence for (co)Poisson Hopf Algebras | Andrea Rivezzi et.al. | 2602.03810 | null |
| 2026-02-03 | QVLA: Not All Channels Are Equal in Vision-Language-Action Model’s Quantization | Yuhao Xu et.al. | 2602.03782 | null |
| 2026-02-03 | Edge-Optimized Vision-Language Models for Underground Infrastructure Assessment | Johny J. Lopez et.al. | 2602.03742 | null |
| 2026-02-03 | Conflict-Resolving and Sharpness-Aware Minimization for Generalized Knowledge Editing with Multiple Updates | Duy Nguyen et.al. | 2602.03696 | null |
| 2026-02-03 | Efficient Sequential Neural Network with Spatial-Temporal Attention and Linear LSTM for Robust Lane Detection Using Multi-Frame Images | Sandeep Patil et.al. | 2602.03669 | null |
| 2026-02-03 | CALM: A Self-Adaptive Orchestration Approach for QoS-Aware Routing in Small Language Model based Systems | Hemang Jain et.al. | 2602.03632 | null |
| 2026-02-03 | KTV: Keyframes and Key Tokens Selection for Efficient Training-Free Video LLMs | Baiyang Song et.al. | 2602.03615 | null |
| 2026-02-03 | Quantization-Aware Regularizers for Deep Neural Networks Compression | Dario Malchiodi et.al. | 2602.03614 | null |
| 2026-02-03 | APEX: Probing Neural Networks via Activation Perturbation | Tao Ren et.al. | 2602.03586 | null |
| 2026-02-03 | Constrained Dynamic Gaussian Splatting | Zihan Zheng et.al. | 2602.03538 | null |
| 2026-02-03 | MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization | Maximilian Kleinegger et.al. | 2602.03537 | null |
| 2026-02-03 | PnP-U3D: Plug-and-Play 3D Framework Bridging Autoregression and Diffusion for Unified Understanding and Generation | Yongwei Chen et.al. | 2602.03533 | null |
| 2026-02-03 | WARP Logic Neural Networks | Lino Gerlach et.al. | 2602.03527 | null |
| 2026-02-03 | Generative Decompression: Optimal Lossy Decoding Against Distribution Mismatch | Saeed R. Khosravirad et.al. | 2602.03505 | null |
| 2026-02-03 | DALI: A Workload-Aware Offloading Framework for Efficient MoE Inference on Local PCs | Zeyu Zhu et.al. | 2602.03495 | null |
| 2026-02-03 | Inlier-Centric Post-Training Quantization for Object Detection Models | Minsu Kim et.al. | 2602.03472 | null |
| 2026-02-03 | MeKi: Memory-based Expert Knowledge Injection for Efficient LLM Scaling | Ning Ding et.al. | 2602.03359 | null |
| 2026-02-03 | RDT2: Exploring the Scaling Limit of UMI Data Towards Zero-Shot Cross-Embodiment Generalization | Songming Liu et.al. | 2602.03310 | null |
| 2026-02-03 | POP: Prefill-Only Pruning for Efficient Large Model Inference | Junhui He et.al. | 2602.03295 | null |
| 2026-02-03 | Merging Beyond: Streaming LLM Updates via Activation-Guided Rotations | Yuxuan Yao et.al. | 2602.03237 | null |
| 2026-02-03 | PokeFusion Attention: Enhancing Reference-Free Style-Conditioned Generation | Jingbang Tang et.al. | 2602.03220 | null |
| 2026-02-03 | FARTrack: Fast Autoregressive Visual Tracking with High Performance | Guijie Wang et.al. | 2602.03214 | null |
| 2026-02-03 | WebSplatter: Enabling Cross-Device Efficient Gaussian Splatting in Web Browsers via WebGPU | Yudong Han et.al. | 2602.03207 | null |
| 2026-02-03 | LSGQuant: Layer-Sensitivity Guided Quantization for One-Step Diffusion Real-World Video Super-Resolution | Tianxing Wu et.al. | 2602.03182 | null |
| 2026-02-03 | BinaryDemoire: Moiré-Aware Binarization for Image Demoiréing | Zheng Chen et.al. | 2602.03176 | null |
| 2026-02-03 | FASA: Frequency-aware Sparse Attention | Yifei Wang et.al. | 2602.03152 | null |
| 2026-02-03 | Analyzing Zigbee Traffic: Datasets, Classification and Storage Trade-offs | Antonio Boiano et.al. | 2602.03140 | null |
| 2026-02-03 | SwiftVLM: Efficient Vision-Language Model Inference via Cross-Layer Token Bypass | Chen Qian et.al. | 2602.03134 | null |
| 2026-02-03 | Sharp $C^{1,\bar1}$ estimates in Kähler quantization and non-pluripolar Radon measures | Zbigniew Błocki et.al. | 2602.03111 | null |
| 2026-02-03 | IVC-Prune: Revealing the Implicit Visual Coordinates in LVLMs for Vision Token Pruning | Zhichao Sun et.al. | 2602.03060 | null |
| 2026-02-03 | SAES-SVD: Self-Adaptive Suppression of Accumulated and Local Errors for SVD-based LLM Compression | Xing Hu et.al. | 2602.03051 | null |
| 2026-02-03 | SAFE-KD: Risk-Controlled Early-Exit Distillation for Vision Backbones | Salim Khazem et.al. | 2602.03043 | null |
| 2026-02-03 | STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models | Jiliang Ni et.al. | 2602.03022 | null |
| 2026-02-03 | FedKRSO: Communication and Memory Efficient Federated Fine-Tuning of Large Language Models | Guohao Yang et.al. | 2602.03019 | null |
| 2026-02-03 | Agent Alpha: Tree Search Unifying Generation, Exploration and Evaluation for Computer-Use Agents | Sizhe Tang et.al. | 2602.02995 | null |
| 2026-02-03 | Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization | Haocheng Xi et.al. | 2602.02958 | null |
| 2026-02-03 | Nüwa: Mending the Spatial Integrity Torn by VLM Token Pruning | Yihong Huang et.al. | 2602.02951 | null |
| 2026-02-02 | TraceNAS: Zero-shot LLM Pruning via Gradient Trace Correlation | Prajna G. Malettira et.al. | 2602.02891 | null |
| 2026-02-02 | Efficiency Optimizations for Superblock-based Sparse Retrieval | Parker Carlson et.al. | 2602.02883 | null |
| 2026-02-02 | Zero Sum SVD: Balancing Loss Sensitivity for Low Rank LLM Compression | Ali Abbasi et.al. | 2602.02848 | null |
| 2026-02-02 | Col-Bandit: Zero-Shot Query-Time Pruning for Late-Interaction Retrieval | Roi Pony et.al. | 2602.02827 | null |
| 2026-02-02 | When Efficient Communication Explains Convexity | Ashvin Ranjan et.al. | 2602.02821 | null |
| 2026-02-02 | Efficient Counterfactual Estimation of Conditional Greeks via Malliavin-based Weak Derivatives | Vikram Krishnamurthy et.al. | 2602.02811 | null |
| 2026-02-02 | De-Linearizing Agent Traces: Bayesian Inference of Latent Partial Orders for Efficient Execution | Dongqing Li et.al. | 2602.02806 | null |
| 2026-02-02 | Scaling Small Agents Through Strategy Auctions | Lisa Alazraki et.al. | 2602.02751 | null |
| 2026-02-02 | TopoPrune: Robust Data Pruning via Unified Latent Space Topology | Arjun Roy et.al. | 2602.02739 | null |
| 2026-02-02 | Dynamic Mix Precision Routing for Efficient Multi-step LLM Interaction | Yuanzhe Li et.al. | 2602.02711 | null |
| 2026-02-02 | Graph-Augmented Reasoning with Large Language Models for Tobacco Pest and Disease Management | Siyu Li et.al. | 2602.02635 | null |
| 2026-02-02 | Rethinking Test-Time Training: Tilting The Latent Distribution For Few-Shot Source-Free Adaptation | Tahir Qasim Syed et.al. | 2602.02633 | null |
| 2026-02-02 | Performance of Small Language Model Pretraining on FABRIC: An Empirical Study | Praveen Rao et.al. | 2602.02632 | null |
| 2026-02-02 | Age-Aware Edge-Blind Federated Learning via Over-the-Air Aggregation | Ahmed M. Elshazly et.al. | 2602.02469 | null |
| 2026-02-02 | Hierarchical Federated Learning with SignSGD: A Highly Communication-Efficient Approach | Amirreza Kazemi et.al. | 2602.02355 | null |
| 2026-02-02 | Rethinking Generative Recommender Tokenizer: Recsys-Native Encoding and Semantic Quantization Beyond LLMs | Yu Liang et.al. | 2602.02338 | null |
| 2026-02-02 | Enhancing Indoor Occupancy Prediction via Sparse Query-Based Multi-Level Consistent Knowledge Distillation | Xiang Li et.al. | 2602.02318 | null |
| 2026-02-02 | MAIN-VLA: Modeling Abstraction of Intention and eNvironment for Vision-Language-Action Models | Zheyuan Zhou et.al. | 2602.02212 | null |
| 2026-02-02 | More Than a Quick Glance: Overcoming the Greedy Bias in KV-Cache Compression | Aryan Sood et.al. | 2602.02199 | null |
| 2026-02-02 | ECHO-2: A Large Scale Distributed Rollout Framework for Cost-efficient Reinforcement Learning | Jie Xiao et.al. | 2602.02192 | null |
| 2026-02-02 | Reg4Pru: Regularisation Through Random Token Routing for Token Pruning | Julian Wyatt et.al. | 2602.02163 | null |
| 2026-02-02 | Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing | Lingkun Long et.al. | 2602.02159 | null |
| 2026-02-02 | Revisiting Adaptive Rounding with Vectorized Reparameterization for LLM Quantization | Yuli Zhou et.al. | 2602.02151 | null |
| 2026-02-02 | Two-Stage Grid Optimization for Group-wise Quantization of LLMs | Junhan Kim et.al. | 2602.02126 | null |
| 2026-02-02 | An Empirical Study of World Model Quantization | Zhongqian Fu et.al. | 2602.02110 | null |
| 2026-02-02 | Teacher-Guided Student Self-Knowledge Distillation Using Diffusion Model | Yu Wang et.al. | 2602.02107 | null |
| 2026-02-02 | UrbanGS: A Scalable and Efficient Architecture for Geometrically Accurate Large-Scene Reconstruction | Changbai Li et.al. | 2602.02089 | null |
| 2026-02-02 | A global potential constrained by the Bohr-Sommerfeld quantization condition for $α$ -decay half-lives of even-even nuclei | Nguyen Gia Huy et.al. | 2602.02070 | null |
| 2026-02-02 | Ultrafast On-chip Online Learning via Spline Locality in Kolmogorov-Arnold Networks | Duc Hoang et.al. | 2602.02056 | link |
| 2026-02-02 | Dissecting Outlier Dynamics in LLM NVFP4 Pretraining | Peijie Dong et.al. | 2602.02047 | null |
| 2026-02-02 | Bandwidth-Efficient Multi-Agent Communication through Information Bottleneck and Vector Quantization | Ahmad Farooq et.al. | 2602.02035 | null |
| 2026-02-02 | Hippasus: Effective and Efficient Automatic Feature Augmentation for Machine Learning Tasks on Relational Data | Serafeim Papadias et.al. | 2602.02025 | null |
| 2026-02-02 | Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation | Zhanghao Hu et.al. | 2602.02007 | null |
| 2026-02-02 | Preserve-Then-Quantize: Balancing Rank Budgets for Quantization Error Reconstruction in LLMs | Yoonjun Cho et.al. | 2602.02001 | null |
| 2026-02-02 | On the Limits of Layer Pruning for Generative Reasoning in LLMs | Safal Shrestha et.al. | 2602.01997 | null |
| 2026-02-02 | FlyPrompt: Brain-Inspired Random-Expanded Routing with Temporal-Ensemble Experts for General Continual Learning | Hongwei Yan et.al. | 2602.01976 | null |
| 2026-02-02 | IntraSlice: Towards High-Performance Structural Pruning with Block-Intra PCA for LLMs | Meng Li et.al. | 2602.01975 | null |
| 2026-02-02 | Efficient Epistemic Uncertainty Estimation for Large Language Models via Knowledge Distillation | Seonghyeon Park et.al. | 2602.01956 | null |
| 2026-02-02 | Q Cache: Visual Attention is Valuable in Less than Half of Decode Layers for Multimodal Large Language Model | Jiedong Zhuang et.al. | 2602.01901 | null |
| 2026-02-02 | ProxyImg: Towards Highly-Controllable Image Representation via Hierarchical Disentangled Proxy Embedding | Ye Chen et.al. | 2602.01881 | null |
| 2026-02-02 | BTGenBot-2: Efficient Behavior Tree Generation with Small Language Models | Riccardo Andrea Izzo et.al. | 2602.01870 | null |
| 2026-02-02 | Zero-Shot Knowledge Base Resizing for Rate-Adaptive Digital Semantic Communication | Shumin Yao et.al. | 2602.01829 | null |
| 2026-02-02 | ParaGSE: Parallel Generative Speech Enhancement with Group-Vector-Quantization-based Neural Speech Codec | Fei Liu et.al. | 2602.01793 | null |
| 2026-02-02 | Efficient Cross-Architecture Knowledge Transfer for Large-Scale Online User Response Prediction | Yucheng Wu et.al. | 2602.01775 | null |
| 2026-02-02 | Reduced Phase Space Quantization and Quantum Corrected Entropy of Schwarzschild-de Sitter Horizons | S. Jalalzadeh et.al. | 2602.01767 | null |
| 2026-02-02 | Tail-Aware Post-Training Quantization for 3D Geometry Models | Sicheng Pan et.al. | 2602.01741 | null |
| 2026-02-02 | A Practical Tensor-Network Compression Pipeline for Production-Scale Large Language Models | Sergii Kozyrev et.al. | 2602.01613 | null |
| 2026-02-02 | Token Pruning for In-Context Generation in Diffusion Transformers | Junqing Lin et.al. | 2602.01609 | null |
| 2026-02-02 | Spectral-Aligned Pruning for Universal Error-Correcting Code Transformers | Sanghyeon Cho et.al. | 2602.01602 | null |
| 2026-02-02 | Plain Transformers are Surprisingly Powerful Link Predictors | Quang Truong et.al. | 2602.01553 | null |
| 2026-02-02 | NeuroAI Temporal Neural Networks (NeuTNNs): Microarchitecture and Design Framework for Specialized Neuromorphic Processing Units | Shanmuga Venkatachalam et.al. | 2602.01546 | null |
| 2026-02-02 | When Is Rank-1 Enough? Geometry-Guided Initialization for Parameter-Efficient Fine-Tuning | Haoran Zhao et.al. | 2602.01522 | null |
| 2026-02-02 | HDSense: An efficient method for ranking observable sensitivity | Benoît Assi et.al. | 2602.01509 | null |
| 2026-02-01 | ConPress: Learning Efficient Reasoning from Multi-Question Contextual Pressure | Jie Deng et.al. | 2602.01472 | null |
| 2026-02-01 | Rethinking Selective Knowledge Distillation | Almog Tavor et.al. | 2602.01395 | null |
| 2026-02-01 | The Enhanced Physics-Informed Kolmogorov-Arnold Networks: Applications of Newton’s Laws in Financial Deep Reinforcement Learning (RL) Algorithms | Trang Thoi et.al. | 2602.01388 | null |
| 2026-02-01 | Gradient-Aligned Calibration for Post-Training Quantization of Diffusion Models | Dung Anh Hoang et.al. | 2602.01289 | null |
| 2026-02-01 | Q-DiT4SR: Exploration of Detail-Preserving Diffusion Transformer Quantization for Real-World Image Super-Resolution | Xun Zhang et.al. | 2602.01273 | null |
| 2026-02-01 | Mixture-of-World Models: Scaling Multi-Task Reinforcement Learning with Modular Latent Dynamics | Boxuan Zhang et.al. | 2602.01270 | null |
| 2026-01-30 | Geometric Quantization by Paths, Part III: The Metaplectic Anomaly | Patrick Iglesias-Zemmour et.al. | 2601.23259 | null |
| 2026-01-30 | Agile Reinforcement Learning through Separable Neural Architecture | Rajib Mostakim et.al. | 2601.23225 | null |
| 2026-01-30 | High-quality generation of dynamic game content via small language models: A proof of concept | Morten I. K. Munk et.al. | 2601.23206 | null |
| 2026-01-30 | Segment Any Events with Language | Seungjun Lee et.al. | 2601.23159 | null |
| 2026-01-30 | Compressed BC-LISTA via Low-Rank Convolutional Decomposition | Han Wang et.al. | 2601.23148 | null |
| 2026-01-30 | Lossy Compression of Cellular Network KPIs | Andrea Pimpinella et.al. | 2601.23105 | null |
| 2026-01-30 | FlexLoRA: Entropy-Guided Flexible Low-Rank Adaptation | Muqing Liu et.al. | 2601.22905 | null |
| 2026-01-30 | Decomposing and Composing: Towards Efficient Vision-Language Continual Learning via Rank-1 Expert Pool in a Single LoRA | Zhan Fa et.al. | 2601.22828 | null |
| 2026-01-30 | CVeDRL: An Efficient Code Verifier via Difficulty-aware Reinforcement Learning | Ji Shi et.al. | 2601.22803 | null |
| 2026-01-30 | Float8@2bits: Entropy Coding Enables Data-Free Model Compression | Patrick Putzky et.al. | 2601.22787 | null |
| 2026-01-30 | Active Learning-Driven Lightweight YOLOv9: Enhancing Efficiency in Smart Agriculture | Hung-Chih Tu et.al. | 2601.22732 | null |
| 2026-01-30 | Breaking the Blocks: Continuous Low-Rank Decomposed Scaling for Unified LLM Quantization and Adaptation | Pingzhi Tang et.al. | 2601.22716 | null |
| 2026-01-30 | Gated Relational Alignment via Confidence-based Distillation for Efficient VLMs | Yanlong Chen et.al. | 2601.22709 | null |
| 2026-01-30 | A Unified Study of LoRA Variants: Taxonomy, Review, Codebase, and Empirical Evaluation | Haonan He et.al. | 2601.22708 | null |
| 2026-01-30 | FNF: Functional Network Fingerprint for Large Language Models | Yiheng Liu et.al. | 2601.22692 | null |
| 2026-01-30 | Fire on Motion: Optimizing Video Pass-bands for Efficient Spiking Action Recognition | Shuhan Ye et.al. | 2601.22675 | null |
| 2026-01-30 | DART-ing Through the Drift: Dynamic Tracing of Knowledge Neurons for Adaptive Inference-Time Pruning | Abhishek Tyagi et.al. | 2601.22632 | null |
| 2026-01-30 | PEFT-MuTS: A Multivariate Parameter-Efficient Fine-Tuning Framework for Remaining Useful Life Prediction based on Cross-domain Time Series Representation Model | En Fu et.al. | 2601.22631 | null |
| 2026-01-30 | Rethinking LLM-as-a-Judge: Representation-as-a-Judge with Small Language Models via Semantic Capacity Asymmetry | Zhuochun Li et.al. | 2601.22588 | null |
| 2026-01-30 | EUGens: Efficient, Unified, and General Dense Layers | Sang Min Kim et.al. | 2601.22563 | null |
| 2026-01-29 | Understanding Efficiency: Quantization, Batching, and Serving Strategies in LLM Energy Use | Julien Delavande et.al. | 2601.22362 | null |
| 2026-01-29 | MixQuant: Pushing the Limits of Block Rotations in Post-Training Quantization | Sai Sanjeet et.al. | 2601.22347 | null |
| 2026-01-29 | Symmetry Breaking in Transformers for Efficient and Interpretable Training | Eva Silverstein et.al. | 2601.22257 | null |
| 2026-01-29 | Is Hierarchical Quantization Essential for Optimal Reconstruction? | Shirin Reyhanian et.al. | 2601.22244 | null |
| 2026-01-29 | Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts | Yingfa Chen et.al. | 2601.22156 | null |
| 2026-01-29 | Pay for Hints, Not Answers: LLM Shepherding for Cost-Efficient Inference | Ziming Dong et.al. | 2601.22132 | null |
| 2026-01-29 | A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine | Anran Li et.al. | 2601.22124 | null |
| 2026-01-29 | ReactEMG Stroke: Healthy-to-Stroke Few-shot Adaptation for sEMG-Based Intent Detection | Runsheng Wang et.al. | 2601.22090 | null |
| 2026-01-29 | Making Foundation Models Probabilistic via Singular Value Ensembles | Mehmet Ozgur Turkoglu et.al. | 2601.22068 | null |
| 2026-01-30 | PocketDP3: Efficient Pocket-Scale 3D Visuomotor Policy | Jinhao Zhang et.al. | 2601.22018 | null |
| 2026-01-29 | OVD: On-policy Verbal Distillation | Jing Xiong et.al. | 2601.21968 | null |
| 2026-01-29 | From Generative Modeling to Clinical Classification: A GPT-Based Architecture for EHR Notes | Fariba Afrin Irany et.al. | 2601.21955 | null |
| 2026-01-29 | KnowBias: Mitigating Social Bias in LLMs via Know-Bias Neuron Enhancement | Jinhao Pan et.al. | 2601.21864 | null |
| 2026-01-29 | Visual Disentangled Diffusion Autoencoders: Scalable Counterfactual Generation for Foundation Models | Sidney Bender et.al. | 2601.21851 | null |
| 2026-01-29 | Enhancing Language Models for Robust Greenwashing Detection | Neil Heinrich Braun et.al. | 2601.21722 | null |
| 2026-01-29 | Why Attention Patterns Exist: A Unifying Temporal Perspective Analysis | Qingyue Yang et.al. | 2601.21709 | null |
| 2026-01-29 | Can David Beat Goliath? On Multi-Hop Reasoning with Resource-Constrained Agents | Hojae Han et.al. | 2601.21699 | null |
| 2026-01-29 | Do Not Waste Your Rollouts: Recycling Search Experience for Efficient Test-Time Scaling | Xinglin Wang et.al. | 2601.21684 | null |
| 2026-01-29 | SWE-Spot: Building Small Repo-Experts with Repository-Centric Learning | Jinjun Peng et.al. | 2601.21649 | null |
| 2026-01-29 | Leveraging rapid parameter estimates for efficient gravitational-wave Bayesian inference via posterior repartitioning | Metha Prathaban et.al. | 2601.21630 | null |
| 2026-01-29 | HeRo-Q: A General Framework for Stable Low Bit Quantization via Hessian Conditioning | Jinhao Zhang Yunquan Zhang et.al. | 2601.21626 | null |
| 2026-01-29 | Thinking Broad, Acting Fast: Latent Reasoning Distillation from Multi-Perspective Chain-of-Thought for E-Commerce Relevance | Baopu Qiu et.al. | 2601.21611 | null |
| 2026-01-29 | Representation Unlearning: Forgetting through Information Compression | Antonio Almudévar et.al. | 2601.21564 | null |
| 2026-01-29 | On the Adversarial Robustness of Large Vision-Language Models under Visual Token Compression | Xinwei Zhang et.al. | 2601.21531 | null |
| 2026-01-29 | Adaptive Confidence Gating in Multi-Agent Collaboration for Efficient and Optimized Code Generation | Haoji Zhang et.al. | 2601.21469 | null |
| 2026-01-29 | ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation | Zihao Huang et.al. | 2601.21420 | null |
| 2026-01-29 | Rethinking Federated Graph Foundation Models: A Graph-Language Alignment-based Approach | Yinlin Zhu et.al. | 2601.21369 | null |
| 2026-01-29 | Small models, big threats: Characterizing safety challenges from low-compute AI models | Prateek Puri et.al. | 2601.21365 | null |
| 2026-01-29 | L2R: Low-Rank and Lipschitz-Controlled Routing for Mixture-of-Experts | Minghao Yang et.al. | 2601.21349 | null |
| 2026-01-29 | Semantic-Guided Dynamic Sparsification for Pre-Trained Model-based Class-Incremental Learning | Ruiqi Liu et.al. | 2601.21345 | null |
| 2026-01-29 | A Time-Domain Dual-Edge Asynchronous Pipelined SAR ADC Featuring Reset-Free Quantization at Multi-GS/s | Richard Zeng et.al. | 2601.21308 | null |
| 2026-01-29 | Mam-App: A Novel Parameter-Efficient Mamba Model for Apple Leaf Disease Classification | Md Nadim Mahamood et.al. | 2601.21307 | null |
| 2026-01-29 | Grounding and Enhancing Informativeness and Utility in Dataset Distillation | Shaobo Wang et.al. | 2601.21296 | null |
| 2026-01-29 | Drive-KD: Multi-Teacher Distillation for VLMs in Autonomous Driving | Weitong Lian et.al. | 2601.21288 | null |
| 2026-01-29 | An efficient implicit scheme for the multimaterial Euler equations in Lagrangian coordinates | Simone Chiocchetti et.al. | 2601.21241 | null |
| 2026-01-29 | PTQ4ARVG: Post-Training Quantization for AutoRegressive Visual Generation Models | Xuewen Liu et.al. | 2601.21238 | null |
| 2026-01-29 | Soft Quantization: Model Compression Via Weight Coupling | Daniel T. Bernstein et.al. | 2601.21219 | null |
| 2026-01-29 | Temporal Context and Architecture: A Benchmark for Naturalistic EEG Decoding | Mehmet Ergezer et.al. | 2601.21215 | null |
| 2026-01-29 | ZipMoE: Efficient On-Device MoE Serving via Lossless Compression and Cache-Affinity Scheduling | Yuchen Yang et.al. | 2601.21198 | null |
| 2026-01-29 | Generative Recall, Dense Reranking: Learning Multi-View Semantic IDs for Efficient Text-to-Video Retrieval | Zecheng Zhao et.al. | 2601.21193 | null |
| 2026-01-28 | ChunkWise LoRA: Adaptive Sequence Partitioning for Memory-Efficient Low-Rank Adaptation and Accelerated LLM Inference | Ketan Thakkar et.al. | 2601.21109 | null |
| 2026-01-28 | CompSRT: Quantization and Pruning for Image Super Resolution Transformers | Dorsa Zeinali et.al. | 2601.21069 | null |
| 2026-01-28 | PatchFormer: A Patch-Based Time Series Foundation Model with Hierarchical Masked Reconstruction and Cross-Domain Transfer Learning for Zero-Shot Multi-Horizon Forecasting | Olaf Yunus Laitinen Imanov et.al. | 2601.20845 | null |
| 2026-01-28 | MemCtrl: Using MLLMs as Active Memory Controllers on Embodied Agents | Vishnu Sashank Dorbala et.al. | 2601.20831 | null |
| 2026-01-28 | REASON: Accelerating Probabilistic Logical Reasoning for Scalable Neuro-Symbolic Intelligence | Zishen Wan et.al. | 2601.20784 | null |
| 2026-01-28 | Leveraging Second-Order Curvature for Efficient Learned Image Compression: Theory and Empirical Evidence | Yichi Zhang et.al. | 2601.20769 | null |
| 2026-01-28 | HESTIA: A Hessian-Guided Differentiable Quantization-Aware Training Framework for Extremely Low-Bit LLMs | Guoan Wang et.al. | 2601.20745 | null |
| 2026-01-28 | One Step Is Enough: Dispersive MeanFlow Policy Optimization | Guowei Zou et.al. | 2601.20701 | null |
| 2026-01-28 | When Vision Meets Texts in Listwise Reranking | Hongyi Cai et.al. | 2601.20623 | null |
| 2026-01-28 | DiffVC-RT: Towards Practical Real-Time Diffusion-based Perceptual Neural Video Compression | Wenzhuo Ma et.al. | 2601.20564 | null |
| 2026-01-28 | Weaker quantization dimension results for self-similar measures | Saurabh Verma et.al. | 2601.20531 | null |
| 2026-01-28 | IOTA: Corrective Knowledge-Guided Prompt Learning via Black-White Box Framework | Shaokun Wang et.al. | 2601.20526 | null |
| 2026-01-28 | AnomalyVFM – Transforming Vision Foundation Models into Zero-Shot Anomaly Detectors | Matic Fučka et.al. | 2601.20524 | null |
| 2026-01-28 | CtrlCoT: Dual-Granularity Chain-of-Thought Compression for Controllable Reasoning | Zhenxuan Fan et.al. | 2601.20467 | null |
| 2026-01-28 | RepSFNet : A Single Fusion Network with Structural Reparameterization for Crowd Counting | Mas Nurul Achmadiah et.al. | 2601.20369 | null |
| 2026-01-28 | PalmBridge: A Plug-and-Play Feature Alignment Framework for Open-Set Palmprint Verification | Chenke Zhang et.al. | 2601.20351 | null |
| 2026-01-28 | Improving Diffusion Language Model Decoding through Joint Search in Generation Order and Token Space | Yangyi Shen et.al. | 2601.20339 | null |
| 2026-01-28 | Window-Diffusion: Accelerating Diffusion Language Model Inference with Windowed Token Pruning and Caching | Fengrui Zuo et.al. | 2601.20332 | null |
| 2026-01-28 | VersaQ-3D: A Reconfigurable Accelerator Enabling Feed-Forward and Generalizable 3D Reconstruction via Versatile Quantization | Yipu Zhang et.al. | 2601.20317 | null |
| 2026-01-28 | Towards Compact and Robust DNNs via Compression-aware Sharpness Minimization | Jialuo He et.al. | 2601.20301 | null |
| 2026-01-28 | MiLorE-SSL: Scaling Multilingual Capabilities in Self-Supervised Models without Forgetting | Jing Xu et.al. | 2601.20300 | null |
| 2026-01-28 | Quantum Cosmology as a Hydrogen atom: Discrete $Λ$ and cyclic Universes from Wheeler-DeWitt quantization | Dipayan Mukherjee et.al. | 2601.20286 | null |
| 2026-01-28 | SATA: Sparsity-Aware Scheduling for Selective Token Attention | Zhenkun Fan et.al. | 2601.20267 | null |
| 2026-01-28 | Shallow-π: Knowledge Distillation for Flow-based VLAs | Boseong Jeon et.al. | 2601.20262 | null |
| 2026-01-28 | Certificate-Guided Pruning for Stochastic Lipschitz Optimization | Ibne Farabi Shihab et.al. | 2601.20231 | null |
| 2026-01-28 | MERGE: Next-Generation Item Indexing Paradigm for Large-Scale Streaming Recommendation | Jing Yan et.al. | 2601.20199 | null |
| 2026-01-28 | Efficient Token Pruning for LLaDA-V | Zhewen Wan et.al. | 2601.20168 | null |
| 2026-01-27 | Look in the Middle: Structural Anchor Pruning for Scalable Visual RAG Indexing | Zhuchenyang Liu et.al. | 2601.20107 | null |
| 2026-01-27 | Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery | Meng Xin et.al. | 2601.20088 | null |
| 2026-01-27 | E2HiL: Entropy-Guided Sample Selection for Efficient Real-World Human-in-the-Loop Reinforcement Learning | Haoyuan Deng et.al. | 2601.19969 | null |
| 2026-01-27 | Melvin–Bonnor and Bertotti–Robinson spacetimes with Baryonic charge | José Barrientos et.al. | 2601.19858 | null |
| 2026-01-27 | A Latent Space Framework for Modeling Transient Engine Emissions Using Joint Embedding Predictive Architectures | Ganesh Sundaram et.al. | 2601.19822 | null |
| 2026-01-27 | Component-Aware Pruning Framework for Neural Network Controllers via Gradient-Based Importance Estimation | Ganesh Sundaram et.al. | 2601.19794 | null |
| 2026-01-27 | Interpretable and backpropagation-free Green Learning for efficient multi-task echocardiographic segmentation and classification | Jyun-Ping Kao et.al. | 2601.19743 | null |
| 2026-01-27 | LoPRo: Enhancing Low-Rank Quantization via Permuted Block-Wise Rotation | Hongyaoxing Gu et.al. | 2601.19675 | null |
| 2026-01-27 | AC^2-VLA: Action-Context-Aware Adaptive Computation in Vision-Language-Action Models for Efficient Robotic Manipulation | Wenda Yu et.al. | 2601.19634 | null |
| 2026-01-27 | GradPruner: Gradient-Guided Layer Pruning Enabling Efficient Fine-Tuning and Inference for LLMs | Wei Huang et.al. | 2601.19503 | null |
| 2026-01-27 | StableQAT: Stable Quantization-Aware Training at Ultra-Low Bitwidths | Tianyi Chen et.al. | 2601.19320 | null |
| 2026-01-27 | Reinforced Rate Control for Neural Video Compression via Inter-Frame Rate-Distortion Awareness | Wuyang Cong et.al. | 2601.19293 | null |
| 2026-01-27 | DART: Diffusion-Inspired Speculative Decoding for Fast LLM Inference | Fuliang Liu et.al. | 2601.19278 | null |
| 2026-01-27 | M $^{\text{2}}$ XFP: A Metadata-Augmented Microscaling Data Format for Efficient Low-bit Quantization | Weiming Hu et.al. | 2601.19213 | null |
| 2026-01-27 | Optimized $k$ -means color quantization of digital images in machine-based and human perception-based colorspaces | Ranjan Maitra et.al. | 2601.19117 | null |
| 2026-01-27 | EPAS: Efficient Training with Progressive Activation Sharing | Rezaul Karim et.al. | 2601.19089 | null |
| 2026-01-26 | Is Finer Better? The Limits of Microscaling Formats in Large Language Models | Andrea Fasoli et.al. | 2601.19026 | null |
| 2026-01-26 | EVEREST: An Evidential, Tail-Aware Transformer for Rare-Event Time-Series Forecasting | Antanas Zilinskas et.al. | 2601.19022 | null |
| 2026-01-26 | FROST: Filtering Reasoning Outliers with Attention for Efficient Reasoning | Haozheng Luo et.al. | 2601.19001 | null |
| 2026-01-26 | How Is Uncertainty Propagated in Knowledge Distillation? | Ziyao Cui et.al. | 2601.18909 | null |
| 2026-01-26 | XProvence: Zero-Cost Multilingual Context Pruning for Retrieval-Augmented Generation | Youssef Mohamed et.al. | 2601.18886 | null |
| 2026-01-26 | Low-Bit Quantization of Bandlimited Graph Signals via Iterative Methods | Felix Krahmer et.al. | 2601.18782 | null |
| 2026-01-26 | Goal-oriented Communication for Fast and Robust Robotic Fault Detection and Recovery | Shutong Chen et.al. | 2601.18765 | null |
| 2026-01-26 | Efficient Trotter-Suzuki Schemes for Long-time Quantum Dynamics | Marko Maležič et.al. | 2601.18756 | null |
| 2026-01-26 | Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models | Siyan Zhao et.al. | 2601.18734 | null |
| 2026-01-26 | AI-enabled Satellite Edge Computing: A Single-Pixel Feature based Shallow Classification Model for Hyperspectral Imaging | Li Fang et.al. | 2601.18560 | null |
| 2026-01-26 | XFit: Global Optimization and Degeneracy Mapping in X-ray Spectral Modeling | Austin MacMaster et.al. | 2601.18542 | null |
| 2026-01-26 | Hybrid Radar Fusion with Quantization: CRB-Rate Trade-offs and ADC Dynamic Range | Akhileswar Chowdary et.al. | 2601.18539 | null |
| 2026-01-26 | DisasterInsight: A Multimodal Benchmark for Function-Aware and Grounded Disaster Assessment | Sara Tehrani et.al. | 2601.18493 | null |
| 2026-01-26 | DV-VLN: Dual Verification for Reliable LLM-Based Vision-and-Language Navigation | Zijun Li et.al. | 2601.18492 | null |
| 2026-01-27 | An Adaptive Purification Controller for Quantum Networks: Dynamic Protocol Selection and Multipartite Distillation | Pranav Kulkarni et.al. | 2601.18351 | null |
| 2026-01-26 | Orchestrating Specialized Agents for Trustworthy Enterprise RAG | Xincheng You et.al. | 2601.18267 | null |
| 2026-01-26 | Facial Emotion Recognition on FER-2013 using an EfficientNetB2-Based Approach | Sahil Naik et.al. | 2601.18228 | null |
| 2026-01-26 | Multi-Perspective Subimage CLIP with Keyword Guidance for Remote Sensing Image-Text Retrieval | Yifan Li et.al. | 2601.18190 | null |
| 2026-01-27 | Quantum Recurrent Unit: A Parameter-Efficient Quantum Neural Network Architecture for NISQ Devices | Tzong-Daw Wu et.al. | 2601.18164 | null |
| 2026-01-26 | From LLMs to LRMs: Rethinking Pruning for Reasoning-Centric Models | Longwei Ding et.al. | 2601.18091 | null |
| 2026-01-25 | Systematic Characterization of Minimal Deep Learning Architectures: A Unified Analysis of Convergence, Pruning, and Quantization | Ziwei Zheng et.al. | 2601.17987 | null |
| 2026-01-25 | SD-E $^2$ : Semantic Exploration for Reasoning Under Token Budgets | Kshitij Mishra et.al. | 2601.17982 | null |
| 2026-01-25 | From Specialist to Generalist: Unlocking SAM’s Learning Potential on Unlabeled Medical Images | Vi Vu et.al. | 2601.17934 | null |
| 2026-01-25 | RemEdit: Efficient Diffusion Editing with Riemannian Geometry | Eashan Adhikarla et.al. | 2601.17927 | null |
| 2026-01-25 | ShapLoRA: Allocation of Low-rank Adaption on Large Language Models via Shapley Value Inspired Importance Estimation | Yi Zhao et.al. | 2601.17921 | null |
| 2026-01-25 | treaming-dLLM: Accelerating Diffusion LLMs via Suffix Pruning and Dynamic Decoding | Zhongyu Xiao et.al. | 2601.17917 | null |
| 2026-01-25 | Adaptive Weighting in Knowledge Distillation: An Axiomatic Framework for Multi-Scale Teacher Ensemble Optimization | Aaron R. Flouro et.al. | 2601.17910 | null |
| 2026-01-25 | Assessment of Generative Named Entity Recognition in the Era of Large Language Models | Qi Zhan et.al. | 2601.17898 | null |
| 2026-01-25 | VidLaDA: Bidirectional Diffusion Large Language Models for Efficient Video Understanding | Zhihao He et.al. | 2601.17868 | null |
| 2026-01-25 | ViTCoP: Accelerating Large Vision-Language Models via Visual and Textual Semantic Collaborative Pruning | Wen Luo et.al. | 2601.17818 | null |
| 2026-01-25 | Residual neural-field ptychography for dose-efficient electron, X-ray, and optical nanoscopy | Qianhao Zhao et.al. | 2601.17694 | null |
| 2026-01-24 | BrainDistill: Implantable Motor Decoding with Task-Specific Knowledge Distillation | Yuhan Xie et.al. | 2601.17625 | null |
| 2026-01-24 | Split-on-Share: Mixture of Sparse Experts for Task-Agnostic Continual Learning | Fatema Siddika et.al. | 2601.17616 | null |
| 2026-01-24 | Travelling Waves in Wolbachia Spread Dynamics | Zhuolin Qu et.al. | 2601.17590 | null |
| 2026-01-24 | Saliency Driven Imagery Preprocessing for Efficient Compression – Industrial Paper | Justin Downes et.al. | 2601.17555 | null |
| 2026-01-24 | Reconstructing Training Data from Adapter-based Federated Large Language Models | Silong Chen et.al. | 2601.17533 | null |
| 2026-01-24 | Less is More for RAG: Information Gain Pruning for Generator-Aligned Reranking and Evidence Selection | Zhipeng Song et.al. | 2601.17532 | null |
| 2026-01-24 | Efficient Dilated Squeeze and Excitation Neural Operator for Differential Equations | Prajwal Chauhan et.al. | 2601.17407 | null |
| 2026-01-24 | SMV-EAR: Bring Spatiotemporal Multi-View Representation Learning into Efficient Event-Based Action Recognition | Rui Fan et.al. | 2601.17391 | null |
| 2026-01-24 | Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian Laws | Mohammed Fasha et.al. | 2601.17364 | null |
| 2026-01-24 | Spectral Geometry for Deep Learning: Compression and Hallucination Detection via Random Matrix Theory | Davide Ettori et.al. | 2601.17357 | null |
| 2026-01-24 | Dynamic Meta-Ensemble Framework for Efficient and Accurate Deep Learning in Plant Leaf Disease Detection on Resource-Constrained Edge Devices | Weloday Fikadu Moges et.al. | 2601.17290 | null |
| 2026-01-24 | Latent-Space Contrastive Reinforcement Learning for Stable and Efficient LLM Reasoning | Lianlei Shan et.al. | 2601.17275 | null |
| 2026-01-23 | JetFormer: A Scalable and Efficient Transformer for Jet Tagging from Offline Analysis to FPGA Triggers | Ruoqing Zheng et.al. | 2601.17215 | null |
| 2026-01-23 | AstroTimer: Rethinking Non-Access Stratum Timers in LEO Constellations | Arshiya Rezaie Hezaveh et.al. | 2601.17195 | null |
| 2026-01-23 | High-Rate Quantized Matrix Multiplication: Theory and Practice | Or Ordentlich et.al. | 2601.17187 | null |
| 2026-01-23 | Constrained Symplectic Quantization I: the Quantum Harmonic Oscillator | Martina Giachello et.al. | 2601.16963 | null |
| 2026-01-23 | Is BatchEnsemble a Single Model? On Calibration and Diversity of Efficient Ensembles | Anton Zamyatin et.al. | 2601.16936 | null |
| 2026-01-23 | Evaluating Large Vision-language Models for Surgical Tool Detection | Nakul Poudel et.al. | 2601.16895 | null |
| 2026-01-23 | PocketDVDNet: Realtime Video Denoising for Real Camera Noise | Crispian Morris et.al. | 2601.16780 | null |
| 2026-01-23 | SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents | Yuhang Wang et.al. | 2601.16746 | null |
| 2026-01-23 | Dirac-Bergmann algorithm and canonical quantization of $k$ -essence cosmology | Andrés Lueiza et.al. | 2601.16703 | null |
| 2026-01-23 | Fast, faithful and photorealistic diffusion-based image super-resolution with enhanced Flow Map models | Maxence Noble et.al. | 2601.16660 | null |
| 2026-01-23 | Typologically Informed Parameter Aggregation | Stef Accou et.al. | 2601.16629 | null |
| 2026-01-23 | AuroraEdge-V-2B: A Faster And Stronger Edge Visual Large Language Model | Xiang Chen et.al. | 2601.16615 | null |
| 2026-01-23 | Spiking Neural Networks for Communication Systems: Encoding Schemes, Learning Algorithms, and Equalization~Techniques | Eike-Manuel Edelmann et.al. | 2601.16550 | null |
| 2026-01-23 | LLM is Not All You Need: A Systematic Evaluation of ML vs. Foundation Models for text and image based Medical Classification | Meet Raval et.al. | 2601.16549 | null |
| 2026-01-23 | W4A16 Mixed-Precision Matrix Multiplication on Decoupled Architecture: Kernel Design and Memory Bottleneck Analysis for Ascend NPUs | Yuanhong He et.al. | 2601.16536 | null |
| 2026-01-23 | Indefinite Causal Order from Failure-to-Glue: Contextual Semantics and Parametric Time | Partha Ghose et.al. | 2601.16494 | null |
| 2026-01-23 | Log-Likelihood Loss for Semantic Compression | Anuj Kumar Yadav et.al. | 2601.16461 | null |
| 2026-01-22 | EdgeSpot: Efficient and High-Performance Few-Shot Model for Keyword Spotting | Oguzhan Buyuksolak et.al. | 2601.16316 | null |
| 2026-01-22 | Teaching and Evaluating LLMs to Reason About Polymer Design Related Tasks | Dikshya Mohanty et.al. | 2601.16312 | null |
| 2026-01-22 | LiDMaS: Architecture-Level Modeling of Fault-Tolerant Magic-State Injection in GKP Photonic Qubits | Dennis Delali Kwesi Wayo et.al. | 2601.16244 | null |
| 2026-01-22 | CamPilot: Improving Camera Control in Video Diffusion Model with Efficient Camera Reward Feedback | Wenhang Ge et.al. | 2601.16214 | null |
| 2026-01-22 | PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation | Onkar Susladkar et.al. | 2601.16210 | null |
| 2026-01-22 | Domain-Incremental Continual Learning for Robust and Efficient Keyword Spotting in Resource Constrained Systems | Prakash Dhungana et.al. | 2601.16158 | null |
| 2026-01-22 | SAMTok: Representing Any Mask with Two Words | Yikang Zhou et.al. | 2601.16093 | null |
| 2026-01-22 | DSFedMed: Dual-Scale Federated Medical Image Segmentation via Mutual Distillation Between Foundation and Lightweight Models | Hanwen Zhang et.al. | 2601.16073 | null |
| 2026-01-22 | DTP: A Simple yet Effective Distracting Token Pruning Framework for Vision-Language Action Models | Chenyang Li et.al. | 2601.16065 | null |
| 2026-01-22 | An Efficient Algorithm to Generate all Labeled Triangle-free Graphs with a given Graphical Degree Sequence | Kai Wang et.al. | 2601.15943 | null |
| 2026-01-22 | A Lightweight Brain-Inspired Machine Learning Framework for Coronary Angiography: Hybrid Neural Representation and Robust Learning Strategies | Jingsong Xia et.al. | 2601.15865 | null |
| 2026-01-22 | TinySense: Effective CSI Compression for Scalable and Accurate Wi-Fi Sensing | Toan Gian et.al. | 2601.15838 | null |
| 2026-01-22 | Improving the efficiency of QAOA using efficient parameter transfer initialization and targeted-single-layer regularized optimization with minimal performance degradation | Shubham Patel et.al. | 2601.15760 | null |
| 2026-01-22 | Communication-efficient Federated Graph Classification via Generative Diffusion Modeling | Xiuling Wang et.al. | 2601.15722 | null |
| 2026-01-22 | FlexLLM: Composable HLS Library for Flexible Hybrid LLM Accelerator Design | Jiahao Zhang et.al. | 2601.15710 | null |
| 2026-01-22 | D-Optimality-Guided Reinforcement Learning for Efficient Open-Loop Calibration of a 3-DOF Ankle Rehabilitation Robot | Qifan Hu et.al. | 2601.15707 | null |
| 2026-01-22 | Integrating Knowledge Distillation Methods: A Sequential Multi-Stage Framework | Yinxi Tian et.al. | 2601.15657 | null |
| 2026-01-22 | Scaling-Based Quantization of Spacetime Microstructure | Weihu Ma et.al. | 2601.15649 | null |
| 2026-01-21 | QUAIL: Quantization Aware Unlearning for Mitigating Misinformation in LLMs | Himanshu Mishra et.al. | 2601.15538 | null |
| 2026-01-21 | SAGE-FM: A lightweight and interpretable spatial transcriptomics foundation model | Xianghao Zhan et.al. | 2601.15504 | null |
| 2026-01-21 | Memorization Dynamics in Knowledge Distillation for Language Models | Jaydeep Borkar et.al. | 2601.15394 | null |
| 2026-01-21 | FedUMM: A General Framework for Federated Learning with Unified Multimodal Models | Zhaolong Su et.al. | 2601.15390 | null |
| 2026-01-21 | Towards Understanding Best Practices for Quantization of Vision-Language Models | Gautom Das et.al. | 2601.15287 | null |
| 2026-01-21 | Lightweight LLMs for Network Attack Detection in IoT Networks | Piyumi Bhagya Sudasinghe et.al. | 2601.15269 | null |
| 2026-01-21 | Metadata Conditioned Large Language Models for Localization | Anjishnu Mukherjee et.al. | 2601.15236 | null |
| 2026-01-21 | Overcoming In-Memory Bottlenecks in Graph Foundation Models via Retrieval-Augmented Generation | Haonan Yuan et.al. | 2601.15124 | null |
| 2026-01-21 | Parameter-Efficient Multi-Task Fine-Tuning in Code-Related Tasks | Md Zahidul Haque et.al. | 2601.15094 | null |
| 2026-01-21 | LoRAP: Low-Rank Aggregation Prompting for Quantized Graph Neural Networks Training | Chenyu Liu et.al. | 2601.15079 | null |
| 2026-01-21 | Efficient and Minimax-optimal In-context Nonparametric Regression with Transformers | Michelle Ching et.al. | 2601.15014 | null |
| 2026-01-21 | Solution-derived barium titanate waveguides for integrated electro-optic modulation | Virginia Falcone et.al. | 2601.14938 | null |
| 2026-01-21 | What Makes Low-Bit Quantization-Aware Training Work for Reasoning LLMs? A Systematic Study | Keyu Lv et.al. | 2601.14888 | null |
| 2026-01-21 | POTR: Post-Training 3DGS Compression | Bert Ramlot et.al. | 2601.14821 | null |
| 2026-01-21 | Efficient Beamforming for Discrete SIM-Aided Multiuser Systems Under Statistical CSI | Yuhui Jiao et.al. | 2601.14803 | null |
| 2026-01-21 | Training-Efficient Text-to-Music Generation with State-Space Modeling | Wei-Jaw Lee et.al. | 2601.14786 | null |
| 2026-01-21 | RefProtoFL: Communication-Efficient Federated Learning via External-Referenced Prototype Alignment | Hongyue Wu et.al. | 2601.14746 | null |
| 2026-01-21 | PULSE: Socially-Aware User Representation Modeling Toward Parameter-Efficient Graph Collaborative Filtering | Doyun Choi et.al. | 2601.14720 | null |
| 2026-01-21 | Triage knowledge distillation for speaker verification | Ju-ho Kim et.al. | 2601.14699 | null |
| 2026-01-21 | Maximum Edge-based Quasi-Clique: Novel Iterative Frameworks | Hongbo Xia et.al. | 2601.14619 | null |
| 2026-01-21 | IntelliSA: An Intelligent Static Analyzer for IaC Security Smell Detection Using Symbolic Rules and Neural Inference | Qiyue Mei et.al. | 2601.14595 | null |
| 2026-01-21 | Breaking the accuracy-resource dilemma: a lightweight adaptive video inference enhancement | Wei Ma et.al. | 2601.14568 | null |
| 2026-01-21 | QMC: Efficient SLM Edge Inference via Outlier-Aware Quantization and Emergent Memories Co-Design | Nilesh Prasad Pandey et.al. | 2601.14549 | null |
| 2026-01-22 | Structured Image-based Coding for Efficient Gaussian Splatting Compression | Pedro Martin et.al. | 2601.14510 | null |
| 2026-01-20 | Neutrino production mechanisms in strongly magnetized quark matter: Current status and open questions | Igor A. Shovkovy et.al. | 2601.14450 | null |
| 2026-01-20 | Layer-adaptive Expert Pruning for Pre-Training of Mixture-of-Experts Large Language Models | YuanLab. ai et.al. | 2601.14327 | null |
| 2026-01-20 | LRC-DHVC: Towards Local Rate Control in Neural Video Compression | Marc Windsheimer et.al. | 2601.14240 | null |
| 2026-01-20 | Domain-Adaptation through Synthetic Data: Fine-Tuning Large Language Models for German Law | Ali Hamza Bashir et.al. | 2601.14160 | null |
| 2026-01-20 | LLMOrbit: A Circular Taxonomy of Large Language Models -From Scaling Walls to Agentic AI Systems | Badri N. Patro et.al. | 2601.14053 | null |
| 2026-01-20 | Kakugo: Distillation of Low-Resource Languages into Small Language Models | Peter Devine et.al. | 2601.14051 | null |
| 2026-01-20 | Differentiable Logic Synthesis: Spectral Coefficient Selection via Sinkhorn-Constrained Composition | Gorgi Pavlov et.al. | 2601.13953 | null |
| 2026-01-21 | Chain-of-Thought Compression Should Not Be Blind: V-Skip for Efficient Multimodal Reasoning via Dual-Path Anchoring | Dongxu Zhang et.al. | 2601.13879 | null |
| 2026-01-20 | An efficient treatment of heat-flux boundary conditions in GSIS for rarefied gas flows | Yanbing Zhang et.al. | 2601.13870 | null |
| 2026-01-20 | MirageNet:A Secure, Efficient, and Scalable On-Device Model Protection in Heterogeneous TEE and GPU System | Huadi Zheng et.al. | 2601.13826 | null |
| 2026-01-20 | Three-dimensional properties of a coronal shock and the longitudinal distribution of its related solar energetic particles | Yue Zhou et.al. | 2601.13692 | null |
| 2026-01-20 | Ultra-Lightweight Network for Ship-Radiated Sound Classification on Embedded Deployment | Sangwon Park et.al. | 2601.13679 | null |
| 2026-01-20 | Direct Finite-Time Contraction (Step-Log) Profiling–Driven Optimization of Parallel Schemes for Nonlinear Problems on Multicore Architectures | Mudassir Shams et.al. | 2601.13637 | null |
| 2026-01-20 | A Kubernetes custom scheduler based on reinforcement learning for compute-intensive pods | Hanlin Zhou et.al. | 2601.13579 | null |
| 2026-01-21 | ButterflyMoE: Sub-Linear Ternary Experts via Structured Butterfly Orbits | Aryan Karmore et.al. | 2601.13563 | null |
| 2026-01-20 | DIS2: Disentanglement Meets Distillation with Classwise Attention for Robust Remote Sensing Segmentation under Missing Modalities | Nhi Kieu et.al. | 2601.13502 | null |
| 2026-01-19 | Quantum Circuit Pruning: Improving Fidelity via Compilation-Aware Circuit Approximation | Pau Escofet et.al. | 2601.13322 | null |
| 2026-01-19 | Verifying Local Robustness of Pruned Safety-Critical Networks | Minh Le et.al. | 2601.13303 | null |
| 2026-01-19 | An efficient model of cosmology dependence in the covariance matrix of the matter power spectrum | Theodore Steele et.al. | 2601.13245 | null |
| 2026-01-19 | Co-Channel Interference Mitigation Using Deep Learning for Drone-Based Large-Scale Antenna Measurements | Kadyrzhan Tortayev et.al. | 2601.13205 | null |
| 2026-01-19 | Onsager’s Mean Field Theory of Vortex Flows with Singular Sources: Blow-Up and Concentration without Quantization | Daniele Bartolucci et.al. | 2601.13192 | null |
| 2026-01-19 | Probe and Skip: Self-Predictive Token Skipping for Efficient Long-Context LLM Inference | Zimeng Wu et.al. | 2601.13155 | null |
| 2026-01-19 | Recursive Meta-Distillation: An Axiomatic Framework for Iterative Knowledge Refinement | Aaron R. Flouro et.al. | 2601.13100 | null |
| 2026-01-19 | PaperGuide: Making Small Language-Model Paper-Reading Agents More Efficient | Zijian Wang et.al. | 2601.12988 | null |
| 2026-01-19 | Sparse ActionGen: Accelerating Diffusion Policy with Real-time Pruning | Kangye Ji et.al. | 2601.12894 | null |
| 2026-01-19 | SCULPT: Constraint-Guided Pruned MCTS that Carves Efficient Paths for Mathematical Reasoning | Qitong Fang et.al. | 2601.12842 | null |
| 2026-01-19 | CSGaussian: Progressive Rate-Distortion Compression and Segmentation for 3D Gaussian Splatting | Yu-Jen Tseng et.al. | 2601.12814 | null |
| 2026-01-19 | Distilling Time Series Foundation Models for Efficient Forecasting | Yuqi Li et.al. | 2601.12785 | null |
| 2026-01-19 | CodeSep: Low-Bitrate Codec-Driven Speech Separation with Base-Token Disentanglement and Auxiliary-Token Serial Prediction | Hui-Peng Du et.al. | 2601.12757 | null |
| 2026-01-19 | P2L-CA: An Effective Parameter Tuning Framework for Rehearsal-Free Multi-Label Class-Incremental Learning | Songlin Dong et.al. | 2601.12714 | null |
| 2026-01-19 | BlocksecRT-DETR: Decentralized Privacy-Preserving and Token-Efficient Federated Transformer Learning for Secure Real-Time Object Detection in ITS | Mohoshin Ara Tahera et.al. | 2601.12693 | null |
| 2026-01-19 | Mixed Precision PointPillars for Efficient 3D Object Detection with TensorRT | Ninnart Fuengfusin et.al. | 2601.12638 | null |
| 2026-01-18 | Mixtenna: A Self-Biased Nonlinear Patch Antenna for Passive Third-Harmonic Radiation | Yishai Brill et.al. | 2601.12462 | null |
| 2026-01-18 | LiQSS: Post-Transformer Linear Quantum-Inspired State-Space Tensor Networks for Real-Time 6G | Farhad Rezazadeh et.al. | 2601.12375 | null |
| 2026-01-18 | Efficient classical simulation of time dynamics in Fermi-Hubbard models with imaginary interactions | Raul A. Santos et.al. | 2601.12368 | null |
| 2026-01-18 | FlowIID: Single-Step Intrinsic Image Decomposition via Latent Flow Matching | Mithlesh Singla et.al. | 2601.12329 | null |
| 2026-01-18 | Adaptive Multi-Scale Correlation Meta-Network for Few-Shot Remote Sensing Image Classification | Anurag Kaushish et.al. | 2601.12308 | null |
| 2026-01-18 | AgenticPruner: MAC-Constrained Neural Network Compression via LLM-Driven Strategy Search | Shahrzad Esmat et.al. | 2601.12272 | null |
| 2026-01-16 | MHA2MLA-VLM: Enabling DeepSeek’s Economical Multi-Head Latent Attention across Vision-Language Models | Xiaoran Fan et.al. | 2601.11464 | null |
| 2026-01-16 | IMS: Intelligent Hardware Monitoring System for Secure SoCs | Wadid Foudhaili et.al. | 2601.11447 | null |
| 2026-01-16 | FEATHer: Fourier-Efficient Adaptive Temporal Hierarchy Forecaster for Time-Series Forecasting | Jaehoon Lee et.al. | 2601.11350 | null |
| 2026-01-16 | X-Distill: Cross-Architecture Vision Distillation for Visuomotor Learning | Maanping Shao et.al. | 2601.11269 | null |
| 2026-01-16 | Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation | Pingzhi Tang et.al. | 2601.11258 | null |
| 2026-01-16 | Language-Agnostic Visual Embeddings for Cross-Script Handwriting Retrieval | Fangke Chen et.al. | 2601.11248 | null |
| 2026-01-16 | SDFLoRA: Selective Dual-Module LoRA for Federated Fine-tuning with Heterogeneous Clients | Zhikang Shen et.al. | 2601.11219 | null |
| 2026-01-16 | FAQ: Mitigating Quantization Error via Regenerating Calibration Data with Family-Aware Quantization | Haiyang Xiao et.al. | 2601.11200 | null |
| 2026-01-16 | Democratizing planetary-scale analysis: An ultra-lightweight Earth embedding database for accurate and flexible global land monitoring | Shuang Chen et.al. | 2601.11183 | null |
| 2026-01-16 | PruneRAG: Confidence-Guided Query Decomposition Trees for Efficient Retrieval-Augmented Generation | Shuguang Jiao et.al. | 2601.11024 | null |
| 2026-01-15 | EncodeRec: An Embedding Backbone for Recommendation Systems | Guy Hadad et.al. | 2601.10837 | null |
| 2026-01-15 | Mugi: Value Level Parallelism For Efficient LLMs | Daniel Price et.al. | 2601.10823 | null |
| 2026-01-15 | Towards Tensor Network Models for Low-Latency Jet Tagging on FPGAs | Alberto Coppi et.al. | 2601.10801 | null |
| 2026-01-15 | Astrometric microlensing probes of the isolated neutron star population with Roman | Zofia Kaczmarek et.al. | 2601.10789 | null |
| 2026-01-14 | Pruning as Evolution: Emergent Sparsity Through Selection Dynamics in Neural Networks | Zubair Shah et.al. | 2601.10765 | null |
| 2026-01-15 | From One-to-One to Many-to-Many: Dynamic Cross-Layer Injection for Deep Vision-Language Fusion | Cheng Chen et.al. | 2601.10710 | null |
| 2026-01-15 | Communication-Efficient and Privacy-Adaptable Mechanism – a Federated Learning Scheme with Convergence Analysis | Chun Hei Michael Shiu et.al. | 2601.10701 | null |
| 2026-01-15 | PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution | Minghao Yan et.al. | 2601.10657 | null |
| 2026-01-15 | Representation-Aware Unlearning via Activation Signatures: From Suppression to Knowledge-Signature Erasure | Syed Naveed Mahmood et.al. | 2601.10566 | null |
| 2026-01-15 | TF3-RO-50M: Training Compact Romanian Language Models from Scratch on Synthetic Moral Microfiction | Mihai Dan Nadas et.al. | 2601.10410 | null |
| 2026-01-15 | coTherapist: A Behavior-Aligned Small Language Model to Support Mental Healthcare Experts | Prottay Kumar Adhikary et.al. | 2601.10246 | null |
| 2026-01-15 | LOOKAT: Lookup-Optimized Key-Attention for Memory-Efficient Transformers | Aryan Karmore et.al. | 2601.10155 | null |
| 2026-01-15 | Privacy Enhanced PEFT: Tensor Train Decomposition Improves Privacy Utility Tradeoffs under DP-SGD | Pradip Kunwar et.al. | 2601.10045 | null |
| 2026-01-15 | Instruction Finetuning LLaMA-3-8B Model Using LoRA for Financial Named Entity Recognition | Zhiming Lian et.al. | 2601.10043 | null |
| 2026-01-15 | Resistive Memory based Efficient Machine Unlearning and Continual Learning | Ning Lin et.al. | 2601.10037 | null |
| 2026-01-15 | FaTRQ: Tiered Residual Quantization for LLM Vector Search in Far-Memory-Aware ANNS Systems | Tianqi Zhang et.al. | 2601.09985 | null |
| 2026-01-14 | Advancing Model Refinement: Muon-Optimized Distillation and Quantization for LLM Deployment | Jacob Sander et.al. | 2601.09865 | null |
| 2026-01-16 | NanoSD: Edge Efficient Foundation Model for Real Time Image Restoration | Subhajit Sanyal et.al. | 2601.09823 | null |
| 2026-01-14 | QFed: Parameter-Compact Quantum-Classical Federated Learning | Samar Abdelghani et.al. | 2601.09809 | null |
| 2026-01-14 | ShortCoder: Knowledge-Augmented Syntax Optimization for Token-Efficient Code Generation | Sicong Liu et.al. | 2601.09703 | null |
| 2026-01-14 | COMPOSE: Hypergraph Cover Optimization for Multi-view 3D Human Pose Estimation | Tony Danjun Wang et.al. | 2601.09698 | null |
| 2026-01-14 | LLMs can Compress LLMs: Adaptive Pruning by Agents | Sai Varun Kodathala et.al. | 2601.09694 | null |
| 2026-01-14 | Disentangling Task Conflicts in Multi-Task LoRA via Orthogonal Gradient Projection | Ziyu Yang et.al. | 2601.09684 | null |
| 2026-01-14 | Quantization Commutes with Reduction of Chern-Simons Gauge Theory | Geyang Dai et.al. | 2601.09666 | null |
| 2026-01-14 | Exploring Fine-Tuning for Tabular Foundation Models | Aditya Tanna et.al. | 2601.09654 | null |
| 2026-01-14 | Benchmarking Post-Training Quantization of Large Language Models under Microscaling Floating Point Formats | Manyi Zhang et.al. | 2601.09555 | null |
| 2026-01-14 | Strange quark star I: the maximum gravitational mass and deformation of magnetized spinning model | Fatemeh Kayanikhoo et.al. | 2601.09529 | null |
| 2026-01-14 | CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion | Ralf Römer et.al. | 2601.09512 | null |
| 2026-01-14 | Unifying Search and Recommendation in LLMs via Gradient Multi-Subspace Tuning | Jujia Zhao et.al. | 2601.09496 | null |
| 2026-01-14 | How many users have been here for a long time? Efficient solutions for counting long aggregated visits | Peyman Afshani et.al. | 2601.09489 | null |
| 2026-01-14 | Analysis of the Maximum Prediction Gain of Short-Term Prediction on Sustained Speech | Reemt Hinrichs et.al. | 2601.09461 | null |
| 2026-01-14 | GeoRA: Geometry-Aware Low-Rank Adaptation for RLVR | Jiaying Zhang et.al. | 2601.09361 | null |
| 2026-01-14 | Spectral Complex Autoencoder Pruning: A Fidelity-Guided Criterion for Extreme Structured Channel Compression | Wei Liu et.al. | 2601.09352 | null |
| 2026-01-14 | Arbitrary fractional quantization in Dirac systems | Christos Papapanos et.al. | 2601.09331 | null |
| 2026-01-14 | On-Device Large Language Models for Sequential Recommendation | Xin Xia et.al. | 2601.09306 | null |
| 2026-01-14 | TIDI-GS: Floater Suppression in 3D Gaussian Splatting for Enhanced Indoor Scene Fidelity | Sooyeun Yang et.al. | 2601.09291 | null |
| 2026-01-14 | RISER: Orchestrating Latent Reasoning Skills for Adaptive Activation Steering | Wencheng Ye et.al. | 2601.09269 | null |
| 2026-01-14 | A Theoretical Framework for Rate-Distortion Limits in Learned Image Compression | Changshuo Wang et.al. | 2601.09254 | null |
| 2026-01-14 | Integrating Diverse Assignment Strategies into DETRs | Yiwei Zhang et.al. | 2601.09247 | null |
| 2026-01-14 | CLIDD: Cross-Layer Independent Deformable Description for Efficient and Discriminative Local Feature Representation | Haodi Yao et.al. | 2601.09230 | null |
| 2026-01-14 | Pairing-free Group-level Knowledge Distillation for Robust Gastrointestinal Lesion Classification in White-Light Endoscopy | Qiang Hu et.al. | 2601.09209 | null |
| 2026-01-14 | From Performance to Practice: Knowledge-Distilled Segmentator for On-Premises Clinical Workflows | Qizhen Lan et.al. | 2601.09191 | null |
| 2026-01-14 | OrthoGeoLoRA: Geometric Parameter-Efficient Fine-Tuning for Structured Social Science Concept Retrieval on theWeb | Zeqiang Wang et.al. | 2601.09185 | null |
| 2026-01-14 | $D^2Prune$ : Sparsifying Large Language Models via Dual Taylor Expansion and Attention Distribution Awareness | Lang Xiong et.al. | 2601.09176 | null |
| 2026-01-14 | N-EIoU-YOLOv9: A Signal-Aware Bounding Box Regression Loss for Lightweight Mobile Detection of Rice Leaf Diseases | Dung Ta Nguyen Duc et.al. | 2601.09170 | null |
| 2026-01-14 | Multi-Teacher Ensemble Distillation: A Mathematical Framework for Probability-Domain Knowledge Aggregation | Aaron R. Flouro et.al. | 2601.09165 | null |
| 2026-01-14 | SkinFlow: Efficient Information Transmission for Open Dermatological Diagnosis via Dynamic Visual Encoding and Staged RL | Lijun Liu et.al. | 2601.09136 | null |
| 2026-01-14 | LPCAN: Lightweight Pyramid Cross-Attention Network for Rail Surface Defect Detection Using RGB-D Data | Jackie Alex et.al. | 2601.09118 | null |
| 2026-01-14 | LP-LLM: End-to-End Real-World Degraded License Plate Text Recognition via Large Multimodal Models | Haoyan Gong et.al. | 2601.09116 | null |
| 2026-01-14 | Hidden States as Early Signals: Step-level Trace Evaluation and Pruning for Efficient Test-Time Scaling | Zhixiang Liang et.al. | 2601.09093 | null |
| 2026-01-14 | Efficient Multilingual Dialogue Processing via Translation Pipelines and Distilled Language Models | Santiago Martínez Novoa et.al. | 2601.09059 | null |
| 2026-01-13 | Semiparametric Efficient Data Integration Using the Dual-Frame Sampling Framework | Kosuke Morikawa et.al. | 2601.08707 | null |
| 2026-01-13 | Efficient Parameter Calibration of Numerical Weather Prediction Models via Evolutionary Sequential Transfer Optimization | Heping Fang et.al. | 2601.08663 | null |
| 2026-01-13 | SfMamba: Efficient Source-Free Domain Adaptation via Selective Scan Modeling | Xi Chen et.al. | 2601.08608 | null |
| 2026-01-13 | Bridging Theory and Experiment in Virtually Imaged Phased Array (VIPA) Spectrometers | Kiumars Aryana et.al. | 2601.08589 | null |
| 2026-01-13 | Ministral 3 | Alexander H. Liu et.al. | 2601.08584 | null |
| 2026-01-13 | JudgeRLVR: Judge First, Generate Second for Efficient Reasoning | Jiangshan Duo et.al. | 2601.08468 | null |
| 2026-01-13 | Hybrid Distillation with CoT Guidance for Edge-Drone Control Code Generation | Yizhan Feng et.al. | 2601.08412 | null |
| 2026-01-13 | An Efficient Algorithm to Sample Quantum Low-Density Parity-Check Codes | Paolo Santini et.al. | 2601.08387 | null |
| 2026-01-13 | RotCurves: A PYTHON package for efficient modelling and fitting of galactic rotation curves at high-z | A. Nestor Shachar et.al. | 2601.08348 | null |
| 2026-01-13 | ReCo-KD: Region- and Context-Aware Knowledge Distillation for Efficient 3D Medical Image Segmentation | Qizhen Lan et.al. | 2601.08301 | null |
| 2026-01-13 | Variable-Length Wideband CSI Feedback via Loewner Interpolation and Deep Learning | Meilin Li et.al. | 2601.08300 | null |
| 2026-01-13 | Human-inspired Global-to-Parallel Multi-scale Encoding for Lightweight Vision Models | Wei Xu et.al. | 2601.08190 | null |
| 2026-01-13 | Relational Knowledge Distillation Using Fine-tuned Function Vectors | Andrea Kang et.al. | 2601.08169 | null |
| 2026-01-13 | Q-realign: Piggybacking Realignment on Quantization for Safe and Efficient LLM Deployment | Qitao Tan et.al. | 2601.08089 | null |
| 2026-01-12 | LUT-Compiled Kolmogorov-Arnold Networks for Lightweight DoS Detection on IoT Edge Devices | Oleksandr Kuznetsov et.al. | 2601.08044 | null |
| 2026-01-12 | InfGraND: An Influence-Guided GNN-to-MLP Knowledge Distillation | Amir Eskandari et.al. | 2601.08033 | null |
| 2026-01-12 | DYCP: Dynamic Context Pruning for Long-Form Dialogue with LLMs | Nayoung Choi et.al. | 2601.07994 | null |
| 2026-01-12 | LWMSCNN-SE: A Lightweight Multi-Scale Network for Efficient Maize Disease Classification on Edge Devices | Fikadu Weloday et.al. | 2601.07957 | null |
| 2026-01-12 | Sherry: Hardware-Efficient 1.25-Bit Ternary Quantization via Fine-grained Sparsification | Hong Huang et.al. | 2601.07892 | null |
| 2026-01-12 | KVzap: Fast, Adaptive, and Faithful KV Cache Pruning | Simon Jegou et.al. | 2601.07891 | null |
| 2026-01-12 | Vision-Language Model for Accurate Crater Detection | Patrick Bauer et.al. | 2601.07795 | null |
| 2026-01-12 | Benchmarking Small Language Models and Small Reasoning Language Models on System Log Severity Classification | Yahya Masri et.al. | 2601.07790 | null |
| 2026-01-13 | Free-RBF-KAN: Kolmogorov-Arnold Networks with Adaptive Radial Basis Functions for Efficient Function Learning | Shao-Ting Chiu et.al. | 2601.07760 | null |
| 2026-01-12 | Tab-TRM: Tiny Recursive Model for Insurance Pricing on Tabular Data | Kishan Padayachy et.al. | 2601.07675 | null |
| 2026-01-12 | Adaptive Layer Selection for Layer-Wise Token Pruning in LLM Inference | Rei Taniguchi et.al. | 2601.07667 | null |
| 2026-01-12 | Quantization-scheme-Independent Energy and Its Implications for Holographic Bounds | Ze Li et.al. | 2601.07607 | null |
| 2026-01-12 | Vector Quantized-Aided XL-MIMO CSI Feedback with Channel Adaptive Transmission | Yuhang Ma et.al. | 2601.07584 | null |
| 2026-01-12 | Backpropagation-Free Test-Time Adaptation for Lightweight EEG-Based Brain-Computer Interfaces | Siyang Li et.al. | 2601.07556 | null |
| 2026-01-12 | High-Rank Structured Modulation for Parameter-Efficient Fine-Tuning | Yongkang Liu et.al. | 2601.07507 | null |
| 2026-01-12 | ARCQuant: Boosting NVFP4 Quantization with Augmented Residual Channels for LLMs | Haoqian Meng et.al. | 2601.07475 | null |
| 2026-01-12 | Knowledge Distillation for LLM-Based Human Activity Recognition in Homes | Julien Cumin et.al. | 2601.07469 | null |
| 2026-01-12 | From Sketch to Fresco: Efficient Diffusion Transformer with Progressive Resolution | Shikang Zheng et.al. | 2601.07462 | null |
| 2026-01-12 | SDHSI-Net: Learning Better Representations for Hyperspectral Images via Self-Distillation | Prachet Dev Singh et.al. | 2601.07416 | null |
| 2026-01-12 | Forecast the Principal, Stabilize the Residual: Subspace-Aware Feature Caching for Efficient Diffusion Transformers | Guantao Chen et.al. | 2601.07396 | null |
| 2026-01-12 | Software-Hardware Co-optimization for Modular E2E AV Paradigm: A Unified Framework of Optimization Approaches, Simulation Environment and Evaluation Metrics | Chengzhi Ji et.al. | 2601.07393 | null |
| 2026-01-12 | MI-PRUN: Optimize Large Language Model Pruning via Mutual Information | Hao Zhang et.al. | 2601.07212 | null |
| 2026-01-12 | Active Context Compression: Autonomous Memory Management in LLM Agents | Nikhil Verma et.al. | 2601.07190 | null |
| 2026-01-12 | Stable On-Policy Distillation through Adaptive Target Reformulation | Ijun Jang et.al. | 2601.07155 | null |
| 2026-01-11 | Robust Mean Estimation under Quantization | Pedro Abdalla et.al. | 2601.07074 | null |
| 2026-01-11 | Jasper: ANNS Quantized for Speed, Built for Change on GPU | Hunter McCoy et.al. | 2601.07048 | null |
| 2026-01-11 | Magnetic winds in resistive compact binary discs | Marc Van den Bossche et.al. | 2601.06994 | null |
| 2026-01-11 | HAS-VQ: Hessian-Adaptive Sparse Vector Quantization for High-Fidelity LLM Compression | Vladimer Khasia et.al. | 2601.06959 | null |
| 2026-01-11 | TagSpeech: End-to-End Multi-Speaker ASR and Diarization with Fine-Grained Temporal Grounding | Mingyue Huo et.al. | 2601.06896 | null |
| 2026-01-11 | SecMoE: Communication-Efficient Secure MoE Inference via Select-Then-Compute | Bowen Shen et.al. | 2601.06790 | null |
| 2026-01-11 | Artificial Entanglement in the Fine-Tuning of Large Language Models | Min Chen et.al. | 2601.06788 | null |
| 2026-01-11 | Garbage Attention in Large Language Models: BOS Sink Heads and Sink-aware Pruning | Jaewon Sok et.al. | 2601.06787 | null |
| 2026-01-10 | GRASP LoRA: GRPO Guided Adapter Sparsity Policy for Cross Lingual Transfer | Besher Hassan et.al. | 2601.06702 | null |
| 2026-01-10 | Families of Toeplitz operators, family index and deformation quantization | Clément Cren et.al. | 2601.06619 | null |
| 2026-01-10 | Joint Impact of ADC and Fronthaul Quantization in Cell-Free Massive MIMO-OFDM Uplink | Özlem Tuğfe Demir et.al. | 2601.06483 | null |
| 2026-01-10 | PRISP: Privacy-Safe Few-Shot Personalization via Lightweight Adaptation | Junho Park et.al. | 2601.06471 | null |
| 2026-01-10 | SecureDyn-FL: A Robust Privacy-Preserving Federated Learning Framework for Intrusion Detection in IoT Networks | Imtiaz Ali Soomro et.al. | 2601.06466 | null |
| 2026-01-10 | Gecko: An Efficient Neural Architecture Inherently Processing Sequences with Arbitrary Lengths | Xuezhe Ma et.al. | 2601.06463 | null |
| 2026-01-09 | Monkey Jump : MoE-Style PEFT for Efficient Multi-Task Learning | Nusrat Jahan Prottasha et.al. | 2601.06356 | null |
| 2026-01-09 | Why LoRA Fails to Forget: Regularized Low-Rank Adaptation Against Backdoors in Language Models | Hoang-Chau Luong et.al. | 2601.06305 | null |
| 2026-01-09 | Real-Time Image Processing Algorithms for Embedded Systems | Soundes Oumaima Boufaida et.al. | 2601.06243 | null |
| 2026-01-09 | Distilling Lightweight Domain Experts from Large ML Models by Identifying Relevant Subspaces | Pattarawat Chormai et.al. | 2601.05913 | null |
| 2026-01-09 | FLRQ: Faster LLM Quantization with Flexible Low-Rank Matrix Sketching | Hongyaoxing Gul et.al. | 2601.05684 | null |
| 2026-01-09 | Compressing image encoders via latent distillation | Caroline Mazini Rodrigues et.al. | 2601.05639 | null |
| 2026-01-09 | LatentVLA: Efficient Vision-Language Models for Autonomous Driving via Latent Action Prediction | Chengen Xie et.al. | 2601.05611 | null |
| 2026-01-09 | AntibodyDesignBFN: High-Fidelity Fixed-Backbone Antibody Design via Discrete Bayesian Flow Networks | Yue Hu et.al. | 2601.05605 | null |
| 2026-01-09 | Generalizable and Adaptive Continual Learning Framework for AI-generated Image Detection | Hanyi Wang et.al. | 2601.05580 | null |
| 2026-01-09 | One Language-Free Foundation Model Is Enough for Universal Vision Anomaly Detection | Bin-Bin Gao et.al. | 2601.05552 | null |
| 2026-01-09 | Discrete Homogeneity and Quantizer Design for Nonlinear Homogeneous Control Systems | Yu Zhou et.al. | 2601.05526 | null |
| 2026-01-08 | Efficient Inference for Noisy LLM-as-a-Judge Evaluation | Yiqun T Chen et.al. | 2601.05420 | null |
| 2026-01-08 | Interactive Distillation for Cooperative Multi-Agent Reinforcement Learning | Minwoo Cho et.al. | 2601.05407 | null |
| 2026-01-08 | Markovian Compression: Looking to the Past Helps Accelerate the Future | Andrey Veprikov et.al. | 2601.05398 | null |
| 2026-01-08 | Sketch&Patch++: Efficient Structure-Aware 3D Gaussian Representation | Yuang Shi et.al. | 2601.05394 | null |
| 2026-01-08 | Knowledge Distillation of a Protein Language Model Yields a Foundational Implicit Solvent Model | Justin Airas et.al. | 2601.05388 | null |
| 2026-01-08 | EdgeLDR: Quaternion Low-Displacement Rank Neural Networks for Edge-Efficient Deep Learning | Vladimir Frants et.al. | 2601.05379 | null |
| 2026-01-08 | MOSAIC-GS: Monocular Scene Reconstruction via Advanced Initialization for Complex Dynamic Environments | Svitlana Morkva et.al. | 2601.05368 | null |
| 2026-01-08 | STResNet & STYOLO : A New Family of Compact Classification and Object Detection Models for MCUs | Sudhakar Sah et.al. | 2601.05364 | null |
| 2026-01-08 | Microscopic Unitarity and the Quantization of Black Hole Evaporation Time | Ahmad Adel Abutaleb et.al. | 2601.05305 | null |
| 2026-01-08 | RelayLLM: Efficient Reasoning via Collaborative Decoding | Chengsong Huang et.al. | 2601.05167 | null |
| 2026-01-08 | Learning Mixture Models via Efficient High-dimensional Sparse Fourier Transforms | Alkis Kalavasis et.al. | 2601.05157 | null |
| 2026-01-08 | ArcAligner: Adaptive Recursive Aligner for Compressed Context Embeddings in RAG | Jianbo Li et.al. | 2601.05038 | link |
| 2026-01-08 | Guided Variational Network for Image Decomposition | Alessandro Lanza et.al. | 2601.04999 | null |
| 2026-01-08 | ConMax: Confidence-Maximizing Compression for Efficient Chain-of-Thought Reasoning | Minda Hu et.al. | 2601.04973 | null |
| 2026-01-08 | DR-LoRA: Dynamic Rank LoRA for Mixture-of-Experts Adaptation | Guanzhi Deng et.al. | 2601.04823 | null |
| 2026-01-08 | GPU-Accelerated INT8 Quantization for KV Cache Compression in Large Language Models | Maanas Taneja et.al. | 2601.04719 | null |
| 2026-01-08 | PROMISE: Process Reward Models Unlock Test-Time Scaling Laws in Generative Recommendations | Chengcheng Guo et.al. | 2601.04674 | null |
| 2026-01-08 | FedKDX: Federated Learning with Negative Knowledge Distillation for Enhanced Healthcare AI Systems | Quang-Tu Pham et.al. | 2601.04587 | null |
| 2026-01-08 | TokenSeg: Efficient 3D Medical Image Segmentation via Hierarchical Visual Token Compression | Sen Zeng et.al. | 2601.04519 | null |
| 2026-01-07 | Exact Multimode Quantization of Superconducting Circuits via Boundary Admittance | Mustafa Bakr et.al. | 2601.04407 | null |
| 2026-01-07 | SCAR-GS: Spatial Context Attention for Residuals in Progressive Gaussian Splatting | Diego Revilla et.al. | 2601.04348 | null |
| 2026-01-07 | MemKD: Memory-Discrepancy Knowledge Distillation for Efficient Time Series Classification | Nilushika Udayangani et.al. | 2601.04264 | null |
| 2026-01-07 | Learning to Reason: Temporal Saliency Distillation for Interpretable Knowledge Transfer | Nilushika Udayangani Hewa Dehigahawattage et.al. | 2601.04263 | null |
| 2026-01-07 | Safety-Utility Conflicts Are Not Global: Surgical Alignment via Head-Level Diagnosis | Wang Cai et.al. | 2601.04262 | null |
| 2026-01-07 | ToTMNet: FFT-Accelerated Toeplitz Temporal Mixing Network for Lightweight Remote Photoplethysmography | Vladimir Frants et.al. | 2601.04159 | null |
| 2026-01-07 | Hybrid Downlink Beamforming with Outage Constraints under Imperfect CSI using Model-Driven Deep Learning | Lukas Schynol et.al. | 2601.04069 | null |
| 2026-01-07 | A Scheduling Framework for Efficient MoE Inference on Edge GPU-NDP Systems | Qi Wu et.al. | 2601.03992 | null |
| 2026-01-07 | Using Small Language Models to Reverse-Engineer Machine Learning Pipelines Structures | Nicolas Lacroix et.al. | 2601.03988 | null |
| 2026-01-07 | FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection | Mingyu Ouyang et.al. | 2601.03928 | null |
| 2026-01-07 | Evaluating Small Decoder-Only Language Models for Grammar Correction and Text Simplification | Anthony Lamelas et.al. | 2601.03874 | null |
| 2026-01-07 | MPM-QIR: Measurement-Probability Matching for Quantum Image Representation and Compression via Variational Quantum Circuit | Chong-Wei Wang et.al. | 2601.03855 | null |
| 2026-01-07 | Rethinking Table Pruning in TableQA: From Sequential Revisions to Gold Trajectory-Supervised Parallel Search | Yu Guo et.al. | 2601.03851 | null |
| 2026-01-07 | Unified and Efficient Analysis of Machining Chatter and Surface Location Error | Woraphrut Kornmaneesang et.al. | 2601.03819 | null |
| 2026-01-07 | AI Generated Text Detection | Adilkhan Alikhanov et.al. | 2601.03812 | null |
| 2026-01-07 | Improving Compactness and Reducing Ambiguity of CFIRE Rule-Based Explanations | Sebastian Müller et.al. | 2601.03776 | null |
| 2026-01-07 | Topological quantization of vector meson anomalous couplings | Chao-Qiang Geng et.al. | 2601.03740 | null |
| 2026-01-07 | Investigating Knowledge Distillation Through Neural Networks for Protein Binding Affinity Prediction | Wajid Arshad Abbasi et.al. | 2601.03704 | null |
| 2026-01-07 | ELO: Efficient Layer-Specific Optimization for Continual Pretraining of Multilingual LLMs | HanGyeol Yoo et.al. | 2601.03648 | null |
| 2026-01-07 | PhysicsFormer: An Efficient and Fast Attention-Based Physics Informed Neural Network for Solving Incompressible Navier Stokes Equations | Biswanath Barman et.al. | 2601.03613 | null |
| 2026-01-07 | Policy-Guided Search on Tree-of-Thoughts for Efficient Problem Solving with Bounded Language Model Queries | Sumedh Pendurkar et.al. | 2601.03606 | null |
| 2026-01-07 | Stratified Pseudobundles and Quantization | Ethan Ross et.al. | 2601.03544 | null |
| 2026-01-07 | Cyberattack Detection in Virtualized Microgrids Using LightGBM and Knowledge-Distilled Classifiers | Osasumwen Cedric Ogiesoba-Eguakun et.al. | 2601.03495 | null |
| 2026-01-07 | From Bits to Chips: An LLM-based Hardware-Aware Quantization Agent for Streamlined Deployment of LLMs | Kaiyuan Deng et.al. | 2601.03484 | null |
| 2026-01-06 | Implicit Graph, Explicit Retrieval: Towards Efficient and Interpretable Long-horizon Memory for Large Language Models | Xin Zhang et.al. | 2601.03417 | null |
| 2026-01-06 | PIVONet: A Physically-Informed Variational Neuro ODE Model for Efficient Advection-Diffusion Fluid Simulation | Hei Shing Cheung et.al. | 2601.03397 | null |
| 2026-01-06 | Edit2Restore:Few-Shot Image Restoration via Parameter-Efficient Adaptation of Pre-trained Editing Models | M. Akın Yılmaz et.al. | 2601.03391 | null |
| 2026-01-06 | Optimal Quantization of Finite Uniform Data on the Sphere | Mrinal Kanti Roychowdhury et.al. | 2601.03333 | null |
| 2026-01-06 | LUT-KAN: Segment-wise LUT Quantization for Fast KAN Inference | Oleksandr Kuznetsov et.al. | 2601.03332 | null |
| 2026-01-06 | Fine-tuning Small Language Models as Efficient Enterprise Search Relevance Labelers | Yue Kang et.al. | 2601.03211 | null |
| 2026-01-06 | Sparse Knowledge Distillation: A Mathematical Framework for Probability-Domain Temperature Scaling and Multi-Stage Compression | Aaron R. Flouro et.al. | 2601.03195 | null |
| 2026-01-06 | Do LLMs Encode Functional Importance of Reasoning Tokens? | Janvijay Singh et.al. | 2601.03066 | null |
| 2026-01-06 | From Memorization to Creativity: LLM as a Designer of Novel Neural-Architectures | Waleed Khalid et.al. | 2601.02997 | null |
| 2026-01-06 | Few-shot learning for security bug report identification | Muhammad Laiq et.al. | 2601.02971 | null |
| 2026-01-06 | Reliability-Aware Adaptive Self-Consistency for Efficient Sampling in LLM Reasoning | Junseok Kim et.al. | 2601.02970 | null |
| 2026-01-06 | RPIQ: Residual-Projected Multi-Collaboration Closed-Loop and Single Instance Quantization for Visually Impaired Assistance | Xuanyu Wang et.al. | 2601.02888 | null |
| 2026-01-06 | Sample-Efficient Neurosymbolic Deep Reinforcement Learning | Celeste Veronese et.al. | 2601.02850 | null |
| 2026-01-06 | AnyDepth: Depth Estimation Made Easy | Zeyu Ren et.al. | 2601.02760 | null |
| 2026-01-06 | CRoPE: Efficient Parametrization of Rotary Positional Embedding | Beicheng Lou et.al. | 2601.02728 | null |
| 2026-01-06 | Transform and Entropy Coding in AV2 | Alican Nalci et.al. | 2601.02712 | null |
| 2026-01-06 | Adversarial Contrastive Learning for LLM Quantization Attacks | Dinghong Song et.al. | 2601.02680 | null |
| 2026-01-06 | Iterative Structured Pruning for Large Language Models with Multi-Domain Calibration | Guangxin Wu et.al. | 2601.02674 | null |
| 2026-01-05 | Compressed code: the hidden effects of quantization and distillation on programming tokens | Viacheslav Siniaev et.al. | 2601.02563 | null |
| 2026-01-05 | ModeX: Evaluator-Free Best-of-N Selection for Open-Ended Generation | Hyeong Kyu Choi et.al. | 2601.02535 | null |
| 2026-01-05 | GEM-Style Constraints for PEFT with Dual Gradient Projection in LoRA | Brian Tekmen et.al. | 2601.02500 | null |
| 2026-01-05 | TAP-ViTs: Task-Adaptive Pruning for On-Device Deployment of Vision Transformers | Zhibo Wang et.al. | 2601.02437 | null |
| 2026-01-04 | A Dynamic Retrieval-Augmented Generation System with Selective Memory and Remembrance | Okan Bursa et.al. | 2601.02428 | null |
| 2026-01-05 | DARC: Drum accompaniment generation with fine-grained rhythm control | Trey Brosnan et.al. | 2601.02357 | null |
| 2026-01-05 | Meta-Learning Guided Pruning for Few-Shot Plant Pathology on Edge Devices | Shahnawaz Alam et.al. | 2601.02353 | null |
| 2026-01-05 | Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling | Falcon LLM Team et.al. | 2601.02346 | null |
| 2026-01-05 | Power-of-Two Quantization-Aware-Training (PoT-QAT) in Large Language Models (LLMs) | Mahmoud Elgenedy et.al. | 2601.02298 | null |
| 2026-01-05 | TopoLoRA-SAM: Topology-Aware Parameter-Efficient Adaptation of Foundation Segmenters for Thin-Structure and Cross-Domain Binary Semantic Segmentation | Salim Khazem et.al. | 2601.02273 | null |
| 2026-01-05 | SLGNet: Synergizing Structural Priors and Language-Guided Modulation for Multimodal Object Detection | Xiantai Xiang et.al. | 2601.02249 | null |
| 2026-01-05 | Quantized SO(3)-Equivariant Graph Neural Networks for Efficient Molecular Property Prediction | Haoyu Zhou et.al. | 2601.02213 | null |
| 2026-01-05 | Parameter-Efficient Domain Adaption for CSI Crowd-Counting via Self-Supervised Learning with Adapter Modules | Oliver Custance et.al. | 2601.02203 | null |
| 2026-01-05 | HFRWKV: A High-Performance Fully On-Chip Hardware Accelerator for RWKV | Liu Shijie et.al. | 2601.02135 | null |
| 2026-01-05 | MindChat: A Privacy-preserving Large Language Model for Mental Health Support | Dong Xue et.al. | 2601.01993 | null |
| 2026-01-05 | Vector Search for the Future: From Memory-Resident, Static Heterogeneous Storage, to Cloud-Native Architectures | Yitong Song et.al. | 2601.01937 | null |
| 2026-01-05 | RRNet: Configurable Real-Time Video Enhancement with Arbitrary Local Lighting Variations | Wenlong Yang et.al. | 2601.01865 | null |
| 2026-01-05 | Causality-Aware Temporal Projection for Video Understanding in Video-LLMs | Zhengjian Kang et.al. | 2601.01804 | null |
| 2026-01-05 | Subsymmetry-protected compact edge states | Ruoqi Cheng et.al. | 2601.01721 | null |
| 2026-01-05 | Digital Twin-Driven Communication-Efficient Federated Anomaly Detection for Industrial IoT | Mohammed Ayalew Belay et.al. | 2601.01701 | null |
| 2026-01-05 | Real-Time Lane Detection via Efficient Feature Alignment and Covariance Optimization for Low-Power Embedded Systems | Yian Liu et.al. | 2601.01696 | null |
| 2026-01-04 | DiffKD-DCIS: Predicting Upgrade of Ductal Carcinoma In Situ with Diffusion Augmentation and Knowledge Distillation | Tao Li et.al. | 2601.01507 | null |
| 2026-01-04 | SGD-Based Knowledge Distillation with Bayesian Teachers: Theory and Guidelines | Itai Morad et.al. | 2601.01484 | null |
| 2026-01-04 | Efficient Cover Construction for Ball Mapper via Accelerated Range Queries | Jay-Anne Bulauan et.al. | 2601.01405 | null |
| 2026-01-04 | Empowering Small Language Models with Factual Hallucination-Aware Reasoning for Financial Classification | Han Yuan et.al. | 2601.01378 | null |
| 2026-01-03 | T3C: Test-Time Tensor Compression with Consistency Guarantees | Ismail Lamaakal et.al. | 2601.01299 | null |
| 2026-01-03 | MambaFormer: Token-Level Guided Routing Mixture-of-Experts for Accurate and Efficient Clinical Assistance | Hamad Khan et.al. | 2601.01260 | null |
| 2026-01-03 | Racka: Efficient Hungarian LLM Adaptation on Academic Infrastructure | Zsolt Csibi et.al. | 2601.01244 | null |
| 2026-01-03 | XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression | Zunhai Su et.al. | 2601.01204 | null |
| 2026-01-03 | EmoLoom-2B: Fast Base-Model Screening for Emotion Classification and VAD with Lexicon-Weak Supervision and KV-Off Evaluation | Zilin Li et.al. | 2601.01112 | null |
| 2026-01-03 | Efficient Hyperspectral Image Reconstruction Using Lightweight Separate Spectral Transformers | Jianan Li et.al. | 2601.01064 | null |
| 2026-01-03 | Multi-Dimensional Prompt Chaining to Improve Open-Domain Dialogue Generation | Livia Leong Hui Teng et.al. | 2601.01037 | null |
| 2026-01-02 | Lightweight Channel Attention for Efficient CNNs | Prem Babu Kanaparthi et.al. | 2601.01002 | null |
| 2026-01-02 | KDPhys: An Attention Guided 3D to 2D Knowledge Distillation for Real-time Video-Based Physiological Measurement | Nicky Nirlipta Sahoo et.al. | 2601.00714 | null |
| 2026-01-02 | QSLM: A Performance- and Memory-aware Quantization Framework with Tiered Search Strategy for Spike-driven Language Models | Rachmad Vidya Wicaksana Putra et.al. | 2601.00679 | null |
| 2026-01-02 | Sparse FEONet: A Low-Cost, Memory-Efficient Operator Network via Finite-Element Local Sparsity for Parametric PDEs | Seungchan Ko et.al. | 2601.00672 | null |
| 2026-01-02 | CoCo-Fed: A Unified Framework for Memory- and Communication-Efficient Federated Learning at the Wireless Edge | Zhiheng Guo et.al. | 2601.00549 | null |
| 2026-01-02 | Variable Elimination in Hybrid Factor Graphs for Discrete-Continuous Inference & Estimation | Varun Agrawal et.al. | 2601.00545 | null |
| 2026-01-02 | ECR: Manifold-Guided Semantic Cues for Compact Language Models | Chung-Wei Victor Yuan et.al. | 2601.00543 | null |
| 2026-01-02 | Federated Customization of Large Models: Approaches, Experiments, and Insights | Yuchuan Ye et.al. | 2601.00526 | null |
| 2026-01-02 | Optimizing LSTM Neural Networks for Resource-Constrained Retail Sales Forecasting: A Model Compression Study | Ravi Teja Pagidoju et.al. | 2601.00525 | null |
| 2026-01-01 | Fisher-Information-Driven Adaptive Acquisition for Photon-Efficient FLIM: A Dual-Implementation Framework for TCSPC and Programmable Time-Gating | J. Sumaya-Martinez et.al. | 2601.00490 | null |
| 2026-01-01 | A Comparative Study of Adaptation Strategies for Time Series Foundation Models in Anomaly Detection | Miseon Park et.al. | 2601.00446 | null |
| 2026-01-01 | Time–to–Digital Converter (TDC)–Based Resonant Compute–in–Memory for INT8 CNNs with Layer–Optimized SRAM Mapping | Dhandeep Challagundla et.al. | 2601.00434 | null |
| 2026-01-01 | Efficient Prediction of Dense Visual Embeddings via Distillation and RGB-D Transformers | Söhnke Benedikt Fischedick et.al. | 2601.00359 | null |
| 2026-01-01 | Can Optimal Transport Improve Federated Inverse Reinforcement Learning? | David Millard et.al. | 2601.00309 | null |
| 2026-01-01 | VisNet: Efficient Person Re-Identification via Alpha-Divergence Loss, Feature Fusion and Dynamic Multi-Task Learning | Anns Ijaz et.al. | 2601.00307 | null |
| 2026-01-01 | Can Large Language Models Still Explain Themselves? Investigating the Impact of Quantization on Self-Explanations | Qianli Wang et.al. | 2601.00282 | null |
| 2026-01-01 | Equivariant Cohomology, BRST Quantization, and Analytic Localization: A Unified Framework | Lixin Xu et.al. | 2601.00256 | null |
| 2026-01-01 | An Empirical Evaluation of LLM-Based Approaches for Code Vulnerability Detection: RAG, SFT, and Dual-Agent Systems | Md Hasan Saju et.al. | 2601.00254 | null |
| 2026-01-01 | GRIT – Geometry-Aware PEFT with K-FACPreconditioning, Fisher-Guided Reprojection, andDynamic Rank Adaptation | Pritish Saha et.al. | 2601.00231 | null |
| 2026-01-01 | Robust Graph Fine-Tuning with Adversarial Graph Prompting | Ziyan Zhang et.al. | 2601.00229 | null |
| 2026-01-01 | LooC: Effective Low-Dimensional Codebook for Compositional Vector Quantization | Jie Li et.al. | 2601.00222 | null |
| 2026-01-01 | Knowledge Distillation for Temporal Knowledge Graph Reasoning with Large Language Models | Wang Xing et.al. | 2601.00202 | null |
| 2025-12-31 | Evaluating the Impact of Compression Techniques on the Robustness of CNNs under Natural Corruptions | Itallo Patrick Castro Alves Da Silva et.al. | 2512.24971 | null |
| 2025-12-31 | OFL-SAM2: Prompt SAM2 with Online Few-shot Learner for Efficient Medical Image Segmentation | Meng Lan et.al. | 2512.24861 | null |
| 2025-12-31 | HiGR: Efficient Generative Slate Recommendation via Hierarchical Planning and Multi-Objective Preference Alignment | Yunsheng Pang et.al. | 2512.24787 | null |
| 2025-12-31 | Control of Microrobots with Reinforcement Learning under On-Device Compute Constraints | Yichen Liu et.al. | 2512.24740 | null |
| 2025-12-31 | FPGA Co-Design for Efficient N:M Sparse and Quantized Model Inference | Fen-Yu Hsieh et.al. | 2512.24713 | null |
| 2025-12-31 | Average Consensus with Dynamic Quantization Framing and Finite-Time Termination over Limited-Bandwidth Directed Networks | Evagoras Makridis et.al. | 2512.24700 | null |
| 2025-12-31 | Distributed Bilevel Optimization with Dual Pruning for Resource-limited Clients | Mingyi Li et.al. | 2512.24667 | null |
| 2025-12-31 | Renormalization Group Guided Tensor Network Structure Search | Maolin Wang et.al. | 2512.24663 | null |
| 2025-12-31 | Geometric Quantization by Paths Part II: The General Case | Patrick Iglesias-Zemmour et.al. | 2512.24627 | null |
| 2025-12-31 | AutoFed: Manual-Free Federated Traffic Prediction via Personalized Prompt | Zijian Zhao et.al. | 2512.24625 | null |
| 2025-12-31 | Collaborative Low-Rank Adaptation for Pre-Trained Vision Transformers | Zheng Liu et.al. | 2512.24603 | null |
| 2025-12-31 | Hierarchical Vector-Quantized Latents for Perceptual Low-Resolution Video Compression | Manikanta Kotthapalli et.al. | 2512.24547 | null |
| 2025-12-31 | More Than Bits: Multi-Envelope Double Binary Factorization for Extreme Quantization | Yuma Ichikawa et.al. | 2512.24545 | null |
| 2025-12-30 | Implementing the three-neutron quantization condition | Wilder Schaaf et.al. | 2512.24508 | null |
| 2025-12-30 | Spectroscopy of Quantum Phase Slips: Visualizing Complex Real-Time Instantons | Foster Thompson et.al. | 2512.24495 | null |
| 2025-12-30 | PackKV: Reducing KV Cache Memory Footprint through LLM-Aware Lossy Compression | Bo Jiang et.al. | 2512.24449 | link |
| 2025-12-30 | FAST-IDS: A Fast Two-Stage Intrusion Detection System with Hybrid Compression for Real-Time Threat Detection in Connected and Autonomous Vehicles | Devika S et.al. | 2512.24391 | null |
| 2025-12-30 | Incremental Certificate Learning for Hybrid Neural Network Verification . A Solver Architecture for Piecewise-Linear Safety Queries | Chandrasekhar Gokavarapu et.al. | 2512.24379 | null |
| 2025-12-30 | Efficient Decoding of Twisted GRS Codes and Roth–Lempel Codes | Runtian Zhu et.al. | 2512.24217 | null |
| 2025-12-30 | OptRot: Mitigating Weight Outliers via Data-Free Rotations for Post-Training Quantization | Advait Gadhikar et.al. | 2512.24124 | null |
| 2025-12-30 | One-Shot Structured Pruning of Quantum Neural Networks via $q$ -Group Engineering and Quantum Geometric Metrics | Haijian Shao et.al. | 2512.24019 | null |
| 2025-12-30 | Structure-Guided Allocation of 2D Gaussians for Image Representation and Compression | Huanxiong Liang et.al. | 2512.24018 | null |
| 2025-12-30 | HERO-Sign: Hierarchical Tuning and Efficient Compiler-Time GPU Optimizations for SPHINCS+ Signature Generation | Yaoyun Zhou et.al. | 2512.23969 | null |
| 2025-12-30 | Hardware Acceleration for Neural Networks: A Comprehensive Survey | Bin Xu et.al. | 2512.23914 | null |
| 2025-12-29 | Efficient Deep Learning for Short-Term Solar Irradiance Time Series Forecasting: A Benchmark Study in Ho Chi Minh City | Tin Hoang et.al. | 2512.23898 | null |
| 2025-12-29 | Probing the Limits of Compressive Memory: A Study of Infini-Attention in Small-Scale Pretraining | Ruizhe Huang et.al. | 2512.23862 | null |
| 2025-12-29 | FRoD: Full-Rank Efficient Fine-Tuning with Rotational Degrees for Fast Convergence | Guoan Wan et.al. | 2512.23485 | null |
| 2025-12-29 | Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss | Ang Lv et.al. | 2512.23447 | null |
| 2025-12-29 | Mobile-Efficient Speech Emotion Recognition Using DistilHuBERT: A Cross-Corpus Validation Study | Saifelden M. Ismail et.al. | 2512.23435 | null |
| 2025-12-29 | Electro-optical modulation of light polarization in a nonlocal lithium niobate metasurface | Agostino Di Francescantonio et.al. | 2512.23393 | null |
| 2025-12-29 | Post-Training Quantization of OpenPangu Models for Efficient Deployment on Atlas A2 | Yilun Luo et.al. | 2512.23367 | null |
| 2025-12-29 | Deep learning for pedestrians: backpropagation in Transformers | Laurent Boué et.al. | 2512.23329 | null |
| 2025-12-29 | Interpretable Safety Alignment via SAE-Constructed Low-Rank Subspace Adaptation | Dianyun Wang et.al. | 2512.23260 | null |
| 2025-12-29 | RS-Prune: Training-Free Data Pruning at High Ratios for Efficient Remote Sensing Diffusion Foundation Models | Fan Wei et.al. | 2512.23239 | null |
| 2025-12-29 | Energy and Memory-Efficient Federated Learning With Ordered Layer Freezing | Ziru Niu et.al. | 2512.23200 | null |
| 2025-12-29 | A Simple, Optimal and Efficient Algorithm for Online Exp-Concave Optimization | Yi-Han Wang et.al. | 2512.23190 | null |
| 2025-12-30 | Evaluating Parameter Efficient Methods for RLVR | Qingyu Yin et.al. | 2512.23165 | null |
| 2025-12-28 | Efficient flip-chip and on-chip-based modulation of flux-tunable superconducting resonators | Achintya Paradkar et.al. | 2512.23119 | null |
| 2025-12-28 | Taming the Tail: Stable LLM Reinforcement Learning via Dynamic Vocabulary Pruning | Yingru Li et.al. | 2512.23087 | null |
| 2025-12-28 | Rethinking Fine-Tuning: Unlocking Hidden Capabilities in Vision-Language Models | Mingyuan Zhang et.al. | 2512.23073 | null |
| 2025-12-28 | Federated Learning With L0 Constraint Via Probabilistic Gates For Sparsity | Krishna Harsha Kovelakuntla Huthasana et.al. | 2512.23071 | null |
| 2025-12-28 | TYTAN: Taylor-series based Non-Linear Activation Engine for Deep Learning Accelerators | Soham Pramanik et.al. | 2512.23062 | null |
| 2025-12-28 | The topological life of Dynkin indices: universal scaling and matter selection | Mboyo Esole et.al. | 2512.23041 | null |
| 2025-12-28 | Interpretable Gallbladder Ultrasound Diagnosis: A Lightweight Web-Mobile Software Platform with Real-Time XAI | Fuyad Hasan Bhoyan et.al. | 2512.23033 | null |
| 2025-12-28 | Merge before Forget: A Single LoRA Continual Learning via Continual Merging | Fuli Qiao et.al. | 2512.23017 | null |
| 2025-12-28 | Improving Generalization in LLM Structured Pruning via Function-Aware Neuron Grouping | Tao Yu et.al. | 2512.23014 | null |
| 2025-12-28 | YOLO-IOD: Towards Real Time Incremental Object Detection | Shizhou Zhang et.al. | 2512.22973 | null |
| 2025-12-28 | Gauge Symmetry in Quantum Simulation | Masanori Hanada et.al. | 2512.22932 | null |
| 2025-12-28 | Covering in Hamming and Grassmann Spaces: New Bounds and Reed–Solomon-Based Constructions | Samin Riasat et.al. | 2512.22911 | null |
| 2025-12-28 | Hash Grid Feature Pruning | Yangzhi Ma et.al. | 2512.22882 | null |
| 2025-12-28 | Parallel Diffusion Solver via Residual Dirichlet Policy Optimization | Ruoyu Wang et.al. | 2512.22796 | null |
| 2025-12-28 | TrimTokenator-LC: Towards Adaptive Visual Token Pruning for Large Multimodal Models with Long Contexts | Hao Zhang et.al. | 2512.22748 | null |
| 2025-12-28 | Robust LLM-based Column Type Annotation via Prompt Augmentation with LoRA Tuning | Hanze Meng et.al. | 2512.22742 | null |
| 2025-12-27 | Fragile Knowledge, Robust Instruction-Following: The Width Pruning Dichotomy in Llama-3.2 | Pere Martra et.al. | 2512.22671 | null |
| 2025-12-27 | The Quest for Winning Tickets in Low-Rank Adapters | Hamed Damirchi et.al. | 2512.22495 | null |
| 2025-12-27 | Scalpel-SAM: A Semi-Supervised Paradigm for Adapting SAM to Infrared Small Object Detection | Zihan Liu et.al. | 2512.22483 | null |
| 2025-12-27 | AFA-LoRA: Enabling Non-Linear Adaptations in LoRA with Activation Function Annealing | Jiacheng Li et.al. | 2512.22455 | null |
| 2025-12-26 | Lightweight Inference-Time Personalization for Frozen Knowledge Graph Embeddings | Ozan Oguztuzun et.al. | 2512.22398 | null |
| 2025-12-26 | Integrating Wide and Deep Neural Networks with Squeeze-and-Excitation Blocks for Multi-Target Property Prediction in Additively Manufactured Fiber Reinforced Composites | Behzad Parvaresh et.al. | 2512.22397 | null |
| 2025-12-26 | Towards Efficient Post-Training via Fourier-Driven Adapter Architectures | Donggyun Bae et.al. | 2512.22378 | null |
| 2025-12-26 | The Effectiveness of Approximate Regularized Replay for Efficient Supervised Fine-Tuning of Large Language Models | Matthew Riemer et.al. | 2512.22337 | null |
| 2025-12-26 | PortionNet: Distilling 3D Geometric Knowledge for Food Nutrition Estimation | Darrin Bright et.al. | 2512.22304 | null |
| 2025-12-26 | Pruning as a Game: Equilibrium-Driven Sparsification of Neural Networks | Zubair Shah et.al. | 2512.22106 | null |
| 2025-12-26 | A Lightweight Multi-Scale Attention Framework for Real-Time Spinal Endoscopic Instance Segmentation | Qi Lai et.al. | 2512.21984 | null |
| 2025-12-26 | Breaking Alignment Barriers: TPS-Driven Semantic Correlation Learning for Alignment-Free RGB-T Salient Object Detection | Lupiao Hu et.al. | 2512.21856 | null |
| 2025-12-26 | Knowledge Reasoning of Large Language Models Integrating Graph-Structured Information for Pest and Disease Control in Tobacco | Siyu Li et.al. | 2512.21837 | null |
| 2025-12-26 | LIME:Accelerating Collaborative Lossless LLM Inference on Memory-Constrained Edge Devices | Mingyu Sun et.al. | 2512.21835 | null |
| 2025-12-25 | InstructMoLE: Instruction-Guided Mixture of Low-rank Experts for Multi-Conditional Image Generation | Jinqi Xiao et.al. | 2512.21788 | null |
| 2025-12-25 | An Information Theoretic Perspective on Agentic System Design | Shizhe He et.al. | 2512.21720 | null |
| 2025-12-25 | MoRAgent: Parameter Efficient Agent Tuning with Mixture-of-Roles | Jing Han et.al. | 2512.21708 | null |
| 2025-12-25 | Rethinking Output Alignment For 1-bit Post-Training Quantization of Large Language Models | Dung Anh Hoang et.al. | 2512.21651 | null |
| 2025-12-25 | UltraLBM-UNet: Ultralight Bidirectional Mamba-based Model for Skin Lesion Segmentation | Linxuan Fan et.al. | 2512.21584 | null |
| 2025-12-25 | Gamayun’s Path to Multilingual Mastery: Cost-Efficient Training of a 1.5B-Parameter LLM | Alexander Podolskiy et.al. | 2512.21580 | null |
| 2025-12-25 | Quantum $SL^+(N,\mathbb{R})$ as a locally compact quantum group | K. De Commer et.al. | 2512.21579 | null |
| 2025-12-25 | Towards Long-window Anchoring in Vision-Language Model Distillation | Haoyi Zhou et.al. | 2512.21576 | null |
| 2025-12-25 | World-Coordinate Human Motion Retargeting via SAM 3D Body | Zhangzheng Tu et.al. | 2512.21573 | null |
| 2025-12-25 | RefineBridge: Generative Bridge Models Improve Financial Forecasting by Foundation Models | Anthony Bolton et.al. | 2512.21572 | null |
| 2025-12-25 | Hierarchy-Aware Fine-Tuning of Vision-Language Models | Jiayu Li et.al. | 2512.21529 | null |
| 2025-12-25 | Selective LLM-Guided Regularization for Enhancing Recommendation Models | Shanglin Yang et.al. | 2512.21526 | null |
| 2025-12-25 | Fixed-Budget Parameter-Efficient Training with Frozen Encoders Improves Multimodal Chest X-Ray Classification | Md Ashik Khan et.al. | 2512.21508 | null |
| 2025-12-24 | A Graph-Augmented knowledge Distillation based Dual-Stream Vision Transformer with Region-Aware Attention for Gastrointestinal Disease Classification with Explainable AI | Md Assaduzzaman et.al. | 2512.21372 | null |
| 2025-12-24 | Fast SAM2 with Text-Driven Token Pruning | Avilasha Mandal et.al. | 2512.21333 | null |
| 2025-12-24 | Model Merging via Multi-Teacher Knowledge Distillation | Seyed Arshan Dalili et.al. | 2512.21288 | null |
| 2025-12-24 | SMART SLM: Structured Memory and Reasoning Transformer, A Small Language Model for Accurate Document Assistance | Divij Dudeja et.al. | 2512.21280 | null |
| 2025-12-24 | TGC-Net: A Structure-Aware and Semantically-Aligned Framework for Text-Guided Medical Image Segmentation | Gaoren Lin et.al. | 2512.21135 | null |
| 2025-12-24 | Classical reservoir approach for efficient molecular ground state preparation | Zekun He et.al. | 2512.21069 | null |
| 2025-12-24 | Formal O(N3) scaling GW calculations by block tensor decomposition for large molecule systems | Yueyang Zhang et.al. | 2512.21022 | null |
| 2025-12-24 | Efficient and Robust Video Defense Framework against 3D-field Personalized Talking Face | Rui-qing Sun et.al. | 2512.21019 | null |
| 2025-12-24 | Distilling the Essence: Efficient Reasoning Distillation via Sequence Truncation | Wei-Rui Chen et.al. | 2512.21002 | null |
| 2025-12-24 | Leveraging Overfitting for Low-Complexity and Modality-Agnostic Joint Source-Channel Coding | Haotian Wu et.al. | 2512.20981 | null |
| 2025-12-24 | Universal Transient Stability Analysis: A Large Language Model-Enabled Dynamics Prediction Framework | Chao Shen et.al. | 2512.20970 | null |
| 2025-12-24 | AirGS: Real-Time 4D Gaussian Streaming for Free-Viewpoint Video Experiences | Zhe Wang et.al. | 2512.20943 | null |
| 2025-12-24 | RevFFN: Memory-Efficient Full-Parameter Fine-Tuning of Mixture-of-Experts LLMs with Reversible Blocks | Ningyuan Liu et.al. | 2512.20920 | null |
| 2025-12-24 | Beyond Weight Adaptation: Feature-Space Domain Injection for Cross-Modal Ship Re-Identification | Tingfeng Xian et.al. | 2512.20892 | null |
| 2025-12-24 | Architectural Trade-offs in Small Language Models Under Compute Constraints | Shivraj Singh Bhatti et.al. | 2512.20877 | null |
| 2025-12-25 | Learning to Sense for Driving: Joint Optics-Sensor-Model Co-Design for Semantic Segmentation | Reeshad Khan et.al. | 2512.20815 | null |
| 2025-12-23 | Making Large Language Models Efficient Dense Retrievers | Yibin Lei et.al. | 2512.20612 | null |
| 2025-12-23 | FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models | Kaitong Cai et.al. | 2512.20561 | null |
| 2025-12-23 | Simplifying Multi-Task Architectures Through Task-Specific Normalization | Mihai Suteu et.al. | 2512.20420 | null |
| 2025-12-23 | Branch Learning in MRI: More Data, More Models, More Training | Yuyang Li et.al. | 2512.20330 | null |
| 2025-12-23 | Mixture-of-Experts with Gradient Conflict-Driven Subspace Topology Pruning for Emergent Modularity | Yuxing Gan et.al. | 2512.20291 | null |
| 2025-12-23 | Generative Latent Coding for Ultra-Low Bitrate Image Compression | Zhaoyang Jia et.al. | 2512.20194 | null |
| 2025-12-23 | HEART-VIT: Hessian-Guided Efficient Dynamic Attention and Token Pruning in Vision Transformer | Mohammad Helal Uddin et.al. | 2512.20120 | null |
| 2025-12-23 | Neural Compression of 360-Degree Equirectangular Videos using Quality Parameter Adaptation | Daichi Arai et.al. | 2512.20093 | null |
| 2025-12-23 | Rethinking Knowledge Distillation in Collaborative Machine Learning: Memory, Knowledge, and Their Interactions | Pengchao Han et.al. | 2512.19972 | null |
| 2025-12-22 | Fine-Tuned In-Context Learners for Efficient Adaptation | Jorg Bornschein et.al. | 2512.19879 | null |
| 2025-12-22 | Quantization for sequences of blow-up solutions to an elliptic equation having nonlocal exponential nonlinearity | Mathew Gluck et.al. | 2512.19865 | null |
| 2025-12-22 | Efficient Vision Mamba for MRI Super-Resolution via Hybrid Selective Scanning | Mojtaba Safari et.al. | 2512.19676 | null |
| 2025-12-22 | Quantization of Random Homogeneous Self-Similar Measures | Akash Banerjee et.al. | 2512.19628 | null |
| 2025-12-22 | Yang-Mills energy quantization over non-collapsed degenerating Einstein manifolds and applications | Youmin Chen et.al. | 2512.19552 | null |
| 2025-12-22 | Lightweight Intrusion Detection in IoT via SHAP-Guided Feature Pruning and Knowledge-Distilled Kronecker Networks | Hafsa Benaddi et.al. | 2512.19488 | null |
| 2025-12-22 | Sensitivity-Aware Mixed-Precision Quantization for ReRAM-based Computing-in-Memory | Guan-Cheng Chen et.al. | 2512.19445 | null |
| 2025-12-22 | D2Pruner: Debiased Importance and Structural Diversity for MLLM Token Pruning | Evelyn Zhang et.al. | 2512.19443 | null |
| 2025-12-22 | A Computationally Efficient Framework for Overlapping Community Detection in Large Bipartite Graphs | Yue Zeng et.al. | 2512.19426 | null |
| 2025-12-22 | Sprecher Networks: A Parameter-Efficient Kolmogorov-Arnold Architecture | Christian Hägg et.al. | 2512.19367 | null |
| 2025-12-22 | Are All Data Necessary? Efficient Data Pruning for Large-scale Autonomous Driving Dataset via Trajectory Entropy Maximization | Zhaoyang Liu et.al. | 2512.19270 | null |
| 2025-12-22 | Small Language Models as Compiler Experts: Auto-Parallelization for Heterogeneous Systems | Prathamesh Devadiga et.al. | 2512.19250 | null |
| 2025-12-22 | Towards Minimal Fine-Tuning of VLMs | Tiange Luo et.al. | 2512.19219 | null |
| 2025-12-22 | MixKVQ: Query-Aware Mixed-Precision KV Cache Quantization for Long-Context Reasoning | Tao Zhang et.al. | 2512.19206 | null |
| 2025-12-22 | SAP: Syntactic Attention Pruning for Transformer-based Language Models | Tzu-Yun Lee et.al. | 2512.19125 | null |
| 2025-12-22 | GaussianImage++: Boosted Image Representation and Compression with 2D Gaussian Splatting | Tiantian Li et.al. | 2512.19108 | null |
| 2025-12-22 | Tool-Augmented Hybrid Ensemble Reasoning with Distillation for Bilingual Mathematical Problem Solving | Peiqing Lu et.al. | 2512.19093 | null |
| 2025-12-22 | Can abstract concepts from LLM improve SLM performance? | Siddharth Tandon et.al. | 2512.19069 | null |
| 2025-12-22 | When Less is More: 8-bit Quantization Improves Continual Learning in Large Language Models | Michael S. Zhang et.al. | 2512.18934 | null |
| 2025-12-21 | Stochastic quantization of the weighted exponential QFT | Seiichiro Kusuoka et.al. | 2512.18927 | null |
| 2025-12-21 | FedVideoMAE: Efficient Privacy-Preserving Federated Video Moderation | Ziyuan Tao et.al. | 2512.18809 | null |
| 2025-12-21 | IPCV: Information-Preserving Compression for MLLM Visual Encoders | Yuan Chen et.al. | 2512.18747 | null |
| 2025-12-21 | Uni-Neur2Img: Unified Neural Signal-Guided Image Generation, Editing, and Stylization via Diffusion Transformers | Xiyue Bai et.al. | 2512.18635 | null |
| 2025-12-21 | A Multi-agent Text2SQL Framework using Small Language Models and Execution Feedback | Thanh Dat Hoang et.al. | 2512.18622 | null |
| 2025-12-21 | Neologism Learning as a Parameter-Efficient Alternative to Fine-Tuning for Model Steering | Sungjoon Park et.al. | 2512.18551 | null |
| 2025-12-20 | Position-Resolved Resonance Quantization for Lossy Cavities | Lucas Weitzel et.al. | 2512.18478 | null |
| 2025-12-20 | Analog Quantum Image Representation with Qubit-Frugal Encoding | Vikrant Sharma et.al. | 2512.18451 | null |
| 2025-12-20 | MoE Pathfinder: Trajectory-driven Expert Pruning | Xican Yang et.al. | 2512.18425 | null |
| 2025-12-20 | Quantization for Vector Search under Streaming Updates | Ishaq Aden-Ali et.al. | 2512.18335 | null |
| 2025-12-20 | Asynchronous Pipeline Parallelism for Real-Time Multilingual Lip Synchronization in Video Communication Systems | Eren Caglar et.al. | 2512.18318 | null |
| 2025-12-20 | SG-RIFE: Semantic-Guided Real-Time Intermediate Flow Estimation with Diffusion-Competitive Perceptual Quality | Pan Ben Wong et.al. | 2512.18241 | null |
| 2025-12-19 | ACE-Sync: An Adaptive Cloud-Edge Synchronization Framework for Communication-Efficient Large-Scale Distributed Model Training | Yi Yang et.al. | 2512.18127 | null |
| 2025-12-19 | Efficient Mixture-of-Agents Serving via Tree-Structured Routing, Adaptive Pruning, and Dependency-Aware Prefill-Decode Overlap | Zijun Wang et.al. | 2512.18126 | null |
| 2025-12-19 | YolovN-CBi: A Lightweight and Efficient Architecture for Real-Time Detection of Small UAVs | Ami Pandat et.al. | 2512.18046 | null |
| 2025-12-19 | CoPE: A Small Language Model for Steerable and Scalable Content Labeling | Samidh Chakrabarti et.al. | 2512.18027 | null |
| 2025-12-19 | On General Linearly Implicit Quantized State System Methods | Mariana Bergonzi et.al. | 2512.17855 | null |
| 2025-12-19 | Two-photon light-sheet live imaging at kilohertz frame rate using birefringence-based pulse splitting | Lei Zhu et.al. | 2512.17783 | null |
| 2025-12-19 | Easy Adaptation: An Efficient Task-Specific Knowledge Injection Method for Large Models in Resource-Constrained Environments | Dong Chen et.al. | 2512.17771 | null |
| 2025-12-19 | AdaptPrompt: Parameter-Efficient Adaptation of VLMs for Generalizable Deepfake Detection | Yichen Jiang et.al. | 2512.17730 | null |
| 2025-12-19 | Mitigating Forgetting in Low Rank Adaptation | Joanna Sliwa et.al. | 2512.17720 | null |
| 2025-12-19 | Confidence-Credibility Aware Weighted Ensembles of Small LLMs Outperform Large LLMs in Emotion Detection | Menna Elgabry et.al. | 2512.17630 | null |
| 2025-12-19 | Guided progressive reconstructive imaging: a new quantization-based framework for low-dose, high-throughput and real-time analytical ptychography | Hoelen L. Lalandec Robert et.al. | 2512.17561 | null |
| 2025-12-19 | A 28nm 0.22 μJ/token memory-compute-intensity-aware CNN-Transformer accelerator with hybrid-attention-based layer-fusion and cascaded pruning for semanticsegmentation | Pingcheng Dong et.al. | 2512.17555 | null |
| 2025-12-19 | Voxel-GS: Quantized Scaffold Gaussian Splatting Compression with Run-Length Coding | Chunyang Fu et.al. | 2512.17528 | null |
| 2025-12-19 | Resource-efficient medical image classification for edge devices | Mahsa Lavaei et.al. | 2512.17515 | null |
| 2025-12-19 | A lightweight Spatial-Temporal Graph Neural Network for Long-term Time Series Forecasting | Henok Tenaw Moges et.al. | 2512.17453 | null |
| 2025-12-19 | Adaptive Graph Pruning with Sudden-Events Evaluation for Traffic Prediction using Online Semi-Decentralized ST-GNNs | Ivan Kralj et.al. | 2512.17352 | null |
| 2025-12-19 | Auxiliary Descriptive Knowledge for Few-Shot Adaptation of Vision-Language Model | SuBeen Lee et.al. | 2512.17313 | null |
| 2025-12-19 | Warmer for Less: A Cost-Efficient Strategy for Cold-Start Recommendations at Pinterest | Saeed Ebrahimi et.al. | 2512.17277 | null |
| 2025-12-19 | BumpNet: A Sparse Neural Network Framework for Learning PDE Solutions | Shao-Ting Chiu et.al. | 2512.17198 | null |
| 2025-12-18 | Atom: Efficient On-Device Video-Language Pipelines Through Modular Reuse | Kunjal Panchal et.al. | 2512.17108 | null |
| 2025-12-18 | Bandwidth-Efficient Adaptive Mixture-of-Experts via Low-Rank Compensation | Zhenyu Liu et.al. | 2512.17073 | null |
| 2025-12-18 | Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL | Khushboo Thaker et.al. | 2512.17053 | null |
| 2025-12-18 | UniRel-R1: RL-tuned LLM Reasoning for Knowledge Graph Relational Question Answering | Yinxu Tang et.al. | 2512.17043 | null |
| 2025-12-18 | Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection | Kaixin Ding et.al. | 2512.16905 | null |
| 2025-12-18 | TOGGLE: Temporal Logic-Guided Large Language Model Compression for Edge | Khurram Khalil et.al. | 2512.16855 | null |
| 2025-12-18 | Simulation-based inference with neural posterior estimation applied to X-ray spectral fitting - III Deriving exact posteriors with dimension reduction and importance sampling | Didier Barret et.al. | 2512.16709 | null |
| 2025-12-18 | Direct inversion of data-space Hessian for efficient time-domain extended-source waveform inversion using the multiplier method | Mahdi Sonbolestan et.al. | 2512.16642 | null |
| 2025-12-18 | Efficient CPU-GPU Collaborative Inference for MoE-based LLMs on Memory-Limited Systems | En-Ming Huang et.al. | 2512.16473 | null |
| 2025-12-18 | CKA-Guided Modular Quantization: Beyond Bit-Width to Algorithmic Diversity | Jinhao Zhang et.al. | 2512.16282 | null |
| 2025-12-19 | Trustworthy and Controllable Professional Knowledge Utilization in Large Language Models with TEE-GPU Execution | Yifeng Cai et.al. | 2512.16238 | null |
| 2025-12-18 | A Domain-Adapted Pipeline for Structured Information Extraction from Police Incident Announcements on Social Media | Mengfan Shen et.al. | 2512.16183 | null |
| 2025-12-18 | Tunneling in double-well potentials within stochastic quantization: Application to ammonia inversion | Danilo F. Schafaschek et.al. | 2512.16168 | null |
| 2025-12-18 | Antisymmetrization of composite fermionic states for quantum simulations of nuclear reactions in first-quantization mapping | Ionel Stetcu et.al. | 2512.16138 | null |
| 2025-12-18 | A Tri-Dynamic Preprocessing Framework for UGC Video Compression | Fei Zhao et.al. | 2512.16101 | null |
| 2025-12-18 | TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times | Jintao Zhang et.al. | 2512.16093 | null |
| 2025-12-18 | LAPX: Lightweight Hourglass Network with Global Context | Haopeng Zhao et.al. | 2512.16089 | null |
| 2025-12-17 | AIE4ML: An End-to-End Framework for Compiling Neural Networks for the Next Generation of AMD AI Engines | Dimitrios Danopoulos et.al. | 2512.15946 | null |
| 2025-12-17 | Small Language Models for Efficient Agentic Tool Calling: Outperforming Large Models with Targeted Fine-tuning | Polaris Jhandi et.al. | 2512.15943 | null |
| 2025-12-17 | End-to-End Training for Autoregressive Video Diffusion via Self-Resampling | Yuwei Guo et.al. | 2512.15702 | null |
| 2025-12-17 | How Much is Too Much? Exploring LoRA Rank Trade-offs for Retaining Knowledge and Domain Robustness | Darshita Rathore et.al. | 2512.15634 | null |
| 2025-12-17 | Bolmo: Byteifying the Next Generation of Language Models | Benjamin Minixhofer et.al. | 2512.15586 | null |
| 2025-12-17 | IMKD: Intensity-Aware Multi-Level Knowledge Distillation for Camera-Radar Fusion | Shashank Mishra et.al. | 2512.15581 | null |
| 2025-12-17 | An Efficient and Effective Encoder Model for Vision and Language Tasks in the Remote Sensing Domain | João Daniel Silva et.al. | 2512.15531 | null |
| 2025-12-17 | Photorealistic Phantom Roads in Real Scenes: Disentangling 3D Hallucinations from Physical Geometry | Hoang Nguyen et.al. | 2512.15423 | null |
| 2025-12-17 | Dual-Density Inference for Efficient Language Model Reasoning | Zhengyi Zhao et.al. | 2512.15358 | null |
| 2025-12-17 | Joint Activity Detection and Channel Estimation For Fluid Antenna System Exploiting Geographical and Angular Information | Zhentian Zhang et.al. | 2512.15342 | null |
| 2025-12-17 | Bits for Privacy: Evaluating Post-Training Quantization via Membership Inference | Chenxiang Zhang et.al. | 2512.15335 | null |
| 2025-12-17 | A Masked Reverse Knowledge Distillation Method Incorporating Global and Local Information for Image Anomaly Detection | Yuxin Jiang et.al. | 2512.15326 | null |
| 2025-12-17 | KD360-VoxelBEV: LiDAR and 360-degree Camera Cross Modality Knowledge Distillation for Bird’s-Eye-View Segmentation | Wenke E et.al. | 2512.15311 | null |
| 2025-12-17 | LLMQ: Efficient Lower-Precision Pretraining for Consumer GPUs | Erik Schultheis et.al. | 2512.15306 | null |
| 2025-12-17 | Generative Preprocessing for Image Compression with Pre-trained Diffusion Models | Mengxi Guo et.al. | 2512.15270 | null |
| 2025-12-18 | Null-LoRA: Low-Rank Adaptation on Null Space | Yi Zhang et.al. | 2512.15233 | null |
| 2025-12-17 | ERIENet: An Efficient RAW Image Enhancement Network under Low-Light Environment | Jianan Wang et.al. | 2512.15186 | null |
| 2025-12-18 | An updated efficient galaxy morphology classification model based on ConvNeXt encoding with UMAP dimensionality reduction | Guanwen Fang et.al. | 2512.15137 | null |
| 2025-12-17 | Large Model Enabled Embodied Intelligence for 6G Integrated Perception, Communication, and Computation Network | Zhuoran Li et.al. | 2512.15109 | null |
| 2025-12-17 | Quantization in mixed polarization via transverse Poincaré-Birkhoff-Witt theorem | Dan Wang et.al. | 2512.15060 | null |
| 2025-12-17 | Fractional quantization by interaction of arbitrary strength in gapless flat bands with divergent quantum geometry | Wenqi Yang et.al. | 2512.15041 | null |
| 2025-12-16 | Cross-Tokenizer Likelihood Scoring Algorithms for Language Model Distillation | Buu Phan et.al. | 2512.14954 | null |
| 2025-12-16 | Parameter Efficient Multimodal Instruction Tuning for Romanian Vision Language Models | George-Andrei Dima et.al. | 2512.14926 | null |
| 2025-12-16 | Compensation of Coarse Quantization Effects on Channel Estimation and BER in Massive MIMO | Reza Mohammadkhani et.al. | 2512.14893 | null |
| 2025-12-16 | Spherical Leech Quantization for Visual Tokenization and Generation | Yue Zhao et.al. | 2512.14697 | null |
| 2025-12-16 | Native and Compact Structured Latents for 3D Generation | Jianfeng Xiang et.al. | 2512.14692 | null |
| 2025-12-16 | Focus: A Streaming Concentration Architecture for Efficient Vision-Language Models | Chiyue Wei et.al. | 2512.14661 | null |
| 2025-12-16 | PruneX: A Hierarchical Communication-Efficient System for Distributed CNN Training with Structured Pruning | Alireza Olama et.al. | 2512.14628 | null |
| 2025-12-16 | Distill Video Datasets into Images | Zhenghao Zhao et.al. | 2512.14621 | null |
| 2025-12-16 | Polypersona: Persona-Grounded LLM for Synthetic Survey Responses | Tejaswani Dash et.al. | 2512.14562 | null |
| 2025-12-16 | VersatileFFN: Achieving Parameter Efficiency in LLMs via Adaptive Wide-and-Deep Reuse | Ying Nie et.al. | 2512.14531 | null |
| 2025-12-16 | SASQ: Static Activation Scaling for Quantization-Aware Training in Large Language Models | Shizhuo Mao et.al. | 2512.14481 | null |
| 2025-12-16 | Context-Picker: Dynamic context selection using multi-stage reinforcement learning | Siyuan Zhu et.al. | 2512.14465 | null |
| 2025-12-16 | HGS: Hybrid Gaussian Splatting with Static-Dynamic Decomposition for Compact Dynamic View Synthesis | Kaizhe Zhang et.al. | 2512.14352 | null |
| 2025-12-16 | Ladder Up, Memory Down: Low-Cost Fine-Tuning With Side Nets | Estelle Zheng et.al. | 2512.14237 | null |
| 2025-12-16 | Arithmetic-Intensity-Aware Quantization | Taig Singh et.al. | 2512.14090 | null |
| 2025-12-16 | HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices | HyperAI Team et.al. | 2512.14052 | null |
| 2025-12-16 | Evaluating Small Language Models for Agentic On-Farm Decision Support Systems | Enhong Liu et.al. | 2512.14043 | null |
| 2025-12-15 | Ensemble-Guided Distillation for Compact and Robust Acoustic Scene Classification on Edge Devices | Hossein Sharify et.al. | 2512.13905 | null |
| 2025-12-15 | OPTIMA: Optimal One-shot Pruning for LLMs via Quadratic Programming Reconstruction | Mohammad Mozaffari et.al. | 2512.13886 | null |
| 2025-12-15 | Improvise, Adapt, Overcome – Telescopic Adapters for Efficient Fine-tuning of Vision Language Models in Medical Imaging | Ujjwal Mishra et.al. | 2512.13855 | null |
| 2025-12-15 | Recurrent Video Masked Autoencoders | Daniel Zoran et.al. | 2512.13684 | null |
| 2025-12-15 | SEDULity: A Proof-of-Learning Framework for Distributed and Secure Blockchains with Efficient Useful Work | Weihang Cao et.al. | 2512.13666 | null |
| 2025-12-15 | Large-Language Memorization During the Classification of United States Supreme Court Cases | John E. Ortega et.al. | 2512.13654 | null |
| 2025-12-15 | Performance Limits of Hardware-Constrained THz Inter-Satellite MIMO-ISAC Systems | Haofan Dong et.al. | 2512.13652 | null |
| 2025-12-16 | MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning | Haoyu Fu et.al. | 2512.13636 | null |
| 2025-12-15 | LightTopoGAT: Enhancing Graph Attention Networks with Topological Features for Efficient Graph Classification | Ankit Sharma et.al. | 2512.13617 | null |
| 2025-12-15 | Null quantization, shadows and boost eigenfunctions in Lorentzian AdS | Núria Navarro et.al. | 2512.13541 | null |
| 2025-12-15 | SkipCat: Rank-Maximized Low-Rank Compression of Large Language Models via Shared Projection and Block Skipping | Yu-Chen Lu et.al. | 2512.13494 | null |
| 2025-12-15 | Element-wise Modulation of Random Matrices for Efficient Neural Layers | Maksymilian Szorc et.al. | 2512.13480 | null |
| 2025-12-15 | Automated Information Flow Selection for Multi-scenario Multi-task Recommendation | Chaohua Yang et.al. | 2512.13396 | null |
| 2025-12-15 | Space Efficient Algorithms for Parameterised Problems | Sheikh Shakil Akhtar et.al. | 2512.13342 | null |
| 2025-12-16 | KD-PINN: Knowledge-Distilled PINNs for ultra-low-latency real-time neural PDE solvers | Karim Bounja et.al. | 2512.13336 | null |
| 2025-12-15 | Distillation of continuous variable qudits from single photon sources: A cascaded approach | Devibala Esakkimuthu et.al. | 2512.13264 | null |
| 2025-12-15 | Seeing the Whole Picture: Distribution-Guided Data-Free Distillation for Semantic Segmentation | Hongxuan Sun et.al. | 2512.13175 | null |
| 2025-12-15 | An Optimal Alignment-Driven Iterative Closed-Loop Convergence Framework for High-Performance Ultra-Large Scale Layout Pattern Clustering | Shuo Liu et.al. | 2512.13133 | null |
| 2025-12-15 | SliceMoE: Bit-Sliced Expert Caching under Miss-Rate Constraints for Efficient MoE Inference | Yuseon Choi et.al. | 2512.12990 | null |
| 2025-12-15 | CoDeQ: End-to-End Joint Model Compression with Dead-Zone Quantizer for High-Sparsity and Low-Precision Networks | Jonathan Wenshøj et.al. | 2512.12981 | null |
| 2025-12-15 | Application of Deep Learning in Biological Data Compression | Chunyu Zou et.al. | 2512.12975 | null |
| 2025-12-15 | Investigating Data Pruning for Pretraining Biological Foundation Models at Scale | Yifan Wu et.al. | 2512.12932 | null |
| 2025-12-15 | SeVeDo: A Heterogeneous Transformer Accelerator for Low-Bit Inference via Hierarchical Group Quantization and SVD-Guided Mixed Precision | Yuseon Choi et.al. | 2512.12930 | null |
| 2025-12-14 | Improving Recursive Transformers with Mixture of LoRAs | Mohammadmahdi Nouriborji et.al. | 2512.12880 | null |
| 2025-12-14 | KANELÉ: Kolmogorov-Arnold Networks for Efficient LUT-based Evaluation | Duc Hoang et.al. | 2512.12850 | null |
| 2025-12-14 | HaShiFlex: A High-Throughput Hardened Shifter DNN Accelerator with Fine-Tuning Flexibility | Jonathan Herbst et.al. | 2512.12847 | null |
| 2025-12-14 | Adapting Multimodal Foundation Models for Few-Shot Learning: A Comprehensive Study on Contrastive Captioners | N. K. B. M. P. K. B. Narasinghe et.al. | 2512.12824 | null |
| 2025-12-14 | FuXi- $γ$ : Efficient Sequential Recommendation with Exponential-Power Temporal Encoder and Diagonal-Sparse Positional Mechanism | Dezhi Yi et.al. | 2512.12740 | null |
| 2025-12-14 | Self-Motivated Growing Neural Network for Adaptive Architecture via Local Structural Plasticity | Yiyang Jia et.al. | 2512.12713 | null |
| 2025-12-14 | Efficient Vision-Language Reasoning via Adaptive Token Pruning | Xue Li et.al. | 2512.12701 | null |
| 2025-12-14 | Fine-Tuning Causal LLMs for Text Classification: Embedding-Based vs. Instruction-Based Approaches | Amirhossein Yousefiramandi et.al. | 2512.12677 | null |
| 2025-12-14 | Patch-wise Retrieval: A Bag of Practical Techniques for Instance-level Matching | Wonseok Choi et.al. | 2512.12610 | null |
| 2025-12-14 | StreamingAssistant: Efficient Visual Token Pruning for Accelerating Online Video Understanding | Xinqi Jin et.al. | 2512.12560 | null |
| 2025-12-14 | Effective Fine-Tuning with Eigenvector Centrality Based Pruning | Shaif Chowdhury et.al. | 2512.12543 | null |
| 2025-12-13 | Cross-Modal Representational Knowledge Distillation for Enhanced Spike-Informed LFP Modeling | Eray Erturk et.al. | 2512.12461 | null |
| 2025-12-13 | Complete Topological Quantization of Higher Gauge Fields | Hisham Sati et.al. | 2512.12431 | null |
| 2025-12-13 | Large and Small Model Collaboration for Air Interface | Yiming Cui et.al. | 2512.12170 | null |
| 2025-12-12 | Instruction-Tuning Open-Weight Language Models for BPMN Model Generation | Gökberk Çelikmasat et.al. | 2512.12063 | null |
| 2025-12-12 | HFS: Holistic Query-Aware Frame Selection for Efficient Video Reasoning | Yiqing Yang et.al. | 2512.11534 | null |
| 2025-12-12 | Quantization for Semipositive Adjoint Line Bundles | Yu-Chi Hou et.al. | 2512.11523 | null |
| 2025-12-12 | Enhanced Pruning for Distributed Closeness Centrality under Multi-Packet Messaging | Patrick D. Manya et.al. | 2512.11512 | null |
| 2025-12-12 | qa-FLoRA: Data-free query-adaptive Fusion of LoRAs for LLMs | Shreya Shukla et.al. | 2512.11366 | null |
| 2025-12-12 | Why cut-and-choose quantum state verification cannot be both efficient and secure | Fabian Wiesner et.al. | 2512.11358 | null |
| 2025-12-12 | PhraseVAE and PhraseLDM: Latent Diffusion for Full-Song Multitrack Symbolic Music Generation | Longshen Ou et.al. | 2512.11348 | null |
| 2025-12-12 | MLLM Machine Unlearning via Visual Knowledge Distillation | Yuhang Wang et.al. | 2512.11325 | null |
| 2025-12-12 | AdaSD: Adaptive Speculative Decoding for Efficient Language Model Inference | Kuan-Wei Lu et.al. | 2512.11280 | null |
| 2025-12-12 | Information-Theoretic Equivalences Across Rate-Distortion, Quantization, and Decoding | Bruno Macchiavello et.al. | 2512.11279 | null |
| 2025-12-11 | Network and Compiler Optimizations for Efficient Linear Algebra Kernels in Private Transformer Inference | Karthik Garimella et.al. | 2512.11135 | null |
| 2025-12-11 | Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching | Bowen Wen et.al. | 2512.11130 | null |
| 2025-12-11 | Information-driven Fusion of Pathology Foundation Models for Enhanced Disease Characterization | Brennan Flannery et.al. | 2512.11104 | null |
| 2025-12-11 | Q-BAR: Blogger Anomaly Recognition via Quantum-enhanced Manifold Learning | Maida Wang et.al. | 2512.11071 | null |
| 2025-12-11 | Weakly Supervised Tuberculosis Localization in Chest X-rays through Knowledge Distillation | Marshal Ashif Shawkat et.al. | 2512.11057 | null |
| 2025-12-11 | SparseSwaps: Tractable LLM Pruning Mask Refinement at Scale | Max Zimmer et.al. | 2512.10922 | null |
| 2025-12-11 | Multi-Granular Node Pruning for Circuit Discovery | Muhammad Umair Haider et.al. | 2512.10903 | null |
| 2025-12-11 | LDP: Parameter-Efficient Fine-Tuning of Multimodal LLM for Medical Report Generation | Tianyu Zhou et.al. | 2512.10750 | null |
| 2025-12-11 | Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution | Zouying Cao et.al. | 2512.10696 | null |
| 2025-12-11 | Master functions and hybrid quantization of perturbed nonrotating black hole interiors | Michele Lenzi et.al. | 2512.10692 | null |
| 2025-12-11 | Deep Photonic Reservoir Computing with On-chip Nonlinearity | Jinlong Xiang et.al. | 2512.10626 | null |
| 2025-12-11 | Phythesis: Physics-Guided Evolutionary Scene Synthesis for Energy-Efficient Data Center Design via LLMs | Minghao LI et.al. | 2512.10611 | null |
| 2025-12-11 | Uncertainty-Preserving QBNNs: Multi-Level Quantization of SVI-Based Bayesian Neural Networks for Image Classification | Hendrik Borras et.al. | 2512.10602 | null |
| 2025-12-11 | Quantization of massive Dirac neutrinos in external fields | Maxim Dvornikov et.al. | 2512.10587 | null |
| 2025-12-11 | Disentangled and Distilled Encoder for Out-of-Distribution Reasoning with Rademacher Guarantees | Zahra Rahiminasab et.al. | 2512.10522 | null |
| 2025-12-11 | Geometric quantization on big line bundles | Siarhei Finski et.al. | 2512.10466 | null |
| 2025-12-11 | Error-Propagation-Free Learned Video Compression With Dual-Domain Progressive Temporal Alignment | Han Li et.al. | 2512.10450 | null |
| 2025-12-11 | Clustered Federated Learning with Hierarchical Knowledge Distillation | Sabtain Ahmad et.al. | 2512.10443 | null |
| 2025-12-11 | A Kernel-based Resource-efficient Neural Surrogate for Multi-fidelity Prediction of Aerodynamic Field | Apurba Sarker et.al. | 2512.10287 | null |
| 2025-12-11 | An Efficient Graph-Transformer Operator for Learning Physical Dynamics with Manifolds Embedding | Pengwei Liu et.al. | 2512.10227 | null |
| 2025-12-10 | Generate-Then-Validate: A Novel Question Generation Approach Using Small Language Models | Yumou Wei et.al. | 2512.10110 | null |
| 2025-12-10 | Parallel Decoder Transformer: Model-Internal Parallel Decoding with Speculative Invariance via Note Conditioning | Logan Robbins et.al. | 2512.10054 | null |
| 2025-12-10 | Spatial Spiking Neural Networks Enable Efficient and Robust Temporal Computation | Lennart P. L. Landsmeer et.al. | 2512.10011 | null |
| 2025-12-10 | GoodSpeed: Optimizing Fair Goodput with Adaptive Speculative Decoding in Distributed Edge Inference | Phuong Tran et.al. | 2512.09963 | null |
| 2025-12-10 | Token Expand-Merge: Training-Free Token Compression for Vision-Language-Action Models | Yifan Ye et.al. | 2512.09927 | null |
| 2025-12-10 | Efficient Continual Learning in Neural Machine Translation: A Low-Rank Adaptation Approach | Salvador Carrión et.al. | 2512.09910 | null |
| 2025-12-10 | SCOPE: Language Models as One-Time Teacher for Hierarchical Planning in Text Environments | Haoye Lu et.al. | 2512.09897 | null |
| 2025-12-10 | HPM-KD: Hierarchical Progressive Multi-Teacher Framework for Knowledge Distillation and Efficient Model Compression | Gustavo Coelho Haase et.al. | 2512.09886 | null |
| 2025-12-10 | FlipLLM: Efficient Bit-Flip Attacks on Multimodal LLMs using Reinforcement Learning | Khurram Khalil et.al. | 2512.09872 | null |
| 2025-12-10 | RIFT: A Scalable Methodology for LLM Accelerator Fault Assessment using Reinforcement Learning | Khurram Khalil et.al. | 2512.09829 | null |
| 2025-12-10 | Energy-Efficient Federated Learning with Relay-Assisted Aggregation in IIoT Networks | Hamid Reza Hashempour et.al. | 2512.09827 | null |
| 2025-12-10 | GLaD: Geometric Latent Distillation for Vision-Language-Action Models | Minghao Guo et.al. | 2512.09619 | null |
| 2025-12-10 | LiePrune: Lie Group and Quantum Geometric Dual Representation for One-Shot Structured Pruning of Quantum Neural Networks | Haijian Shao et.al. | 2512.09469 | null |
| 2025-12-10 | Black-Box Behavioral Distillation Breaks Safety Alignment in Medical LLMs | Sohely Jahan et.al. | 2512.09403 | null |
| 2025-12-10 | Are Hypervectors Enough? Single-Call LLM Reasoning over Knowledge Graphs | Yezi Liu et.al. | 2512.09369 | null |
| 2025-12-10 | NOC4SC: A Bandwidth-Efficient Multi-User Semantic Communication Framework for Interference-Resilient Transmission | Yunhao Wang et.al. | 2512.09356 | null |
| 2025-12-10 | Training-free Context-adaptive Attention for Efficient Long Context Modeling | Zeng You et.al. | 2512.09238 | null |
| 2025-12-10 | Tensor-Compressed and Fully-Quantized Training of Neural PDE Solvers | Jinming Lu et.al. | 2512.09202 | null |
| 2025-12-09 | GS-KAN: Parameter-Efficient Kolmogorov-Arnold Networks via Sprecher-Type Shared Basis Functions | Oscar Eliasson et.al. | 2512.09084 | null |
| 2025-12-09 | KD-OCT: Efficient Knowledge Distillation for Clinical-Grade Retinal OCT Classification | Erfan Nourbakhsh et.al. | 2512.09069 | null |
| 2025-12-09 | Towards Lossless Ultimate Vision Token Compression for VLMs | Dehua Zheng et.al. | 2512.09010 | null |
| 2025-12-10 | Efficiently Reconstructing Dynamic Scenes One D4RT at a Time | Chuhan Zhang et.al. | 2512.08924 | null |
| 2025-12-09 | Fed-SE: Federated Self-Evolution for Privacy-Constrained Multi-Environment LLM Agents | Xiang Chen et.al. | 2512.08870 | null |
| 2025-12-09 | PrivTune: Efficient and Privacy-Preserving Fine-Tuning of Large Language Models via Device-Cloud Collaboration | Yi Liu et.al. | 2512.08809 | null |
| 2025-12-09 | Skewness-Guided Pruning of Multimodal Swin Transformers for Federated Skin Lesion Classification on Edge Devices | Kuniko Paxton et.al. | 2512.08751 | null |
| 2025-12-09 | Scale-invariant and View-relational Representation Learning for Full Surround Monocular Depth | Kyumin Hwang et.al. | 2512.08700 | null |
| 2025-12-09 | Beyond Real Weights: Hypercomplex Representations for Stable Quantization | Jawad Ibn Ahad et.al. | 2512.08524 | null |
| 2025-12-10 | Solving Oversmoothing in GNNs via Nonlocal Message Passing: Algebraic Smoothing and Depth Scalability | Weiqi Guan et.al. | 2512.08475 | null |
| 2025-12-09 | Quantization and Security Parameter Design for Overflow-Free Confidential FRIT | Jungjin Park et.al. | 2512.08464 | null |
| 2025-12-09 | Nucleon Structure from Basis Light-Front Quantization : Status and Prospects | James P. Vary et.al. | 2512.08283 | null |
| 2025-12-09 | SOFA-FL: Self-Organizing Hierarchical Federated Learning with Adaptive Clustered Data Sharing | Yi Ni et.al. | 2512.08267 | null |
| 2025-12-09 | HybridToken-VLM: Hybrid Token Compression for Vision-Language Models | Jusheng Zhang et.al. | 2512.08240 | null |
| 2025-12-09 | MobileFineTuner: A Unified End-to-End Framework for Fine-Tuning LLMs on Mobile Phones | Jiaxiang Geng et.al. | 2512.08211 | null |
| 2025-12-09 | Animal Re-Identification on Microcontrollers | Yubo Chen et.al. | 2512.08198 | null |
| 2025-12-08 | Skein-valued mirror curves for toric CY3 strips | Mingyuan Hu et.al. | 2512.07762 | null |
| 2025-12-08 | PVeRA: Probabilistic Vector-Based Random Matrix Adaptation | Leo Fillioux et.al. | 2512.07703 | null |
| 2025-12-08 | Sharp values for all dynamical variables via Anti-Wick quantization | Simon Friederich et.al. | 2512.07616 | null |
| 2025-12-08 | Algorithm-hardware co-design of neuromorphic networks with dual memory pathways | Pengfei Sun et.al. | 2512.07602 | null |
| 2025-12-08 | All You Need Are Random Visual Tokens? Demystifying Token Pruning in VLLMs | Yahong Wang et.al. | 2512.07580 | null |
| 2025-12-08 | LIME: Making LLM Data More Efficient with Linguistic Metadata Embeddings | Sebastian Sztwiertnia et.al. | 2512.07522 | null |
| 2025-12-08 | Dictionary-Based Contrastive Learning for GNSS Jamming Detection | Zawar Hussain et.al. | 2512.07512 | null |
| 2025-12-08 | Persian-Phi: Efficient Cross-Lingual Adaptation of Compact LLMs via Curriculum Learning | Amir Mohammad Akhlaghi et.al. | 2512.07454 | null |
| 2025-12-08 | Revolutionizing Mixed Precision Quantization: Towards Training-free Automatic Proxy Discovery via Large Language Models | Haidong Kang et.al. | 2512.07419 | null |
| 2025-12-08 | GlimmerNet: A Lightweight Grouped Dilated Depthwise Convolutions for UAV-Based Emergency Monitoring | Đorđe Nedeljković et.al. | 2512.07391 | null |
| 2025-12-08 | Recover-to-Forget: Gradient Reconstruction from LoRA for Efficient LLM Unlearning | Yezi Liu et.al. | 2512.07374 | null |
| 2025-12-08 | Non-Intrusive Data-Free Parametric Reduced Order Model for Geometrically Nonlinear Structures | Alexander Saccani et.al. | 2512.07366 | null |
| 2025-12-08 | ReLKD: Inter-Class Relation Learning with Knowledge Distillation for Generalized Category Discovery | Fang Zhou et.al. | 2512.07229 | null |
| 2025-12-08 | Geometric Prior-Guided Federated Prompt Calibration | Fei Luo et.al. | 2512.07208 | null |
| 2025-12-08 | SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting | Seokhyun Youn et.al. | 2512.07197 | null |
| 2025-12-08 | HVQ-CGIC: Enabling Hyperprior Entropy Modeling for VQ-Based Controllable Generative Image Compression | Niu Yi et.al. | 2512.07192 | null |
| 2025-12-08 | MuSASplat: Efficient Sparse-View 3D Gaussian Splats via Lightweight Multi-Scale Adaptation | Muyu Xu et.al. | 2512.07165 | null |
| 2025-12-08 | Winning the Lottery by Preserving Network Training Dynamics with Concrete Ticket Search | Tanay Arora et.al. | 2512.07142 | null |
| 2025-12-08 | FOAM: Blocked State Folding for Memory-Efficient LLM Training | Ziqing Wen et.al. | 2512.07112 | null |
| 2025-12-08 | Leveraging KV Similarity for Online Structured Pruning in LLMs | Jungmin Lee et.al. | 2512.07090 | null |
| 2025-12-07 | DAUNet: A Lightweight UNet Variant with Deformable Convolutions and Parameter-Free Attention for Medical Image Segmentation | Adnan Munir et.al. | 2512.07051 | null |
| 2025-12-07 | PARIS: Pruning Algorithm via the Representer theorem for Imbalanced Scenarios | Enrico Camporeale et.al. | 2512.06950 | null |
| 2025-12-07 | SceneMixer: Exploring Convolutional Mixing Networks for Remote Sensing Scene Classification | Mohammed Q. Alkhatib et.al. | 2512.06877 | null |
| 2025-12-07 | Physics Informed Generative Machine Learning for Accelerated Quantum-centric Supercomputing | Chayan Patra et.al. | 2512.06858 | null |
| 2025-12-07 | RMAdapter: Reconstruction-based Multi-Modal Adapter for Vision-Language Models | Xiang Lin et.al. | 2512.06811 | null |
| 2025-12-07 | Parameter-Efficient Fine-Tuning with Differential Privacy for Robust Instruction Adaptation in Large Language Models | Yulin Huang et.al. | 2512.06711 | null |
| 2025-12-07 | Towards Small Language Models for Security Query Generation in SOC Workflows | Saleha Muzammil et.al. | 2512.06660 | null |
| 2025-12-07 | Quantum Temporal Convolutional Neural Networks for Cross-Sectional Equity Return Prediction: A Comparative Benchmark Study | Chi-Sheng Chen et.al. | 2512.06630 | null |
| 2025-12-07 | Vector Quantization using Gaussian Variational Autoencoder | Tongda Xu et.al. | 2512.06609 | null |
| 2025-12-06 | QL-LSTM: A Parameter-Efficient LSTM for Stable Long-Sequence Modeling | Isaac Kofi Nti et.al. | 2512.06582 | null |
| 2025-12-06 | BEACON: A Unified Behavioral-Tactical Framework for Explainable Cybercrime Analysis with Large Language Models | Arush Sachdeva et.al. | 2512.06555 | null |
| 2025-12-06 | ProSocialAlign: Preference Conditioned Test Time Alignment in Language Models | Somnath Banerjee et.al. | 2512.06515 | null |
| 2025-12-06 | Small Language Models Can Use Nuanced Reasoning For Health Science Research Classification: A Microbial-Oncogenesis Case Study | Muhammed Muaaz Dawood et.al. | 2512.06502 | null |
| 2025-12-06 | Optimizing LLMs Using Quantization for Mobile Execution | Agatsya Yadav et.al. | 2512.06490 | null |
| 2025-12-06 | Neural expressiveness for beyond importance model compression | Angelos-Christos Maroudis et.al. | 2512.06440 | null |
| 2025-12-06 | TreeQ: Pushing the Quantization Boundary of Diffusion Transformer via Tree-Structured Mixed-Precision Search | Kaicheng Yang et.al. | 2512.06353 | null |
| 2025-12-06 | Theoretical Compression Bounds for Wide Multilayer Perceptrons | Houssam El Cheairi et.al. | 2512.06288 | null |
| 2025-12-06 | Nanbeige4-3B Technical Report: Exploring the Frontier of Small Language Models | Chen Yang et.al. | 2512.06266 | null |
| 2025-12-06 | Quantization Blindspots: How Model Compression Breaks Backdoor Defenses | Rohan Pandey et.al. | 2512.06243 | null |
| 2025-12-06 | LOCUS: A System and Method for Low-Cost Customization for Universal Specialization | Dhanasekar Sundararaman et.al. | 2512.06239 | null |
| 2025-12-06 | GPU-GLMB: Assessing the Scalability of GPU-Accelerated Multi-Hypothesis Tracking | Pranav Balakrishnan et.al. | 2512.06230 | null |
| 2025-12-05 | KQ-SVD: Compressing the KV Cache with Provable Guarantees on Attention Fidelity | Damien Lesens et.al. | 2512.05916 | null |
| 2025-12-05 | Hadronic Emissions from the Microquasar V4641 Sgr, SS433, and its implications in the Diffuse Galactic Emission | Basanti Paul et.al. | 2512.05839 | null |
| 2025-12-05 | HQ-DM: Single Hadamard Transformation-Based Quantization-Aware Training for Low-Bit Diffusion Models | Shizhuo Mao et.al. | 2512.05746 | null |
| 2025-12-05 | Efficient Text Classification with Conformal In-Context Learning | Ippokratis Pantelidis et.al. | 2512.05732 | null |
| 2025-12-05 | LeAD-M3D: Leveraging Asymmetric Distillation for Real-time Monocular 3D Detection | Johannes Meier et.al. | 2512.05663 | null |
| 2025-12-05 | Efficient sequential Bayesian inference for state-space epidemic models using ensemble data assimilation | Dhorasso Temfack et.al. | 2512.05650 | null |
| 2025-12-05 | DistillFSS: Synthesizing Few-Shot Knowledge into a Lightweight Segmentation Model | Pasquale De Marinis et.al. | 2512.05613 | null |
| 2025-12-05 | Fast SceneScript: Accurate and Efficient Structured Language Model via Multi-Token Prediction | Ruihong Yin et.al. | 2512.05597 | null |
| 2025-12-05 | Rethinking Infrared Small Target Detection: A Foundation-Driven Efficient Paradigm | Chuang Yu et.al. | 2512.05511 | null |
| 2025-12-05 | TED-4DGS: Temporally Activated and Embedding-based Deformation for 4DGS Compression | Cheng-Yuan Ho et.al. | 2512.05446 | null |
| 2025-12-05 | BEAVER: An Efficient Deterministic LLM Verifier | Tarun Suresh et.al. | 2512.05439 | null |
| 2025-12-05 | Performance Evaluation of Deep Learning for Tree Branch Segmentation in Autonomous Forestry Systems | Yida Lin et.al. | 2512.05418 | null |
| 2025-12-05 | YOLO and SGBM Integration for Autonomous Tree Branch Detection and Depth Estimation in Radiata Pine Pruning Applications | Yida Lin et.al. | 2512.05412 | null |
| 2025-12-05 | SQ-format: A Unified Sparse-Quantized Hardware-friendly Data Format for LLMs | Ruixuan Huang et.al. | 2512.05409 | null |
| 2025-12-05 | LoC-Path: Learning to Compress for Pathology Multimodal Large Language Models | Qingqiao Hu et.al. | 2512.05391 | null |
| 2025-12-05 | ShaRP: SHAllow-LayeR Pruning for Video Large Language Models Acceleration | Yingjie Xia et.al. | 2512.05385 | null |
| 2025-12-05 | Group Orthogonal Low-Rank Adaptation for RGB-T Tracking | Zekai Shao et.al. | 2512.05359 | null |
| 2025-12-04 | Uncertainty-Aware Data-Efficient AI: An Information-Theoretic Perspective | Osvaldo Simeone et.al. | 2512.05267 | null |
| 2025-12-04 | Rethinking Tokenization for Clinical Time Series: When Less is More | Rafi Al Attrach et.al. | 2512.05217 | null |
| 2025-12-04 | Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning | Purbesh Mitra et.al. | 2512.05105 | null |
| 2025-12-04 | Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression | Jung Yi et.al. | 2512.05081 | null |
| 2025-12-04 | David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design? | Shashwat Shankar et.al. | 2512.05073 | null |
| 2025-12-04 | Meta-Learning for Quantum Optimization via Quantum Sequence Model | Yu-Cheng Lin et.al. | 2512.05058 | null |
| 2025-12-04 | Arbitrage: Efficient Reasoning via Advantage-Aware Speculation | Monishwaran Maheswaran et.al. | 2512.05033 | null |
| 2025-12-04 | Generative Neural Video Compression via Video Diffusion Prior | Qi Mao et.al. | 2512.05016 | null |
| 2025-12-04 | Plug-and-Play Homeostatic Spark: Zero-Cost Acceleration for SNN Training Across Paradigms | Rui Chen et.al. | 2512.05015 | null |
| 2025-12-04 | Efficient Generative Transformer Operators For Million-Point PDEs | Armand Kassaï Koupaï et.al. | 2512.04974 | null |
| 2025-12-04 | FASTer: Toward Efficient Autoregressive Vision Language Action Modeling via neural Action Tokenization | Yicheng Liu et.al. | 2512.04952 | null |
| 2025-12-04 | LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging | Zhijian Shu et.al. | 2512.04939 | null |
| 2025-12-04 | Autoregressive Image Generation Needs Only a Few Lines of Cached Tokens | Ziran Qin et.al. | 2512.04857 | null |
| 2025-12-05 | EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture | Xin He et.al. | 2512.04810 | null |
| 2025-12-04 | MemLoRA: Distilling Expert Adapters for On-Device Memory Systems | Massimo Bini et.al. | 2512.04763 | null |
| 2025-12-04 | Model Whisper: Steering Vectors Unlock Large Language Models’ Potential in Test-time | Xinyue Kang et.al. | 2512.04748 | null |
| 2025-12-04 | SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs | Wenhua Cheng et.al. | 2512.04746 | null |
| 2025-12-04 | TRINITY: An Evolved LLM Coordinator | Jinglue Xu et.al. | 2512.04695 | null |
| 2025-12-04 | Towards Ethical Multi-Agent Systems of Large Language Models: A Mechanistic Interpretability Perspective | Jae Hee Lee et.al. | 2512.04691 | null |
| 2025-12-04 | Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation | Yunhong Lu et.al. | 2512.04678 | null |
| 2025-12-04 | Rethinking Decoupled Knowledge Distillation: A Predictive Distribution Perspective | Bowen Zheng et.al. | 2512.04625 | null |
| 2025-12-04 | Metric dimension of Cartesian product of stars | Akbar Davoodi et.al. | 2512.04620 | null |
| 2025-12-04 | Infrared UAV Target Tracking with Dynamic Feature Refinement and Global Contextual Attention Knowledge Distillation | Houzhang Fang et.al. | 2512.04581 | null |
| 2025-12-04 | AdmTree: Compressing Lengthy Context with Adaptive Semantic Trees | Yangning Li et.al. | 2512.04550 | null |
| 2025-12-04 | Boundary-Aware Test-Time Adaptation for Zero-Shot Medical Image Segmentation | Chenlin Xu et.al. | 2512.04520 | null |
| 2025-12-04 | RapidUn: Influence-Driven Parameter Reweighting for Efficient Large Language Model Unlearning | Guoshenghui Zhao et.al. | 2512.04457 | null |
| 2025-12-04 | MD-SNN: Membrane Potential-aware Distillation on Quantized Spiking Neural Network | Donghyun Lee et.al. | 2512.04443 | null |
| 2025-12-04 | Dual-Stream Spectral Decoupling Distillation for Remote Sensing Object Detection | Xiangyi Gao et.al. | 2512.04413 | null |
| 2025-12-04 | Efficient Reinforcement Learning with Semantic and Token Entropy for LLM Reasoning | Hongye Cao et.al. | 2512.04359 | null |
| 2025-12-03 | GRASP: GRouped Activation Shared Parameterization for Parameter-Efficient Fine-Tuning and Robust Inference of Transformers | Malyaban Bal et.al. | 2512.04296 | null |
| 2025-12-03 | Constructing Low-Redundancy Codes via Distributed Graph Coloring | Yuting Li et.al. | 2512.04197 | null |
| 2025-12-03 | Quantum geometry and linear orbital response in arbitrary $SU(2)$ representation | Rhonald Burgos Atencia et.al. | 2512.04164 | null |
| 2025-12-03 | Minuet: A Diffusion Autoencoder for Compact Semantic Compression of Multi-Band Galaxy Images | Alexander T. Gagliano et.al. | 2512.04145 | null |
| 2025-12-03 | Solving N-Queen Problem using Las Vegas Algorithm with State Pruning | Susmita Sharma et.al. | 2512.04139 | null |
| 2025-12-03 | RELIC: Interactive Video World Model with Long-Horizon Memory | Yicong Hong et.al. | 2512.04040 | null |
| 2025-12-03 | Fast & Efficient Normalizing Flows and Applications of Image Generative Models | Sandeep Nagar et.al. | 2512.04039 | null |
| 2025-12-03 | PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation | Xiaolong Li et.al. | 2512.04025 | null |
| 2025-12-03 | Ultra-lightweight Neural Video Representation Compression | Ho Man Kwan et.al. | 2512.04019 | null |
| 2025-12-03 | DIQ-H: Evaluating Hallucination Persistence in VLMs Under Temporal Visual Degradation | Zexin Lin et.al. | 2512.03992 | null |
| 2025-12-03 | Teaching Old Tokenizers New Words: Efficient Tokenizer Adaptation for Pre-trained Models | Taido Purason et.al. | 2512.03989 | null |
| 2025-12-03 | Parameter efficient hybrid spiking-quantum convolutional neural network with surrogate gradient and quantum data-reupload | Luu Trong Nhan et.al. | 2512.03895 | null |
| 2025-12-03 | Lean Unet: A Compact Model for Image Segmentation | Ture Hassler et.al. | 2512.03834 | null |
| 2025-12-03 | AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition | Zichuan Lin et.al. | 2512.03794 | null |
| 2025-12-03 | AR-Med: Automated Relevance Enhancement in Medical Search via LLM-Driven Information Augmentation | Chuyue Wang et.al. | 2512.03737 | null |
| 2025-12-03 | PosA-VLA: Enhancing Action Generation via Pose-Conditioned Anchor Attention | Ziwen Li et.al. | 2512.03724 | null |
| 2025-12-03 | ConvRot: Rotation-Based Plug-and-Play 4-bit Quantization for Diffusion Transformers | Feice Huang et.al. | 2512.03673 | null |
| 2025-12-03 | Multi-Scale Visual Prompting for Lightweight Small-Image Classification | Salim Khazem et.al. | 2512.03663 | null |
| 2025-12-03 | Optical Context Compression Is Just (Bad) Autoencoding | Ivan Yee Lee et.al. | 2512.03643 | null |
| 2025-12-03 | SELF: A Robust Singular Value and Eigenvalue Approach for LLM Fingerprinting | Hanxiu Zhang et.al. | 2512.03620 | null |
| 2025-12-03 | KVNAND: Efficient On-Device Large Language Model Inference Using DRAM-Free In-Flash Computing | Lishuo Deng et.al. | 2512.03608 | null |
| 2025-12-03 | Federated Learning and Trajectory Compression for Enhanced AIS Coverage | Thomas Gräupl et.al. | 2512.03584 | null |
| 2025-12-03 | Optimal Transportation and Alignment Between Gaussian Measures | Sanjit Dandapanthula et.al. | 2512.03579 | null |
| 2025-12-03 | Parameter-Efficient Augment Plugin for Class-Incremental Learning | Zhiming Xu et.al. | 2512.03537 | null |
| 2025-12-03 | NAS-LoRA: Empowering Parameter-Efficient Fine-Tuning for Visual Foundation Models with Searchable Adaptation | Renqi Chen et.al. | 2512.03499 | null |
| 2025-12-03 | Quantum Encrypted Control of Networked Systems | Zihao Ren et.al. | 2512.03434 | null |
| 2025-12-03 | Dual LoRA: Enhancing LoRA with Magnitude and Direction Updates | Yixing Xu et.al. | 2512.03402 | null |
| 2025-12-03 | UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs | Hung-Yueh Chiang et.al. | 2512.03383 | null |
| 2025-12-03 | Nexus: Higher-Order Attention Mechanisms in Transformers | Hanting Chen et.al. | 2512.03377 | null |
| 2025-12-03 | Hierarchical Attention for Sparse Volumetric Anomaly Detection in Subclinical Keratoconus | Lynn Kandakji et.al. | 2512.03346 | null |
| 2025-12-03 | Idea-Gated Transformers: Enforcing Semantic Coherence via Differentiable Vocabulary Pruning | Darshan Fofadiya et.al. | 2512.03343 | null |
| 2025-12-03 | Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs | Ngoc Bui et.al. | 2512.03324 | null |
| 2025-12-02 | InvertiTune: High-Quality Data Synthesis for Cost-Effective Single-Shot Text-to-Knowledge Graph Generation | Faezeh Faez et.al. | 2512.03197 | null |
| 2025-12-02 | A Mathematical Introduction to Geometric Quantization | Kadri İlker Berktav et.al. | 2512.03171 | null |
| 2025-12-02 | The Hilbert space of gauge theories: group averaging and the quantization of Jackiw-Teitelboim gravity | Elba Alonso-Monsalve et.al. | 2512.03030 | null |
| 2025-12-02 | TokenPowerBench: Benchmarking the Power Consumption of LLM Inference | Chenxu Niu et.al. | 2512.03024 | null |
| 2025-12-02 | Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks | Matthew Dutson et.al. | 2512.03014 | null |
| 2025-12-02 | Pruning AMR: Efficient Visualization of Implicit Neural Representations via Weight Matrix Analysis | Jennifer Zvonek et.al. | 2512.02967 | null |
| 2025-12-02 | A Lightweight Real-Time Low-Light Enhancement Network for Embedded Automotive Vision Systems | Yuhan Chen et.al. | 2512.02965 | null |
| 2025-12-02 | AutoNeural: Co-Designing Vision-Language Models for NPU Inference | Wei Chen et.al. | 2512.02924 | null |
| 2025-12-02 | FAIRY2I: Universal Extremely-Low Bit QAT framework via Widely-Linear Representation and Phase-Aware Quantization | Feiyu Wang et.al. | 2512.02901 | null |
| 2025-12-02 | MindGPT-4ov: An Enhanced MLLM via a Multi-Stage Post-Training Paradigm | Wei Chen et.al. | 2512.02895 | null |
| 2025-12-02 | Network Self-Configuration based on Fine-Tuned Small Language Models | Oscar G. Lira et.al. | 2512.02861 | null |
| 2025-12-02 | LumiX: Structured and Coherent Text-to-Intrinsic Generation | Xu Han et.al. | 2512.02781 | null |
| 2025-12-02 | PEFT-Factory: Unified Parameter-Efficient Fine-Tuning of Autoregressive Large Language Models | Robert Belanec et.al. | 2512.02764 | null |
| 2025-12-02 | Menta: A Small Language Model for On-Device Mental Health Prediction | Tianyi Zhang et.al. | 2512.02716 | null |
| 2025-12-02 | G-PIFNN: A Generalizable Physics-informed Fourier Neural Network Framework for Electrical Circuits | Ibrahim Shahbaz et.al. | 2512.02712 | null |
| 2025-12-02 | CREST: Universal Safety Guardrails Through Cluster-Guided Cross-Lingual Transfer | Lavish Bansal et.al. | 2512.02711 | null |
| 2025-12-02 | VLM-Pruner: Buffering for Spatial Sparsity in an Efficient VLM Centrifugal Token Pruning Paradigm | Zhenkai Wu et.al. | 2512.02700 | null |
| 2025-12-02 | PGP-DiffSR: Phase-Guided Progressive Pruning for Efficient Diffusion-based Image Super-Resolution | Zhongbao Yang et.al. | 2512.02681 | null |
| 2025-12-02 | A Communication-Efficient Distributed Optimization Algorithm with Coupled Constraints | Yuzhu Duan et.al. | 2512.02634 | null |
| 2025-12-02 | Adapting Tensor Kernel Machines to Enable Efficient Transfer Learning for Seizure Detection | Seline J. S. de Rooij et.al. | 2512.02626 | null |
| 2025-12-02 | Stepwise Schema-Guided Prompting Framework with Parameter Efficient Instruction Tuning for Multimedia Event Extraction | Xiang Yuan et.al. | 2512.02584 | null |
| 2025-12-02 | ADORE: Autonomous Domain-Oriented Relevance Engine for E-commerce | Zheng Fang et.al. | 2512.02555 | null |
| 2025-12-02 | In-Context Distillation with Self-Consistency Cascades: A Simple, Training-Free Way to Reduce LLM Agent Costs | Vishnu Sarukkai et.al. | 2512.02543 | null |
| 2025-12-02 | Improved Ising Meson Spectroscopy Simulation on a Noisy Digital Quantum Device | Hao-Ti Hung et.al. | 2512.02516 | null |
| 2025-12-02 | TGDD: Trajectory Guided Dataset Distillation with Balanced Distribution | Fengli Ran et.al. | 2512.02469 | null |
| 2025-12-02 | Artificial Noise Aided Physical Layer Security for Near-Field MIMO with Fluid Antenna Systems | Peng Zhang et.al. | 2512.02461 | null |
| 2025-12-02 | Basis-Oriented Low-rank Transfer for Few-Shot and Test-Time Adaptation | Junghwan Park et.al. | 2512.02441 | null |
| 2025-12-02 | Boosting Medical Vision-Language Pretraining via Momentum Self-Distillation under Limited Computing Resources | Phuc Pham et.al. | 2512.02438 | null |
| 2025-12-02 | Generalizing Vision-Language Models with Dedicated Prompt Guidance | Xinyao Li et.al. | 2512.02421 | null |
| 2025-12-02 | Data Curation Through the Lens of Spectral Dynamics: Static Limits, Dynamic Acceleration, and Practical Oracles | Yizhou Zhang et.al. | 2512.02409 | null |
| 2025-12-02 | ESACT: An End-to-End Sparse Accelerator for Compute-Intensive Transformers via Local Similarity | Hongxiang Liu et.al. | 2512.02403 | null |
| 2025-12-02 | Understanding and Harnessing Sparsity in Unified Multimodal Models | Shwai He et.al. | 2512.02351 | null |
| 2025-12-01 | Fantasy: Efficient Large-scale Vector Search on GPU Clusters with GPUDirect Async | Yi Liu et.al. | 2512.02278 | null |
| 2025-12-01 | Adversarial Robustness of Traffic Classification under Resource Constraints: Input Structure Matters | Adel Chehade et.al. | 2512.02276 | null |
| 2025-12-01 | Lightweight Latent Reasoning for Narrative Tasks | Alexander Gurung et.al. | 2512.02240 | null |
| 2025-12-01 | Thermodynamic Entropy as Information – A compression-based demonstration of the Shannon-Boltzmann equivalence in condensed matter | Dallin Fisher et.al. | 2512.02221 | null |
| 2025-12-01 | Parameter-Efficient Subspace Optimization for LLM Fine-Tuning | Yuchen Lou et.al. | 2512.02216 | null |
| 2025-12-01 | Think Before You Prune: Self-Reflective Structured Pruning for Reasoning Language Models | Ziyan Wang et.al. | 2512.02185 | null |
| 2025-12-01 | TT-Stack: A Transformer-Based Tiered-Stacking Ensemble Framework with Meta-Learning for Automated Breast Cancer Detection in Mammography | Showkat Osman et.al. | 2512.02091 | null |
| 2025-12-01 | Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling | Jack Cook et.al. | 2512.02010 | null |
| 2025-12-01 | Feature-Based Semantics-Aware Scheduling for Energy-Harvesting Federated Learning | Eunjeong Jeong et.al. | 2512.01983 | null |
| 2025-12-01 | Low-Rank Prehab: Preparing Neural Networks for SVD Compression | Haoran Qin et.al. | 2512.01980 | null |
| 2025-12-01 | KV Pareto: Systems-Level Optimization of KV Cache and Model Compression for Long Context Inference | Sai Gokhale et.al. | 2512.01953 | null |
| 2025-12-01 | Script: Graph-Structured and Query-Conditioned Semantic Token Pruning for Multimodal Large Language Models | Zhongyu Yang et.al. | 2512.01949 | null |
| 2025-12-01 | SAM3-UNet: Simplified Adaptation of Segment Anything Model 3 | Xinyu Xiong et.al. | 2512.01789 | null |
| 2025-12-01 | Learned Image Compression for Earth Observation: Implications for Downstream Segmentation Tasks | Christian Mollière et.al. | 2512.01788 | null |
| 2025-12-01 | Resource Estimation for VQE on Small Molecules: Impact of Fermion Mappings and Hamiltonian Reductions | Anurag K. S. V. et.al. | 2512.01605 | null |
| 2025-12-01 | Neural Network Perturbation Theory (NNPT): Learning Residual Corrections from Exact Solutions | Zhenhao Chen et.al. | 2512.01558 | null |
| 2025-12-01 | LPCD: Unified Framework from Layer-Wise to Submodule Quantization | Yuma Ichikawa et.al. | 2512.01546 | null |
| 2025-12-01 | FlashVGGT: Efficient and Scalable Visual Geometry Transformers with Compressed Descriptor Attention | Zipeng Wang et.al. | 2512.01540 | null |
| 2025-12-01 | The Poisson-Fourier Transform for bicrossed products I: Abelian approximations and the quantum duality principle | A. Massar et.al. | 2512.01536 | null |
| 2025-12-01 | Diffusion Fuzzy System: Fuzzy Rule Guided Latent Multi-Path Diffusion Modeling | Hailong Yang et.al. | 2512.01533 | null |
| 2025-12-01 | MEGConformer: Conformer-Based MEG Decoder for Robust Speech and Phoneme Classification | Xabier de Zuazo et.al. | 2512.01443 | null |
| 2025-12-01 | Fantastic Features and Where to Find Them: A Probing Method to combine Features from Multiple Foundation Models | Benjamin Ramtoula et.al. | 2512.01405 | null |
| 2025-12-01 | Intrinsic Structure as a Proxy for Saliency: SVD-Based Weight Preservation for Mixed-Precision Quantization in Large Language Models | Shashank Landge et.al. | 2512.01343 | null |
| 2025-12-01 | EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly | Xiaokun Pan et.al. | 2512.01296 | null |
| 2025-12-01 | Diffusion Model in Latent Space for Medical Image Segmentation Task | Huynh Trinh Ngoc et.al. | 2512.01292 | null |
| 2025-12-01 | Efficient Training of Diffusion Mixture-of-Experts Models: A Practical Recipe | Yahui Liu et.al. | 2512.01252 | null |
| 2025-12-01 | First On-Orbit Demonstration of a Geospatial Foundation Model | Andrew Du et.al. | 2512.01181 | null |
| 2025-11-30 | Projection-Free CNN Pruning via Frank-Wolfe with Momentum: Sparser Models with Less Pretraining | Hamza ElMokhtar Shili et.al. | 2512.01147 | null |
| 2025-11-30 | Structural Prognostic Event Modeling for Multimodal Cancer Survival Analysis | Yilan Zhang et.al. | 2512.01116 | null |
| 2025-11-30 | Parameter Reduction Improves Vision Transformers: A Comparative Study of Sharing and Width Reduction | Anantha Padmanaban Krishna Kumar et.al. | 2512.01059 | null |
| 2025-11-30 | A Provably Efficient Method for Tensor Ring Decomposition and Its Applications | Han Chen et.al. | 2512.01016 | null |
| 2025-11-30 | WUSH: Near-Optimal Adaptive Transforms for LLM Quantization | Jiale Chen et.al. | 2512.00956 | null |
| 2025-11-28 | Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction | Bao Shu et.al. | 2511.23476 | null |
| 2025-11-28 | Visual Generation Tuning | Jiahao Guo et.al. | 2511.23469 | null |
| 2025-11-28 | Quantized-Tinyllava: a new multimodal foundation model enables efficient split learning | Jiajun Guo et.al. | 2511.23402 | null |
| 2025-11-28 | FedSGT: Exact Federated Unlearning via Sequential Group-based Training | Bokang Zhang et.al. | 2511.23393 | null |
| 2025-11-28 | VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction | Sinan Du et.al. | 2511.23386 | null |
| 2025-11-28 | Optimizing Multimodal Language Models through Attention-based Interpretability | Alexander Sergeev et.al. | 2511.23375 | null |
| 2025-11-28 | Chart2Code-MoLA: Efficient Multi-Modal Code Generation via Adaptive Expert Routing | Yifei Wang et.al. | 2511.23321 | null |
| 2025-11-28 | Efficient Estimation of Sum-Parameters for Multi-Component Complex Exponential Signals with Theoretical Cramer-Rao Bound Analysis | Huiguang Zhang et.al. | 2511.23318 | null |
| 2025-11-28 | Closing the Generalization Gap in Parameter-efficient Federated Edge Learning | Xinnong Du et.al. | 2511.23282 | null |
| 2025-11-28 | Behavior-Equivalent Token: Single-Token Replacement for Long Prompts in LLMs | Jiancheng Dong et.al. | 2511.23271 | null |
| 2025-11-28 | PointCNN++: Performant Convolution on Native Points | Lihan Li et.al. | 2511.23227 | null |
| 2025-11-28 | TWEO: Transformers Without Extreme Outliers Enables FP8 Training And Quantization For Dummies | Guang Liang et.al. | 2511.23225 | null |
| 2025-11-28 | Pathryoshka: Compressing Pathology Foundation Models via Multi-Teacher Knowledge Distillation with Nested Embeddings | Christian Grashei et.al. | 2511.23204 | null |
| 2025-11-28 | InstanceV: Instance-Level Video Generation | Yuheng Chen et.al. | 2511.23146 | null |
| 2025-11-28 | Evolutionary Discovery of Heuristic Policies for Traffic Signal Control | Ruibing Wang et.al. | 2511.23122 | null |
| 2025-11-28 | Dripper: Token-Efficient Main HTML Extraction with a Lightweight LM | Mengjie Liu et.al. | 2511.23119 | null |
| 2025-11-28 | Accent Placement Models for Rigvedic Sanskrit Text | Akhil Rajeev P et.al. | 2511.23088 | null |
| 2025-11-28 | EnECG: Efficient Ensemble Learning for Electrocardiogram Multi-task Foundation Model | Yuhao Xu et.al. | 2511.22935 | null |
| 2025-11-28 | AgentShield: Make MAS more secure and efficient | Kaixiang Wang et.al. | 2511.22924 | null |
| 2025-11-28 | ORION: Teaching Language Models to Reason Efficiently in the Language of Thought | Kumar Tanmay et.al. | 2511.22891 | null |
| 2025-11-28 | Serving Heterogeneous LoRA Adapters in Distributed LLM Inference Systems | Shashwat Jaiswal et.al. | 2511.22880 | null |
| 2025-11-28 | CNN-Based Framework for Pedestrian Age and Gender Classification Using Far-View Surveillance in Mixed-Traffic Intersections | Shisir Shahriar Arif et.al. | 2511.22873 | null |
| 2025-11-28 | PerfMamba: Performance Analysis and Pruning of Selective State Space Models | Abdullah Al Asif et.al. | 2511.22849 | null |
| 2025-11-27 | FPGA-Enabled Modulo ADC with x100 Dynamic-Range Expansion: Hardware Design and Performance Evaluation | Zeyuan Li et.al. | 2511.22752 | null |
| 2025-11-27 | All Centers Are at most a Few Tokens Apart: Knowledge Distillation with Domain Invariant Prompt Tuning | Amir Mohammad Ezzati et.al. | 2511.22739 | null |
| 2025-11-27 | Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer | Z-Image Team et.al. | 2511.22699 | null |
| 2025-11-27 | Smarter, not Bigger: Fine-Tuned RAG-Enhanced LLMs for Automotive HIL Testing | Chao Feng et.al. | 2511.22584 | null |
| 2025-11-27 | Diff-ICMH: Harmonizing Machine and Human Vision in Image Compression with Generative Prior | Ruoyu Feng et.al. | 2511.22549 | null |
| 2025-11-27 | Enhancing Trustworthiness with Mixed Precision: Benchmarks, Opportunities, and Challenges | Guanxi Lu et.al. | 2511.22483 | null |
| 2025-11-27 | OmniInfer: System-Wide Acceleration Techniques for Optimizing LLM Serving Throughput and Latency | Jun Wang et.al. | 2511.22481 | null |
| 2025-11-27 | RoadSceneBench: A Lightweight Benchmark for Mid-Level Road Scene Understanding | Xiyan Liu et.al. | 2511.22466 | null |
| 2025-11-27 | An Efficient Embedding Based Ad Retrieval with GPU-Powered Feature Interaction | Yifan Lei et.al. | 2511.22460 | null |
| 2025-11-27 | ITS3D: Inference-Time Scaling for Text-Guided 3D Diffusion Models | Zhenglin Zhou et.al. | 2511.22456 | null |
| 2025-11-27 | Fin3R: Fine-tuning Feed-forward 3D Reconstruction Models via Monocular Knowledge Distillation | Weining Ren et.al. | 2511.22429 | null |
| 2025-11-27 | Efficient-Husformer: Efficient Multimodal Transformer Hyperparameter Optimization for Stress and Cognitive Loads | Merey Orazaly et.al. | 2511.22362 | null |
| 2025-11-26 | ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration | Hongjin Su et.al. | 2511.21689 | null |
| 2025-11-26 | Continual Error Correction on Low-Resource Devices | Kirill Paramonov et.al. | 2511.21652 | null |
| 2025-11-26 | Automated Protein Motif Localization using Concept Activation Vectors in Protein Language Model Embedding Space | Ahmad Shamail et.al. | 2511.21614 | null |
| 2025-11-26 | Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining | Dongyang Fan et.al. | 2511.21613 | null |
| 2025-11-26 | Multimodal Robust Prompt Distillation for 3D Point Cloud Models | Xiang Gu et.al. | 2511.21574 | null |
| 2025-11-26 | EoS-FM: Can an Ensemble of Specialist Models act as a Generalist Feature Extractor? | Pierre Adorni et.al. | 2511.21523 | null |
| 2025-11-26 | IntAttention: A Fully Integer Attention Pipeline for Efficient Edge Inference | Wanli Zhong et.al. | 2511.21513 | null |
| 2025-11-26 | CanKD: Cross-Attention-based Non-local operation for Feature-based Knowledge Distillation | Shizhe Sun et.al. | 2511.21503 | null |
| 2025-11-26 | MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices | Shuai Zhang et.al. | 2511.21475 | null |
| 2025-11-26 | VibraWave: Sensing the Pulse of Polluted Waters | Sagnik Ghosh et.al. | 2511.21456 | null |
| 2025-11-26 | Odin: Oriented Dual-module Integration for Text-rich Network Representation Learning | Kaifeng Hong et.al. | 2511.21416 | null |
| 2025-11-26 | Knowledge Distillation for Continual Learning of Biomedical Neural Fields | Wouter Visser et.al. | 2511.21409 | null |
| 2025-11-26 | Prune4Web: DOM Tree Pruning Programming for Web Agent | Jiayuan Zhang et.al. | 2511.21398 | null |
| 2025-11-26 | FITRep: Attention-Guided Item Representation via MLLMs | Guoxiao Zhang et.al. | 2511.21389 | null |
| 2025-11-26 | An octree-based sampling algorithm for analyzing big simulation data | Janis Geise et.al. | 2511.21352 | null |
| 2025-11-26 | Helical Quasiperiodic Chains with Engineered Dissipation: Liouvillian Rapidity Diagnostics of Transport and Localization | Mohammad Pouranvari et.al. | 2511.21332 | null |
| 2025-11-26 | PEFT-Bench: A Parameter-Efficient Fine-Tuning Methods Benchmark | Robert Belanec et.al. | 2511.21285 | null |
| 2025-11-26 | DynamicAdaptiveClimb: Adaptive Cache Replacement with Dynamic Resizing | Daniel Berend et.al. | 2511.21235 | null |
| 2025-11-26 | Data Exfiltration by Compression Attack: Definition and Evaluation on Medical Image Data | Huiyu Li et.al. | 2511.21227 | null |
| 2025-11-26 | LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs | Shichu Sun et.al. | 2511.21150 | null |
| 2025-11-26 | Which Layer Causes Distribution Deviation? Entropy-Guided Adaptive Pruning for Diffusion and Flow Models | Changlin Li et.al. | 2511.21122 | null |
| 2025-11-26 | Quantum Hard Spheres with Affine Quantization | Riccardo Fantoni et.al. | 2511.21119 | null |
| 2025-11-26 | EM-KD: Distilling Efficient Multimodal Large Language Model with Unbalanced Vision Tokens | Ze Feng et.al. | 2511.21106 | null |
| 2025-11-26 | MLPMoE: Zero-Shot Architectural Metamorphosis of Dense LLM MLPs into Static Mixture-of-Experts | Ivan Novikov et.al. | 2511.21089 | null |
| 2025-11-26 | 5G Network Automation Using Local Large Language Models and Retrieval-Augmented Generation | Ahmadreza Majlesara et.al. | 2511.21084 | null |
| 2025-11-26 | G-Net: A Provably Easy Construction of High-Accuracy Random Binary Neural Networks | Alireza Aghasi et.al. | 2511.21063 | null |
| 2025-11-26 | RAVQ-HoloNet: Rate-Adaptive Vector-Quantized Hologram Compression | Shima Rafiei et.al. | 2511.21035 | null |
| 2025-11-26 | Lightweight Model Editing for LLMs to Correct Deprecated API Recommendations | Guancheng Lin et.al. | 2511.21022 | null |
| 2025-11-26 | ICPO: Intrinsic Confidence-Driven Group Relative Preference Optimization for Efficient Reinforcement Learning | Jinpeng Wang et.al. | 2511.21005 | null |
| 2025-11-25 | $Δ$ -NeRF: Incremental Refinement of Neural Radiance Fields through Residual Control and Knowledge Transfer | Kriti Ghosh et.al. | 2511.20804 | null |
| 2025-11-25 | Unleashing the Power of Vision-Language Models for Long-Tailed Multi-Label Visual Recognition | Wei Tang et.al. | 2511.20641 | null |
| 2025-11-25 | DiFR: Inference Verification Despite Nondeterminism | Adam Karvonen et.al. | 2511.20621 | null |
| 2025-11-25 | NVIDIA Nemotron Parse 1.1 | Kateryna Chumachenko et.al. | 2511.20478 | null |
| 2025-11-25 | Efficient Estimation of Multiple Temperatures via a Collisional Model | Srijon Ghosh et.al. | 2511.20448 | null |
| 2025-11-25 | Object-Centric Vision Token Pruning for Vision Language Models | Guangyuan Li et.al. | 2511.20439 | null |
| 2025-11-25 | BRIC: Bridging Kinematic Plans and Physical Control at Test Time | Dohun Lim et.al. | 2511.20431 | null |
| 2025-11-25 | Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled Pairs | Bao Tang et.al. | 2511.20410 | null |
| 2025-11-25 | MoRE: Batch-Robust Multi-Omics Representations from Frozen Pre-trained Transformers | Audrey Pei-Hsuan Chen et.al. | 2511.20382 | null |
| 2025-11-25 | From Passive Perception to Active Memory: A Weakly Supervised Image Manipulation Localization Framework Driven by Coarse-Grained Annotations | Zhiqing Guo et.al. | 2511.20359 | null |
| 2025-11-25 | Resistive switching and long-range filaments in metal/DMSO liquid systems for three-dimensional, multi-terminal connection schemes with on demand dynamic reconfigurability | Roshani Madurawala et.al. | 2511.20314 | null |
| 2025-11-25 | CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation | Shilei Cao et.al. | 2511.20302 | null |
| 2025-11-25 | Forgetting by Pruning: Data Deletion in Join Cardinality Estimation | Chaowei He et.al. | 2511.20293 | null |
| 2025-11-25 | Modality-Balanced Collaborative Distillation for Multi-Modal Domain Generalization | Xiaohan Wang et.al. | 2511.20258 | null |
| 2025-11-25 | Communication-Efficient Learning for Satellite Constellations | Ruxandra-Stefania Tudose et.al. | 2511.20220 | null |
| 2025-11-25 | Interactive AI NPCs Powered by LLMs: Technical Report for the CPDC Challenge 2025 | Yitian Huang et.al. | 2511.20200 | null |
| 2025-11-25 | Efficient multi-fidelity Gaussian process regression for noisy outputs and non-nested experimental designs | Nils Baillie et.al. | 2511.20183 | null |
| 2025-11-25 | KyrgyzBERT: A Compact, Efficient Language Model for Kyrgyz NLP | Adilet Metinov et.al. | 2511.20182 | null |
| 2025-11-25 | Hybrid Convolution and Frequency State Space Network for Image Compression | Haodong Pan et.al. | 2511.20151 | null |
| 2025-11-25 | Fusion of Simulation and Experiment Data for Hypersonic Flow Field Prediction via Pre-Training and Fine-Tuning | Yuan Jia et.al. | 2511.20149 | null |
| 2025-11-25 | IDAP++: Advancing Divergence-Based Pruning via Filter-Level and Layer-Level Optimization | Aleksei Samarin et.al. | 2511.20141 | null |
| 2025-11-25 | WPT: World-to-Policy Transfer via Online World Model Distillation | Guangfeng Jiang et.al. | 2511.20095 | null |
| 2025-11-25 | VICoT-Agent: A Vision-Interleaved Chain-of-Thought Framework for Interpretable Multimodal Reasoning and Scalable Remote Sensing Analysis | Chujie Wang et.al. | 2511.20085 | null |
| 2025-11-25 | FLaTEC: Frequency-Disentangled Latent Triplanes for Efficient Compression of LiDAR Point Clouds | Xiaoge Zhang et.al. | 2511.20065 | null |
| 2025-11-25 | On-Demand Multi-Task Sparsity for Efficient Large-Model Deployment on Edge Devices | Lianming Huang et.al. | 2511.19986 | null |
| 2025-11-25 | Error-structure-tailored early fault-tolerant quantum computing | Pei Zeng et.al. | 2511.19983 | null |
| 2025-11-25 | M $^3$ Prune: Hierarchical Communication Graph Pruning for Efficient Multi-Modal Multi-Agent Retrieval-Augmented Generation | Weizi Shao et.al. | 2511.19969 | null |
| 2025-11-25 | Towards Edge General Intelligence: Knowledge Distillation for Mobile Agentic AI | Yuxuan Wu et.al. | 2511.19947 | null |
| 2025-11-25 | EfficientXpert: Efficient Domain Adaptation for Large Language Models via Propagation-Aware Pruning | Songlin Zhao et.al. | 2511.19935 | null |
| 2025-11-25 | Context-Aware Token Pruning and Discriminative Selective Attention for Transformer Tracking | Janani Kugarajeevan et.al. | 2511.19928 | null |
| 2025-11-25 | Efficient Importance Sampling under Heston Model: Short Maturity and Deep Out-of-the-Money Options | Yun-Feng Tu et.al. | 2511.19826 | null |
| 2025-11-25 | Mosaic Pruning: A Hierarchical Framework for Generalizable Pruning of Mixture-of-Experts Models | Wentao Hu et.al. | 2511.19822 | null |
| 2025-11-24 | NOEM $^{3}$ A: A Neuro-Symbolic Ontology-Enhanced Method for Multi-Intent Understanding in Mobile Agents | Ioannis Tzachristas et.al. | 2511.19780 | null |
| 2025-11-24 | A Storage-Efficient Feature for 3D Concrete Defect Segmentation to Replace Normal Vector | Linxin Hua et.al. | 2511.19760 | null |
| 2025-11-24 | Leveraging Foundation Models for Histological Grading in Cutaneous Squamous Cell Carcinoma using PathFMTools | Abdul Rahman Diab et.al. | 2511.19751 | null |
| 2025-11-24 | Rethinking Vision Transformer Depth via Structural Reparameterization | Chengwei Zhou et.al. | 2511.19718 | null |
| 2025-11-24 | CafeQ: Calibration-free Quantization via Learned Transformations and Adaptive Rounding | Ziteng Sun et.al. | 2511.19705 | null |
| 2025-11-24 | RADSeg: Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models | Omar Alama et.al. | 2511.19704 | null |
| 2025-11-24 | INTERLACE: Interleaved Layer Pruning and Efficient Adaptation in Large Vision-Language Models | Parsa Madinei et.al. | 2511.19676 | null |
| 2025-11-24 | FISCAL: Financial Synthetic Claim-document Augmented Learning for Efficient Fact-Checking | Rishab Sharma et.al. | 2511.19671 | null |
| 2025-11-24 | SLMFix: Leveraging Small Language Models for Error Fixing with Reinforcement Learning | David Jiahao Fu et.al. | 2511.19422 | null |
| 2025-11-24 | Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens | Yiming Qin et.al. | 2511.19418 | null |
| 2025-11-24 | Be My Eyes: Extending Large Language Models to New Modalities Through Multi-Agent Collaboration | James Y. Huang et.al. | 2511.19417 | null |
| 2025-11-24 | Learning Plug-and-play Memory for Guiding Video Diffusion Models | Selena Song et.al. | 2511.19229 | null |
| 2025-11-24 | Communication: Modeling layered mosaic perovskite alloy microstructures across length scales via a packing algorithm | Murray Skolnick et.al. | 2511.19228 | null |
| 2025-11-24 | UMCL: Unimodal-generated Multimodal Contrastive Learning for Cross-compression-rate Deepfake Detection | Ching-Yi Lai et.al. | 2511.18983 | null |
| 2025-11-24 | FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning | Xin Yuan et.al. | 2511.18977 | null |
| 2025-11-24 | Compressor-VLA: Instruction-Guided Visual Token Compression for Efficient Robotic Manipulation | Juntao Gao et.al. | 2511.18950 | null |
| 2025-11-24 | SWAN: Sparse Winnowed Attention for Reduced Inference Memory via Decompression-Free KV-Cache Compression | Santhosh G S et.al. | 2511.18936 | null |
| 2025-11-24 | EventSTU: Event-Guided Efficient Spatio-Temporal Understanding for Video Large Language Models | Wenhao Xu et.al. | 2511.18920 | null |
| 2025-11-24 | Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models | Yonggan Fu et.al. | 2511.18890 | null |
| 2025-11-24 | HunyuanVideo 1.5 Technical Report | Bing Wu et.al. | 2511.18870 | null |
| 2025-11-24 | Think Before You Prune: Selective Self-Generated Calibration for Pruning Large Reasoning Models | Yang Xiang et.al. | 2511.18864 | null |
| 2025-11-24 | Optimizing LLM Code Suggestions: Feedback-Driven Timing with Lightweight State Bounds | Mohammad Nour Al Awad et.al. | 2511.18842 | null |
| 2025-11-24 | Auto-ML Graph Neural Network Hypermodels for Outcome Prediction in Event-Sequence Data | Fang Wang et.al. | 2511.18835 | null |
| 2025-11-24 | Concept than Document: Context Compression via AMR-based Conceptual Entropy | Kaize Shi et.al. | 2511.18832 | null |
| 2025-11-24 | VideoCompressa: Data-Efficient Video Understanding via Joint Temporal Compression and Spatial Reconstruction | Shaobo Wang et.al. | 2511.18831 | null |
| 2025-11-24 | Towards Characterizing Knowledge Distillation of PPG Heart Rate Estimation Models | Kanav Arora et.al. | 2511.18829 | null |
| 2025-11-24 | Uncertainty-Aware Dual-Student Knowledge Distillation for Efficient Image Classification | Aakash Gore et.al. | 2511.18826 | null |
| 2025-11-24 | DiP: Taming Diffusion Models in Pixel Space | Zhennan Chen et.al. | 2511.18822 | null |
| 2025-11-24 | HERMES: Towards Efficient and Verifiable Mathematical Reasoning in LLMs | Azim Ospanov et.al. | 2511.18760 | null |
| 2025-11-24 | CoD: A Diffusion Foundation Model for Image Compression | Zhaoyang Jia et.al. | 2511.18706 | null |
| 2025-11-24 | VLM in a flash: I/O-Efficient Sparsification of Vision-Language Model via Neuron Chunking | Kichang Yang et.al. | 2511.18692 | null |
| 2025-11-24 | EVCC: Enhanced Vision Transformer-ConvNeXt-CoAtNet Fusion for Classification | Kazi Reyazul Hasan et.al. | 2511.18691 | null |
| 2025-11-24 | QuantKAN: A Unified Quantization Framework for Kolmogorov Arnold Networks | Kazi Ahmed Asif Fuad et.al. | 2511.18689 | null |
| 2025-11-23 | Kitty: Accurate and Efficient 2-bit KV Cache Quantization with Dynamic Channel-wise Precision Boost | Haojun Xia et.al. | 2511.18643 | null |
| 2025-11-23 | AutoFocus-IL: VLM-based Saliency Maps for Data-Efficient Visual Imitation Learning without Extra Human Annotations | Litian Gong et.al. | 2511.18617 | null |
| 2025-11-23 | Quantum machine learning for efficient reduced order modelling of turbulent flows | Han Li et.al. | 2511.18552 | null |
| 2025-11-21 | Native 3D Editing with Full Attention | Weiwei Cai et.al. | 2511.17501 | null |
| 2025-11-21 | Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift | Björn Michele et.al. | 2511.17455 | null |
| 2025-11-21 | MMT-ARD: Multimodal Multi-Teacher Adversarial Distillation for Robust Vision-Language Models | Yuqi Li et.al. | 2511.17448 | null |
| 2025-11-21 | Preventing Shortcut Learning in Medical Image Analysis through Intermediate Layer Knowledge Distillation from Specialist Teachers | Christopher Boland et.al. | 2511.17421 | null |
| 2025-11-21 | DS-Span: Single-Phase Discriminative Subgraph Mining for Efficient Graph Embeddings | Yeamin Kaiser et.al. | 2511.17419 | null |
| 2025-11-21 | METIS: Multi-Source Egocentric Training for Integrated Dexterous Vision-Language-Action Model | Yankai Fu et.al. | 2511.17366 | null |
| 2025-11-21 | Efficient calculation of magnetic fields from ferromagnetic materials near strong electromagnets, and application to stellarator coil optimization | Matt Landreman et.al. | 2511.17305 | null |
| 2025-11-21 | Equivariant-Aware Structured Pruning for Efficient Edge Deployment: A Comprehensive Framework with Adaptive Fine-Tuning | Mohammed Alnemari et.al. | 2511.17242 | null |
| 2025-11-21 | E $^3$ -Pruner: Towards Efficient, Economical, and Effective Layer Pruning for Large Language Models | Tao Yuan et.al. | 2511.17205 | null |
| 2025-11-21 | Efficient Robot Design with Multi-Objective Black-Box Optimization and Large Language Models | Kento Kawaharazuka et.al. | 2511.17178 | null |
| 2025-11-21 | Magnetized particle motion and accretion process with shock cone morphology around a decoupled hairy black holes | G. Mustafa et.al. | 2511.17137 | null |
| 2025-11-21 | A Multi-Stage Optimization Framework for Deploying Learned Image Compression on FPGAs | Jiaxun Fang et.al. | 2511.17135 | null |
| 2025-11-21 | Learning to Compress: Unlocking the Potential of Large Language Models for Text Representation | Yeqin Zhang et.al. | 2511.17129 | null |
| 2025-11-21 | Layer-wise Weight Selection for Power-Efficient Neural Network Acceleration | Jiaxun Fang et.al. | 2511.17123 | null |
| 2025-11-21 | CLLMRec: LLM-powered Cognitive-Aware Concept Recommendation via Semantic Alignment and Prerequisite Knowledge Distillation | Xiangrui Xiong et.al. | 2511.17041 | null |
| 2025-11-21 | Supervised Fine Tuning of Large Language Models for Domain Specific Knowledge Graph Construction:A Case Study on Hunan’s Historical Celebrities | Junjie Hao et.al. | 2511.17012 | null |
| 2025-11-21 | Gradient-Driven Natural Selection for Compact 3D Gaussian Splatting | Xiaobin Deng et.al. | 2511.16980 | null |
| 2025-11-21 | RASTP: Representation-Aware Semantic Token Pruning for Generative Recommendation with Semantic Identifiers | Tianyu Zhan et.al. | 2511.16943 | null |
| 2025-11-21 | Berezin-Toeplitz quantization revisited | Kwokwai Chan et.al. | 2511.16889 | null |
| 2025-11-21 | Avoiding Quality Saturation in UGC Compression Using Denoised References | Xin Xiong et.al. | 2511.16876 | null |
| 2025-11-20 | Efficient Penalty-Based Bilevel Methods: Improved Analysis, Novel Updates, and Flatness Condition | Liuyuan Jiang et.al. | 2511.16796 | null |
| 2025-11-20 | Revisiting Multimodal KV Cache Compression: A Frequency-Domain-Guided Outlier-KV-Aware Approach | Yaoxin Yang et.al. | 2511.16786 | null |
| 2025-11-20 | RampoNN: A Reachability-Guided System Falsification for Efficient Cyber-Kinetic Vulnerability Detection | Kohei Tsujio et.al. | 2511.16765 | null |
| 2025-11-20 | Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter | Qinghao Hu et.al. | 2511.16665 | null |
| 2025-11-20 | Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs | Ali Taghibakhshi et.al. | 2511.16664 | null |
| 2025-11-20 | Teacher-Guided One-Shot Pruning via Context-Aware Knowledge Distillation | Md. Samiul Alim et.al. | 2511.16653 | null |
| 2025-11-21 | You Only Forward Once: An Efficient Compositional Judging Paradigm | Tianlong Zhang et.al. | 2511.16600 | null |
| 2025-11-20 | TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding | Boshen Xu et.al. | 2511.16595 | null |
| 2025-11-20 | The Oracle and The Prism: A Decoupled and Efficient Framework for Generative Recommendation Explanation | Jiaheng Zhang et.al. | 2511.16543 | null |
| 2025-11-20 | Optimizing Federated Learning in the Era of LLMs: Message Quantization and Streaming | Ziyue Xu et.al. | 2511.16450 | null |
| 2025-11-20 | VLA-Pruner: Temporal-Aware Dual-Level Visual Token Pruning for Efficient Vision-Language-Action Inference | Ziyan Liu et.al. | 2511.16449 | null |
| 2025-11-20 | FreqFlow: Long-term forecasting using lightweight flow matching | Seyed Mohamad Moghadas et.al. | 2511.16426 | null |
| 2025-11-20 | TOFA: Training-Free One-Shot Federated Adaptation for Vision-Language Models | Li Zhang et.al. | 2511.16423 | null |
| 2025-11-20 | An Efficient LLM-based Evolutional Recommendation with Locate-Forget-Update Paradigm | Hao Liu et.al. | 2511.16414 | null |
| 2025-11-20 | VersaPants: A Loose-Fitting Textile Capacitive Sensing System for Lower-Body Motion Capture | Deniz Kasap et.al. | 2511.16346 | null |
| 2025-11-20 | Aerial View River Landform Video segmentation: A Weakly Supervised Context-aware Temporal Consistency Distillation Approach | Chi-Han Chen et.al. | 2511.16343 | null |
| 2025-11-20 | SDA: Steering-Driven Distribution Alignment for Open LLMs without Fine-Tuning | Wei Xia et.al. | 2511.16324 | null |
| 2025-11-20 | WWE-UIE: A Wavelet & White Balance Efficient Network for Underwater Image Enhancement | Ching-Heng Cheng et.al. | 2511.16321 | null |
| 2025-11-20 | SeSE: A Structural Information-Guided Uncertainty Quantification Framework for Hallucination Detection in LLMs | Xingtao Zhao et.al. | 2511.16275 | null |
| 2025-11-20 | Accelerating Reionization Constraints: An ANN-Emulator Framework for the SCRIPT Semi-numerical Model | Saptarshi Sarkar et.al. | 2511.16256 | null |
| 2025-11-20 | FT-NCFM: An Influence-Aware Data Distillation Framework for Efficient VLA Models | Kewei Chen et.al. | 2511.16233 | null |
| 2025-11-20 | Q-MLLM: Vector Quantization for Robust Multimodal Large Language Model Security | Wei Zhao et.al. | 2511.16229 | null |
| 2025-11-20 | Optical Waveguide-Pair Design for CMOS-Compatible Hybrid III-V-on-Silicon Quantum Dot Lasers | Peter Raymond Smith et.al. | 2511.16222 | null |
| 2025-11-20 | PIPHEN: Physical Interaction Prediction with Hamiltonian Energy Networks | Kewei Chen et.al. | 2511.16200 | null |
| 2025-11-20 | Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers | Jian Ma et.al. | 2511.16156 | null |
| 2025-11-20 | TS-PEFT: Token-Selective Parameter-Efficient Fine-Tuning with Learnable Threshold Gating | Dabiao Ma et.al. | 2511.16147 | null |
| 2025-11-20 | LEGO-SLAM: Language-Embedded Gaussian Optimization SLAM | Sibaek Lee et.al. | 2511.16144 | null |
| 2025-11-20 | Degradation-Aware Hierarchical Termination for Blind Quality Enhancement of Compressed Video | Li Yu et.al. | 2511.16137 | null |
| 2025-11-20 | Change-of-Basis Pruning via Rotational Invariance | Alex Ning et.al. | 2511.16061 | null |
| 2025-11-20 | LiSTAR: Ray-Centric World Models for 4D LiDAR Sequences in Autonomous Driving | Pei Liu et.al. | 2511.16049 | null |
| 2025-11-20 | Fairness in Multi-modal Medical Diagnosis with Demonstration Selection | Dawei Li et.al. | 2511.15986 | null |
| 2025-11-20 | JudgeBoard: Benchmarking and Enhancing Small Language Models for Reasoning Evaluation | Zhenyu Bi et.al. | 2511.15958 | null |
| 2025-11-20 | A Scalable NorthPole System with End-to-End Vertical Integration for Low-Latency and Energy-Efficient LLM Inference | Michael V. DeBole et.al. | 2511.15950 | null |
| 2025-11-19 | discretize_distributions: Efficient Quantization of Gaussian Mixtures with Guarantees in Wasserstein Distance | Steven Adams et.al. | 2511.15854 | null |
| 2025-11-19 | EfficientSAM3: Progressive Hierarchical Distillation for Video Concept Segmentation from SAM1, 2, and 3 | Chengxi Zeng et.al. | 2511.15833 | null |
| 2025-11-19 | Dimensional Phenomenology in Polymeric Quantization Framework | Kourosh Nozari et.al. | 2511.15826 | null |
| 2025-11-19 | UniUltra: Interactive Parameter-Efficient SAM2 for Universal Ultrasound Segmentation | Yue Li et.al. | 2511.15771 | null |
| 2025-11-19 | Joint Semantic-Channel Coding and Modulation for Token Communications | Jingkai Ying et.al. | 2511.15699 | null |
| 2025-11-19 | The Impact of Quantization on Large Reasoning Model Reinforcement Learning | Medha Kumar et.al. | 2511.15694 | null |
| 2025-11-19 | From Low-Rank Features to Encoding Mismatch: Rethinking Feature Distillation in Vision Transformers | Huiyuan Tian et.al. | 2511.15572 | null |
| 2025-11-19 | Learning to Expand Images for Efficient Visual Autoregressive Modeling | Ruiqing Yang et.al. | 2511.15499 | null |
| 2025-11-19 | Batalin-Fradkin-Vilkovisky Quantization of Quadratic Gravity | Jorge Bellorin et.al. | 2511.15474 | null |
| 2025-11-19 | Small Language Models for Phishing Website Detection: Cost, Performance, and Privacy Trade-Offs | Georg Goldenits et.al. | 2511.15434 | null |
| 2025-11-19 | D4C: Data-free Quantization for Contrastive Language-Image Pre-training Models | Wenlun Zhang et.al. | 2511.15411 | null |
| 2025-11-19 | Breaking Expert Knowledge Limits: Self-Pruning for Large Language Models | Haidong Kang et.al. | 2511.15390 | null |
| 2025-11-19 | Parameter Importance-Driven Continual Learning for Foundation Models | Lingxiang Wang et.al. | 2511.15375 | null |
| 2025-11-19 | IPTQ-ViT: Post-Training Quantization of Non-linear Functions for Integer-only Vision Transformers | Gihwan Kim et.al. | 2511.15369 | null |
| 2025-11-19 | Fidelity-Preserving Quantum Encoding for Quantum Neural Networks | Yuhu Lu et.al. | 2511.15363 | null |
| 2025-11-19 | Quant-Trim in Practice: Improved Cross-Platform Low-Bit Deployment on Edge NPUs | Rayen Dhahri et.al. | 2511.15300 | null |
| 2025-11-19 | Context Cascade Compression: Exploring the Upper Limits of Text Compression | Fanfan Liu et.al. | 2511.15244 | null |
| 2025-11-19 | SkinGPT-R1: Adapter-Only Dual Distillation for Efficient Dermatology Reasoning | Yuhao Shen et.al. | 2511.15242 | null |
| 2025-11-19 | Masked Auto-Regressive Variational Acceleration: Fast Inference Makes Practical Reinforcement Learning | Yuxuan Gu et.al. | 2511.15190 | null |
| 2025-11-19 | Efficient RF Passive Components Modeling with Bayesian Online Learning and Uncertainty Aware Sampling | Huifan Zhang et.al. | 2511.15125 | null |
| 2025-11-19 | Multi-Aspect Cross-modal Quantization for Generative Recommendation | Fuwei Zhang et.al. | 2511.15122 | null |
| 2025-11-19 | A Comprehensive Study on Visual Token Redundancy for Discrete Diffusion-based Multimodal Large Language Models | Duo Li et.al. | 2511.15098 | null |
| 2025-11-19 | Cement2: Temporal Hardware Transactions for High-Level and Efficient FPGA Programming | Youwei Xiao et.al. | 2511.15073 | null |
| 2025-11-19 | Learning Human-Like RL Agents Through Trajectory Optimization With Action Quantization | Jian-Ting Guo et.al. | 2511.15055 | null |
| 2025-11-19 | Dynamic Expert Quantization for Scalable Mixture-of-Experts Inference | Kexin Chu et.al. | 2511.15015 | null |
| 2025-11-19 | Compiling Set Queries into Work-Efficient Tree Traversals | Alexander J Root et.al. | 2511.15000 | null |
| 2025-11-18 | Logit-Based Losses Limit the Effectiveness of Feature Knowledge Distillation | Nicholas Cooper et.al. | 2511.14981 | null |
| 2025-11-18 | SparseST: Exploiting Data Sparsity in Spatiotemporal Modeling and Prediction | Junfeng Wu et.al. | 2511.14753 | null |
| 2025-11-18 | AdamHD: Decoupled Huber Decay Regularization for Language Model Pre-Training | Fu-Ming Guo et.al. | 2511.14721 | null |
| 2025-11-18 | Near-Lossless Model Compression Enables Longer Context Inference in DNA Large Language Models | Rui Zhu et.al. | 2511.14694 | null |
| 2025-11-18 | AutoTool: Efficient Tool Selection for Large Language Model Agents | Jingyi Jia et.al. | 2511.14650 | null |
| 2025-11-18 | Expert-Guided POMDP Learning for Data-Efficient Modeling in Healthcare | Marco Locatelli et.al. | 2511.14619 | null |
| 2025-11-18 | CCSD: Cross-Modal Compositional Self-Distillation for Robust Brain Tumor Segmentation with Missing Modalities | Dongqing Xie et.al. | 2511.14599 | null |
| 2025-11-18 | IMSE: Efficient U-Net-based Speech Enhancement using Inception Depthwise Convolution and Amplitude-Aware Linear Attention | Xinxin Tang et.al. | 2511.14515 | null |
| 2025-11-18 | Watch Out for the Lifespan: Evaluating Backdoor Attacks Against Federated Model Adaptation | Bastien Vuillod et.al. | 2511.14406 | null |
| 2025-11-18 | Jasper-Token-Compression-600M Technical Report | Dun Zhang et.al. | 2511.14405 | null |
| 2025-11-18 | SAM-Fed: SAM-Guided Federated Semi-Supervised Learning for Medical Image Segmentation | Sahar Nasirihaghighi et.al. | 2511.14302 | null |
| 2025-11-18 | Weight Variance Amplifier Improves Accuracy in High-Sparsity One-Shot Pruning | Vincent-Daniel Yun et.al. | 2511.14282 | null |
| 2025-11-18 | Entropy-Guided Reasoning Compression | Hourun Zhu et.al. | 2511.14258 | null |
| 2025-11-18 | Enhancing Generalization of Depth Estimation Foundation Model via Weakly-Supervised Adaptation with Regularization | Yan Huang et.al. | 2511.14238 | null |
| 2025-11-18 | Online Data Curation for Object Detection via Marginal Contributions to Dataset-level Average Precision | Zitang Sun et.al. | 2511.14197 | null |
| 2025-11-18 | Few-Shot Precise Event Spotting via Unified Multi-Entity Graph and Distillation | Zhaoyu Liu et.al. | 2511.14186 | null |
| 2025-11-18 | AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs | Xinliang Zhang et.al. | 2511.14169 | null |
| 2025-11-18 | Run, Ruminate, and Regulate: A Dual-process Thinking System for Vision-and-Language Navigation | Yu Zhong et.al. | 2511.14131 | null |
| 2025-11-18 | Canonical quantization for Equilibrium Thermodynamics | Luis F. Santos et.al. | 2511.14121 | null |
| 2025-11-18 | FailSafe: High-performance Resilient Serving | Ziyi Xu et.al. | 2511.14116 | null |
| 2025-11-18 | CascadedViT: Cascaded Chunk-FeedForward and Cascaded Group Attention Vision Transformer | Srivathsan Sivakumar et.al. | 2511.14111 | null |
| 2025-11-18 | RTS-Mono: A Real-Time Self-Supervised Monocular Depth Estimation Method for Real-World Deployment | Zeyu Cheng et.al. | 2511.14107 | null |
| 2025-11-18 | Lightweight Multi-task CNN for ECG Diagnosis with GRU-Diffusion | Lehuai Xu et.al. | 2511.14104 | null |
| 2025-11-18 | Zero-Training Task-Specific Model Synthesis for Few-Shot Medical Image Classification | Yao Qin et.al. | 2511.14082 | null |
| 2025-11-18 | CORE: Compact Object-centric REpresentations as a New Paradigm for Token Merging in LVLMs | Jingyu Lei et.al. | 2511.14072 | null |
| 2025-11-18 | ELiC: Efficient LiDAR Geometry Compression via Cross-Bit-depth Feature Propagation and Bag-of-Encoders | Junsik Kim et.al. | 2511.14070 | null |
| 2025-11-18 | Semantic Context Matters: Improving Conditioning for Autoregressive Models | Dongyang Jin et.al. | 2511.14063 | null |
| 2025-11-18 | ALEX:A Light Editing-knowledge Extractor | Minghu Wang et.al. | 2511.14018 | null |
| 2025-11-17 | T-SAR: A Full-Stack Co-design for CPU-Only Ternary LLM Inference via In-Place SIMD ALU Reorganization | Hyunwoo Oh et.al. | 2511.13676 | null |
| 2025-11-17 | CacheFlow: Compressive Streaming Memory for Efficient Long-Form Video Understanding | Shrenik Patel et.al. | 2511.13644 | null |
| 2025-11-17 | Compact Multimodal Language Models as Robust OCR Alternatives for Noisy Textual Clinical Reports | Nikita Neveditsin et.al. | 2511.13523 | null |
| 2025-11-17 | Spin-Adapted Fermionic Unitaries: From Lie Algebras to Compact Quantum Circuits | Ilias Magoulas et.al. | 2511.13485 | null |
| 2025-11-17 | A Novel Hierarchical Integration Method for Efficient Model Merging in Medical LLMs | Prakrit Timilsina et.al. | 2511.13373 | null |
| 2025-11-17 | Donors and Recipients: On Asymmetric Transfer Across Tasks and Languages with Parameter-Efficient Fine-Tuning | Kajetan Dymkiewicz et.al. | 2511.13368 | null |
| 2025-11-17 | TabFlash: Efficient Table Understanding with Progressive Question Conditioning and Token Focusing | Jongha Kim et.al. | 2511.13283 | null |
| 2025-11-17 | SF-Recon: Simplification-Free Lightweight Building Reconstruction via 3D Gaussian Splatting | Zihan Li et.al. | 2511.13278 | null |
| 2025-11-17 | SymGS : Leveraging Local Symmetries for 3D Gaussian Splatting Compression | Keshav Gupta et.al. | 2511.13264 | null |
| 2025-11-17 | TokenSqueeze: Performance-Preserving Compression for Reasoning LLMs | Yuxiang Zhang et.al. | 2511.13223 | null |
| 2025-11-17 | Personalized Federated Learning with Bidirectional Communication Compression via One-Bit Random Sketching | Jiacheng Cheng et.al. | 2511.13144 | null |
| 2025-11-17 | Low-Level Dataset Distillation for Medical Image Enhancement | Fengzhi Xu et.al. | 2511.13106 | null |
| 2025-11-17 | Self-Adaptive Graph Mixture of Models | Mohit Meena et.al. | 2511.13062 | null |
| 2025-11-17 | MACKO: Sparse Matrix-Vector Multiplication for Low Sparsity | Vladimír Macko et.al. | 2511.13061 | null |
| 2025-11-17 | Dimension vs. Precision: A Comparative Analysis of Autoencoders and Quantization for Efficient Vector Retrieval on BEIR SciFact | Satyanarayan Pati et.al. | 2511.13057 | null |
| 2025-11-17 | uCLIP: Parameter-Efficient Multilingual Extension of Vision-Language Models with Unpaired Data | Dahyun Chung et.al. | 2511.13036 | null |
| 2025-11-17 | SLMQuant:Benchmarking Small Language Model Quantization for Practical Deployment | Jiacheng Wang et.al. | 2511.13023 | null |
| 2025-11-17 | Fine-Tuned LLMs Know They Don’t Know: A Parameter-Efficient Approach to Recovering Honesty | Zeyu Shi et.al. | 2511.12991 | null |
| 2025-11-17 | UNSEEN: Enhancing Dataset Pruning from a Generalization Perspective | Furui Xu et.al. | 2511.12988 | null |
| 2025-11-17 | MCAQ-YOLO: Morphological Complexity-Aware Quantization for Efficient Object Detection with Curriculum Learning | Yoonjae Seo et.al. | 2511.12976 | null |
| 2025-11-17 | MedRule-KG: A Knowledge-Graph–Steered Scaffold for Reliable Mathematical and Biomedical Reasoning | Crystal Su et.al. | 2511.12963 | null |
| 2025-11-17 | CoS: Towards Optimal Event Scheduling via Chain-of-Scheduling | Yiming Zhao et.al. | 2511.12913 | null |
| 2025-11-17 | ActVAR: Activating Mixtures of Weights and Tokens for Efficient Visual Autoregressive Generation | Kaixin Zhang et.al. | 2511.12893 | null |
| 2025-11-17 | Quantization and Algebraic Index | Si Li et.al. | 2511.12875 | null |
| 2025-11-17 | View-aware Cross-modal Distillation for Multi-view Action Recognition | Trung Thanh Nguyen et.al. | 2511.12870 | null |
| 2025-11-17 | NeuroLex: A Lightweight Domain Language Model for EEG Report Understanding and Generation | Kang Yin et.al. | 2511.12851 | null |
| 2025-11-16 | Catastrophic Forgetting in Kolmogorov-Arnold Networks | Mohammad Marufur Rahman et.al. | 2511.12828 | null |
| 2025-11-16 | LoRA-Enhanced Vision Transformer for Single Image based Morphing Attack Detection via Knowledge Distillation from EfficientNet | Ria Shekhawat et.al. | 2511.12602 | null |
| 2025-11-14 | Data-efficient U-Net for Segmentation of Carbide Microstructures in SEM Images of Steel Alloys | Alinda Ezgi Gerçek et.al. | 2511.11485 | null |
| 2025-11-14 | Rethinking Efficient Mixture-of-Experts for Remote Sensing Modality-Missing Classification | Qinghao Gao et.al. | 2511.11460 | null |
| 2025-11-14 | DiffPro: Joint Timestep and Layer-Wise Precision Optimization for Efficient Diffusion Inference | Farhana Amin et.al. | 2511.11446 | null |
| 2025-11-14 | CURENet: Combining Unified Representations for Efficient Chronic Disease Prediction | Cong-Tinh Dao et.al. | 2511.11423 | null |
| 2025-11-14 | Low-Bit, High-Fidelity: Optimal Transport Quantization for Flow Matching | Dara Varam et.al. | 2511.11418 | null |
| 2025-11-14 | Coupled Proca theories: Green-hyperbolicity, quantization and applications to polarization measurement | Christopher J. Fewster et.al. | 2511.11348 | null |
| 2025-11-14 | DocSLM: A Small Vision-Language Model for Long Multimodal Document Understanding | Tanveer Hannan et.al. | 2511.11313 | null |
| 2025-11-14 | iMAD: Intelligent Multi-Agent Debate for Efficient and Accurate LLM Inference | Wei Fan et.al. | 2511.11306 | null |
| 2025-11-14 | EcoAlign: An Economically Rational Framework for Efficient LVLM Alignment | Ruoxi Cheng et.al. | 2511.11301 | null |
| 2025-11-14 | Parameter-Efficient MoE LoRA for Few-Shot Multi-Style Editing | Cong Cao et.al. | 2511.11236 | null |
| 2025-11-14 | A Comparison of Lightweight Deep Learning Models for Particulate-Matter Nowcasting in the Indian Subcontinent & Surrounding Regions | Ansh Kushwaha et.al. | 2511.11185 | null |
| 2025-11-14 | Viper-F1: Fast and Fine-Grained Multimodal Understanding with Cross-Modal State-Space Modulation | Quoc-Huy Trinh et.al. | 2511.11177 | null |
| 2025-11-14 | Hindsight Distillation Reasoning with Knowledge Encouragement Preference for Knowledge-based Visual Question Answering | Yu Zhao et.al. | 2511.11132 | null |
| 2025-11-14 | SemanticNN: Compressive and Error-Resilient Semantic Offloading for Extremely Weak Devices | Jiaming Huang et.al. | 2511.11038 | null |
| 2025-11-14 | Rethinking Autoregressive Models for Lossless Image Compression via Hierarchical Parallelism and Progressive Adaptation | Daxin Li et.al. | 2511.10991 | null |
| 2025-11-14 | Heterogeneous Complementary Distillation | Liuchi Xu et.al. | 2511.10942 | null |
| 2025-11-14 | PhaseWin Search Framework Enable Efficient Object-Level Interpretation | Zihan Gu et.al. | 2511.10914 | null |
| 2025-11-13 | Accuracy-Preserving CNN Pruning Method under Limited Data Availability | Daisuke Yasui et.al. | 2511.10861 | null |
| 2025-11-13 | GFT: Graph Feature Tuning for Efficient Point Cloud Analysis | Manish Dhakal et.al. | 2511.10799 | null |
| 2025-11-13 | Structure-Aware Encodings of Argumentation Properties for Clique-width | Yasir Mahmood et.al. | 2511.10767 | null |
| 2025-11-13 | ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference | Yesheng Liang et.al. | 2511.10645 | null |
| 2025-11-13 | Black-Box On-Policy Distillation of Large Language Models | Tianzhu Ye et.al. | 2511.10643 | null |
| 2025-11-13 | Know Your Limits: Entropy Estimation Modeling for Compression and Generalization | Benjamin L. Badger et.al. | 2511.10618 | null |
| 2025-11-13 | Maximizing Efficiency of Dataset Compression for Machine Learning Potentials With Information Theory | Benjamin Yu et.al. | 2511.10561 | null |
| 2025-11-13 | A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space | Huijie Liu et.al. | 2511.10555 | null |
| 2025-11-13 | URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding | Yongxin Shi et.al. | 2511.10552 | null |
| 2025-11-13 | Learning Post-Newtonian Corrections from Numerical Relativity | Jooheon Yoo et.al. | 2511.10522 | null |
| 2025-11-13 | SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation | Wei Li et.al. | 2511.10518 | null |
| 2025-11-13 | Analogical Structure, Minimal Contextual Cues and Contrastive Distractors: Input Design for Sample-Efficient Linguistic Rule Induction | Chunyang Jiang et.al. | 2511.10441 | null |
| 2025-11-13 | AgentEvolver: Towards Efficient Self-Evolving Agent System | Yunpeng Zhai et.al. | 2511.10395 | null |
| 2025-11-13 | EDGC: Entropy-driven Dynamic Gradient Compression for Efficient LLM Training | Qingao Yi et.al. | 2511.10333 | null |
| 2025-11-13 | Semantic Communication with Hopfield Memories | Karim Nasreddine et.al. | 2511.10302 | null |
| 2025-11-13 | HeatV2X: Scalable Heterogeneous Collaborative Perception via Efficient Alignment and Interaction | Yueran Zhao et.al. | 2511.10211 | null |
| 2025-11-13 | LiNeXt: Revisiting LiDAR Completion with Efficient Non-Diffusion Architectures | Wenzhe He et.al. | 2511.10209 | null |
| 2025-11-13 | EffiReason-Bench: A Unified Benchmark for Evaluating and Advancing Efficient Reasoning in Large Language Models | Junquan Huang et.al. | 2511.10201 | null |
| 2025-11-13 | Microscopy X-ray Imaging enriched with Small Angle X-ray Scattering for few nanometer resolution reveals shock waves and compression in intense short pulse laser irradiation of solids | Thomas Kluge et.al. | 2511.10127 | null |
| 2025-11-13 | RobIA: Robust Instance-aware Continual Test-time Adaptation for Deep Stereo | Jueun Ko et.al. | 2511.10107 | null |
| 2025-11-13 | Balancing Centralized Learning and Distributed Self-Organization: A Hybrid Model for Embodied Morphogenesis | Takehiro Ishikawa et.al. | 2511.10101 | null |
| 2025-11-13 | GridPrune: From “Where to Look” to “What to Select” in Visual Token Pruning for MLLMs | Yuxiang Duan et.al. | 2511.10081 | null |
| 2025-11-13 | Image Aesthetic Reasoning via HCM-GRPO: Empowering Compact Model for Superior Performance | Zhiyuan Hu et.al. | 2511.10055 | null |
| 2025-11-13 | Efficient Thought Space Exploration through Strategic Intervention | Ziheng Li et.al. | 2511.10038 | null |
| 2025-11-13 | LampQ: Towards Accurate Layer-wise Mixed Precision Quantization for Vision Transformers | Minjun Kim et.al. | 2511.10004 | link |
| 2025-11-13 | Explore and Establish Synergistic Effects Between Weight Pruning and Coreset Selection in Neural Network Training | Weilin Wan et.al. | 2511.09901 | null |
| 2025-11-13 | Regional Attention-Enhanced Swin Transformer for Clinically Relevant Medical Image Captioning | Zubia Naz et.al. | 2511.09893 | null |
| 2025-11-13 | HCC-3D: Hierarchical Compensatory Compression for 98% 3D Token Reduction in Vision-Language Models | Liheng Zhang et.al. | 2511.09883 | null |
| 2025-11-13 | RWKV-PCSSC: Exploring RWKV Model for Point Cloud Semantic Scene Completion | Wenzhe He et.al. | 2511.09878 | null |
| 2025-11-13 | DP-GENG : Differentially Private Dataset Distillation Guided by DP-Generated Data | Shuo Shi et.al. | 2511.09876 | null |
| 2025-11-13 | HierRouter: Coordinated Routing of Specialized Large Language Models via Reinforcement Learning | Nikunj Gupta et.al. | 2511.09873 | null |
| 2025-11-13 | Steering Pretrained Drafters during Speculative Decoding | Frédéric Berdoz et.al. | 2511.09844 | null |
| 2025-11-12 | TARG: Training-Free Adaptive Retrieval Gating for Efficient RAG | Yufeng Wang et.al. | 2511.09803 | null |
| 2025-11-12 | How Small Can You Go? Compact Language Models for On-Device Critical Error Detection in Machine Translation | Muskaan Chopra et.al. | 2511.09748 | null |
| 2025-11-12 | Separating QMA from QCMA with a classical oracle | John Bostanci et.al. | 2511.09551 | null |
| 2025-11-10 | StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation | Tianrui Feng et.al. | 2511.07399 | null |
| 2025-11-10 | LeCoT: revisiting network architecture for two-view correspondence pruning | Luanyuan Dai et.al. | 2511.07078 | null |
| 2025-11-10 | GFix: Perceptually Enhanced Gaussian Splatting Video Compression | Siyue Teng et.al. | 2511.06953 | null |
| 2025-11-10 | A Closer Look at Knowledge Distillation in Spiking Neural Network Training | Xu Liu et.al. | 2511.06902 | null |
| 2025-11-10 | Joint Access Point Selection and Beamforming Design for Bistatic Backscatter Communication | Ahmet Kaplan et.al. | 2511.06866 | null |
| 2025-11-10 | Distillation Dynamics: Towards Understanding Feature-Based Distillation in Vision Transformers | Huiyuan Tian et.al. | 2511.06848 | null |
| 2025-11-10 | MI-to-Mid Distilled Compression (M2M-DC): An Hybrid-Information-Guided-Block Pruning with Progressive Inner Slicing Approach to Model Compression | Lionel Levine et.al. | 2511.06842 | null |
| 2025-11-10 | P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats | Yuzong Chen et.al. | 2511.06838 | null |
| 2025-11-10 | QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common Patterns in Nonlinear Operations | Zhixiong Zhao et.al. | 2511.06767 | null |
| 2025-11-10 | Sensitivity of Small Language Models to Fine-tuning Data Contamination | Nicy Scaria et.al. | 2511.06763 | null |
| 2025-11-10 | MobileLLM-Pro Technical Report | Patrick Huber et.al. | 2511.06719 | null |
| 2025-11-09 | You Had One Job: Per-Task Quantization Using LLMs’ Hidden Representations | Amit LeVi et.al. | 2511.06516 | null |
| 2025-11-09 | EASE: Practical and Efficient Safety Alignment for Small Language Models | Haonan Shi et.al. | 2511.06512 | null |
| 2025-11-09 | GHOST: Solving the Traveling Salesman Problem on Graphs of Convex Sets | Jingtao Tang et.al. | 2511.06471 | null |
| 2025-11-09 | Efficient LLM Safety Evaluation through Multi-Agent Debate | Dachuan Lin et.al. | 2511.06396 | null |
| 2025-11-09 | Ghost in the Transformer: Tracing LLM Lineage with SVD-Fingerprint | Suqing Wang et.al. | 2511.06390 | null |
| 2025-11-09 | Precision-Scalable Microscaling Datapaths with Optimized Reduction Tree for Efficient NPU Integration | Stef Cuyckens et.al. | 2511.06313 | null |
| 2025-11-09 | CAMP-HiVe: Cyclic Pair Merging based Efficient DNN Pruning with Hessian-Vector Approximation for Resource-Constrained Systems | Mohammad Helal Uddin et.al. | 2511.06265 | null |
| 2025-11-09 | VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving | Ruifei Zhang et.al. | 2511.06256 | null |
| 2025-11-09 | Explicit Knowledge-Guided In-Context Learning for Early Detection of Alzheimer’s Disease | Puzhen Su et.al. | 2511.06215 | null |
| 2025-11-09 | LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs | Zifan He et.al. | 2511.06174 | null |
| 2025-11-08 | Neodragon: Mobile Video Generation using Diffusion Transformer | Animesh Karnewar et.al. | 2511.06055 | null |
| 2025-11-08 | Lethe: Layer- and Time-Adaptive KV Cache Pruning for Reasoning-Intensive LLM Serving | Hui Zeng et.al. | 2511.06029 | null |
| 2025-11-08 | MoSKA: Mixture of Shared KV Attention for Efficient Long-Sequence LLM Inference | Myunghyun Rhee et.al. | 2511.06010 | null |
| 2025-11-08 | GABFusion: Rethinking Feature Fusion for Low-Bit Quantization of Multi-Task Networks | Zhaoyang Wang et.al. | 2511.05898 | null |
| 2025-11-08 | HarmoQ: Harmonized Post-Training Quantization for High-Fidelity Image | Hongjun Wang et.al. | 2511.05868 | null |
| 2025-11-08 | EGG-SR: Embedding Symbolic Equivalence into Symbolic Regression via Equality Graph | Nan Jiang et.al. | 2511.05849 | null |
| 2025-11-08 | Training-Free Adaptive Quantization for Variable Rate Image Coding for Machines | Yui Tatsumi et.al. | 2511.05836 | null |
| 2025-11-08 | MOSS: Efficient and Accurate FP8 LLM Training with Microscaling and Automatic Scaling | Yu Zhang et.al. | 2511.05811 | null |
| 2025-11-07 | An Efficient Gradient-Aware Error-Bounded Lossy Compressor for Federated Learning | Zhijing Ye et.al. | 2511.05770 | null |
| 2025-11-11 | Compressing Multi-Task Model for Autonomous Driving via Pruning and Knowledge Distillation | Jiayuan Wang et.al. | 2511.05557 | null |
| 2025-11-07 | A Metamorphic Testing Perspective on Knowledge Distillation for Language Models of Code: Does the Student Deeply Mimic the Teacher? | Md. Abdul Awal et.al. | 2511.05476 | null |
| 2025-11-07 | APP: Accelerated Path Patching with Task-Specific Pruning | Frauke Andersen et.al. | 2511.05442 | null |
| 2025-11-07 | Efficient CNN Inference on Ultra-Low-Power MCUs via Saturation-Aware Convolution | Shiming Li et.al. | 2511.05347 | null |
| 2025-11-07 | Attention and Compression is all you need for Controllably Efficient Language Models | Jatin Prakash et.al. | 2511.05313 | null |
| 2025-11-07 | Optimal Quantization on Spherical Surfaces: Continuous and Discrete Models - A Beginner-Friendly Expository Study | Mrinal Kanti Roychowdhury et.al. | 2511.05099 | null |
| 2025-11-07 | An Efficient Proximity Graph-based Approach to Table Union Search | Yiming Xie et.al. | 2511.05082 | null |
| 2025-11-07 | Representational power of selected neural network quantum states in second quantization | Zhendong Li et.al. | 2511.04932 | null |
| 2025-11-06 | DMA: Online RAG Alignment with Human Feedback | Yu Bai et.al. | 2511.04880 | null |
| 2025-11-06 | Data Efficiency and Transfer Robustness in Biomedical Image Segmentation: A Study of Redundancy and Forgetting with Cellpose | Shuo Zhao et.al. | 2511.04803 | null |
| 2025-11-06 | Hardware-Accelerated GNN-based Hit Filtering for the Belle II Level-1 Trigger | Greta Heine et.al. | 2511.04731 | null |
| 2025-11-06 | Benchmark Designers Should “Train on the Test Set” to Expose Exploitable Non-Visual Shortcuts | Ellis Brown et.al. | 2511.04655 | null |
| 2025-11-06 | TT-Prune: Joint Model Pruning and Resource Allocation for Communication-efficient Time-triggered Federated Learning | Xinlu Zhang et.al. | 2511.04653 | null |
| 2025-11-06 | Enabling Dynamic Sparsity in Quantized LLM Inference | Rongxiang Wang et.al. | 2511.04477 | null |
| 2025-11-06 | Block Rotation is All You Need for MXFP4 Quantization | Yuantian Shao et.al. | 2511.04214 | null |
| 2025-11-06 | DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization | Yuantian Shao et.al. | 2511.04063 | null |
| 2025-11-06 | Tiny-WiFo: A Lightweight Wireless Foundation Model for Channel Prediction via Multi-Component Adaptive Knowledge Distillation | Haotian Zhang et.al. | 2511.04015 | null |
| 2025-11-06 | Memory- and Latency-Constrained Inference of Large Language Models via Adaptive Split Computing | Mingyu Sung et.al. | 2511.04002 | null |
| 2025-11-06 | TwIST: Rigging the Lottery in Transformers with Independent Subnetwork Training | Michael Menezes et.al. | 2511.03983 | null |
| 2025-11-09 | Temporal Zoom Networks: Distance Regression and Continuous Depth for Efficient Action Localization | Ibne Farabi Shihab et.al. | 2511.03943 | null |
| 2025-11-05 | Desert Waste Detection and Classification Using Data-Based and Model-Based Enhanced YOLOv12 DL Model | Abdulmumin Sa’ad et.al. | 2511.03888 | null |
| 2025-11-05 | Unconventional quantization of 2D plasmons in cavities formed by gate slots | Ilia Moiseenko et.al. | 2511.03829 | null |
| 2025-11-05 | Efficient Neural Networks with Discrete Cosine Transform Activations | Marc Martinez-Gost et.al. | 2511.03531 | null |
| 2025-11-05 | Kastor: Fine-tuned Small Language Models for Shape-based Active Relation Extraction | Ringwald Celian et.al. | 2511.03466 | null |
| 2025-11-05 | EQ-Negotiator: Dynamic Emotional Personas Empower Small Language Models for Edge-Deployable Credit Negotiation | Yunbo Long et.al. | 2511.03370 | null |
| 2025-11-05 | Incorporating QM/MM molecular dynamics into the few-mode quantization approach for light-matter interactions in nanophotonic structures | Ruth H. Tichauer et.al. | 2511.03303 | null |
| 2025-11-07 | Provable Separations between Memorization and Generalization in Diffusion Models | Zeqi Ye et.al. | 2511.03202 | null |
| 2025-11-05 | A Quantized VAE-MLP Botnet Detection Model: A Systematic Evaluation of Quantization-Aware Training and Post-Training Quantization Strategies | Hassan Wasswa et.al. | 2511.03201 | null |
| 2025-11-05 | LogicSparse: Enabling Engine-Free Unstructured Sparsity for Quantised Deep-learning Accelerators | Changhong Li et.al. | 2511.03079 | null |
| 2025-11-04 | Targeted Error Correction in Knowledge Distillation: Small Language Models Surpass GPT | Hee-Jin Lee et.al. | 2511.03005 | null |
| 2025-11-04 | Analog-to-Digital Converter Based on Voltage-controlled Superconducting Device | Md Mazharul Islam et.al. | 2511.02968 | null |
| 2025-11-04 | In Good GRACEs: Principled Teacher Selection for Knowledge Distillation | Abhishek Panigrahi et.al. | 2511.02833 | null |
| 2025-11-04 | A Non-Uniform Quantization Framework for Time-Encoding Machines | Kaluguri Yashaswini et.al. | 2511.02728 | null |
| 2025-11-04 | Can Visual Input Be Compressed? A Visual Token Compression Benchmark for Large Multimodal Models | Tianfan Peng et.al. | 2511.02650 | null |
| 2025-11-04 | LiteVoxel: Low-memory Intelligent Thresholding for Efficient Voxel Rasterization | Jee Won Lee et.al. | 2511.02510 | null |
| 2025-11-04 | FP8-Flow-MoE: A Casting-Free FP8 Recipe without Double Quantization Error | Fengjuan Wang et.al. | 2511.02302 | null |
| 2025-11-05 | IG-Pruning: Input-Guided Block Pruning for Large Language Models | Kangyu Qiao et.al. | 2511.02213 | null |
| 2025-11-03 | Testing Quantum Gravity with Gravitational Waves from the ringdown of binary Black Holes coalescences: A New Frontier in Fundamental Physics | Marco Danilo Claudio Torri et.al. | 2511.02056 | null |
| 2025-11-01 | Fibbinary-Based Compression and Quantization for Efficient Neural Radio Receivers | Roberta Fiandaca et.al. | 2511.01921 | null |
| 2025-11-03 | KV Cache Transform Coding for Compact Storage in LLM Inference | Konrad Staniszewski et.al. | 2511.01815 | null |
| 2025-11-03 | Random Initialization of Gated Sparse Adapters | Vi Retault et.al. | 2511.01794 | null |
| 2025-11-03 | Optimizing Movable Antenna Position and Transmissive RIS Phase for Efficient Base Station Design | Marjan Boloori et.al. | 2511.01575 | null |
| 2025-11-03 | Luminance-Aware Statistical Quantization: Unsupervised Hierarchical Learning for Illumination Enhancement | Derong Kong et.al. | 2511.01510 | null |
| 2025-11-03 | Thinking with DistilQwen: A Tale of Four Distilled Reasoning and Reward Model Series | Wenrui Cai et.al. | 2511.01354 | null |
| 2025-11-03 | FirstAidQA: A Synthetic Dataset for First Aid and Emergency Response in Low-Connectivity Settings | Saiyma Sittul Muna et.al. | 2511.01289 | null |
| 2025-11-03 | MoSa: Motion Generation with Scalable Autoregressive Modeling | Mengyuan Liu et.al. | 2511.01200 | null |
| 2025-11-03 | MicroAUNet: Boundary-Enhanced Multi-scale Fusion with Knowledge Distillation for Colonoscopy Polyp Image Segmentation | Ziyi Wang et.al. | 2511.01143 | null |
| 2025-11-02 | All-in-one Graph-based Indexing for Hybrid Search on GPUs | Zhonggen Li et.al. | 2511.00855 | null |
| 2025-11-02 | Towards Ultra-Low Latency: Binarized Neural Network Architectures for In-Vehicle Network Intrusion Detection | Huiyao Dong et.al. | 2511.00828 | null |
| 2025-11-02 | Efficient Query Repair for Aggregate Constraints | Shatha Algarni et.al. | 2511.00826 | link |
| 2025-11-02 | REaR: Retrieve, Expand and Refine for Effective Multitable Retrieval | Rishita Agarwal et.al. | 2511.00805 | null |
| 2025-11-01 | Predicting Encoding Energy from Low-Pass Anchors for Green Video Streaming | Zoha Azimi et.al. | 2511.00707 | null |
| 2025-11-01 | Privacy-Aware Time Series Synthesis via Public Knowledge Distillation | Penghang Liu et.al. | 2511.00700 | null |
| 2025-11-04 | Inference-Time Chain-of-Thought Pruning with Latent Informativeness Signals | Sophie Li et.al. | 2511.00699 | null |
| 2025-11-01 | Outlier-Aware Post-Training Quantization for Image Super-Resolution | Hailing Wang et.al. | 2511.00682 | null |
| 2025-11-01 | Reviving Stale Updates: Data-Free Knowledge Distillation for Asynchronous Federated Learning | Baris Askin et.al. | 2511.00655 | null |
| 2025-11-01 | Leveraging Multi-Agent System (MAS) and Fine-Tuned Small Language Models (SLMs) for Automated Telecom Network Troubleshooting | Chenhua Shi et.al. | 2511.00651 | null |
| 2025-11-01 | Diluting Restricted Boltzmann Machines | C. Díaz-Faloh et.al. | 2511.00648 | null |
| 2025-11-05 | Towards 1000-fold Electron Microscopy Image Compression for Connectomics via VQ-VAE with Transformer Prior | Fuming Yang et.al. | 2511.00231 | null |
| 2025-10-31 | Vision Transformer for Robust Occluded Person Reidentification in Complex Surveillance Scenes | Bo Li et.al. | 2510.27677 | null |
| 2025-10-31 | SpecAttn: Speculating Sparse Attention | Harsh Shah et.al. | 2510.27641 | null |
| 2025-10-31 | Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications | Zixuan Hu et.al. | 2510.27186 | null |
| 2025-10-30 | Elastic Architecture Search for Efficient Language Models | Shang Wang et.al. | 2510.27037 | null |
| 2025-10-30 | LightPro: A Linear Photonic Processor with Full Programmability | Amin Shafiee et.al. | 2510.27013 | null |
| 2025-10-30 | STaMP: Sequence Transformation and Mixed Precision for Low-Precision Activation Quantization | Marco Federici et.al. | 2510.26771 | null |
| 2025-10-30 | LoRAQuant: Mixed-Precision Quantization of LoRA to Ultra-Low Bits | Amir Reza Mirzaei et.al. | 2510.26690 | null |
| 2025-10-30 | Knowledge Distillation of Noisy Force Labels for Improved Coarse-Grained Force Fields | Feranmi V. Olowookere et.al. | 2510.26650 | null |
| 2025-10-30 | ReSpec: Towards Optimizing Speculative Decoding in Reinforcement Learning Systems | Qiaoling Chen et.al. | 2510.26475 | null |
| 2025-10-30 | 1+1>2: A Synergistic Sparse and Low-Rank Compression Method for Large Language Models | Zeliang Zong et.al. | 2510.26446 | null |
| 2025-10-30 | Personalized Treatment Outcome Prediction from Scarce Data via Dual-Channel Knowledge Distillation and Adaptive Fusion | Wenjie Chen et.al. | 2510.26444 | null |
| 2025-10-30 | Discovering State Equivalences in UCT Search Trees By Action Pruning | Robin Schmöcker et.al. | 2510.26346 | null |
| 2025-10-30 | Do LLMs Signal When They’re Right? Evidence from Neuron Agreement | Kang Chen et.al. | 2510.26277 | null |
| 2025-10-30 | Distilling Multilingual Vision-Language Models: When Smaller Models Stay Multilingual | Sukrit Sriratanawilai et.al. | 2510.26271 | null |
| 2025-10-30 | BitSemCom: A Bit-Level Semantic Communication Framework with Learnable Probabilistic Mapping | Haoshuo Zhang et.al. | 2510.26225 | null |
| 2025-10-30 | STAR: A Privacy-Preserving, Energy-Efficient Edge AI Framework for Human Activity Recognition via Wi-Fi CSI in Mobile and Pervasive Computing Environments | Kexing Liu et.al. | 2510.26148 | null |
| 2025-10-30 | Do Students Debias Like Teachers? On the Distillability of Bias Mitigation Methods | Jiali Cheng et.al. | 2510.26038 | null |
| 2025-10-29 | Robust GNN Watermarking via Implicit Perception of Topological Invariants | Jipeng Li et.al. | 2510.25934 | null |
| 2025-10-29 | Humains-Junior: A 3.8B Language Model Achieving GPT-4o-Level Factual Accuracy by Directed Exoskeleton Reasoning | Nissan Yaron et.al. | 2510.25933 | null |
| 2025-10-28 | Group theoretic quantization of punctured plane | Manvendra Somvanshi et.al. | 2510.25794 | null |
| 2025-10-29 | INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats | Mengzhao Chen et.al. | 2510.25602 | null |
| 2025-10-30 | PureKV: Plug-and-Play KV Cache Optimization with Spatial-Temporal Sparse Attention for Vision-Language Large Models | Zhonghua Jiang et.al. | 2510.25600 | null |
| 2025-10-29 | Feedback Alignment Meets Low-Rank Manifolds: A Structured Recipe for Local Learning | Arani Roy et.al. | 2510.25594 | null |
| 2025-10-29 | Lightweight Federated Learning in Mobile Edge Computing with Statistical and Device Heterogeneity Awareness | Jinghong Tan et.al. | 2510.25342 | null |
| 2025-10-29 | Adapting Small Language Models to Low-Resource Domains: A Case Study in Hindi Tourism QA | Sandipan Majhi et.al. | 2510.25273 | null |
| 2025-10-29 | Energy-Efficient Autonomous Driving with Adaptive Perception and Robust Decision | Yuyang Xia et.al. | 2510.25205 | null |
| 2025-10-29 | Machine Learning and CPU (Central Processing Unit) Scheduling Co-Optimization over a Network of Computing Centers | Mohammadreza Doostmohammadian et.al. | 2510.25176 | null |
| 2025-10-28 | Resource-Efficient and Robust Inference of Deep and Bayesian Neural Networks on Embedded and Analog Computing Platforms | Bernhard Klein et.al. | 2510.24951 | null |
| 2025-10-28 | SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens | Yinhan He et.al. | 2510.24940 | null |
| 2025-10-28 | Send Less, Save More: Energy-Efficiency Benchmark of Embedded CNN Inference vs. Data Transmission in IoT | Benjamin Karic et.al. | 2510.24829 | null |
| 2025-10-27 | A Survey on Efficient Vision-Language-Action Models | Zhaoshu Yu et.al. | 2510.24795 | null |
| 2025-10-27 | ESCA: Enabling Seamless Codec Avatar Execution through Algorithm and Hardware Co-Optimization for Virtual Reality | Mingzhi Zhu et.al. | 2510.24787 | null |
| 2025-10-28 | All in one timestep: Enhancing Sparsity and Energy efficiency in Multi-level Spiking Neural Networks | Andrea Castagnetti et.al. | 2510.24637 | null |
| 2025-10-28 | Fast and accurate neural reflectance transformation imaging through knowledge distillation | Tinsae G. Dulecha et.al. | 2510.24486 | null |
| 2025-10-28 | MiniOneRec: An Open-Source Framework for Scaling Generative Recommendation | Xiaoyu Kong et.al. | 2510.24431 | null |
| 2025-11-01 | Comprehensive and Efficient Distillation for Lightweight Sentiment Analysis Models | Guangyu Xie et.al. | 2510.24425 | null |
| 2025-10-29 | Odyssey: An End-to-End System for Pareto-Optimal Serverless Query Processing | Shyam Jesalpura et.al. | 2510.24307 | null |
| 2025-10-28 | SCOPE: Saliency-Coverage Oriented Token Pruning for Efficient Multimodel LLMs | Jinhong Deng et.al. | 2510.24214 | null |
| 2025-11-01 | Spectral-Geometric Deformations of Function Algebras on Manifolds | Amandip Sangha et.al. | 2510.24184 | null |
| 2025-10-28 | UHKD: A Unified Framework for Heterogeneous Knowledge Distillation via Frequency-Domain Representations | Fengming Yu et.al. | 2510.24116 | null |
| 2025-10-28 | FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic | Kanghyun Choi et.al. | 2510.24061 | null |
| 2025-10-28 | SpecKD: Speculative Decoding for Effective Knowledge Distillation of LLMs | Haiduo Huang et.al. | 2510.24021 | null |
| 2025-10-27 | Adaptive Training of INRs via Pruning and Densification | Diana Aldana et.al. | 2510.23943 | null |
| 2025-10-27 | BitSkip: An Empirical Analysis of Quantization and Early Exit Composition | Ramshankar Bhuvaneswaran et.al. | 2510.23766 | null |
| 2025-10-25 | The Structural Scalpel: Automated Contiguous Layer Pruning for Large Language Models | Yao Lu et.al. | 2510.23652 | null |
| 2025-10-25 | Efficient Low Rank Attention for Long-Context Inference in Large Language Models | Tenghui Li et.al. | 2510.23649 | null |
| 2025-10-24 | LLMComp: A Language Modeling Paradigm for Error-Bounded Scientific Data Compression | Guozhong Li et.al. | 2510.23632 | null |
| 2025-10-27 | Tighter CMI-Based Generalization Bounds via Stochastic Projection and Quantization | Milad Sefidgaran et.al. | 2510.23485 | null |
| 2025-10-27 | Enabling Vibration-Based Gesture Recognition on Everyday Furniture via Energy-Efficient FPGA Implementation of 1D Convolutional Networks | Koki Shibata et.al. | 2510.23156 | null |
| 2025-10-27 | DeepSalt: Bridging Laboratory and Satellite Spectra through Domain Adaptation and Knowledge Distillation for Large-Scale Soil Salinity Estimation | Rupasree Dey et.al. | 2510.23124 | null |
| 2025-10-27 | LightPFP: A Lightweight Route to Ab Initio Accuracy at Scale | Wenwen Li et.al. | 2510.23064 | null |
| 2025-10-27 | AirFed: Federated Graph-Enhanced Multi-Agent Reinforcement Learning for Multi-UAV Cooperative Mobile Edge Computing | Zhiyu Wang et.al. | 2510.23053 | null |
| 2025-10-27 | Sentinel: Dynamic Knowledge Distillation for Personalized Federated Intrusion Detection in Heterogeneous IoT Networks | Gurpreet Singh et.al. | 2510.23019 | null |
| 2025-10-28 | Switchable Token-Specific Codebook Quantization For Face Image Compression | Yongbo Wang et.al. | 2510.22943 | null |
| 2025-10-27 | Rethinking Inference Placement for Deep Learning across Edge and Cloud Platforms: A Multi-Objective Optimization Perspective and Future Directions | Zongshun Zhang et.al. | 2510.22909 | null |
| 2025-10-26 | TELL-TALE: Task Efficient LLMs with Task Aware Layer Elimination | Omar Naim et.al. | 2510.22767 | null |
| 2025-10-26 | Iterative Layer Pruning for Efficient Translation Inference | Yasmin Moslem et.al. | 2510.22763 | null |
| 2025-10-26 | TVMC: Time-Varying Mesh Compression via Multi-Stage Anchor Mesh Generation | He Huang et.al. | 2510.22646 | null |
| 2025-10-26 | Bag-of-Word-Groups (BoWG): A Robust and Efficient Loop Closure Detection Method Under Perceptual Aliasing | Xiang Fei et.al. | 2510.22529 | null |
| 2025-10-26 | Frustratingly Easy Task-aware Pruning for Large Language Models | Yuanhe Tian et.al. | 2510.22489 | null |
| 2025-10-26 | Single-Teacher View Augmentation: Boosting Knowledge Distillation via Angular Diversity | Seonghoon Yu et.al. | 2510.22480 | null |
| 2025-10-25 | GigaEmbeddings: Efficient Russian Language Embedding Model | Egor Kolodin et.al. | 2510.22369 | null |
| 2025-10-25 | Real-Time Semantic Segmentation on FPGA for Autonomous Vehicles Using LMIINet with the CGRA4ML Framework | Amir Mohammad Khadem Hosseini et.al. | 2510.22243 | null |
| 2025-10-25 | Synthetic-to-Real Transfer Learning for Chromatin-Sensitive PWS Microscopy | Jahidul Arafat et.al. | 2510.22239 | null |
| 2025-10-25 | When Fewer Layers Break More Chains: Layer Pruning Harms Test-Time Scaling in LLMs | Keyu Wang et.al. | 2510.22228 | null |
| 2025-10-25 | Scaling Up Efficient Small Language Models Serving and Deployment for Semantic Job Search | Kayhan Behdin et.al. | 2510.22101 | null |
| 2025-10-24 | Pruning and Quantization Impact on Graph Neural Networks | Khatoon Khedri et.al. | 2510.22058 | null |
| 2025-10-24 | Performance Trade-offs of Optimizing Small Language Models for E-Commerce | Josip Tomo Licardo et.al. | 2510.21970 | null |
| 2025-10-23 | TernaryCLIP: Efficiently Compressing Vision-Language Models with Ternary Weights and Distilled Knowledge | Shu-Hao Zhang et.al. | 2510.21879 | null |
| 2025-10-22 | KARIPAP: Quantum-Inspired Tensor Network Compression of Large Language Models Using Infinite Projected Entangled Pair States and Tensor Renormalization Group | Azree Nazri et.al. | 2510.21844 | null |
| 2025-10-22 | Restoring Pruned Large Language Models via Lost Component Compensation | Zijian Feng et.al. | 2510.21834 | null |
| 2025-10-24 | A Dynamic Knowledge Distillation Method Based on the Gompertz Curve | Han Yang et.al. | 2510.21649 | null |
| 2025-10-24 | Few-Shot Knowledge Distillation of LLMs With Counterfactual Explanations | Faisal Hamman et.al. | 2510.21631 | null |
| 2025-10-24 | Does Model Size Matter? A Comparison of Small and Large Language Models for Requirements Classification | Mohammad Amin Zadenoori et.al. | 2510.21443 | null |
| 2025-10-24 | A Convergence Analysis of Adaptive Optimizers under Floating-point Quantization | Xuan Tang et.al. | 2510.21314 | null |
| 2025-10-24 | Correlation Dimension of Auto-Regressive Large Language Models | Xin Du et.al. | 2510.21258 | null |
| 2025-10-24 | DictPFL: Efficient and Private Federated Learning on Encrypted Gradients | Jiaqi Xue et.al. | 2510.21086 | null |
| 2025-10-23 | Learning Grouped Lattice Vector Quantizers for Low-Bit LLM Compression | Xi Zhang et.al. | 2510.20984 | link |
| 2025-10-23 | Compress to Impress: Efficient LLM Adaptation Using a Single Gradient Step on 100 Samples | Shiva Sreeram et.al. | 2510.20800 | null |
| 2025-10-23 | Efficient Multi-bit Quantization Network Training via Weight Bias Correction and Bit-wise Coreset Sampling | Jinhee Kim et.al. | 2510.20673 | null |
| 2025-10-23 | xTime: Extreme Event Prediction with Hierarchical Knowledge Distillation and Expert Fusion | Quan Li et.al. | 2510.20651 | null |
| 2025-10-23 | Dynamic Weight Adjustment for Knowledge Distillation: Leveraging Vision Transformer for High-Accuracy Lung Cancer Detection and Real-Time Deployment | Saif Ur Rehman Khan et.al. | 2510.20438 | null |
| 2025-10-23 | AccuQuant: Simulating Multiple Denoising Steps for Quantizing Diffusion Models | Seunghoon Lee et.al. | 2510.20348 | null |
| 2025-10-24 | EditInfinity: Image Editing with Binary-Quantized Generative Models | Jiahuan Wang et.al. | 2510.20217 | null |
| 2025-10-23 | BoundRL: Efficient Structured Text Segmentation through Reinforced Boundary Generation | Haoyuan Li et.al. | 2510.20151 | null |
| 2025-10-22 | Improving Predictive Confidence in Medical Imaging via Online Label Smoothing | Kushan Choudhury et.al. | 2510.20011 | null |
| 2025-10-22 | From Large to Small: Transferring CUDA Optimization Expertise via Reasoning Graph | Junfeng Gong et.al. | 2510.19873 | null |
| 2025-10-21 | Foveated Compression for Immersive Telepresence Visualization | Max Schwarz et.al. | 2510.19848 | null |
| 2025-10-20 | Mechanics as a general-relativistic gauge field theory, and Relational Quantization | J. François et.al. | 2510.19845 | null |
| 2025-10-22 | AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders | Yuezhou Hu et.al. | 2510.19779 | null |
| 2025-10-22 | A flexible framework for structural plasticity in GPU-accelerated sparse spiking neural networks | James C. Knight et.al. | 2510.19764 | null |
| 2025-10-22 | Adaptive Distribution-aware Quantization for Mixed-Precision Neural Networks | Shaohang Jia et.al. | 2510.19760 | null |
| 2025-10-22 | Accelerating Moment Tensor Potentials through Post-Training Pruning | Zijian Meng et.al. | 2510.19737 | null |
| 2025-10-22 | Single-Scale Magnetoelastic Landau Quantization: Thermodynamics, Quantum Oscillations, and Metrology | Denise Assafrão et.al. | 2510.19637 | null |
| 2025-10-22 | HAD: Hierarchical Asymmetric Distillation to Bridge Spatio-Temporal Gaps in Event-Based Object Tracking | Yao Deng et.al. | 2510.19560 | null |
| 2025-10-22 | Energy-Efficient and Dequantization-Free Q-LLMs: A Spiking Neural Network Approach to Salient Value Mitigation | Chenyu Wang et.al. | 2510.19498 | null |
| 2025-10-22 | ELUTQ: Efficient LUT-Aware Quantization for Deploying Large Language Models on Edge Devices | Xin Nie et.al. | 2510.19482 | null |
| 2025-10-22 | BLiSS 1.0: Evaluating Bilingual Learner Competence in Second Language Small Language Models | Yuan Gao et.al. | 2510.19419 | null |
| 2025-10-22 | CPSVD: Enhancing Large Language Model Compression via Column-Preserving Singular Value Decomposition | Lin Xv et.al. | 2510.19385 | null |
| 2025-10-27 | Multi-Rate Task-Oriented Communication for Multi-Edge Cooperative Inference | Dongwon Kim et.al. | 2510.19360 | null |
| 2025-10-24 | Knowledge Distillation of Uncertainty using Deep Latent Factor Model | Sehyun Park et.al. | 2510.19290 | null |
| 2025-10-22 | MobiAct: Efficient MAV Action Recognition Using MobileNetV4 with Contrastive Learning and Knowledge Distillation | Zhang Nengbo et.al. | 2510.19273 | null |
| 2025-10-23 | Data Efficient Any Transformer-to-Mamba Distillation via Attention Bridge | Penghao Wang et.al. | 2510.19266 | null |
| 2025-10-22 | Res-DPU: Resource-shared Digital Processing-in-memory Unit for Edge-AI Workloads | Mukul Lokhande et.al. | 2510.19260 | null |
| 2025-10-22 | Background Fades, Foreground Leads: Curriculum-Guided Background Pruning for Efficient Foreground-Centric Collaborative Perception | Yuheng Wu et.al. | 2510.19250 | null |
| 2025-10-22 | TinyUSFM: Towards Compact and Efficient Ultrasound Foundation Models | Chen Ma et.al. | 2510.19239 | null |
| 2025-10-22 | Enhancing Graph Neural Networks: A Mutual Learning Approach | Paul Agbaje et.al. | 2510.19223 | null |
| 2025-10-22 | MoE-GS: Mixture of Experts for Dynamic Gaussian Splatting | In-Hwan Jin et.al. | 2510.19210 | null |
| 2025-10-22 | PruneHal: Reducing Hallucinations in Multi-modal Large Language Models through Adaptive KV Cache Pruning | Fengyuan Sun et.al. | 2510.19183 | null |
| 2025-10-21 | Towards Universal Solvers: Using PGD Attack in Active Learning to Increase Generalizability of Neural Operators as Knowledge Distillation from Numerical PDE Solvers | Yifei Sun et.al. | 2510.18989 | null |
| 2025-10-21 | DuoLens: A Framework for Robust Detection of Machine-Generated Multilingual Text and Code | Shriyansh Agrawal et.al. | 2510.18904 | null |
| 2025-10-20 | CosmoCore Affective Dream-Replay Reinforcement Learning for Code Generation | Santhosh Kumar Ravindran et.al. | 2510.18895 | null |
| 2025-10-21 | Fine-Tuned Thoughts: Leveraging Chain-of-Thought Reasoning for Industrial Asset Health Monitoring | Shuxin Lin et.al. | 2510.18817 | null |
| 2025-10-21 | CAGE: Curvature-Aware Gradient Estimation For Accurate Quantization-Aware Training | Soroush Tabesh et.al. | 2510.18784 | null |
| 2025-10-21 | Binary Quadratic Quantization: Beyond First-Order Quantization for Real-Valued Matrix Compression | Kyo Kuroki et.al. | 2510.18650 | null |
| 2025-10-21 | C-SWAP: Explainability-Aware Structured Pruning for Efficient Neural Networks Compression | Baptiste Bauvin et.al. | 2510.18636 | null |
| 2025-10-21 | Channel-Aware Vector Quantization for Robust Semantic Communication on Discrete Channels | Zian Meng et.al. | 2510.18604 | null |
| 2025-10-21 | Pay Attention to the Triggers: Constructing Backdoors That Survive Distillation | Giovanni De Muri et.al. | 2510.18541 | null |
| 2025-10-21 | From Quarter to All: Accelerating Speculative LLM Decoding via Floating-Point Exponent Remapping and Parameter Sharing | Yushu Zhao et.al. | 2510.18525 | null |
| 2025-10-21 | DWaste: Greener AI for Waste Sorting using Mobile and Edge Devices | Suman Kunwar et.al. | 2510.18513 | null |
| 2025-10-21 | How2Compress: Scalable and Efficient Edge Video Analytics via Adaptive Granular Video Compression | Yuheng Wu et.al. | 2510.18409 | null |
| 2025-10-21 | MENTOR: A Reinforcement Learning Framework for Model Enhancement via Teacher-Optimized Rewards in Small Models | ChangSu Choi et.al. | 2510.18383 | null |
| 2025-10-21 | S2AP: Score-space Sharpness Minimization for Adversarial Pruning | Giorgio Piras et.al. | 2510.18381 | null |
| 2025-10-21 | Ensembling Pruned Attention Heads For Uncertainty-Aware Efficient Transformers | Firas Gabetni et.al. | 2510.18358 | null |
| 2025-10-21 | StreamingTOM: Streaming Token Compression for Efficient Video Understanding | Xueyi Chen et.al. | 2510.18269 | null |
| 2025-10-21 | Learning under Quantization for High-Dimensional Linear Regression | Dechen Zhang et.al. | 2510.18259 | null |
| 2025-10-21 | DualHash: A Stochastic Primal-Dual Algorithm with Theoretical Guarantee for Deep Hashing | Luxuan Li et.al. | 2510.18218 | null |
| 2025-10-20 | Learning from Generalization Patterns: An Evaluation-Driven Approach to Enhanced Data Augmentation for Fine-Tuning Small Language Models | Huan Song et.al. | 2510.18143 | null |
| 2025-10-20 | CompactPrompt: A Unified Pipeline for Prompt Data Compression in LLM Workflows | Joong Ho Choi et.al. | 2510.18043 | null |
| 2025-10-20 | From Local to Global: Revisiting Structured Pruning Paradigms for Large Language Models | Ziyan Wang et.al. | 2510.18030 | null |
| 2025-10-20 | Quantum Computing Approach to Atomic and Molecular Three-Body Systems | Mohammad Haidar et.al. | 2510.18005 | null |
| 2025-10-20 | SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference | Samir Khaki et.al. | 2510.17777 | null |
| 2025-10-21 | Efficient Tensor Completion Algorithms for Highly Oscillatory Operators | Navjot Singh et.al. | 2510.17734 | null |
| 2025-10-20 | Elastic ViTs from Pretrained Models without Retraining | Walter Simoncini et.al. | 2510.17700 | null |
| 2025-10-20 | Deparametrization and Quantization of Scalar-Tensor Gravity and Its Cosmological Model | Faqiang Yuan et.al. | 2510.17663 | null |
| 2025-10-21 | TrajMamba: An Efficient and Semantic-rich Vehicle Trajectory Pre-training Model | Yichen Liu et.al. | 2510.17545 | null |
| 2025-10-20 | The Graphon Limit Hypothesis: Understanding Neural Network Pruning via Infinite Width Analysis | Hoang Pham et.al. | 2510.17515 | null |
| 2025-10-20 | $\mathcal{V}isi\mathcal{P}runer$ : Decoding Discontinuous Cross-Modal Dynamics for Efficient Multimodal LLMs | Yingqi Fan et.al. | 2510.17205 | null |
| 2025-10-20 | ZSPAPrune: Zero-Shot Prompt-Aware Token Pruning for Vision-Language Models | Pu Zhang et.al. | 2510.17197 | null |
| 2025-10-20 | SOLE: Hardware-Software Co-design of Softmax and LayerNorm for Efficient Transformer Inference | Wenxun Wang et.al. | 2510.17189 | null |
| 2025-10-20 | HyperSearch: Prediction of New Hyperedges through Unconstrained yet Efficient Search | Hyunjin Choo et.al. | 2510.17153 | null |
| 2025-10-19 | Foundation Models in Medical Image Analysis: A Systematic Review and Meta-Analysis | Praveenbalaji Rajendran et.al. | 2510.16973 | null |
| 2025-10-19 | Leave It to the Experts: Detecting Knowledge Distillation via MoE Expert Signatures | Pingzhi Li et.al. | 2510.16968 | null |
| 2025-10-19 | SAC: Neural Speech Codec with Semantic-Acoustic Dual-Stream Quantization | Wenxi Chen et.al. | 2510.16841 | null |
| 2025-10-19 | Mixed-Precision Quantization for Language Models: Techniques and Prospects | Mariam Rakka et.al. | 2510.16805 | null |
| 2025-10-19 | ELMM: Efficient Lightweight Multimodal Large Language Models for Multimodal Knowledge Graph Completion | Wei Huang et.al. | 2510.16753 | null |
| 2025-10-19 | DistilLock: Safeguarding LLMs from Unauthorized Knowledge Distillation on the Edge | Asmita Mohanty et.al. | 2510.16716 | null |
| 2025-10-19 | CLIP: Client-Side Invariant Pruning for Mitigating Stragglers in Secure Federated Learning | Anthony DiMaggio et.al. | 2510.16694 | null |
| 2025-10-19 | Pursuing Minimal Sufficiency in Spatial Reasoning | Yejie Guo et.al. | 2510.16688 | null |
| 2025-10-18 | HYDRA: HYbrid knowledge Distillation and spectral Reconstruction Algorithm for high channel hyperspectral camera applications | Christopher Thirgood et.al. | 2510.16664 | null |
| 2025-10-18 | Self-Supervised Learning to Fly using Efficient Semantic Segmentation and Metric Depth Estimation for Low-Cost Autonomous UAVs | Sebastian Mocanu et.al. | 2510.16624 | null |
| 2025-10-18 | A Deep Learning Framework for Real-Time Image Processing in Medical Diagnostics: Enhancing Accuracy and Speed in Clinical Applications | Melika Filvantorkaman et.al. | 2510.16611 | null |
| 2025-10-18 | SPLite Hand: Sparsity-Aware Lightweight 3D Hand Pose Estimation | Yeh Keng Hao et.al. | 2510.16396 | null |
| 2025-10-18 | QSVD: Efficient Low-rank Approximation for Unified Query-Key-Value Weight Compression in Low-Precision Vision-Language Models | Yutong Wang et.al. | 2510.16292 | null |
| 2025-10-17 | One-Bit Quantization for Random Features Models | Danil Akhtiamov et.al. | 2510.16250 | null |
| 2025-10-18 | Differentiable, Bit-shifting, and Scalable Quantization without training neural network from scratch | Zia Badar et.al. | 2510.16088 | null |
| 2025-10-17 | Optimization of the quantization of dense neural networks from an exact QUBO formulation | Sergio Muñiz Subiñas et.al. | 2510.16075 | null |
| 2025-10-16 | AMS-QUANT: Adaptive Mantissa Sharing for Floating-point Quantization | Mengtao Lv et.al. | 2510.16045 | null |
| 2025-10-16 | Vector Quantization in the Brain: Grid-like Codes in World Models | Xiangyuan Peng et.al. | 2510.16039 | null |
| 2025-10-17 | SANR: Scene-Aware Neural Representation for Light Field Image Compression with Rate-Distortion Optimization | Gai Zhang et.al. | 2510.15775 | null |
| 2025-10-17 | Evaluation of Novel Fast Machine Learning Algorithms for Knowledge-Distillation-Based Anomaly Detection at CMS | Lino Gerlach et.al. | 2510.15672 | null |
| 2025-10-17 | Time evolution of the Husimi and Glauber-Sudarshan functions in terms of complementary Hamiltonian symbols | Mritunjay Tyagi et.al. | 2510.15628 | null |
| 2025-10-17 | GRATING: Low-Latency and Memory-Efficient Semantic Selection on Device | Jiahao Zhou et.al. | 2510.15620 | null |
| 2025-10-17 | Quantized FCA: Efficient Zero-Shot Texture Anomaly Detection | Andrei-Timotei Ardelean et.al. | 2510.15602 | null |
| 2025-10-17 | SpikeFit: Towards Optimal Deployment of Spiking Networks on Neuromorphic Hardware | Ivan Kartashov et.al. | 2510.15542 | null |
| 2025-10-17 | Revisiting Knowledge Distillation: The Hidden Role of Dataset Size | Giulia Lanzillotta et.al. | 2510.15516 | null |
| 2025-10-17 | Quantization-Based Score Calibration for Few-Shot Keyword Spotting with Dynamic Time Warping in Noisy Environments | Kevin Wilkinghoff et.al. | 2510.15432 | null |
| 2025-10-17 | ParaFormer: Shallow Parallel Transformers with Progressive Approximation | Wei Wang et.al. | 2510.15425 | null |
| 2025-10-17 | Fine-Tuning MedGemma for Clinical Captioning to Enhance Multimodal RAG over Malaysia CPGs | Lee Qi Zun et.al. | 2510.15418 | null |
| 2025-10-17 | Layer as Puzzle Pieces: Compressing Large Language Models through Layer Concatenation | Fei Wang et.al. | 2510.15304 | null |
| 2025-10-17 | GRank: Towards Target-Aware and Streamlined Industrial Retrieval with a Generate-Rank Framework | Yijia Sun et.al. | 2510.15299 | null |
| 2025-10-17 | Exemplar-Guided Planing: Enhanced LLM Agent for KGQA | Jingao Xu et.al. | 2510.15283 | null |
| 2025-10-16 | Dyadic microlocal partition for anisotropic metrics and uniform Weyl quantization | Vicente Vergara et.al. | 2510.15183 | null |
| 2025-10-16 | SaLon3R: Structure-aware Long-term Generalizable 3D Reconstruction from Unposed Images | Jiaxin Guo et.al. | 2510.15072 | link |
| 2025-10-16 | MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning | Mattia Segu et.al. | 2510.15026 | null |
| 2025-10-16 | TASLA: Text-Aligned Speech Tokens with Multiple Layer-Aggregation | Ming-Hao Hsu et.al. | 2510.14934 | null |
| 2025-10-16 | Efficient and Robust Carathéodory-Steinitz Pruning of Positive Discrete Measures | Filip Bělík et.al. | 2510.14916 | null |
| 2025-10-16 | Dynamic-Key-Aware Co-Simulation Framework for Next Generation of SCADA Systems Encrypted by Quantum-Key-Distribution Techniques | Ziqing Zhu et.al. | 2510.14838 | null |
| 2025-10-16 | FraQAT: Quantization Aware Training with Fractional bits | Luca Morreale et.al. | 2510.14823 | null |
| 2025-10-16 | Dataset Pruning in RecSys and ML: Best Practice or Mal-Practice? | Leonie Winter et.al. | 2510.14704 | null |
| 2025-10-16 | WeCKD: Weakly-supervised Chained Distillation Network for Efficient Multimodal Medical Imaging | Md. Abdur Rahman et.al. | 2510.14668 | null |
| 2025-10-16 | Task-Based Quantization for Channel Estimation in RIS Empowered MmWave Systems | Gyoseung Lee et.al. | 2510.14649 | null |
| 2025-10-16 | GemiRec: Interest Quantization and Generation for Multi-Interest Recommendation | Zhibo Wu et.al. | 2510.14626 | null |
| 2025-10-16 | Efficient Video Sampling: Pruning Temporally Redundant Tokens for Faster VLM Inference | Natan Bagrov et.al. | 2510.14624 | null |
| 2025-10-16 | A Deep State-Space Model Compression Method using Upper Bound on Output Error | Hiroki Sakamoto et.al. | 2510.14542 | null |
| 2025-10-16 | Pruning Overparameterized Multi-Task Networks for Degraded Web Image Restoration | Thomas Katraouras et.al. | 2510.14463 | null |
| 2025-10-16 | A Free Lunch in LLM Compression: Revisiting Retraining after Pruning | Moritz Wagner et.al. | 2510.14444 | null |
| 2025-10-16 | Low Power Vision Transformer Accelerator with Hardware-Aware Pruning and Optimized Dataflow | Ching-Lin Hsiung et.al. | 2510.14393 | null |
| 2025-10-16 | DRBD-Mamba for Robust and Efficient Brain Tumor Segmentation with Analytical Insights | Danish Ali et.al. | 2510.14383 | null |
| 2025-10-16 | Computing-In-Memory Aware Model Adaption For Edge Devices | Ming-Han Lin et.al. | 2510.14379 | null |
| 2025-10-16 | Constraint-Driven Small Language Models Based on Agent and OpenAlex Knowledge Graph: Mining Conceptual Pathways and Discovering Innovation Points in Academic Papers | Ziye Xia et.al. | 2510.14303 | null |
| 2025-10-15 | Toward Cybersecurity-Expert Small Language Models | Matan Levi et.al. | 2510.14113 | null |
| 2025-10-15 | REAP the Experts: Why Pruning Prevails for One-Shot MoE compression | Mike Lasby et.al. | 2510.13999 | null |
| 2025-10-15 | Readability $\ne$ Learnability: Rethinking the Role of Simplicity in Training Small Language Models | Ivan Lee et.al. | 2510.13915 | null |
| 2025-10-14 | A Survey on Collaborating Small and Large Language Models for Performance, Cost-effectiveness, Cloud-edge Privacy, and Trustworthiness | Fali Wang et.al. | 2510.13890 | null |
| 2025-10-13 | What Layers When: Learning to Skip Compute in LLMs with Residual Gates | Filipe Laitenberger et.al. | 2510.13876 | null |
| 2025-10-13 | ShishuLM: Lightweight Language Model with Hybrid Decoder-MLP Architecture and Paired Weight Sharing | Shivanshu Kumar et.al. | 2510.13860 | null |
| 2025-10-15 | Invited Paper: BitMedViT: Ternary-Quantized Vision Transformer for Medical AI Assistants on the Edge | Mikolaj Walczak et.al. | 2510.13760 | null |
| 2025-10-15 | Don’t Be Greedy, Just Relax! Pruning LLMs via Frank-Wolfe | Christophe Roux et.al. | 2510.13713 | null |
| 2025-10-15 | XD-RCDepth: Lightweight Radar-Camera Depth Estimation with Explainability-Aligned and Distribution-Aware Distillation | Huawei Sun et.al. | 2510.13565 | null |
| 2025-10-15 | DistilCLIP-EEG: Enhancing Epileptic Seizure Detection Through Multi-modal Learning and Knowledge Distillation | Zexin Wang et.al. | 2510.13497 | null |
| 2025-10-15 | F-BFQ: Flexible Block Floating-Point Quantization Accelerator for LLMs | Jude Haris et.al. | 2510.13401 | null |
| 2025-10-15 | Energy-Efficient FPGA Framework for Non-Quantized Convolutional Neural Networks | Angelos Athanasiadis et.al. | 2510.13362 | null |
| 2025-10-15 | Behavioral Embeddings of Programs: A Quasi-Dynamic Approach for Optimization Prediction | Haolin Pan et.al. | 2510.13158 | null |
| 2025-10-15 | NeuroRVQ: Multi-Scale EEG Tokenization for Generative Large Brainwave Models | Konstantinos Barmpas et.al. | 2510.13068 | null |
| 2025-10-14 | Data to Certificate: Guaranteed Cost Control with Quantization-Aware System Identification | Shahab Ataei et.al. | 2510.13024 | null |
| 2025-10-14 | Pruning Cannot Hurt Robustness: Certified Trade-offs in Reinforcement Learning | James Pedley et.al. | 2510.12939 | null |
| 2025-10-14 | Emergent spin Hall quantization and high-order van Hove singularities in square-octagonal MA $_2$Z$_4$ | Rahul Verma et.al. | 2510.12935 | null |
| 2025-10-14 | Learning at the Speed of Physics: Equilibrium Propagation on Oscillator Ising Machines | Alex Gower et.al. | 2510.12934 | null |
| 2025-10-14 | Efficient Adaptive Transformer: An Empirical Study and Reproducible Framework | Jan Miller et.al. | 2510.12856 | null |
| 2025-10-14 | CARVQ: Corrective Adaptor with Group Residual Vector Quantization for LLM Embedding Compression | Dayin Gou et.al. | 2510.12721 | null |
| 2025-10-14 | Rethinking Knowledge Distillation: A Data Dependent Regulariser With a Negative Asymmetric Payoff | Israel Mason-Williams et.al. | 2510.12615 | null |
| 2025-10-14 | Automated Behavior Planning for Fruit Tree Pruning via Redundant Robot Manipulators: Addressing the Behavior Planning Challenge | Gaoyuan Liu et.al. | 2510.12509 | null |
| 2025-10-14 | SMEC: Rethinking Matryoshka Representation Learning for Retrieval Embedding Compression | Biao Zhang et.al. | 2510.12474 | null |
| 2025-10-14 | A Hierarchical Quantized Tokenization Framework for Task-Adaptive Graph Representation Learning | Yang Xiang et.al. | 2510.12369 | null |
| 2025-10-14 | Dual Learning with Dynamic Knowledge Distillation and Soft Alignment for Partially Relevant Video Retrieval | Jianfeng Dong et.al. | 2510.12283 | null |
| 2025-10-14 | CompoDistill: Attention Distillation for Compositional Reasoning in Multimodal LLMs | Jiwan Kim et.al. | 2510.12184 | null |
| 2025-10-14 | Evolution of meta’s llama models and parameter-efficient fine-tuning of large language models: a survey | Abdulhady Abas Abdullah et.al. | 2510.12178 | null |
| 2025-10-14 | Compressibility Measures Complexity: Minimum Description Length Meets Singular Learning Theory | Einar Urdshals et.al. | 2510.12077 | null |
| 2025-10-14 | Multi-stage Prompt Refinement for Mitigating Hallucinations in Large Language Models | Jung-Woo Shim et.al. | 2510.12032 | null |
| 2025-10-13 | MosaicDiff: Training-free Structural Pruning for Diffusion Model Acceleration Reflecting Pretraining Dynamics | Bowei Guo et.al. | 2510.11962 | null |
| 2025-10-13 | Topological Vibration Analysis of Elastic Lattices via Bloch Sphere Mapping | Kazi Tahsin Mahmood et.al. | 2510.11930 | null |
| 2025-10-13 | QeRL: Beyond Efficiency – Quantization-enhanced Reinforcement Learning for LLMs | Wei Huang et.al. | 2510.11696 | null |
| 2025-10-13 | LLM-Oriented Token-Adaptive Knowledge Distillation | Xurong Xie et.al. | 2510.11615 | null |
| 2025-10-14 | AndesVL Technical Report: An Efficient Mobile-side Multimodal Large Language Model | Zhiwei Jin et.al. | 2510.11496 | null |
| 2025-10-13 | Rescaling-Aware Training for Efficient Deployment of Deep Learning Models on Full-Integer Hardware | Lion Mueller et.al. | 2510.11484 | null |
| 2025-10-13 | XQuant: Achieving Ultra-Low Bit KV Cache Quantization with Cross-Layer Compression | Haoqi Yang et.al. | 2510.11236 | null |
| 2025-10-13 | G2L:From Giga-Scale to Cancer-Specific Large-Scale Pathology Foundation Models via Knowledge Distillation | Yesung Cho et.al. | 2510.11176 | null |
| 2025-10-13 | Lightweight Facial Landmark Detection in Thermal Images via Multi-Level Cross-Modal Knowledge Transfer | Qiyi Tong et.al. | 2510.11128 | null |
| 2025-10-13 | DITTO: A Spoofing Attack Framework on Watermarked LLMs via Knowledge Distillation | Hyeseon Ahn et.al. | 2510.10987 | null |
| 2025-10-15 | Bit Allocation Transfer for Perceptual Quality Enhancement of VVC Intra Coding | Runyu Yang et.al. | 2510.10970 | null |
| 2025-10-13 | Not All Bits Are Equal: Scale-Dependent Memory Optimization Strategies for Reasoning Models | Junhyuck Kim et.al. | 2510.10964 | null |
| 2025-10-13 | MC#: Mixture Compressor for Mixture-of-Experts Large Models | Wei Huang et.al. | 2510.10962 | null |
| 2025-10-12 | PruneGCRN: Minimizing and explaining spatio-temporal problems through node pruning | Javier García-Sigüenza et.al. | 2510.10803 | null |
| 2025-10-12 | Bhasha-Rupantarika: Algorithm-Hardware Co-design approach for Multilingual Neural Machine Translation | Mukul Lokhande et.al. | 2510.10676 | null |
| 2025-10-12 | ADiP: Adaptive Precision Systolic Array for Matrix Multiplication Acceleration | Ahmed J. Abdelmaksoud et.al. | 2510.10623 | null |
| 2025-10-12 | Preserving LLM Capabilities through Calibration Data Curation: From Analysis to Optimization | Bowei He et.al. | 2510.10618 | link |
| 2025-10-12 | BitMar: Low-Bit Multimodal Fusion with Episodic Memory for Edge Devices | Euhid Aman et.al. | 2510.10560 | null |
| 2025-10-12 | MRS-YOLO Railroad Transmission Line Foreign Object Detection Based on Improved YOLO11 and Channel Pruning | Siyuan Liu et.al. | 2510.10553 | null |
| 2025-10-12 | Preserving Core Structures of Social Networks via Information Guided Multi-Step Graph Pruning | Yutong Hu et.al. | 2510.10499 | null |
| 2025-10-12 | AnyBCQ: Hardware Efficient Flexible Binary-Coded Quantization for Multi-Precision LLMs | Gunho Park et.al. | 2510.10467 | null |
| 2025-10-14 | Multi-View Graph Learning with Graph-Tuple | Shiyu Chen et.al. | 2510.10341 | null |
| 2025-10-11 | Grounded AI for Code Review: Resource-Efficient Large-Model Serving in Enterprise Pipelines | Sayan Mandal et.al. | 2510.10290 | null |
| 2025-10-11 | Opacity-Gradient Driven Density Control for Compact and Efficient Few-Shot 3D Gaussian Splatting | Abdelrhman Elrawy et.al. | 2510.10257 | null |
| 2025-10-11 | Efficient Mining of Low-Utility Sequential Patterns | Jian Zhu et.al. | 2510.10243 | null |
| 2025-10-11 | ImCoref-CeS: An Improved Lightweight Pipeline for Coreference Resolution with LLM-based Checker-Splitter Refinement | Kangyang Luo et.al. | 2510.10241 | null |
| 2025-10-11 | PermLLM: Learnable Channel Permutation for N:M Sparse Large Language Models | Lancheng Zou et.al. | 2510.10136 | null |
| 2025-10-11 | Preference-driven Knowledge Distillation for Few-shot Node Classification | Xing Wei et.al. | 2510.10116 | null |
| 2025-10-11 | Targeted Sequential Pattern Mining with High Average Utility | Kai Cao et.al. | 2510.10115 | null |
| 2025-10-11 | P-4DGS: Predictive 4D Gaussian Splatting with 90 $\times$ Compression | Henan Wang et.al. | 2510.10030 | null |
| 2025-10-11 | Conformal Sparsification for Bandwidth-Efficient Edge-Cloud Speculative Decoding | Payel Bhattacharjee et.al. | 2510.09942 | null |
| 2025-10-10 | DELTA: Dynamic Layer-Aware Token Attention for Efficient Long-Context Reasoning | Hossein Entezari Zarch et.al. | 2510.09883 | null |
| 2025-10-10 | Tensor-based compression of the sea temperature data | Ilya Kosolapov et.al. | 2510.09778 | null |
| 2025-10-10 | Secret-Key Agreement Through Hidden Markov Modeling of Wavelet Scattering Embeddings | Nora Basha et.al. | 2510.09773 | null |
| 2025-10-10 | ReaLM: Residual Quantization Bridging Knowledge Graph Embeddings and Large Language Models | Wenbin Guo et.al. | 2510.09711 | null |
| 2025-10-09 | Vanishing Contributions: A Unified Approach to Smoothly Transition Neural Models into Compressed Form | Lorenzo Nikiforos et.al. | 2510.09696 | null |
| 2025-10-10 | Automated Evolutionary Optimization for Resource-Efficient Neural Network Training | Ilia Revin et.al. | 2510.09566 | null |
| 2025-10-10 | Hierarchical Indexing with Knowledge Enrichment for Multilingual Video Corpus Retrieval | Yu Wang et.al. | 2510.09553 | null |
| 2025-10-10 | Quantization of charged fields in the presence of intense electromagnetic fields | Álvaro Álvarez-Domínguez et.al. | 2510.09447 | null |
| 2025-10-10 | ReTraceQA: Evaluating Reasoning Traces of Small Language Models in Commonsense Question Answering | Francesco Maria Molfese et.al. | 2510.09351 | null |
| 2025-10-10 | Serial Polar Automorphism Ensemble Decoders for Physical Unclonable Functions | Marvin Rübenacke et.al. | 2510.09220 | null |
| 2025-10-10 | DICE: Structured Reasoning in LLMs through SLM-Guided Chain-of-Thought Correction | Yiqi Li et.al. | 2510.09211 | null |
| 2025-10-10 | Co-designing a Programmable RISC-V Accelerator for MPC-based Energy and Thermal Management of Many-Core HPC Processors | Alessandro Ottaviano et.al. | 2510.09163 | null |
| 2025-10-10 | Cross-Representation Benchmarking in Time-Series Electronic Health Records for Clinical Outcome Prediction | Tianyi Chen et.al. | 2510.09159 | null |
| 2025-10-10 | Dense2MoE: Restructuring Diffusion Transformer to MoE for Efficient Text-to-Image Generation | Youwei Zheng et.al. | 2510.09094 | null |
| 2025-10-10 | HERO: Hardware-Efficient RL-based Optimization Framework for NeRF Quantization | Yipu Zhang et.al. | 2510.09010 | null |
| 2025-10-10 | SQS: Bayesian DNN Compression through Sparse Quantized Sub-distributions | Ziyi Wang et.al. | 2510.08999 | null |
| 2025-10-10 | FedL2T: Personalized Federated Learning with Two-Teacher Distillation for Seizure Prediction | Jionghao Lou et.al. | 2510.08984 | null |
| 2025-10-10 | Defense against Unauthorized Distillation in Image Restoration via Feature Space Perturbation | Han Hu et.al. | 2510.08925 | null |
| 2025-10-09 | FOLK: Fast Open-Vocabulary 3D Instance Segmentation via Label-guided Knowledge Distillation | Hongrui Wu et.al. | 2510.08849 | null |
| 2025-10-13 | TinyGraphEstimator: Adapting Lightweight Language Models for Graph Structure Inference | Michal Podstawski et.al. | 2510.08808 | null |
| 2025-10-09 | Learning What to Remember: Adaptive Probabilistic Memory Retention for Memory-Efficient Language Models | S M Rafiuddin et.al. | 2510.08798 | null |
| 2025-10-08 | From What to Why: Thought-Space Recommendation with Small Language Models | Prosenjit Biswas et.al. | 2510.08626 | null |
| 2025-10-09 | DeepPrune: Parallel Scaling without Inter-trace Redundancy | Shangqing Tu et.al. | 2510.08483 | null |
| 2025-10-09 | Don’t Run with Scissors: Pruning Breaks VLA Models but They Can Be Recovered | Jason Jabbour et.al. | 2510.08464 | null |
| 2025-10-09 | Hierarchical Spatial Algorithms for High-Resolution Image Quantization and Feature Extraction | Noor Islam S. Mohammad et.al. | 2510.08449 | null |
| 2025-10-09 | Continuous Variable Hamiltonian Learning at Heisenberg Limit via Displacement-Random Unitary Transformation | Xi Huang et.al. | 2510.08419 | null |
| 2025-10-10 | Fewer Weights, More Problems: A Practical Attack on LLM Pruning | Kazuki Egashira et.al. | 2510.07985 | null |
| 2025-10-09 | LightReasoner: Can Small Language Models Teach Large Language Models Reasoning? | Jingyuan Wang et.al. | 2510.07962 | null |
| 2025-10-09 | SimCast: Enhancing Precipitation Nowcasting with Short-to-Long Term Knowledge Distillation | Yifang Yin et.al. | 2510.07953 | null |
| 2025-10-09 | Synergy Between the Strong and the Weak: Spiking Neural Networks are Inherently Self-Distillers | Yongqi Ding et.al. | 2510.07924 | null |
| 2025-10-09 | STEPER: Step-wise Knowledge Distillation for Enhancing Reasoning Ability in Multi-Step Retrieval-Augmented Language Models | Kyumin Lee et.al. | 2510.07923 | null |
| 2025-10-09 | Balanced ternary formalism of second quantization | Yao Yao et.al. | 2510.07863 | null |
| 2025-10-09 | AdaSwitch: Adaptive Switching Generation for Knowledge Distillation | Jingyu Peng et.al. | 2510.07842 | null |
| 2025-10-09 | RCPU: Rotation-Constrained Error Compensation for Structured Pruning of a Large Language Model | Shuichiro Haruta et.al. | 2510.07782 | null |
| 2025-10-09 | From Noisy to Native: LLM-driven Graph Restoration for Test-Time Graph Domain Adaptation | Xiangwei Lv et.al. | 2510.07762 | null |
| 2025-10-09 | OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference | Yuzhe Gu et.al. | 2510.07651 | null |
| 2025-10-08 | Don’t Adapt Small Language Models for Tools; Adapt Tool Schemas to the Models | Jonggeun Lee et.al. | 2510.07248 | null |
| 2025-10-08 | Where to Begin: Efficient Pretraining via Subnetwork Selection and Distillation | Arjun Krishnakumar et.al. | 2510.07227 | null |
| 2025-10-08 | A Theoretically-Grounded Codebook for Digital Semantic Communications | Lingyi Wang et.al. | 2510.07108 | null |
| 2025-10-08 | Sharpness-Aware Data Generation for Zero-shot Quantization | Dung Hoang-Anh et.al. | 2510.07018 | null |
| 2025-10-08 | Efficient numeracy in language models through single-token number embeddings | Linus Kreitner et.al. | 2510.06824 | null |
| 2025-10-08 | OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot | Junhan Zhu et.al. | 2510.06751 | null |
| 2025-10-08 | Optimizing Fronthaul Quantization for Flexible User Load in Cell-Free Massive MIMO | Fabian Göttsch et.al. | 2510.06734 | null |
| 2025-10-08 | Distilling Lightweight Language Models for C/C++ Vulnerabilities | Zhiyuan Wei et.al. | 2510.06645 | null |
| 2025-10-08 | Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer | Ziyuan Huang et.al. | 2510.06590 | null |
| 2025-10-07 | GUIDE: Guided Initialization and Distillation of Embeddings | Khoa Trinh et.al. | 2510.06502 | null |
| 2025-10-05 | Dual-stage and Lightweight Patient Chart Summarization for Emergency Physicians | Jiajun Wu et.al. | 2510.06263 | null |
| 2025-10-07 | Training Dynamics Impact Post-Training Quantization Robustness | Albert Catalan-Tatjer et.al. | 2510.06213 | null |
| 2025-10-07 | Latent Speech-Text Transformer | Yen-Ju Lu et.al. | 2510.06195 | null |
| 2025-10-07 | VecInfer: Efficient LLM Inference with Low-Bit KV Cache via Outlier-Suppressed Vector Quantization | Dingyu Yao et.al. | 2510.06175 | null |
| 2025-10-07 | Downsized and Compromised?: Assessing the Faithfulness of Model Compression | Moumita Kamal et.al. | 2510.06125 | null |
| 2025-10-07 | Influence Functions for Efficient Data Selection in Reasoning | Prateek Humane et.al. | 2510.06108 | null |
| 2025-10-07 | The Valley of Code Reasoning: Scaling Knowledge Distillation of Large Language Models | Muyu He et.al. | 2510.06101 | null |
| 2025-10-07 | Adaptive Pruning for Increased Robustness and Reduced Computational Overhead in Gaussian Process Accelerated Saddle Point Searches | Rohit Goswami et.al. | 2510.06030 | null |
| 2025-10-07 | Distributed Platoon Control Under Quantization: Stability Analysis and Privacy Preservation | Kaixiang Zhang et.al. | 2510.05959 | null |
| 2025-10-07 | $\bf{D^3}$ QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection | Yanran Zhang et.al. | 2510.05891 | null |
| 2025-10-07 | Luth: Efficient French Specialization for Small Language Models and Cross-Lingual Transfer | Maxence Lasbordes et.al. | 2510.05846 | null |
| 2025-10-08 | OneVision: An End-to-End Generative Framework for Multi-view E-commerce Vision Search | Zexin Zheng et.al. | 2510.05759 | null |
| 2025-10-07 | Syn-Diag: An LLM-based Synergistic Framework for Generalizable Few-shot Fault Diagnosis on the Edge | Zijun Jia et.al. | 2510.05733 | null |
| 2025-10-07 | DecEx-RAG: Boosting Agentic Retrieval-Augmented Generation with Decision and Execution Optimization via Process Supervision | Yongqi Leng et.al. | 2510.05691 | null |
| 2025-10-07 | InstaGeo: Compute-Efficient Geospatial Machine Learning from Data to Deployment | Ibrahim Salihu Yusuf et.al. | 2510.05617 | null |
| 2025-10-07 | Deciphering Invariant Feature Decoupling in Source-free Time Series Forecasting with Proxy Denoising | Kangjia Yan et.al. | 2510.05589 | null |
| 2025-10-07 | Activation-Informed Pareto-Guided Low-Rank Compression for Efficient LLM/VLM | Ryan Solgi et.al. | 2510.05544 | null |
| 2025-10-07 | H1B-KV: Hybrid One-Bit Caches for Memory-Efficient Large Language Model Inference | Harshil Vejendla et.al. | 2510.05529 | null |
| 2025-10-07 | ARMOR: High-Performance Semi-Structured Pruning via Adaptive Matrix Factorization | Lawrence Liu et.al. | 2510.05528 | null |
| 2025-10-07 | LANTERN: Scalable Distillation of Large Language Models for Job-Person Fit and Explanation | Zhoutong Fu et.al. | 2510.05490 | null |
| 2025-10-07 | AMAQ: Adaptive Mixed-bit Activation Quantization for Collaborative Parameter Efficient Fine-tuning | Yurun Song et.al. | 2510.05468 | null |
| 2025-10-06 | KVLinC : KV Cache Quantization with Hadamard Rotation and Linear Correction | Utkarsh Saxena et.al. | 2510.05373 | null |
| 2025-10-06 | Gamma Mixture Modeling for Cosine Similarity in Small Language Models | Kevin Player et.al. | 2510.05309 | null |
| 2025-10-06 | DP-Adam-AC: Privacy-preserving Fine-Tuning of Localizable Language Models Using Adam Optimization with Adaptive Clipping | Ruoxing Yang et.al. | 2510.05288 | null |
| 2025-10-05 | OptiFLIDS: Optimized Federated Learning for Energy-Efficient Intrusion Detection in IoT | Saida Elouardi et.al. | 2510.05180 | null |
| 2025-10-05 | PatternKV: Flattening KV Representation Expands Quantization Headroom | Ji Zhang et.al. | 2510.05176 | null |
| 2025-10-04 | SATER: A Self-Aware and Token-Efficient Approach to Routing and Cascading | Yuanzhe Shen et.al. | 2510.05164 | null |
| 2025-10-06 | Slm-mux: Orchestrating small language models for reasoning | Chenyu Wang et.al. | 2510.05077 | null |
| 2025-10-06 | Boomerang Distillation Enables Zero-Shot Model Size Interpolation | Sara Kangaslahti et.al. | 2510.05064 | null |
| 2025-10-06 | ERDE: Entropy-Regularized Distillation for Early-exit | Martial Guidez et.al. | 2510.04856 | null |
| 2025-10-06 | Natural Language Edge Labelling: Decoupling Intent from Execution in Structured LM Reasoning | Abhinav Madahar et.al. | 2510.04817 | null |
| 2025-10-08 | Are BabyLMs Deaf to Gricean Maxims? A Pragmatic Evaluation of Sample-efficient Language Models | Raha Askari et.al. | 2510.04764 | null |
| 2025-10-06 | Dimensionally-Efficient Transmission and Storage of Unitary Matrices | Juan Vidal Alegría et.al. | 2510.04734 | null |
| 2025-10-06 | TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA | Chanjoo Jung et.al. | 2510.04682 | null |
| 2025-10-06 | FT-MDT: Extracting Decision Trees from Medical Texts via a Novel Low-rank Adaptation Method | Yuheng Li et.al. | 2510.04655 | null |
| 2025-10-06 | Compressed Concatenation of Small Embedding Models | Mohamed Ayoub Ben Ayad et.al. | 2510.04626 | null |
| 2025-10-06 | SpikingMamba: Towards Energy-Efficient Large Language Models via Knowledge Distillation from Mamba | Yulong Huang et.al. | 2510.04595 | null |
| 2025-10-06 | Post-training quantization of vision encoders needs prefixing registers | Seunghyeon Kim et.al. | 2510.04547 | null |
| 2025-10-05 | Diffusion-Assisted Distillation for Self-Supervised Graph Representation Learning with MLPs | Seong Jin Ahn et.al. | 2510.04241 | null |
| 2025-10-05 | Enhancing Speaker Verification with w2v-BERT 2.0 and Knowledge Distillation guided Structured Pruning | Ze Li et.al. | 2510.04213 | null |
| 2025-10-05 | Learning from All: Concept Alignment for Autonomous Distillation from Multiple Drifting MLLMs | Xiaoyu Yang et.al. | 2510.04142 | null |
| 2025-10-05 | Efficient Training of Spiking Neural Networks by Spike-aware Data Pruning | Chenxiang Ma et.al. | 2510.04098 | null |
| 2025-10-05 | QuantDemoire: Quantization with Outlier Aware for Image Demoiréing | Zheng Chen et.al. | 2510.04066 | null |
| 2025-10-05 | Quantization Range Estimation for Convolutional Neural Networks | Bingtao Yang et.al. | 2510.04044 | null |
| 2025-10-05 | Small Language Models for Emergency Departments Decision Support: A Benchmark Study | Zirui Wang et.al. | 2510.04032 | null |
| 2025-10-05 | Dual Pruning and Sorting-Free Overestimation for Average-Utility Sequential Pattern Mining | Kai Cao et.al. | 2510.04014 | null |
| 2025-10-04 | PsycholexTherapy: Simulating Reasoning in Psychotherapy with Small Language Models in Persian | Mohammad Amin Abbasi et.al. | 2510.03913 | null |
| 2025-10-04 | NoTVLA: Narrowing of Dense Action Trajectories for Generalizable Robot Manipulation | Zheng Huang et.al. | 2510.03895 | null |
| 2025-10-04 | SDAKD: Student Discriminator Assisted Knowledge Distillation for Super-Resolution Generative Adversarial Networks | Nikolaos Kaparinos et.al. | 2510.03870 | null |
| 2025-10-04 | Optimized Minimal 4D Gaussian Splatting | Minseo Lee et.al. | 2510.03857 | null |
| 2025-10-04 | Small Language Models for Agentic Systems: A Survey of Architectures, Capabilities, and Deployment Trade offs | Raghav Sharma et.al. | 2510.03847 | null |
| 2025-10-04 | MECKD: Deep Learning-Based Fall Detection in Multilayer Mobile Edge Computing With Knowledge Distillation | Wei-Lung Mao et.al. | 2510.03601 | null |
| 2025-10-04 | Decoupling Task-Solving and Output Formatting in LLM Generation | Haikang Deng et.al. | 2510.03595 | null |
| 2025-10-03 | RAPID: An Efficient Reinforcement Learning Algorithm for Small Language Models | Lianghuan Huang et.al. | 2510.03515 | null |
| 2025-10-03 | Conditional Pseudo-Supervised Contrast for Data-Free Knowledge Distillation | Renrong Shao et.al. | 2510.03375 | null |
| 2025-10-03 | FocusAgent: Simple Yet Effective Ways of Trimming the Large Context of Web Agents | Imene Kerboua et.al. | 2510.03204 | null |
| 2025-10-03 | Mixture of Many Zero-Compute Experts: A High-Rate Quantization Theory Perspective | Yehuda Dar et.al. | 2510.03151 | null |
| 2025-10-03 | Enhancing XAI Narratives through Multi-Narrative Refinement and Knowledge Distillation | Flavio Giorgi et.al. | 2510.03134 | null |
| 2025-10-03 | Studying $\textrm{QED}_3$ with radial quantization on the lattice – I. Free limit | Peter A. Boyle et.al. | 2510.03085 | null |
| 2025-10-03 | CHORD: Customizing Hybrid-precision On-device Model for Sequential Recommendation with Device-cloud Collaboration | Tianqi Liu et.al. | 2510.03038 | null |
| 2025-10-03 | PocketSR: The Super-Resolution Expert in Your Pocket Mobiles | Haoze Sun et.al. | 2510.03012 | null |
| 2025-10-03 | Don’t Just Chase “Highlighted Tokens” in MLLMs: Revisiting Visual Holistic Context Retention | Xin Zou et.al. | 2510.02912 | null |
| 2025-10-03 | FlexiQ: Adaptive Mixed-Precision Quantization for Latency/Accuracy Trade-Offs in Deep Neural Networks | Jaemin Kim et.al. | 2510.02822 | null |
| 2025-10-03 | Using Landau quantization to probe disorder in semiconductor heterostructures | Asser Elsayed et.al. | 2510.02794 | null |
| 2025-10-03 | GRNND: A GPU-Parallel Relative NN-Descent Algorithm for Efficient Approximate Nearest Neighbor Graph Construction | Xiang Li et.al. | 2510.02774 | null |
| 2025-10-03 | Rate-Adaptive Semantic Communication via Multi-Stage Vector Quantization | Jinsung Park et.al. | 2510.02646 | null |
| 2025-10-03 | HyperAdaLoRA: Accelerating LoRA Rank Allocation During Training via Hypernetworks without Sacrificing Performance | Hao Zhang et.al. | 2510.02630 | null |
| 2025-10-02 | SAGE: Streaming Agreement-Driven Gradient Sketches for Representative Subset Selection | Ashish Jha et.al. | 2510.02470 | null |
| 2025-10-02 | Assessing the Potential for Catastrophic Failure in Dynamic Post-Training Quantization | Logan Frank et.al. | 2510.02457 | null |
| 2025-10-02 | Knowledge Distillation Detection for Open-weights Models | Qin Shi et.al. | 2510.02302 | null |
| 2025-10-02 | BioX-Bridge: Model Bridging for Unsupervised Cross-Modal Knowledge Transfer across Biosignals | Chenqi Li et.al. | 2510.02276 | null |
| 2025-10-02 | More Than One Teacher: Adaptive Multi-Guidance Policy Optimization for Diverse Exploration | Xiaoyang Yuan et.al. | 2510.02227 | null |
| 2025-10-02 | Collaborative Edge Inference via Semantic Grouping under Wireless Channel Constraints | Mateus P. Mota et.al. | 2510.02222 | null |
| 2025-10-02 | Demystifying the Roles of LLM Layers in Retrieval, Knowledge, and Reasoning | Xinyuan Song et.al. | 2510.02091 | null |
| 2025-10-02 | Parallelism Empowered Guessing Random Additive Noise Decoding | Li Wan et.al. | 2510.01813 | null |
| 2025-10-02 | $C^0$ -rigidity of Legendrians and coisotropics via sheaf quantization | Tomohiro Asano et.al. | 2510.01746 | null |
| 2025-10-02 | ENLighten: Lighten the Transformer, Enable Efficient Optical Acceleration | Hanqing Zhu et.al. | 2510.01673 | null |
| 2025-10-02 | Shift-Invariant Attribute Scoring for Kolmogorov-Arnold Networks via Shapley Value | Wangxuan Fan et.al. | 2510.01663 | null |
| 2025-10-02 | Efficient Training of Robust Traditional Chinese LLaMA-1B on a Single Consumer GPU: Continual Pre-training, SFT, and DPO | Yu-Cheng Chih et.al. | 2510.01616 | null |
| 2025-10-02 | Think Right: Learning to Mitigate Under-Over Thinking via Adaptive, Attentive Compression | Joykirat Singh et.al. | 2510.01581 | null |
| 2025-10-03 | Ultra-Efficient Decoding for End-to-End Neural Compression and Reconstruction | Ethan G. Rogers et.al. | 2510.01407 | null |
| 2025-10-01 | ThinKV: Thought-Adaptive KV Cache Compression for Efficient Reasoning Models | Akshat Ramachandran et.al. | 2510.01290 | null |
| 2025-10-01 | Adaptive Event Stream Slicing for Open-Vocabulary Event-Based Object Detection via Vision-Language Knowledge Distillation | Jinchang Zhang et.al. | 2510.00681 | null |
| 2025-10-01 | Expected Attention: KV Cache Compression by Estimating Attention from Future Queries Distribution | Alessio Devoto et.al. | 2510.00636 | null |
| 2025-10-01 | Panorama: Fast-Track Nearest Neighbors | Vansh Ramani et.al. | 2510.00566 | null |
| 2025-10-01 | GUI-KV: Efficient GUI Agents via KV Cache with Spatio-Temporal Awareness | Kung-Hsiang Huang et.al. | 2510.00536 | null |
| 2025-10-01 | Has the Two-Decade-Old Prophecy Come True? Artificial Bad Intelligence Triggered by Merely a Single-Bit Flip in Large Language Models | Yu Yan et.al. | 2510.00490 | null |
| 2025-10-01 | LongCodeZip: Compress Long Context for Code Language Models | Yuling Shi et.al. | 2510.00446 | null |
| 2025-10-01 | Semantic-Driven AI Agent Communications: Challenges and Solutions | Kaiwen Yu et.al. | 2510.00381 | null |
| 2025-09-30 | DiSC-AMC: Token- and Parameter-Efficient Discretized Statistics In-Context Automatic Modulation Classification | Mohammad Rostami et.al. | 2510.00316 | null |
| 2025-09-30 | PrunedLoRA: Robust Gradient-Based structured pruning for Low-rank Adaptation in Fine-tuning | Xin Yu et.al. | 2510.00192 | null |
| 2025-09-30 | Continuum Fractons: Quantization and the Many Body Problem | Ylias Sadki et.al. | 2510.00110 | null |
| 2025-09-30 | Enhancing Certifiable Semantic Robustness via Robust Pruning of Deep Neural Networks | Hanjiang Hu et.al. | 2510.00083 | null |
| 2025-09-30 | Fairness Testing in Retrieval-Augmented Generation: How Small Perturbations Reveal Bias in Small Language Models | Matheus Vinicius da Silva de Oliveira et.al. | 2509.26584 | null |
| 2025-10-01 | Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation | Miao Rang et.al. | 2509.26497 | null |
| 2025-09-30 | DiVeQ: Differentiable Vector Quantization Using the Reparameterization Trick | Mohammad Hassan Vali et.al. | 2509.26469 | null |
| 2025-09-30 | Post-Training Quantization via Residual Truncation and Zero Suppression for Diffusion Models | Donghoon Kim et.al. | 2509.26436 | null |
| 2025-09-30 | Cat: Post-training quantization error reduction via cluster-based affine transformation | Ali Zoljodi et.al. | 2509.26277 | null |
| 2025-09-30 | Interpret, prune and distill Donut : towards lightweight VLMs for VQA on document | Adnan Ben Mansour et.al. | 2509.26235 | null |
| 2025-09-30 | CAST: Continuous and Differentiable Semi-Structured Sparsity-Aware Training for Large Language Models | Weiyu Huang et.al. | 2509.25996 | null |
| 2025-09-30 | Iterative Hypothesis Pruning and Distribution-based Early Labeling for Sequential Hypothesis Testing | George Vershinin et.al. | 2509.25908 | null |
| 2025-09-30 | PerQ: Efficient Evaluation of Multilingual Text Personalization Quality | Dominik Macko et.al. | 2509.25903 | null |
| 2025-09-30 | SAIL: SRAM-Accelerated LLM Inference System with Lookup-Table-based GEMV | Jingyao Zhang et.al. | 2509.25853 | null |
| 2025-09-30 | Distillation of Large Language Models via Concrete Score Matching | Yeongmin Kim et.al. | 2509.25837 | null |
| 2025-10-03 | Learning to Reason as Action Abstractions with Scalable Mid-Training RL | Shenao Zhang et.al. | 2509.25810 | null |
| 2025-09-30 | Thinking Sparks!: Emergent Attention Heads in Reasoning Models During Post Training | Yein Park et.al. | 2509.25758 | null |
| 2025-09-30 | Collaborative Compression for Large-Scale MoE Deployment on Edge | Yixiao Chen et.al. | 2509.25689 | link |
| 2025-09-30 | Growing Winning Subnetworks, Not Pruning Them: A Paradigm for Density Discovery in Sparse Neural Networks | Qihang Yao et.al. | 2509.25665 | null |
| 2025-09-30 | Effective Model Pruning | Yixuan Wang et.al. | 2509.25606 | null |
| 2025-09-29 | On-Premise AI for the Newsroom: Evaluating Small Language Models for Investigative Document Search | Nick Hagar et.al. | 2509.25494 | null |
| 2025-09-29 | Norm-Q: Effective Compression Method for Hidden Markov Models in Neuro-Symbolic Applications | Hanyuan Gao et.al. | 2509.25439 | null |
| 2025-09-29 | Renormalization of Chern-Simons Wilson Loops via Flux Quantization in Cohomotopy | Hisham Sati et.al. | 2509.25336 | null |
| 2025-09-27 | Knowledge distillation through geometry-aware representational alignment | Prajjwal Bhattarai et.al. | 2509.25253 | null |
| 2025-09-29 | BALF: Budgeted Activation-Aware Low-Rank Factorization for Fine-Tuning-Free Model Compression | David González-Martínez et.al. | 2509.25136 | null |
| 2025-09-29 | Towards Trustworthy Lexical Simplification: Exploring Safety and Efficiency with Small LLMs | Akio Hayakawa et.al. | 2509.25086 | null |
| 2025-09-29 | Light-SQ: Structure-aware Shape Abstraction with Superquadrics for Generated Meshes | Yuhan Wang et.al. | 2509.24986 | null |
| 2025-09-29 | Training-Free Token Pruning via Zeroth-Order Gradient Estimation in Vision-Language Models | Youngeun Kim et.al. | 2509.24837 | null |
| 2025-10-03 | ExGS: Extreme 3D Gaussian Compression with Diffusion Priors | Jiaqi Chen et.al. | 2509.24758 | null |
| 2025-09-29 | An asymptotic field approach for the control of dipole emission in integrated structures | Vincenzo Macri’ et.al. | 2509.24717 | null |
| 2025-09-29 | Discrete Variational Autoencoding via Policy Search | Michael Drolet et.al. | 2509.24716 | null |
| 2025-09-29 | Performance-Efficiency Trade-off for Fashion Image Retrieval | Julio Hurtado et.al. | 2509.24477 | null |
| 2025-09-29 | Generalist Multi-Class Anomaly Detection via Distillation to Two Heterogeneous Student Networks | Hangil Park et.al. | 2509.24448 | null |
| 2025-10-01 | Proxy-GS: Efficient 3D Gaussian Splatting via Proxy Mesh | Yuanyuan Gao et.al. | 2509.24421 | null |
| 2025-09-29 | CLQ: Cross-Layer Guided Orthogonal-based Quantization for Diffusion Transformers | Kai Liu et.al. | 2509.24416 | null |
| 2025-09-29 | S $^2$ NN: Sub-bit Spiking Neural Networks | Wenjie Wei et.al. | 2509.24266 | null |
| 2025-09-28 | A Second-Order Perspective on Pruning at Initialization and Knowledge Transfer | Leonardo Iurada et.al. | 2509.24066 | null |
| 2025-09-28 | The Hidden Costs of Translation Accuracy: Distillation, Quantization, and Environmental Impact | Dhaathri Vijay et.al. | 2509.23990 | null |
| 2025-09-28 | AutoPrune: Each Complexity Deserves a Pruning Policy | Hanshi Wang et.al. | 2509.23931 | null |
| 2025-09-30 | Differentiable Sparsity via $D$ -Gating: Simple and Versatile Structured Penalization | Chris Kolb et.al. | 2509.23898 | null |
| 2025-09-28 | DocPruner: A Storage-Efficient Framework for Multi-Vector Visual Document Retrieval via Adaptive Patch-Level Embedding Pruning | Yibo Yan et.al. | 2509.23883 | null |
| 2025-09-28 | Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning | Shaobo Wang et.al. | 2509.23873 | null |
| 2025-09-28 | Taught Well Learned Ill: Towards Distillation-conditional Backdoor Attack | Yukun Chen et.al. | 2509.23871 | null |
| 2025-09-28 | Tequila: Trapping-free Ternary Quantization for Large Language Models | Hong Huang et.al. | 2509.23809 | null |
| 2025-09-30 | Texture Vector-Quantization and Reconstruction Aware Prediction for Generative Super-Resolution | Qifan Li et.al. | 2509.23774 | null |
| 2025-09-28 | LUQ: Layerwise Ultra-Low Bit Quantization for Multimodal Large Language Models | Shubhang Bhatnagar et.al. | 2509.23729 | null |
| 2025-09-30 | QuantSparse: Comprehensively Compressing Video Diffusion Transformer with Model Quantization and Attention Sparsification | Weilun Feng et.al. | 2509.23681 | null |
| 2025-09-28 | Why Alignment Must Precede Distillation: A Minimal Working Explanation | Sungmin Cha et.al. | 2509.23667 | null |
| 2025-09-28 | HIVTP: A Training-Free Method to Improve VLMs Efficiency via Hierarchical Visual Token Pruning Using Middle-Layer-Based Importance Score | Jingqi Xu et.al. | 2509.23663 | null |
| 2025-10-01 | Reasoning Scaffolding: Distilling the Flow of Thought from LLMs | Xiangyu Wen et.al. | 2509.23619 | null |
| 2025-09-28 | RobuQ: Pushing DiTs to W1.58A2 via Robust Activation Quantization | Kaicheng Yang et.al. | 2509.23582 | null |
| 2025-09-28 | Towards Efficient CoT Distillation: Self-Guided Rationale Selector for Better Performance with Fewer Rationales | Jianzhi Yan et.al. | 2509.23574 | null |
| 2025-09-27 | Bohr-Sommerfeld quantization conditions for Schrodinger operator: the Method of Microlocal Wronskian and Gram Matrix | Abdelwaheb Ifa et.al. | 2509.23514 | null |
| 2025-09-27 | Beyond Outliers: A Study of Optimizers Under Quantization | Georgios Vlassis et.al. | 2509.23500 | null |
| 2025-09-27 | RestoRect: Degraded Image Restoration via Latent Rectified Flow & Feature Distillation | Shourya Verma et.al. | 2509.23480 | null |
| 2025-09-27 | Data-Efficient Training by Evolved Sampling | Ziheng Cheng et.al. | 2509.23461 | null |
| 2025-09-27 | Enhancing Communication Efficiency in FL with Adaptive Gradient Quantization and Communication Frequency Optimization | Asadullah Tariq et.al. | 2509.23419 | null |
| 2025-09-27 | CasPoinTr: Point Cloud Completion with Cascaded Networks and Knowledge Distillation | Yifan Yang et.al. | 2509.23375 | null |
| 2025-09-27 | MedCritical: Enhancing Medical Reasoning in Small Language Models via Self-Collaborative Correction | Xinchun Su et.al. | 2509.23368 | null |
| 2025-09-27 | Using AI on FPGAs for the CMS Overlap Muon Track Finder for the HL-LHC | Pelayo Leguina et.al. | 2509.23347 | null |
| 2025-09-27 | Scaling LLM Test-Time Compute with Mobile NPU on Smartphones | Zixu Hao et.al. | 2509.23324 | null |
| 2025-09-27 | Deformation quantization of a hessian KV- structure on $\mathbb{R}^2$ | Herguey Mopeng et.al. | 2509.23228 | null |
| 2025-09-27 | Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization | Vage Egiazarian et.al. | 2509.23202 | null |
| 2025-09-27 | Effective Quantization of Muon Optimizer States | Aman Gupta et.al. | 2509.23106 | null |
| 2025-09-27 | Desensitizing for Improving Corruption Robustness in Point Cloud Classification through Adversarial Training | Zhiqiang Tian et.al. | 2509.23010 | null |
| 2025-09-26 | SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights | Lorenz K. Müller et.al. | 2509.22944 | null |
| 2025-09-26 | Compute-Optimal Quantization-Aware Training | Aleksandr Dremov et.al. | 2509.22935 | null |
| 2025-09-26 | Vision-Language Alignment from Compressed Image Representations using 2D Gaussian Splatting | Yasmine Omri et.al. | 2509.22615 | null |
| 2025-09-26 | Linear Causal Representation Learning by Topological Ordering, Pruning, and Disentanglement | Hao Chen et.al. | 2509.22553 | null |
| 2025-09-26 | AxLLM: accelerator architecture for large language models with computation reuse capability | Soroush Ahadi et.al. | 2509.22512 | null |
| 2025-09-26 | IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method | Xinyu Liu et.al. | 2509.22463 | null |
| 2025-09-26 | $γ$ -Quant: Towards Learnable Quantization for Low-bit Pattern Recognition | Mishal Fatima et.al. | 2509.22448 | null |
| 2025-09-26 | Progressive Weight Loading: Accelerating Initial Inference and Gradually Boosting Performance on Resource-Constrained Environments | Hyunwoo Kim et.al. | 2509.22319 | null |
| 2025-09-26 | HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space | Ke Li et.al. | 2509.22299 | link |
| 2025-09-26 | A Multi-Level Framework for Multi-Objective Hypergraph Partitioning: Combining Minimum Spanning Tree and Proximal Gradient | Yingying Li et.al. | 2509.22294 | null |
| 2025-09-26 | InfiMed-Foundation: Pioneering Advanced Multimodal Medical Models with Compute-Efficient Pre-Training and Multi-Stage Fine-Tuning | Guanghao Zhu et.al. | 2509.22261 | null |
| 2025-09-26 | Lightweight error mitigation strategies for post-training N:M activation sparsity in LLMs | Shirin Alanova et.al. | 2509.22166 | null |
| 2025-09-26 | Pushing Toward the Simplex Vertices: A Simple Remedy for Code Collapse in Smoothed Vector Quantization | Takashi Morita et.al. | 2509.22161 | null |
| 2025-09-26 | Joint graph entropy knowledge distillation for point cloud classification and robustness against corruptions | Zhiqiang Tian et.al. | 2509.22150 | null |
| 2025-09-26 | Action-aware Dynamic Pruning for Efficient Vision-Language-Action Manipulation | Xiaohuan Pei et.al. | 2509.22093 | null |
| 2025-09-26 | COSPADI: Compressing LLMs via Calibration-Guided Sparse Dictionary Learning | Dmitriy Shopkhoev et.al. | 2509.22075 | null |
| 2025-09-26 | Enriching Knowledge Distillation with Intra-Class Contrastive Learning | Hua Yuan et.al. | 2509.22053 | null |
| 2025-09-26 | Multicollinearity-Aware Parameter-Free Strategy for Hyperspectral Band Selection: A Dependence Measures-Based Approach | Dibyabha Deb et.al. | 2509.21973 | null |
| 2025-09-26 | Real-time Anomaly Detection for Liquid Argon Time Projection Chambers | Seokju Chung et.al. | 2509.21817 | null |
| 2025-09-26 | SubZeroCore: A Submodular Approach with Zero Training for Coreset Selection | Brian B. Moser et.al. | 2509.21748 | null |
| 2025-09-26 | HyperCore: Coreset Selection under Noise via Hypersphere Models | Brian B. Moser et.al. | 2509.21746 | null |
| 2025-09-26 | Brain PathoGraph Learning | Ciyuan Peng et.al. | 2509.21742 | null |
| 2025-09-26 | Optimizing the non-Clifford-count in unitary synthesis using Reinforcement Learning | David Kremer et.al. | 2509.21709 | null |
| 2025-09-25 | Scalable Foundation Interatomic Potentials via Message-Passing Pruning and Graph Partitioning | Lingyu Kong et.al. | 2509.21694 | null |
| 2025-09-25 | General Pruning Criteria for Fast SBL | Jakob Möderl et.al. | 2509.21572 | null |
| 2025-09-25 | SlimDiff: Training-Free, Activation-Guided Hands-free Slimming of Diffusion Models | Arani Roy et.al. | 2509.21498 | null |
| 2025-09-25 | Residual Vector Quantization For Communication-Efficient Multi-Agent Perception | Dereje Shenkut et.al. | 2509.21464 | null |
| 2025-09-24 | Skeleton Sparsification and Densification Scale-Spaces | Julia Gierke et.al. | 2509.21398 | null |
| 2025-09-24 | Large AI Model-Enabled Generative Semantic Communications for Image Transmission | Qiyu Ma et.al. | 2509.21394 | null |
| 2025-09-23 | Do Sparse Subnetworks Exhibit Cognitively Aligned Attention? Effects of Pruning on Saliency Map Fidelity, Sparsity, and Concept Coherence | Sanish Suwal et.al. | 2509.21387 | null |
| 2025-09-25 | SD3.5-Flash: Distribution-Guided Distillation of Generative Flows | Hmrishav Bandyopadhyay et.al. | 2509.21318 | null |
| 2025-09-25 | Interactive Recommendation Agent with Active User Commands | Jiakai Tang et.al. | 2509.21317 | null |
| 2025-09-25 | Efficient Digital Methods to Quantify Sensor Output Uncertainty | Orestis Kaparounakis et.al. | 2509.21311 | null |
| 2025-09-25 | Hybrid RIS-Aided Digital Over-the-Air Computing for Edge AI Inference: Joint Feature Quantization and Active-Passive Beamforming Design | Yang Fu et.al. | 2509.21201 | null |
| 2025-09-26 | GEP: A GCG-Based method for extracting personally identifiable information from chatbots built on small language models | Jieli Zhu et.al. | 2509.21192 | null |
| 2025-09-25 | Can Less Precise Be More Reliable? A Systematic Evaluation of Quantization’s Impact on CLIP Beyond Accuracy | Aymen Bouguerra et.al. | 2509.21173 | null |
| 2025-09-25 | On the geometric quantization of $θ$ -almost twisted Poisson manifold | Nasser Saipele Nansidi et.al. | 2509.21168 | null |
| 2025-09-25 | Fast-SEnSeI: Lightweight Sensor-Independent Cloud Masking for On-board Multispectral Sensors | Jan Kněžík et.al. | 2509.20991 | null |
| 2025-09-25 | Rejuvenating Cross-Entropy Loss in Knowledge Distillation for Recommender Systems | Zhangchi Zhu et.al. | 2509.20989 | null |
| 2025-09-25 | Punching Above Precision: Small Quantized Model Distillation with Learnable Regularizer | Abdur Rehman et.al. | 2509.20854 | null |
| 2025-09-26 | Real-Time Object Detection Meets DINOv3 | Shihua Huang et.al. | 2509.20787 | null |
| 2025-09-25 | RAM-NAS: Resource-aware Multiobjective Neural Architecture Search Method for Robot Vision Tasks | Shouren Mao et.al. | 2509.20688 | null |
| 2025-09-24 | Function Spaces Without Kernels: Learning Compact Hilbert Space Representations | Su Ann Low et.al. | 2509.20605 | null |
| 2025-09-24 | Seedream 4.0: Toward Next-generation Multimodal Image Generation | Team Seedream et.al. | 2509.20427 | null |
| 2025-09-24 | EmbeddingGemma: Powerful and Lightweight Text Representations | Henrique Schechter Vera et.al. | 2509.20354 | null |
| 2025-09-24 | Q-Palette: Fractional-Bit Quantizers Toward Optimal Bit Allocation for Efficient LLM Deployment | Deokjae Lee et.al. | 2509.20214 | null |
| 2025-09-24 | Play by the Type Rules: Inferring Constraints for LLM Functions in Declarative Programs | Parker Glenn et.al. | 2509.20208 | null |
| 2025-09-24 | Smaller is Better: Enhancing Transparency in Vehicle AI Systems via Pruning | Sanish Suwal et.al. | 2509.20148 | null |
| 2025-09-23 | Nano Bio-Agents (NBA): Small Language Model Agents for Genomics | George Hong et.al. | 2509.19566 | null |
| 2025-09-23 | Adversarially-Refined VQ-GAN with Dense Motion Tokenization for Spatio-Temporal Heatmaps | Gabriel Maldonado et.al. | 2509.19252 | null |
| 2025-09-23 | PPG-Distill: Efficient Photoplethysmography Signals Analysis via Foundation Model Distillation | Juntong Ni et.al. | 2509.19215 | null |
| 2025-09-23 | Exact WKB Formulation of Quantization and Particle Production in Time-Dependent Backgrounds | Ryo Namba et.al. | 2509.19194 | null |
| 2025-09-23 | Data-Free Knowledge Distillation for LiDAR-Aided Beam Tracking in MmWave Systems | Abolfazl Zakeri et.al. | 2509.19092 | null |
| 2025-09-23 | Enhancing Noise Robustness for Neural Speech Codecs through Resource-Efficient Progressive Quantization Perturbation Simulation | Rui-Chen Zheng et.al. | 2509.19025 | null |
| 2025-09-23 | Otters: An Energy-Efficient SpikingTransformer via Optical Time-to-First-Spike Encoding | Zhanglu Yan et.al. | 2509.18968 | null |
| 2025-09-23 | VGGT-DP: Generalizable Robot Control via Vision Foundation Models | Shijia Ge et.al. | 2509.18778 | null |
| 2025-09-23 | DiSSECT: Structuring Transfer-Ready Medical Image Representations through Discrete Self-Supervision | Azad Singh et.al. | 2509.18765 | null |
| 2025-09-23 | Bi-VLM: Pushing Ultra-Low Precision Post-Training Quantization Boundaries in Vision-Language Models | Xijun Wang et.al. | 2509.18763 | null |
| 2025-09-23 | Enhanced Survival Trees | Ruiwen Zhou et.al. | 2509.18494 | null |
| 2025-09-23 | Codebook-Based Adaptive Feature Compression With Semantic Enhancement for Edge-Cloud Systems | Xinyu Wang et.al. | 2509.18481 | null |
| 2025-09-22 | Individualized non-uniform quantization for vector search | Mariano Tepper et.al. | 2509.18471 | null |
| 2025-09-22 | TinyBEV: Cross Modal Knowledge Distillation for Efficient Multi Task Bird’s Eye View Perception and Planning | Reeshad Khan et.al. | 2509.18372 | null |
| 2025-09-21 | nDNA – the Semantic Helix of Artificial Cognition | Amitava Das et.al. | 2509.18216 | null |
| 2025-09-19 | MMCD: Multi-Modal Collaborative Decision-Making for Connected Autonomy with Knowledge Distillation | Rui Liu et.al. | 2509.18198 | null |
| 2025-09-19 | TinyEcoWeedNet: Edge Efficient Real-Time Aerial Agricultural Weed Detection | Omar H. Khater et.al. | 2509.18193 | null |
| 2025-09-22 | Visual Detector Compression via Location-Aware Discriminant Analysis | Qizhen Lan et.al. | 2509.17968 | null |
| 2025-09-23 | Optimizing Inference in Transformer-Based Models: A Multi-Method Benchmark | Siu Hang Ho et.al. | 2509.17894 | null |
| 2025-09-23 | Breaking Token Into Concepts: Exploring Extreme Compression in Token Representation Via Compositional Shared Semantics | Kavin R V et.al. | 2509.17737 | null |
| 2025-09-22 | RCTDistill: Cross-Modal Knowledge Distillation Framework for Radar-Camera 3D Object Detection with Temporal Fusion | Geonho Bang et.al. | 2509.17712 | null |
| 2025-09-22 | Stratification of the half-density quantization of the Jeffrey-Weitsman-Witten invariants | Adrian Chitan et.al. | 2509.17656 | null |
| 2025-09-22 | Evaluating the Energy Efficiency of NPU-Accelerated Machine Learning Inference on Embedded Microcontrollers | Anastasios Fanariotis et.al. | 2509.17533 | null |
| 2025-09-22 | MapCoder-Lite: Squeezing Multi-Agent Coding into a Single Small LLM | Woongkyu Lee et.al. | 2509.17489 | null |
| 2025-09-22 | Learning Dexterous Manipulation with Quantized Hand State | Ying Feng et.al. | 2509.17450 | null |
| 2025-09-23 | QWHA: Quantization-Aware Walsh-Hadamard Adaptation for Parameter-Efficient Fine-Tuning on Large Language Models | Hyesung Jeon et.al. | 2509.17428 | null |
| 2025-09-22 | Physics-Informed Operator Learning for Hemodynamic Modeling | Ryan Chappell et.al. | 2509.17293 | null |
| 2025-09-25 | On the Quantization of the Electromagnetic Field with Magnetic Monopoles | Kanan Anwar et.al. | 2509.17284 | null |
| 2025-09-21 | PTQTP: Post-Training Quantization to Trit-Planes for Large Language Models | He Xiao et.al. | 2509.16989 | null |
| 2025-09-24 | Equip Pre-ranking with Target Attention by Residual Quantization | Yutong Li et.al. | 2509.16931 | null |
| 2025-09-21 | PRISM: Precision-Recall Informed Data-Free Knowledge Distillation via Generative Diffusion | Xuewan He et.al. | 2509.16897 | null |
| 2025-09-20 | Knowledge Distillation for Variational Quantum Convolutional Neural Networks on Heterogeneous Data | Kai Yu et.al. | 2509.16699 | null |
| 2025-09-20 | When Big Models Train Small Ones: Label-Free Model Parity Alignment for Efficient Visual Question Answering using Small VLMs | Abhirama Subramanyam Penamakuri et.al. | 2509.16633 | null |
| 2025-09-20 | The Role of Vocabularies in Learning Sparse Representations for Ranking | Hiun Kim et.al. | 2509.16621 | null |
| 2025-09-20 | Federated Learning with Ad-hoc Adapter Insertions: The Case of Soft-Embeddings for Training Classifier-as-Retriever | Marijan Fofonjka et.al. | 2509.16508 | null |
| 2025-09-20 | PrediPrune: Reducing Verification Overhead in Souper with Machine Learning Driven Pruning | Ange-Thierry Ishimwe et.al. | 2509.16497 | null |
| 2025-09-20 | Eye Gaze Tells You Where to Compute: Gaze-Driven Efficient VLMs | Qinyu Chen et.al. | 2509.16476 | null |
| 2025-09-19 | Locally Purified Maximally Mixed States At Scale: Entanglement Pruning and Symmetries | Amit Jamadagni et.al. | 2509.16439 | null |
| 2025-09-19 | Pico: A Modular Framework for Hypothesis-Driven Small Language Model Research | Richard Diehl Martinez et.al. | 2509.16413 | null |
| 2025-09-19 | A Unified AI Approach for Continuous Monitoring of Human Health and Diseases from Intensive Care Unit to Home with Physiological Foundation Models (UNIPHY+) | Minxiao Wang et.al. | 2509.16348 | null |
| 2025-09-24 | The Role of High-Performance GPU Resources in Large Language Model Based Radiology Imaging Diagnosis | Jyun-Ping Kao et.al. | 2509.16328 | null |
| 2025-09-18 | Language Modeling with Learned Meta-Tokens | Alok N. Shah et.al. | 2509.16278 | null |
| 2025-09-19 | DiEP: Adaptive Mixture-of-Experts Compression through Differentiable Expert Pruning | Sikai Bai et.al. | 2509.16105 | null |
| 2025-09-19 | DistillMatch: Leveraging Knowledge Distillation from Vision Foundation Model for Multimodal Image Matching | Meng Yang et.al. | 2509.16017 | null |
| 2025-09-19 | DISPATCH: Distilling Selective Patches for Speech Enhancement | Dohwan Kim et.al. | 2509.15922 | null |
| 2025-09-19 | RMT-KD: Random Matrix Theoretic Causal Knowledge Distillation | Davide Ettori et.al. | 2509.15724 | null |
| 2025-09-19 | Once Upon a Time: Interactive Learning for Storytelling with Small Language Models | Jonas Mayer Martins et.al. | 2509.15714 | null |
| 2025-09-19 | Training-Free Pyramid Token Pruning for Efficient Large Vision-Language Models via Region, Token, and Instruction-Guided Importance | Yuxuan Liang et.al. | 2509.15704 | null |
| 2025-09-19 | pFedSAM: Personalized Federated Learning of Segment Anything Model for Medical Image Segmentation | Tong Wang et.al. | 2509.15638 | null |
| 2025-09-19 | MEC-Quant: Maximum Entropy Coding for Extremely Low Bit Quantization-Aware Training | Junbiao Pang et.al. | 2509.15514 | null |
| 2025-09-19 | Mental Accounts for Actions: EWA-Inspired Attention in Decision Transformers | Zahra Aref et.al. | 2509.15498 | null |
| 2025-09-19 | Backdoor Mitigation via Invertible Pruning Masks | Kealan Dunnett et.al. | 2509.15497 | null |
| 2025-09-18 | IMPQ: Interaction-Aware Layerwise Mixed Precision Quantization for LLMs | Junchen Zhao et.al. | 2509.15455 | null |
| 2025-09-18 | Fair-GPTQ: Bias-Aware Quantization for Large Language Models | Irina Proskurina et.al. | 2509.15206 | null |
| 2025-09-18 | MaRVIn: A Cross-Layer Mixed-Precision RISC-V Framework for DNN Inference, from ISA Extension to Hardware Acceleration | Giorgos Armeniakos et.al. | 2509.15187 | null |
| 2025-09-18 | No Modality Left Behind: Adapting to Missing Modalities via Knowledge Distillation for Brain Tumor Segmentation | Shenghao Zhu et.al. | 2509.15017 | null |
| 2025-09-19 | MeanFlowSE: one-step generative speech enhancement via conditional mean flow | Duojia Li et.al. | 2509.14858 | null |
| 2025-09-18 | Delta Knowledge Distillation for Large Language Models | Yihan Cao et.al. | 2509.14526 | null |
| 2025-09-17 | NIRVANA: Structured pruning reimagined for large language models compression | Mengting Ai et.al. | 2509.14230 | null |
| 2025-09-17 | Where Do Tokens Go? Understanding Pruning Behaviors in STEP at High Resolutions | Michal Szczepanski et.al. | 2509.14165 | null |
| 2025-09-17 | SV-Mixer: Replacing the Transformer Encoder with Lightweight MLPs for Self-Supervised Model Compression in Speaker Verification | Jungwoo Heo et.al. | 2509.14136 | null |
| 2025-09-17 | MOCHA: Multi-modal Objects-aware Cross-arcHitecture Alignment | Elena Camuffo et.al. | 2509.14001 | null |
| 2025-09-17 | Asymptotic Analysis of Nonlinear One-Bit Precoding in Massive MIMO Systems via Approximate Message Passing | Zheyu Wu et.al. | 2509.13955 | null |
| 2025-09-19 | Efficient Quantization-Aware Neural Receivers: Beyond Post-Training Quantization | SaiKrishna Saketh Yellapragada et.al. | 2509.13786 | null |
| 2025-09-17 | TENET: An Efficient Sparsity-Aware LUT-Centric Architecture for Ternary LLM Inference On Edge | Zhirui Huang et.al. | 2509.13765 | null |
| 2025-09-18 | DSPC: Dual-Stage Progressive Compression Framework for Efficient Long-Context Reasoning | Yaxin Gao et.al. | 2509.13723 | null |
| 2025-09-17 | InfraMind: A Novel Exploration-based GUI Agentic Framework for Mission-critical Industrial Management | Liangtao Lin et.al. | 2509.13704 | null |
| 2025-09-17 | A High-Quality and Low-Complexity Streamable Neural Speech Codec with Knowledge Distillation | En-Wei Zhang et.al. | 2509.13670 | null |
| 2025-09-16 | AQUA-LLM: Evaluating Accuracy, Quantization, and Adversarial Robustness Trade-offs in LLMs for Cybersecurity Question Answering | Onat Gungor et.al. | 2509.13514 | null |
| 2025-09-16 | Improving 3D Gaussian Splatting Compression by Scene-Adaptive Lattice Vector Quantization | Hao Xu et.al. | 2509.13482 | null |
| 2025-09-16 | LLMs for energy and macronutrients estimation using only text data from 24-hour dietary recalls: a parameter-efficient fine-tuning experiment using a 10-shot prompt | Rodrigo M Carrillo-Larco et.al. | 2509.13268 | null |
| 2025-09-18 | HAM: Hierarchical Adapter Merging for Scalable Continual Learning | Eric Nuertey Coleman et.al. | 2509.13211 | null |
| 2025-09-16 | Vi-SAFE: A Spatial-Temporal Framework for Efficient Violence Detection in Public Surveillance | Ligang Chang et.al. | 2509.13210 | null |
| 2025-09-16 | Multi-Model Synthetic Training for Mission-Critical Small Language Models | Nolan Platt et.al. | 2509.13047 | null |
| 2025-09-16 | Investigating ReLoRA: Effects on the Learning Dynamics of Small Language Models | Yuval Weiss et.al. | 2509.12960 | null |
| 2025-09-17 | A Novel Compression Framework for YOLOv8: Achieving Real-Time Aerial Object Detection on Edge Devices via Structured Pruning and Channel-Wise Distillation | Melika Sabaghian et.al. | 2509.12918 | null |
| 2025-09-16 | Energy-Efficient Quantized Federated Learning for Resource-constrained IoT devices | Wilfrid Sougrinoma Compaoré et.al. | 2509.12814 | null |
| 2025-09-16 | NEFT: A Unified Transformer Framework for Efficient Near-Field CSI Feedback in XL-MIMO Systems | Haiyang Li et.al. | 2509.12748 | null |
| 2025-09-16 | Effective Gaussian Management for High-fidelity Object Reconstruction | Jiateng Liu et.al. | 2509.12742 | null |
| 2025-09-16 | ZTree: A Subgroup Identification Based Decision Tree Learning Framework | Eric Cheng et.al. | 2509.12688 | null |
| 2025-09-16 | The Better You Learn, The Smarter You Prune: Towards Efficient Vision-language-action Models via Differentiable Token Pruning | Titong Jiang et.al. | 2509.12594 | null |
| 2025-09-16 | iCD: A Implicit Clustering Distillation Mathod for Structural Information Mining | Xiang Xue et.al. | 2509.12553 | null |
| 2025-09-16 | LEAF: Knowledge Distillation of Text Embedding Models with Teacher-Aligned Representations | Robin Vujanic et.al. | 2509.12539 | null |
| 2025-09-15 | Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction | Ryan Lucas et.al. | 2509.12464 | null |
| 2025-09-15 | GhostNetV3-Small: A Tailored Architecture and Comparative Study of Distillation Strategies for Tiny Images | Florian Zager et.al. | 2509.12380 | null |
| 2025-09-15 | Unsupervised Atomic Data Mining via Multi-Kernel Graph Autoencoders for Machine Learning Force Fields | Hong Sun et.al. | 2509.12358 | null |
| 2025-09-15 | SAQ: Pushing the Limits of Vector Quantization through Code Adjustment and Dimension Segmentation | Hui Li et.al. | 2509.12086 | null |
| 2025-09-15 | AMQ: Enabling AutoML for Mixed-precision Weight-Only Quantization of Large Language Models | Sangjun Lee et.al. | 2509.12019 | null |
| 2025-09-15 | CLAIRE: A Dual Encoder Network with RIFT Loss and Phi-3 Small Language Model Based Interpretability for Cross-Modality Synthetic Aperture Radar and Optical Land Cover Segmentation | Debopom Sutradhar et.al. | 2509.11952 | null |
| 2025-09-16 | Enriched text-guided variational multimodal knowledge distillation network (VMD) for automated diagnosis of plaque vulnerability in 3D carotid artery MRI | Bo Cao et.al. | 2509.11924 | null |
| 2025-09-15 | SpecVLM: Fast Speculative Decoding in Vision-Language Models | Haiduo Huang et.al. | 2509.11815 | null |
| 2025-09-15 | Visualization and Analysis of the Loss Landscape in Graph Neural Networks | Samir Moustafa et.al. | 2509.11792 | null |
| 2025-09-15 | Quantization Errors, Human–AI Interaction, and Approximate Fixed Points in $L^1(μ)$ | Faruk Alpay et.al. | 2509.11700 | null |
| 2025-09-15 | DARD: Dice Adversarial Robustness Distillation against Adversarial Attacks | Jing Zou et.al. | 2509.11525 | null |
| 2025-09-14 | Knowledge Distillation for Sensing-Assisted Long-Term Beam Tracking in mmWave Communications | Mengyuan Ma et.al. | 2509.11419 | null |
| 2025-09-14 | Investigating the Lottery Ticket Hypothesis for Variational Quantum Circuits | Michael Kölle et.al. | 2509.11190 | null |
| 2025-09-16 | Optimal Brain Restoration for Joint Quantization and Sparsification of LLMs | Hang Guo et.al. | 2509.11177 | null |
| 2025-09-14 | SVR-GS: Spatially Variant Regularization for Probabilistic Masks in 3D Gaussian Splatting | Ashkan Taghipour et.al. | 2509.11116 | null |
| 2025-09-13 | GAPrune: Gradient-Alignment Pruning for Domain-Aware Embeddings | Yixuan Tang et.al. | 2509.10844 | null |
| 2025-09-12 | Automated MCQA Benchmarking at Scale: Evaluating Reasoning Traces as Retrieval Sources for Domain Adaptation of Small Language Models | Ozan Gokdemir et.al. | 2509.10744 | null |
| 2025-09-12 | Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs | Yixiao Zhou et.al. | 2509.10377 | null |
| 2025-09-12 | Efficient Learned Image Compression Through Knowledge Distillation | Fabien Allemand et.al. | 2509.10366 | null |
| 2025-09-12 | I-Segmenter: Integer-Only Vision Transformer for Efficient Semantic Segmentation | Jordan Sassoon et.al. | 2509.10334 | null |
| 2025-09-12 | Investigating Language Model Capabilities to Represent and Process Formal Knowledge: A Preliminary Study to Assist Ontology Engineering | Hanna Abi Akl et.al. | 2509.10249 | null |
| 2025-09-12 | FedBiF: Communication-Efficient Federated Learning via Bits Freezing | Shiwei Li et.al. | 2509.10161 | null |
| 2025-09-12 | Scalable Training for Vector-Quantized Networks with 100% Codebook Utilization | Yifan Chang et.al. | 2509.10140 | null |
| 2025-09-12 | Efficient and Accurate Downfacing Visual Inertial Odometry | Jonas Kühne et.al. | 2509.10021 | null |
| 2025-09-12 | Toward Green Code: Prompting Small Language Models for Energy-Efficient Code Generation | Humza Ashraf et.al. | 2509.09947 | null |
| 2025-09-12 | Acoustic Scene Classification Using CNN-GRU Model Without Knowledge Distillation | Ee-Leng Tan et.al. | 2509.09931 | null |
| 2025-09-11 | ButterflyQuant: Ultra-low-bit LLM Quantization through Learnable Orthogonal Butterfly Transforms | Bingxin Xu et.al. | 2509.09679 | null |
| 2025-09-11 | ReBaNO: Reduced Basis Neural Operator Mitigating Generalization Gaps and Achieving Discretization Invariance | Haolan Zheng et.al. | 2509.09611 | null |
| 2025-09-11 | Combating the Memory Walls: Optimization Pathways for Long-Context Agentic LLM Inference | Haoran Wu et.al. | 2509.09505 | null |
| 2025-09-11 | Unified Start, Personalized End: Progressive Pruning for Efficient 3D Medical Image Segmentation | Linhao Li et.al. | 2509.09267 | link |
| 2025-09-11 | Adaptive Knowledge Distillation using a Device-Aware Teacher for Low-Complexity Acoustic Scene Classification | Seung Gyu Jeong et.al. | 2509.09262 | null |
| 2025-09-11 | SQAP-VLA: A Synergistic Quantization-Aware Pruning Framework for High-Performance Vision-Language-Action Models | Hengyu Fang et.al. | 2509.09090 | null |
| 2025-09-10 | CSI Compression Beyond Latents: End-to-End Hybrid Attention-CNN Networks with Entropy Regularization | Maryam Ansarifard et.al. | 2509.08776 | null |
| 2025-09-10 | Compressing CNN models for resource-constrained systems by channel and layer pruning | Ahmed Sadaqa et.al. | 2509.08714 | null |
| 2025-09-10 | BitROM: Weight Reload-Free CiROM Architecture Towards Billion-Parameter 1.58-bit LLM Inference | Wenlun Zhang et.al. | 2509.08542 | null |
| 2025-09-12 | SINDI: an Efficient Index for Approximate Maximum Inner Product Search on Sparse Vectors | Ruoxuan Li et.al. | 2509.08395 | null |
| 2025-09-10 | Mitigating Catastrophic Forgetting in Large Language Models with Forgetting-aware Pruning | Wei Huang et.al. | 2509.08255 | null |
| 2025-09-10 | Strategies for Improving Communication Efficiency in Distributed and Federated Learning: Compression, Local Training, and Personalization | Kai Yi et.al. | 2509.08233 | null |
| 2025-09-09 | Risk-Bounded Multi-Agent Visual Navigation via Dynamic Budget Allocation | Viraj Parimi et.al. | 2509.08157 | null |
| 2025-09-09 | Tensor-Train Operator Inference | Engin Danis et.al. | 2509.08071 | null |
| 2025-09-09 | SA-OOSC: A Multimodal LLM-Distilled Semantic Communication Framework for Enhanced Coding Efficiency with Scenario Understanding | Feifan Zhang et.al. | 2509.07436 | null |
| 2025-09-09 | The Role of Exploration Modules in Small Language Models for Knowledge Graph Question Answering | Yi-Jie Cheng et.al. | 2509.07399 | null |
| 2025-09-09 | Knowledge Distillation Driven Semantic NOMA for Image Transmission with Diffusion Model | Qifei Wang et.al. | 2509.07363 | null |
| 2025-09-09 | Word2Spike: Poisson Rate Coding for Associative Memories and Neuromorphic Algorithms | Archit Kalra et.al. | 2509.07361 | null |
| 2025-09-09 | Quantization of the electromagnetic fields from single atomic or molecular radiators | Valerica Raicu et.al. | 2509.07359 | null |
| 2025-09-08 | Recursive algorithm for constructing antisymmetric fermionic states in first quantization mapping | E. Rule et.al. | 2509.07279 | null |
| 2025-09-08 | HealthSLM-Bench: Benchmarking Small Language Models for Mobile and Wearable Healthcare Monitoring | Xin Wang et.al. | 2509.07260 | null |
| 2025-09-08 | Efficient Multi-Agent Coordination via Dynamic Joint-State Graph Construction | Yanlin Zhou et.al. | 2509.07234 | null |
| 2025-09-08 | Efficient Low-Memory Fast Stack Decoding with Variance Polarization for PAC Codes | Mohsen Moradi et.al. | 2509.07231 | null |
| 2025-09-08 | Explaining How Quantization Disparately Skews a Model | Abhimanyu Bellam et.al. | 2509.07222 | null |
| 2025-09-07 | MEGS $^{2}$ : Memory-Efficient Gaussian Splatting via Spherical Gaussians and Unified Pruning | Jiarui Chen et.al. | 2509.07021 | null |
| 2025-09-08 | H $_{2}$ OT: Hierarchical Hourglass Tokenizer for Efficient Video Pose Transformers | Wenhao Li et.al. | 2509.06956 | null |
| 2025-10-13 | COMPACT: Common-token Optimized Model Pruning Across Channels and Tokens | Eugene Kwek et.al. | 2509.06836 | null |
| 2025-09-08 | Tree of Agents: Improving Long-Context Capabilities of Large Language Models through Multi-Perspective Reasoning | Song Yu et.al. | 2509.06436 | null |
| 2025-09-08 | Index-Preserving Lightweight Token Pruning for Efficient Document Understanding in Vision-Language Models | Jaemin Son et.al. | 2509.06415 | null |
| 2025-09-08 | 3DOF+Quantization: 3DGS quantization for large scenes with limited Degrees of Freedom | Matthieu Gendrin et.al. | 2509.06400 | null |
| 2025-09-08 | Variational Garrote for Statistical Physics-based Sparse and Robust Variable Selection | Hyungjoon Soh et.al. | 2509.06383 | null |
| 2025-09-08 | Mask-GCG: Are All Tokens in Adversarial Suffixes Necessary for Jailbreak Attacks? | Junjie Mu et.al. | 2509.06350 | null |
| 2025-09-08 | LoaQ: Layer-wise Output Approximation Quantization | Li Lin et.al. | 2509.06297 | null |
| 2025-09-15 | FineServe: Precision-Aware KV Slab and Two-Level Scheduling for Heterogeneous Precision LLM Serving | Kyungmin Bin et.al. | 2509.06261 | null |
| 2025-09-10 | BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models | Yuming Li et.al. | 2509.06040 | null |
| 2025-09-07 | StripDet: Strip Attention-Based Lightweight 3D Object Detection from Point Cloud | Weichao Wang et.al. | 2509.05954 | null |
| 2025-09-07 | Quantization of bounded symplectic domains associated with compact Lie groups | Alexey A. Sharapov et.al. | 2509.05931 | null |
| 2025-09-06 | Batalin-Fradkin-Vilkovisky Quantization of FLPR model | Ansha S. Nair et.al. | 2509.05632 | null |
| 2025-09-06 | Quantization of spin circular photogalvanic effect in altermagnetic Weyl semimetals | Hiroki Yoshida et.al. | 2509.05620 | null |
| 2025-09-06 | SpecPrune-VLA: Accelerating Vision-Language-Action Models via Action-Aware Self-Speculative Pruning | Hanzhen Wang et.al. | 2509.05614 | null |
| 2025-09-09 | Mitigating Spurious Correlations Between Question and Answer via Chain-of-Thought Correctness Perception Distillation | Hongyan Xie et.al. | 2509.05602 | null |
| 2025-09-06 | ProfilingAgent: Profiling-Guided Agentic Reasoning for Adaptive Model Optimization | Sadegh Jafari et.al. | 2509.05584 | null |
| 2025-09-06 | Sensitivity-Aware Post-Training Quantization for Deep Neural Networks | Zekang Zheng et.al. | 2509.05576 | null |
| 2025-09-05 | SuperSNN: A Hardware-Aware Framework for Physically Realizable, High-Performance Superconducting Spiking Neural Network Chips | Changxu Song et.al. | 2509.05532 | null |
| 2025-09-05 | Dynamic Sensitivity Filter Pruning using Multi-Agent Reinforcement Learning For DCNN’s | Iftekhar Haider Chowdhury et.al. | 2509.05446 | null |
| 2025-09-05 | Accuracy-Constrained CNN Pruning for Efficient and Reliable EEG-Based Seizure Detection | Mounvik K et.al. | 2509.05190 | null |
| 2025-09-05 | FLOWER: Democratizing Generalist Robot Policies with Efficient Vision-Language-Action Flow Policies | Moritz Reuss et.al. | 2509.04996 | null |
| 2025-09-05 | PLaMo 2 Technical Report | Preferred Networks et.al. | 2509.04897 | null |
| 2025-09-05 | AI-Driven Fronthaul Link Compression in Wireless Communication Systems: Review and Method Design | Keqin Zhang et.al. | 2509.04805 | null |
| 2025-09-05 | STADI: Fine-Grained Step-Patch Diffusion Parallelism for Heterogeneous GPUs | Han Liang et.al. | 2509.04719 | null |
| 2025-09-08 | Advancing SLM Tool-Use Capability using Reinforcement Learning | Dhruvi Paprunia et.al. | 2509.04518 | null |
| 2025-09-02 | ProST: Progressive Sub-task Training for Pareto-Optimal Multi-agent Systems Using Small Language Models | Biddut Sarker Bijoy et.al. | 2509.04508 | null |
| 2025-09-04 | PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference | Krishna Teja Chitty-Venkata et.al. | 2509.04377 | null |
| 2025-09-04 | Integrating Pruning with Quantization for Efficient Deep Neural Networks Compression | Sara Makenali et.al. | 2509.04244 | null |
| 2025-09-04 | Real Time FPGA Based Transformers & VLMs for Vision Tasks: SOTA Designs and Optimizations | Safa Mohammed Sali et.al. | 2509.04162 | null |
| 2025-09-04 | Real Time FPGA Based CNNs for Detection, Classification, and Tracking in Autonomous Systems: State of the Art Designs and Optimizations | Safa Mohammed Sali et.al. | 2509.04153 | null |
| 2025-09-04 | Duality between polyhedral approximation of value functions and optimal quantization of measures | Abdellah Bulaich Mehamdi et.al. | 2509.04101 | null |
| 2025-09-04 | Robust MIMO Semantic Communication with Imperfect CSI via Knowledge Distillation | Mingze Gong et.al. | 2509.04005 | null |
| 2025-09-04 | Data-Augmented Quantization-Aware Knowledge Distillation | Justin Kur et.al. | 2509.03850 | null |
| 2025-09-03 | QuantV2X: A Fully Quantized Multi-Agent System for Cooperative Perception | Seth Z. Zhao et.al. | 2509.03704 | null |
| 2025-09-03 | DPQuant: Efficient and Differentially-Private Model Training via Dynamic Quantization Scheduling | Yubo Gao et.al. | 2509.03472 | null |
| 2025-09-08 | Amplifying Effective CXL Memory Bandwidth for LLM Inference via Transparent Near-Data Processing | Rui Xie et.al. | 2509.03377 | null |
| 2025-09-03 | NeurStore: Efficient In-database Deep Learning Model Management System | Siqi Xiang et.al. | 2509.03228 | null |
| 2025-09-03 | BAMG: A Block-Aware Monotonic Graph Index for Disk-Based Approximate Nearest Neighbor Search | Huiling Li et.al. | 2509.03226 | null |
| 2025-09-03 | CapsBeam: Accelerating Capsule Network based Beamformer for Ultrasound Non-Steered Plane Wave Imaging on Field Programmable Gate Array | Abdul Rahoof et.al. | 2509.03201 | null |
| 2025-09-03 | Deep Self-knowledge Distillation: A hierarchical supervised learning for coronary artery segmentation | Mingfeng Lin et.al. | 2509.03173 | null |
| 2025-09-03 | FastCaps: A Design Methodology for Accelerating Capsule Network on Field Programmable Gate Arrays | Abdul Rahoof et.al. | 2509.03103 | null |
| 2025-09-03 | Binary Quantization For LLMs Through Dynamic Grouping | Xinzhe Zheng et.al. | 2509.03054 | null |
| 2025-09-02 | LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference | Krishna Teja Chitty-Venkata et.al. | 2509.02753 | null |
| 2025-09-02 | A quantization of the $\operatorname{SL}_2(\mathbb{C})$ -Chern-Simons invariant of tangle exteriors | Calvin McPhail-Snyder et.al. | 2509.02365 | null |
| 2025-09-02 | All-optical band structure reconstruction and onset of Landau quantization of Dirac fermions | Josef Riepl et.al. | 2509.02362 | null |
| 2025-09-02 | Operator Algebras and Third Quantization | Yidong Chen et.al. | 2509.02293 | null |
| 2025-08-11 | Less is More: Selective Reflection for Compatible and Efficient Knowledge Distillation in Large Language Models | Lingyuan Liu et.al. | 2508.06135 | null |
| 2025-08-06 | Exploring Layer-wise Information Effectiveness for Post-Training Quantization in Small Language Models | He Xiao et.al. | 2508.03332 | null |
| 2025-07-29 | Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study | Yiran Huang et.al. | 2507.20749 | null |
| 2025-07-22 | Collaborative Distillation Strategies for Parameter-Efficient Language Model Deployment | Xiandong Meng et.al. | 2507.15198 | null |
| 2025-07-11 | Exploring the Limits of Model Compression in LLMs: A Knowledge Distillation Study on QA Tasks | Joyeeta Datta et.al. | 2507.07630 | null |
| 2025-07-08 | Put Teacher in Student’s Shoes: Cross-Distillation for Ultra-compact Model Compression Framework | Maolin Wang et.al. | 2507.04636 | null |
| 2025-06-17 | TensorSLM: Energy-efficient Embedding Compression of Sub-billion Parameter Language Models on Low-end Devices | Mingxue Xu et.al. | 2506.13514 | null |
| 2025-05-30 | Small Language Models: Architectures, Techniques, Evaluation, Problems and Future Adaptation | Tanjil Hasan Sakib et.al. | 2505.19529 | null |
| 2025-10-14 | Shifting AI Efficiency From Model-Centric to Data-Centric Compression | Xuyang Liu et.al. | 2505.19147 | null |
| 2025-05-27 | Knowledge Grafting of Large Language Models | Guodong Du et.al. | 2505.18502 | null |
| 2025-04-25 | Does Knowledge Distillation Matter for Large Language Model based Bundle Generation? | Kaidong Feng et.al. | 2504.17220 | null |
| 2025-04-24 | Case Study: Fine-tuning Small Language Models for Accurate and Private CWE Detection in Python Code | Md. Azizul Hakim Bappy et.al. | 2504.16584 | null |
| 2025-04-22 | Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions | Luyang Fang et.al. | 2504.14772 | null |
| 2025-04-09 | Thanos: A Block-wise Pruning Algorithm for Efficient Large Language Model Compression | Ivan Ilin et.al. | 2504.05346 | null |
| 2025-07-01 | Distillation and Refinement of Reasoning in Small Language Models for Document Re-ranking | Chris Samarinas et.al. | 2504.03947 | null |
| 2025-03-17 | Small Vision-Language Models: A Survey on Compact Architectures and Techniques | Nitesh Patnaik et.al. | 2503.10665 | null |
| 2025-10-23 | Using (Not-so) Large Language Models to Generate Simulation Models in a Formal DSL: A Study on Reaction Networks | Justin N. Kreikemeyer et.al. | 2503.01675 | null |
| 2025-03-06 | Rethinking Data: Towards Better Performing Domain-Specific Small Language Models | Boris Nazarov et.al. | 2503.01464 | null |
| 2025-03-04 | ReaderLM-v2: Small Language Model for HTML to Markdown and JSON | Feng Wang et.al. | 2503.01151 | null |
| 2025-05-27 | Beyond the Tip of Efficiency: Uncovering the Submerged Threats of Jailbreak Attacks in Small Language Models | Sibo Yi et.al. | 2502.19883 | null |
| 2025-02-26 | AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages | Joshua Sakthivel Raju et.al. | 2502.18020 | null |
| 2025-06-17 | Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training | Rui Pan et.al. | 2502.03460 | null |
| 2025-03-03 | TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models | Makoto Shing et.al. | 2501.16937 | null |
| 2025-06-09 | GRASP: Replace Redundant Layers with Adaptive Singular Parameters for Efficient Model Compression | Kainan Liu et.al. | 2501.00339 | link |
| 2024-12-30 | Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models | Shuo Wang et.al. | 2412.19449 | null |
| 2024-12-23 | PruneVid: Visual Token Pruning for Efficient Video Large Language Models | Xiaohu Huang et.al. | 2412.16117 | null |
| 2024-11-22 | Hymba: A Hybrid-head Architecture for Small Language Models | Xin Dong et.al. | 2411.13676 | null |
| 2025-02-19 | Efficient Alignment of Large Language Models via Data Sampling | Amrit Khera et.al. | 2411.10545 | null |
| 2024-11-27 | SlimLM: An Efficient Small Language Model for On-Device Document Assistance | Thang M. Pham et.al. | 2411.09944 | null |
| 2025-02-26 | LLM-NEO: Parameter Efficient Knowledge Distillation for Large Language Models | Runming Yang et.al. | 2411.06839 | null |
| 2024-11-12 | Over-parameterized Student Model via Tensor Decomposition Boosted Knowledge Distillation | Yu-Liang Zhan et.al. | 2411.06448 | null |
| 2025-04-09 | Fox-1: Open Small Language Model for Cloud and Edge | Zijian Hu et.al. | 2411.05281 | null |
| 2024-10-29 | KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation | Rambod Azimi et.al. | 2410.20777 | link |
| 2024-10-29 | A Survey of Small Language Models | Chien Van Nguyen et.al. | 2410.20011 | link |
| 2025-07-15 | Self-calibration for Language Model Quantization and Pruning | Miles Williams et.al. | 2410.17170 | null |
| 2024-10-21 | Efficient Vision-Language Models by Summarizing Visual Tokens into Compact Registers | Yuxin Wen et.al. | 2410.14072 | null |
| 2025-06-03 | RoCoFT: Efficient Finetuning of Large Language Models with Row-Column Updates | Md Kowsher et.al. | 2410.10075 | null |
| 2024-09-20 | Efficient Knowledge Distillation: Empowering Small Language Models with Teacher Model Insights | Mohamad Ballout et.al. | 2409.12586 | null |
| 2024-10-04 | FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation | KaShun Shum et.al. | 2408.12168 | null |
| 2024-11-05 | Compact Language Models via Pruning and Knowledge Distillation | Saurav Muralidharan et.al. | 2407.14679 | null |
| 2024-07-09 | Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations | Bowen Shen et.al. | 2407.05690 | null |
| 2025-04-22 | SLMRec: Distilling Large Language Models into Small for Sequential Recommendation | Wujiang Xu et.al. | 2405.17890 | null |
| 2024-05-17 | Densely Distilling Cumulative Knowledge for Continual Learning | Zenglin Shi et.al. | 2405.09820 | null |
| 2024-04-09 | What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models | Busayo Awobade et.al. | 2404.04759 | null |
| 2024-04-05 | Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought | Jooyoung Lee et.al. | 2404.03414 | null |
| 2024-06-26 | Telecom Language Models: Must They Be Large? | Nicola Piovesan et.al. | 2403.04666 | null |
| 2024-05-31 | Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models | Xudong Lu et.al. | 2402.14800 | null |
| 2024-02-16 | Model Compression and Efficient Inference for Large Language Models: A Survey | Wenxiao Wang et.al. | 2402.09748 | null |
| 2024-04-09 | A Survey on Transformer Compression | Yehui Tang et.al. | 2402.05964 | null |
| 2025-07-23 | L4Q: Parameter Efficient Quantization-Aware Fine-Tuning on Large Language Models | Hyesung Jeon et.al. | 2402.04902 | null |
| 2024-06-25 | Shortened LLaMA: Depth Pruning for Large Language Models with Comparison of Retraining Methods | Bo-Kyeong Kim et.al. | 2402.02834 | null |
| 2024-02-07 | Dual Knowledge Distillation for Efficient Sound Event Detection | Yang Xiao et.al. | 2402.02781 | null |
| 2024-02-02 | EPSD: Early Pruning with Self-Distillation for Efficient Model Compression | Dong Chen et.al. | 2402.00084 | null |
| 2024-06-05 | APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference | Bowen Zhao et.al. | 2401.12200 | null |
| 2024-06-05 | TinyLlama: An Open-Source Small Language Model | Peiyuan Zhang et.al. | 2401.02385 | null |
| 2024-02-23 | LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language Model | Yichen Zhu et.al. | 2401.02330 | null |
| 2024-06-24 | TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones | Zhengqing Yuan et.al. | 2312.16862 | null |
| 2024-03-19 | Language Model Knowledge Distillation for Efficient Question Answering in Spanish | Adrián Bazaga et.al. | 2312.04193 | link |
| 2024-02-07 | Compressed Context Memory For Online Language Model Interaction | Jang-Hyun Kim et.al. | 2312.03414 | null |
| 2023-11-09 | PB-LLM: Partially Binarized Large Language Models | Yuzhang Shang et.al. | 2310.00034 | null |
| 2023-08-29 | Improving Knowledge Distillation for BERT Models: Loss Functions, Mapping Methods, and Weight Tuning | Apoorv Dankar et.al. | 2308.13958 | null |
| 2023-06-27 | Low-Rank Prune-And-Factorize for Language Model Compression | Siyu Ren et.al. | 2306.14152 | null |
| 2023-06-21 | Categories of Response-Based, Feature-Based, and Relation-Based Knowledge Distillation | Chuanguang Yang et.al. | 2306.10687 | null |
| 2023-05-29 | Neural Architecture Search for Parameter-Efficient Fine-tuning of Large Pre-trained Language Models | Neal Lawton et.al. | 2305.16597 | null |
| 2023-05-22 | Parameter-Efficient Fine-Tuning with Layer Pruning on Free-Text Sequence-to-Sequence Modeling | Yunqi Zhu et.al. | 2305.08285 | null |
| 2023-04-20 | An Empirical Study of Leveraging Knowledge Distillation for Compressing Multilingual Neural Machine Translation Models | Varun Gumma et.al. | 2304.09388 | null |
| 2023-02-15 | Multi-teacher knowledge distillation as an effective method for compressing ensembles of neural networks | Konrad Zuchniak et.al. | 2302.07215 | null |
| 2022-10-17 | EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning | Tiannan Wang et.al. | 2210.07795 | null |
| 2022-10-11 | AlphaTuning: Quantization-Aware Parameter-Efficient Adaptation of Large-Scale Pre-Trained Language Models | Se Jung Kwon et.al. | 2210.03858 | null |
| 2022-08-04 | Efficient Fine-Tuning of Compressed Language Models with Learners | Danilo Vucetic et.al. | 2208.02070 | null |
| 2022-06-01 | Parameter-Efficient and Student-Friendly Knowledge Distillation | Jun Rao et.al. | 2205.15308 | null |
| 2022-05-24 | Parameter-Efficient Sparsity for Large Language Models Fine-Tuning | Yuchao Li et.al. | 2205.11005 | null |
| 2022-05-04 | Structured Pruning Learns Compact and Accurate Models | Mengzhou Xia et.al. | 2204.00408 | null |
| 2022-03-23 | DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization | Zheng Li et.al. | 2203.11239 | null |
| 2022-03-09 | HyperPELT: Unified Parameter-Efficient Language Model Tuning for Both Language and Vision-and-Language Tasks | Zhengkun Zhang et.al. | 2203.03878 | null |
| 2022-04-05 | CHIP: CHannel Independence-based Pruning for Compact Neural Networks | Yang Sui et.al. | 2110.13981 | null |
| 2022-02-03 | Towards a Unified View of Parameter-Efficient Transfer Learning | Junxian He et.al. | 2110.04366 | null |
| 2023-07-18 | Pruning Ternary Quantization | Dan Liu et.al. | 2107.10998 | null |
| 2021-06-29 | PQK: Model Compression via Pruning, Quantization, and Knowledge Distillation | Jangho Kim et.al. | 2106.14681 | null |
| 2022-02-11 | An Information-Theoretic Justification for Model Pruning | Berivan Isik et.al. | 2102.08329 | null |
| 2022-11-03 | Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across Domains | Haojie Pan et.al. | 2012.01266 | null |
| 2020-11-18 | Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning | Bingbing Li et.al. | 2009.08065 | null |
| 2020-08-04 | Differentiable Feature Aggregation Search for Knowledge Distillation | Yushuo Guan et.al. | 2008.00506 | null |
| 2020-05-19 | MicroNet for Efficient Language Modeling | Zhongxia Yan et.al. | 2005.07877 | null |
| 2020-04-20 | Triplet Loss for Knowledge Distillation | Hideki Oki et.al. | 2004.08116 | null |
| 2019-10-18 | Training Compact Models for Low Resource Entity Tagging using Pre-trained Language Models | Peter Izsak et.al. | 1910.06294 | null |
| 2021-03-05 | Revisiting Knowledge Distillation via Label Smoothing Regularization | Li Yuan et.al. | 1909.11723 | null |
| 2019-08-27 | Patient Knowledge Distillation for BERT Model Compression | Siqi Sun et.al. | 1908.09355 | null |
| 2019-06-18 | Scalable Syntax-Aware Language Models Using Knowledge Distillation | Adhiguna Kuncoro et.al. | 1906.06438 | null |
| 2019-05-07 | Creating Lightweight Object Detectors with Model Compression for Deployment on Edge Devices | Yiwu Yao et.al. | 1905.01787 | null |
| 2018-12-04 | Knowledge Distillation with Feature Maps for Image Classification | Wei-Chun Chen et.al. | 1812.00660 | null |
| 2018-11-07 | Compact Personalized Models for Neural Machine Translation | Joern Wuebker et.al. | 1811.01990 | null |
| 2016-08-17 | Fast, Small and Exact: Infinite-order Language Modelling with Compressed Suffix Trees | Ehsan Shareghi et.al. | 1608.04465 | null |
📊 992 papers
| 📅 Publish Date | 📝 Title | 👥 Authors | 💻 Code | |
|---|---|---|---|---|
| 2026-04-01 | PDA: Text-Augmented Defense Framework for Robust Vision-Language Models against Adversarial Image Attacks | Jingning Xu et.al. | 2604.01010 | null |
| 2026-04-01 | Mine-JEPA: In-Domain Self-Supervised Learning for Mine-Like Object Classification in Side-Scan Sonar | Taeyoun Kwon et.al. | 2604.00383 | null |
| 2026-03-30 | UltraG-Ray: Physics-Based Gaussian Ray Casting for Novel Ultrasound View Synthesis | Felix Duelmer et.al. | 2603.29022 | null |
| 2026-03-30 | LDDMM stochastic interpolants: an application to domain uncertainty quantification in hemodynamics | Sarah Katz et.al. | 2603.28324 | null |
| 2026-03-28 | GIFT: Bootstrapping Image-to-CAD Program Synthesis via Geometric Feedback | Giorgio Giannone et.al. | 2603.27448 | null |
| 2026-03-28 | Evaluating Large and Lightweight Vision Models for Irregular Component Segmentation in E-Waste Disassembly | Xinyao Zhang et.al. | 2603.27441 | null |
| 2026-03-28 | Hybrid Deep Learning with Temporal Data Augmentation for Accurate Remaining Useful Life Prediction of Lithium-Ion Batteries | Yun Tian et.al. | 2603.27186 | null |
| 2026-03-27 | Hybrid Diffusion Model for Breast Ultrasound Image Augmentation | Farhan Fuad Abir et.al. | 2603.26834 | null |
| 2026-03-27 | Central-to-Local Adaptive Generative Diffusion Framework for Improving Gene Expression Prediction in Data-Limited Spatial Transcriptomics | Yaoyu Fang et.al. | 2603.26827 | null |
| 2026-03-25 | PhyDCM: A Reproducible Open-Source Framework for AI-Assisted Brain Tumor Classification from Multi-Sequence MRI | Hayder Saad Abdulbaqi et.al. | 2603.26794 | null |
| 2026-03-26 | A generalized Bayesian approach to multiple changepoint analysis | Yuhui Wang et.al. | 2603.25668 | null |
| 2026-03-26 | Insights on back marking for the automated identification of animals | David Brunner et.al. | 2603.25535 | null |
| 2026-03-26 | Lightweight GenAI for Network Traffic Synthesis: Fidelity, Augmentation, and Classification | Giampaolo Bovenzi et.al. | 2603.25507 | null |
| 2026-03-26 | Translation Asymmetry in LLMs as a Data Augmentation Factor: A Case Study for 6 Romansh Language Varieties | Jannis Vamvas et.al. | 2603.25489 | null |
| 2026-03-26 | MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness | Yuto Matsuo et.al. | 2603.25109 | null |
| 2026-03-26 | $π$ , But Make It Fly: Physics-Guided Transfer of VLA Models to Aerial Manipulation | Johnathan Tucker et.al. | 2603.25038 | null |
| 2026-03-26 | Toward domain-specific machine translation and quality estimation systems | Javad Pourmostafa Roshan Sharami et.al. | 2603.24955 | null |
| 2026-03-26 | CVA: Context-aware Video-text Alignment for Video Temporal Grounding | Sungho Moon et.al. | 2603.24934 | null |
| 2026-03-26 | Decoding Market Emotions in Cryptocurrency Tweets via Predictive Statement Classification with Machine Learning and Transformers | Moein Shahiki Tash et.al. | 2603.24933 | null |
| 2026-03-25 | Amplified Patch-Level Differential Privacy for Free via Random Cropping | Kaan Durmaz et.al. | 2603.24695 | null |
| 2026-03-29 | BCMDA: Bidirectional Correlation Maps Domain Adaptation for Mixed Domain Semi-Supervised Medical Image Segmentation | Bentao Song et.al. | 2603.24691 | null |
| 2026-03-25 | How unconstrained machine-learning models learn physical symmetries | Michelangelo Domina et.al. | 2603.24638 | null |
| 2026-03-25 | A Bayesian Dynamic Latent Space Model for Weighted Networks | Roberto Casarin et.al. | 2603.24201 | null |
| 2026-03-25 | Enhancing and Reporting Robustness Boundary of Neural Code Models for Intelligent Code Understanding | Tingxu Han et.al. | 2603.24119 | null |
| 2026-03-25 | SynMVCrowd: A Large Synthetic Benchmark for Multi-view Crowd Counting and Localization | Qi Zhang et.al. | 2603.23956 | null |
| 2026-03-25 | 3D-LLDM: Label-Guided 3D Latent Diffusion Model for Improving High-Resolution Synthetic MR Imaging in Hepatic Structure Segmentation | Kyeonghun Kim et.al. | 2603.23845 | null |
| 2026-03-30 | Synthetic Mixed Training: Scaling Parametric Knowledge Acquisition Beyond RAG | Seungju Han et.al. | 2603.23562 | null |
| 2026-03-24 | Parametric Knowledge and Retrieval Behavior in RAG Fine-Tuning for Electronic Design Automation | Julian Oestreich et.al. | 2603.23047 | null |
| 2026-03-25 | Instrument-Splatting++: Towards Controllable Surgical Instrument Digital Twin Using Gaussian Splatting | Shuojue Yang et.al. | 2603.22792 | null |
| 2026-03-24 | DALDALL: Data Augmentation for Lexical and Semantic Diverse in Legal Domain by leveraging LLM-Persona | Janghyeok Choi et.al. | 2603.22765 | null |
| 2026-03-23 | Generalized multi-object classification and tracking with sparse feature resonator networks | Lazar Supic et.al. | 2603.22539 | null |
| 2026-03-23 | SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection | Kexian Tang et.al. | 2603.22213 | null |
| 2026-03-23 | Data Curation for Machine Learning Interatomic Potentials by Determinantal Point Processes | Joanna Zou et.al. | 2603.22160 | null |
| 2026-03-23 | ROM: Real-time Overthinking Mitigation via Streaming Detection and Intervention | Xinyan Wang et.al. | 2603.22016 | null |
| 2026-03-23 | Ctrl-A: Control-Driven Online Data Augmentation | Jesper B. Christensen et.al. | 2603.21819 | null |
| 2026-03-23 | HACMatch Semi-Supervised Rotation Regression with Hardness-Aware Curriculum Pseudo Labeling | Mei Li et.al. | 2603.21583 | null |
| 2026-03-22 | AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling | Liang Ding et.al. | 2603.21357 | null |
| 2026-03-22 | Positional Segmentor-Guided Counterfactual Fine-Tuning for Spatially Localized Image Synthesis | Tian Xia et.al. | 2603.21213 | null |
| 2026-03-21 | negMIX: Negative Mixup for OOD Generalization in Open-Set Node Classification | Junwei Gong et.al. | 2603.20798 | null |
| 2026-03-11 | Abjad-Kids: An Arabic Speech Classification Dataset for Primary Education | Abdul Aziz Snoubara et.al. | 2603.20255 | null |
| 2026-03-19 | A Novel Solution for Zero-Day Attack Detection in IDS using Self-Attention and Jensen-Shannon Divergence in WGAN-GP | Ziyu Mu et.al. | 2603.19350 | null |
| 2026-03-10 | Maximizing mutual information between user-contexts and responses improve LLM personalization with no additional data | Hyunji Nam et.al. | 2603.19294 | null |
| 2026-03-19 | PromptHub: Enhancing Multi-Prompt Visual In-Context Learning with Locality-Aware Fusion, Concentration and Alignment | Tianci Luo et.al. | 2603.18891 | null |
| 2026-03-19 | Data-efficient pre-training by scaling synthetic megadocs | Konwoo Kim et.al. | 2603.18534 | null |
| 2026-03-19 | R&D: Balancing Reliability and Diversity in Synthetic Data Augmentation for Semantic Segmentation | Huy Che et.al. | 2603.18427 | null |
| 2026-03-19 | Where are the Hidden Gems? Applying Transformer Models for Design Discussion Detection | Lawrence Arkoh et.al. | 2603.18393 | null |
| 2026-03-18 | Synthetic Data, Information, and Prior Knowledge: Why Synthetic Data Augmentation to Boost Sample Doesn’t Work for Statistical Inference | Reid Dale et.al. | 2603.18345 | null |
| 2026-03-20 | R2-Dreamer: Redundancy-Reduced World Models without Decoders or Augmentation | Naoki Morihira et.al. | 2603.18202 | null |
| 2026-03-18 | Towards Motion-aware Referring Image Segmentation | Chaeyun Kim et.al. | 2603.17413 | null |
| 2026-03-17 | Machine intelligence supports the full chain of 2D dendrite synthesis | Wenqiang Huang et.al. | 2603.16959 | null |
| 2026-03-17 | Spectral Property-Driven Data Augmentation for Hyperspectral Single-Source Domain Generalization | Taiqin Chen et.al. | 2603.16662 | null |
| 2026-03-17 | Dexterous grasp data augmentation based on grasp synthesis with fingertip workspace cloud and contact-aware sampling | Liqi Wu et.al. | 2603.16609 | null |
| 2026-03-17 | AW-MoE: All-Weather Mixture of Experts for Robust Multi-Modal 3D Object Detection | Hongwei Lin et.al. | 2603.16261 | null |
| 2026-03-17 | When Generative Augmentation Hurts: A Benchmark Study of GAN and Diffusion Models for Bias Correction in AI Classification Systems | Shesh Narayan Gupta et.al. | 2603.16134 | null |
| 2026-03-16 | Something from Nothing: Data Augmentation for Robust Severity Level Estimation of Dysarthric Speech | Jaesung Bae et.al. | 2603.15988 | null |
| 2026-03-16 | Benchmarking Machine Learning Approaches for Polarization Mapping in Ferroelectrics Using 4D-STEM | Matej Martinc et.al. | 2603.15582 | null |
| 2026-03-16 | Low-Complexity and Consistent Graphon Estimation from Multiple Networks | Roland Boniface Sogan et.al. | 2603.15578 | null |
| 2026-03-16 | Data Augmentation via Causal-Residual Bootstrapping | Mateusz Gajewski et.al. | 2603.15335 | null |
| 2026-03-16 | Datasets for Verb Alternations across Languages: BLM Templates and Data Augmentation Strategies | Giuseppe Samo et.al. | 2603.15295 | null |
| 2026-03-18 | ViSA: Visited-State Augmentation for Generalized Goal-Space Contrastive Reinforcement Learning | Issa Nakamura et.al. | 2603.14887 | null |
| 2026-03-17 | Topology-Preserving Data Augmentation for Ring-Type Polygon Annotations | Sudip Laudari et.al. | 2603.14764 | null |
| 2026-03-15 | A Heterogeneous Ensemble for Multi-Center COVID-19 Classification from Chest CT Scans | Aadit Nilay et.al. | 2603.14621 | null |
| 2026-03-15 | Infinite Problem Generator: Verifiably Scaling Physics Reasoning Data with Agentic Workflows | Aditya Sharan et.al. | 2603.14486 | null |
| 2026-03-15 | PGcGAN: Pathological Gait-Conditioned GAN for Human Gait Synthesis | Mritula Chandrasekaran et.al. | 2603.14409 | null |
| 2026-03-15 | A Physically-Grounded Attack and Adaptive Defense Framework for Real-World Low-Light Image Enhancement | Tongshun Zhang et.al. | 2603.14304 | null |
| 2026-03-14 | EchoLVFM: One-Step Video Generation via Latent Flow Matching for Echocardiogram Synthesis | Emmanuel Oladokun et.al. | 2603.13967 | null |
| 2026-03-14 | Close to Reality: Interpretable and Feasible Data Augmentation for Imbalanced Learning | Matheus Camilo da Silva et.al. | 2603.13927 | null |
| 2026-03-14 | FMS $^2$ : Unified Flow Matching for Segmentation and Synthesis of Thin Structures | Babak Asadi et.al. | 2603.13659 | null |
| 2026-03-11 | Layout-Guided Controllable Pathology Image Generation with In-Context Diffusion Transformers | Yuntao Shou et.al. | 2603.13386 | null |
| 2026-03-10 | A Computer-aided Framework for Detecting Osteosarcoma in Computed Tomography Scans | Maximo Rodriguez-Herrero et.al. | 2603.13376 | null |
| 2026-03-09 | Multimodal Deep Learning for Dynamic and Static Neuroimaging: Integrating MRI and fMRI for Alzheimer Disease Analysis | Anima Kujur et.al. | 2603.13367 | null |
| 2026-03-13 | Surrogates for Physics-based and Data-driven Modelling of Parametric Systems: Review and New Perspectives | Matteo Giacomini et.al. | 2603.12870 | null |
| 2026-03-13 | On Using Machine Learning to Early Detect Catastrophic Failures in Marine Diesel Engines | Francesco Maione et.al. | 2603.12733 | null |
| 2026-03-16 | Overcoming the Modality Gap in Context-Aided Forecasting | Vincent Zhihao Zheng et.al. | 2603.12451 | null |
| 2026-03-16 | Room Impulse Response Completion Using Signal-Prediction Diffusion Models Conditioned on Simulated Early Reflections | Zeyu Xu et.al. | 2603.12442 | null |
| 2026-03-12 | LLM-Augmented Therapy Normalization and Aspect-Based Sentiment Analysis for Treatment-Resistant Depression on Reddit | Yuxin Zhu et.al. | 2603.12343 | null |
| 2026-03-24 | Multi-Station WiFi CSI Sensing Framework Robust to Station-wise Feature Missingness and Limited Labeled Data | Keita Kayano et.al. | 2603.11858 | null |
| 2026-03-12 | In the LLM era, Word Sense Induction remains unsolved | Anna Mosolova et.al. | 2603.11686 | null |
| 2026-03-12 | FBCIR: Balancing Cross-Modal Focuses in Composed Image Retrieval | Chenchen Zhao et.al. | 2603.11520 | null |
| 2026-03-12 | Dynamic Bayesian regression quantile synthesis for forecasting outlook-at-risk | Genya Kobayashi et.al. | 2603.11474 | null |
| 2026-03-11 | Data Augmentation and Convolutional Network Architecture Influence on Distributed Learning | Victor Forattini Jansen et.al. | 2603.10902 | null |
| 2026-03-11 | Riemannian Geometry-Preserving Variational Autoencoder for MI-BCI Data Augmentation | Viktorija Poļaka et.al. | 2603.10563 | link |
| 2026-03-11 | Sparse Task Vector Mixup with Hypernetworks for Efficient Knowledge Transfer in Whole-Slide Image Prognosis | Pei Liu et.al. | 2603.10526 | link |
| 2026-03-11 | FAR-Dex: Few-shot Data Augmentation and Adaptive Residual Policy Refinement for Dexterous Manipulation | Yushan Bai et.al. | 2603.10451 | null |
| 2026-03-10 | Finetuning a Text-to-Audio Model for Room Impulse Response Generation | Kirak Kim et.al. | 2603.09708 | null |
| 2026-03-10 | Improving 3D Foot Motion Reconstruction in Markerless Monocular Human Motion Capture | Tom Wehrbein et.al. | 2603.09681 | null |
| 2026-03-10 | Grounding Synthetic Data Generation With Vision and Language Models | Ümit Mert Çağlar et.al. | 2603.09625 | null |
| 2026-03-10 | Contrastive Bayesian Inference for Unnormalized Models | Naruki Sonobe et.al. | 2603.09306 | null |
| 2026-03-10 | Multi-model approach for autonomous driving: A comprehensive study on traffic sign-, vehicle- and lane detection and behavioral cloning | Kanishkha Jaisankar et.al. | 2603.09255 | null |
| 2026-03-10 | Acoustic and Semantic Modeling of Emotion in Spoken Language | Soumya Dutta et.al. | 2603.09212 | null |
| 2026-03-10 | Wrong Code, Right Structure: Learning Netlist Representations from Imperfect LLM-Generated RTL | Siyang Cai et.al. | 2603.09161 | null |
| 2026-03-10 | Scalable Neural Vocoder from Range-Null Space Decomposition | Andong Li et.al. | 2603.08574 | null |
| 2026-03-09 | Diffusion-Based Data Augmentation for Image Recognition: A Systematic Analysis and Evaluation | Zekun Li et.al. | 2603.08364 | null |
| 2026-03-09 | Seed2Scale: A Self-Evolving Data Engine for Embodied AI via Small to Large Model Synergy and Multimodal Evaluation | Cong Tai et.al. | 2603.08260 | null |
| 2026-03-09 | WhispEar: A Bi-directional Framework for Scaling Whispered Speech Conversion via Pseudo-Parallel Whisper Generation | Zihao Fang et.al. | 2603.08046 | null |
| 2026-03-09 | Hard/Soft NLoS Detection via Combinatorial Data Augmentation for 6G Positioning | Sang-Hyeok Kim et.al. | 2603.07932 | null |
| 2026-03-08 | Nwāchā Munā: A Devanagari Speech Corpus and Proximal Transfer Benchmark for Nepal Bhasha ASR | Rishikesh Kumar Sharma et.al. | 2603.07554 | null |
| 2026-03-08 | An efficient method of posterior sampling for Poisson INGARCH models | Yixuan Fan et.al. | 2603.07527 | null |
| 2026-03-08 | InterReal: A Unified Physics-Based Imitation Framework for Learning Human-Object Interaction Skills | Dayang Liang et.al. | 2603.07516 | null |
| 2026-03-07 | Neural Control and Learning of Simulated Hand Movements With an EMG-Based Closed-Loop Interface | Balint K. Hodossy et.al. | 2603.07364 | null |
| 2026-03-07 | MedSteer: Counterfactual Endoscopic Synthesis via Training-Free Activation Steering | Trong-Thang Pham et.al. | 2603.07066 | null |
| 2026-03-07 | OV-DEIM: Real-time DETR-Style Open-Vocabulary Object Detection with GridSynthetic Augmentation | Leilei Wang et.al. | 2603.07022 | null |
| 2026-03-06 | Learning From Design Procedure To Generate CAD Programs for Data Augmentation | Yan-Ying Chen et.al. | 2603.06894 | null |
| 2026-03-06 | Physics-Informed Diffusion Model for Generating Synthetic Extreme Rare Weather Events Data | Marawan Yakout et.al. | 2603.06782 | null |
| 2026-03-05 | On the Generalization Capacities of MLLMs for Spatial Intelligence | Gongjie Zhang et.al. | 2603.06704 | null |
| 2026-03-02 | EnsAug: Augmentation-Driven Ensembles for Human Motion Sequence Analysis | Bikram De et.al. | 2603.06661 | null |
| 2026-03-06 | NOBLE: Accelerating Transformers with Nonlinear Low-Rank Branches | Ethan Smith et.al. | 2603.06492 | null |
| 2026-03-06 | Computer vision-based estimation of invertebrate biomass | Mikko Impiö et.al. | 2603.06362 | null |
| 2026-03-06 | MLLMRec-R1: Incentivizing Reasoning Capability in Large Language Models for Multimodal Sequential Recommendation | Yu Wang et.al. | 2603.06243 | null |
| 2026-03-06 | AnyCamVLA: Zero-Shot Camera Adaptation for Viewpoint Robust Vision-Language-Action Models | Hyeongjun Heo et.al. | 2603.05868 | null |
| 2026-03-09 | SAIL: Similarity-Aware Guidance and Inter-Caption Augmentation-based Learning for Weakly-Supervised Dense Video Captioning | Ye-Chan Kim et.al. | 2603.05437 | null |
| 2026-03-05 | CoIn3D: Revisiting Configuration-Invariant Multi-Camera 3D Object Detection | Zhaonian Kuang et.al. | 2603.05042 | null |
| 2026-03-05 | Why Is RLHF Alignment Shallow? A Gradient Analysis | Robin Young et.al. | 2603.04851 | null |
| 2026-03-05 | Revisiting Shape from Polarization in the Era of Vision Foundation Models | Chenhao Li et.al. | 2603.04817 | null |
| 2026-03-10 | Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling | Yong Liu et.al. | 2603.04791 | null |
| 2026-03-05 | Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement | Brian Jing Hong Nge et.al. | 2603.04698 | null |
| 2026-03-04 | Structure-Guided Histopathology Synthesis via Dual-LoRA Diffusion | Xuan Xu et.al. | 2603.04565 | null |
| 2026-03-04 | Balancing Fidelity, Utility, and Privacy in Synthetic Cardiac MRI Generation: A Comparative Study | Madhura Edirisooriya et.al. | 2603.04340 | null |
| 2026-03-04 | A Multi-Fidelity Parametric Framework for Reduced-Order Modeling using Optimal Transport-based Interpolation: Applications to Diffused-Interface Two-Phase Flows | Moaad Khamlich et.al. | 2603.04232 | null |
| 2026-03-04 | Mask-Guided Attention Regulation for Anatomically Consistent Counterfactual CXR Synthesis | Zichun Zhang et.al. | 2603.04130 | null |
| 2026-03-16 | QD-PCQA: Quality-Aware Domain Adaptation for Point Cloud Quality Assessment | Guohua Zhang et.al. | 2603.03726 | null |
| 2026-03-03 | An Effective Data Augmentation Method by Asking Questions about Scene Text Images | Xu Yao et.al. | 2603.03580 | null |
| 2026-03-05 | AOI: Turning Failed Trajectories into Training Signals for Autonomous Cloud Diagnosis | Pei Yang et.al. | 2603.03378 | null |
| 2026-03-03 | Joint Training Across Multiple Activation Sparsity Regimes | Haotian Wang et.al. | 2603.03131 | null |
| 2026-03-03 | Single Microphone Own Voice Detection based on Simulated Transfer Functions for Hearing Aids | Mathuranathan Mayuravaani et.al. | 2603.02724 | null |
| 2026-03-03 | Maximizing Generalization: The Effect of Different Augmentation Techniques on Lightweight Vision Transformer for Bengali Character Classification | Rafi Hassan Chowdhury et.al. | 2603.02591 | null |
| 2026-03-02 | Symbol-Equivariant Recurrent Reasoning Models | Richard Freinschlag et.al. | 2603.02193 | null |
| 2026-03-02 | CTForensics: A Comprehensive Dataset and Method for AI-Generated CT Image Detection | Yiheng Li et.al. | 2603.01878 | null |
| 2026-03-02 | Investigating Group Relative Policy Optimization for Diffusion Transformer based Text-to-Audio Generation | Yi Gu et.al. | 2603.01565 | null |
| 2026-03-16 | Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing | Zijin Yin et.al. | 2603.01535 | null |
| 2026-03-02 | Conversational Speech Naturalness Predictor | Anfeng Xu et.al. | 2603.01467 | null |
| 2026-03-05 | Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards | Seungwook Kim et.al. | 2603.00918 | null |
| 2026-02-28 | Revisiting the machine-learning density functional for the one-dimensional Hubbard model with random external potential | Octavio D. R. Salmon et.al. | 2603.00802 | null |
| 2026-02-28 | TGM-VLA: Task-Guided Mixup for Sampling-Efficient and Robust Robotic Manipulation | Fanqi Pu et.al. | 2603.00615 | null |
| 2026-02-28 | LangGap: Diagnosing and Closing the Language Gap in Vision-Language-Action Models | Yuchen Hou et.al. | 2603.00592 | null |
| 2026-02-27 | Synthetic Priors | Nick Polson et.al. | 2603.00347 | null |
| 2026-02-27 | NAU-QMUL: Utilizing BERT and CLIP for Multi-modal AI-Generated Image Detection | Xiaoyu Guo et.al. | 2602.23863 | null |
| 2026-02-27 | BRIDGE the Gap: Mitigating Bias Amplification in Automated Scoring of English Language Learners via Inter-group Data Augmentation | Yun Wang et.al. | 2602.23580 | null |
| 2026-02-26 | Towards Better RL Training Data Utilization via Second-Order Rollout | Zhe Yang et.al. | 2602.22765 | null |
| 2026-02-26 | TabDLM: Free-Form Tabular Data Generation via Joint Numerical-Language Diffusion | Donghong Cai et.al. | 2602.22586 | null |
| 2026-02-26 | DrivePTS: A Progressive Learning Framework with Textual and Structural Enhancement for Driving Scene Generation | Zhechao Wang et.al. | 2602.22549 | null |
| 2026-02-24 | Computing a Characteristic Orientation for Rotation-Independent Image Analysis | Cristian Valero-Abundio et.al. | 2602.20930 | null |
| 2026-02-24 | Federated Learning for Cross-Modality Medical Image Segmentation via Augmentation-Driven Generalization | Sachin Dudda Nagaraju et.al. | 2602.20773 | null |
| 2026-02-23 | Shape-informed cardiac mechanics surrogates in data-scarce regimes via geometric encoding and generative augmentation | Davide Carrara et.al. | 2602.20306 | null |
| 2026-02-23 | The Sim-to-Real Gap in MRS Quantification: A Systematic Deep Learning Validation for GABA | Zien Ma et.al. | 2602.20289 | null |
| 2026-03-02 | Adaptive Data Augmentation with Multi-armed Bandit: Sample-Efficient Embedding Calibration for Implicit Pattern Recognition | Minxue Tang et.al. | 2602.19385 | null |
| 2026-02-22 | PerSoMed: A Large-Scale Balanced Dataset for Persian Social Media Text Classification | Isun Chehreh et.al. | 2602.19333 | null |
| 2026-02-22 | RetinaVision: XAI-Driven Augmented Regulation for Precise Retinal Disease Classification using deep learning framework | Mohammad Tahmid Noor et.al. | 2602.19324 | null |
| 2026-02-22 | Controlled Face Manipulation and Synthesis for Data Augmentation | Joris Kirchner et.al. | 2602.19219 | null |
| 2026-02-21 | YOLOv10-Based Multi-Task Framework for Hand Localization and Laterality Classification in Surgical Videos | Kedi Sun et.al. | 2602.18959 | null |
| 2026-02-19 | MARS: Margin-Aware Reward-Modeling with Self-Refinement | Payel Bhattacharjee et.al. | 2602.17658 | null |
| 2026-02-19 | Reverso: Efficient Time Series Foundation Models for Zero-shot Forecasting | Xinghong Fu et.al. | 2602.17634 | null |
| 2026-02-19 | RPDR: A Round-trip Prediction-Based Data Augmentation Framework for Long-Tail Question Answering | Yiming Zhang et.al. | 2602.17366 | null |
| 2026-02-18 | Learning to unfold cloth: Scaling up world models to deformable object manipulation | Jack Rome et.al. | 2602.16675 | null |
| 2026-02-18 | Label-Consistent Data Generation for Aspect-Based Sentiment Analysis Using LLM Agents | Mohammad H. A. Monfared et.al. | 2602.16379 | null |
| 2026-02-18 | Spatial Audio Question Answering and Reasoning on Dynamic Source Movements | Arvind Krishna Sridhar et.al. | 2602.16334 | null |
| 2026-02-18 | Peeking Ahead of the Field Study: Exploring VLM Personas as Support Tools for Embodied Studies in HCI | Xinyue Gui et.al. | 2602.16157 | null |
| 2026-02-03 | IT-OSE: Exploring Optimal Sample Size for Industrial Data Augmentation | Mingchun Sun et.al. | 2602.15878 | null |
| 2026-02-17 | RaCo: Ranking and Covariance for Practical Learned Keypoints | Abhiram Shenoi et.al. | 2602.15755 | null |
| 2026-02-22 | Refine Now, Query Fast: A Decoupled Refinement Paradigm for Implicit Neural Fields | Tianyu Xiong et.al. | 2602.15155 | null |
| 2026-02-16 | Hidden Markov Individual-level Models of Infectious Disease Transmission | Dirk Douwes-Schultz et.al. | 2602.15007 | null |
| 2026-02-16 | Data Augmentation for Pathological Speech Enhancement | Mingchi Hou et.al. | 2602.14671 | null |
| 2026-02-16 | Breaking Data Efficiency Dilemma: A Federated and Augmented Learning Framework For Alzheimer’s Disease Detection via Speech | Xiao Wei et.al. | 2602.14655 | null |
| 2026-02-23 | MedVAR: Towards Scalable and Efficient Medical Image Generation via Next-scale Autoregressive Prediction | Zhicheng He et.al. | 2602.14512 | null |
| 2026-02-16 | The geometry of invariant learning: an information-theoretic analysis of data augmentation and generalization | Abdelali Bouyahia et.al. | 2602.14423 | null |
| 2026-02-16 | A Generative AI Approach for Reducing Skin Tone Bias in Skin Cancer Classification | Areez Muhammed Shabu et.al. | 2602.14356 | null |
| 2026-02-18 | Bridging the Urban Divide: Adaptive Cross-City Learning for Disaster Sentiment Understanding | Zihui Ma et.al. | 2602.14352 | null |
| 2026-02-15 | RoboAug: One Annotation to Hundreds of Scenes via Region-Contrastive Data Augmentation for Robotic Manipulation | Xinhua Wang et.al. | 2602.14032 | null |
| 2026-02-14 | Synthetic Dataset Generation and Validation for Robotic Surgery Instrument Segmentation | Giorgio Chiesa et.al. | 2602.13844 | null |
| 2026-02-13 | Backdooring Bias in Large Language Models | Anudeep Das et.al. | 2602.13427 | null |
| 2026-02-04 | Deep Learning CNN for Pneumonia Detection: Advancing Digital Health in Society 5.0 | Hadi Almohab et.al. | 2602.13270 | null |
| 2026-02-13 | Data Augmentation and Attention for massive MIMO-based Indoor Localization in Changing Environments | Luisa Schuhmacher et.al. | 2602.12954 | null |
| 2026-02-13 | Beyond Benchmarks of IUGC: Rethinking Requirements of Deep Learning Methods for Intrapartum Ultrasound Biometry from Fetal Ultrasound Videos | Jieyun Bai et.al. | 2602.12922 | null |
| 2026-02-13 | Robustness of Object Detection of Autonomous Vehicles in Adverse Weather Conditions | Fox Pettersen et.al. | 2602.12902 | null |
| 2026-02-12 | GAN-based data augmentation for rare and exotic hadron searches in Pb–Pb collisions in ALICE | Anisa Khatun et.al. | 2602.12088 | null |
| 2026-02-12 | CSEval: A Framework for Evaluating Clinical Semantics in Text-to-Image Generation | Robert Cronshaw et.al. | 2602.12004 | null |
| 2026-02-11 | LCIP: Loss-Controlled Inverse Projection of High-Dimensional Image Data | Yu Wang et.al. | 2602.11141 | null |
| 2026-02-11 | Healthy Harvests: A Comparative Look at Guava Disease Classification Using InceptionV3 | Samanta Ghosh et.al. | 2602.10967 | null |
| 2026-02-11 | AugVLA-3D: Depth-Driven Feature Augmentation for Vision-Language-Action Models | Zhifeng Rao et.al. | 2602.10698 | null |
| 2026-02-11 | Enhancing Weakly Supervised Multimodal Video Anomaly Detection through Text Guidance | Shengyang Sun et.al. | 2602.10549 | null |
| 2026-02-11 | LakeMLB: Data Lake Machine Learning Benchmark | Feiyu Pan et.al. | 2602.10441 | null |
| 2026-02-10 | MalMoE: Mixture-of-Experts Enhanced Encrypted Malicious Traffic Detection Under Graph Drift | Yunpeng Tan et.al. | 2602.10157 | null |
| 2026-02-09 | MPA: Multimodal Prototype Augmentation for Few-Shot Learning | Liwen Wu et.al. | 2602.10143 | null |
| 2026-02-10 | DexImit: Learning Bimanual Dexterous Manipulation from Monocular Human Videos | Juncheng Mu et.al. | 2602.10105 | null |
| 2026-02-10 | Context-Aware Counterfactual Data Augmentation for Gender Bias Mitigation in Language Models | Shweta Parihar et.al. | 2602.09590 | null |
| 2026-02-26 | HLGFA: High-Low Resolution Guided Feature Alignment for Unsupervised Anomaly Detection | Han Zhou et.al. | 2602.09524 | null |
| 2026-02-10 | Empowering Contrastive Federated Sequential Recommendation with LLMs | Thi Minh Chau Nguyen et.al. | 2602.09306 | null |
| 2026-02-09 | One RNG to Rule Them All: How Randomness Becomes an Attack Vector in Machine Learning | Kotekar Annapoorna Prabhu et.al. | 2602.09182 | null |
| 2026-02-04 | The SJTU X-LANCE Lab System for MSR Challenge 2025 | Jinxuan Zhu et.al. | 2602.09042 | null |
| 2026-02-09 | SynSacc: A Blender-to-V2E Pipeline for Synthetic Neuromorphic Eye-Movement Data and Sim-to-Real Spiking Model Training | Khadija Iddrisu et.al. | 2602.08726 | null |
| 2026-02-09 | Chamelion: Reliable Change Detection for Long-Term LiDAR Mapping in Transient Environments | Seoyeon Jang et.al. | 2602.08189 | null |
| 2026-02-08 | Enhancing Bandit Algorithms with LLMs for Time-varying User Preferences in Streaming Recommendations | Chenglei Shen et.al. | 2602.08067 | null |
| 2026-02-08 | DINO-Mix: Distilling Foundational Knowledge with Cross-Domain CutMix for Semi-supervised Class-imbalanced Medical Image Segmentation | Xinyu Liu et.al. | 2602.07819 | null |
| 2026-02-07 | ComPass: Contrastive Learning for Automated Patch Correctness Assessment in Program Repair | Quanjun Zhang et.al. | 2602.07561 | null |
| 2026-02-07 | Fine-Grained Cat Breed Recognition with Global Context Vision Transformer | Mowmita Parvin Hera et.al. | 2602.07534 | null |
| 2026-02-07 | Pull Requests as a Training Signal for Repo-Level Code Editing | Qinglin Zhu et.al. | 2602.07457 | null |
| 2026-02-07 | Echoes in the Loop: Diagnosing Risks in LLM-Powered Recommender Systems under Feedback Loops | Donguk Park et.al. | 2602.07442 | null |
| 2026-02-06 | Sequences as Nodes for Contrastive Multimodal Graph Recommendation | Bucher Sahyouni et.al. | 2602.07208 | null |
| 2026-02-06 | Calibrating Generative AI to Produce Realistic Essays for Data Augmentation | Edward W. Wolfe et.al. | 2602.06772 | null |
| 2026-02-06 | Diffeomorphism-Equivariant Neural Networks | Josephine Elisabeth Oettinger et.al. | 2602.06695 | null |
| 2026-02-06 | AlertBERT: A noise-robust alert grouping framework for simultaneous cyber attacks | Lukas Karner et.al. | 2602.06534 | null |
| 2026-02-05 | InterPrior: Scaling Generative Control for Physics-Based Human-Object Interactions | Sirui Xu et.al. | 2602.06035 | null |
| 2026-02-05 | Mapper-GIN: Lightweight Structural Graph Abstraction for Corrupted 3D Point Cloud Classification | Jeongbin You et.al. | 2602.05522 | null |
| 2026-02-05 | Balanced Anomaly-guided Ego-graph Diffusion Model for Inductive Graph Anomaly Detection | Chunyu Wei et.al. | 2602.05232 | null |
| 2026-02-04 | Fast Compute via MC Boosting | Sarah Polson et.al. | 2602.05032 | null |
| 2026-02-04 | Smart Diagnosis and Early Intervention in PCOS: A Deep Learning Approach to Women’s Reproductive Health | Shayan Abrar et.al. | 2602.04944 | null |
| 2026-02-04 | Speaker-Aware Simulation Improves Conversational Speech Recognition | Máté Gedeon et.al. | 2602.04776 | null |
| 2026-02-04 | Turbulence teaches equivariance to neural networks | Ryley McConkey et.al. | 2602.04695 | null |
| 2026-02-04 | LatentTune: Efficient Tuning of High Dimensional Database Parameters via Latent Representation Learning | Sein Kwon et.al. | 2602.04190 | null |
| 2026-02-03 | SEIS: Subspace-based Equivariance and Invariance Scores for Neural Representations | Huahua Lin et.al. | 2602.04054 | null |
| 2026-02-03 | Quasi-multimodal-based pathophysiological feature learning for retinal disease diagnosis | Lu Zhang et.al. | 2602.03622 | null |
| 2026-02-03 | Cut to the Mix: Simple Data Augmentation Outperforms Elaborate Ones in Limited Organ Segmentation Datasets | Chang Liu et.al. | 2602.03555 | null |
| 2026-02-03 | Invisible Clean-Label Backdoor Attacks for Generative Data Augmentation | Ting Xiang et.al. | 2602.03316 | null |
| 2026-02-03 | PQTNet: Pixel-wise Quantitative Thermography Neural Network for Estimating Defect Depth in Polylactic Acid Parts by Additive Manufacturing | Lei Deng et.al. | 2602.03314 | null |
| 2026-02-03 | Convolutional Neural Networks for classifying galaxy mergers: Can faint tidal features aid in classifying mergers? | Yeonkyung Lee et.al. | 2602.03312 | null |
| 2026-02-03 | Beyond Cropping and Rotation: Automated Evolution of Powerful Task-Specific Augmentations with Generative Models | Judah Goldfeder et.al. | 2602.03123 | null |
| 2026-02-03 | The High Cost of Data Augmentation for Learning Equivariant Models | Henri Klintebäck et.al. | 2602.03118 | null |
| 2026-02-03 | Structuring Value Representations via Geometric Coherence in Markov Decision Processes | Zuyuan Zhang et.al. | 2602.02978 | null |
| 2026-02-03 | Synthetic Data Augmentation for Medical Audio Classification: A Preliminary Evaluation | David McShannon et.al. | 2602.02955 | null |
| 2026-02-03 | 3D-Learning: Diffusion-Augmented Distributionally Robust Decision-Focused Learning | Jiaqi Wen et.al. | 2602.02943 | null |
| 2026-02-09 | Semantics-Aware Generative Latent Data Augmentation for Learning in Low-Resource Domains | Jaesung Bae et.al. | 2602.02841 | null |
| 2026-01-30 | Auto-Augmentation Contrastive Learning for Wearable-based Human Activity Recognition | Qingyu Wu et.al. | 2602.02542 | null |
| 2026-02-02 | HumanX: Toward Agile and Generalizable Humanoid Interaction Skills from Human Videos | Yinhuai Wang et.al. | 2602.02473 | null |
| 2026-02-02 | Transfer Learning Through Conditional Quantile Matching | Yikun Zhang et.al. | 2602.02358 | null |
| 2026-02-02 | Enhancing Generalization in Evolutionary Feature Construction for Symbolic Regression through Vicinal Jensen Gap Minimization | Hengzhe Zhang et.al. | 2602.01510 | null |
| 2026-02-01 | Understanding vision transformer robustness through the lens of out-of-distribution detection | Joey Kuang et.al. | 2602.01459 | null |
| 2026-02-01 | PedagoSense: A Pedology Grounded LLM System for Pedagogical Strategy Detection and Contextual Response Generation in Learning Dialogues | Shahem Sultan et.al. | 2602.01169 | null |
| 2026-02-01 | Key Principles of Graph Machine Learning: Representation, Robustness, and Generalization | Yassine Abbahaddou et.al. | 2602.01139 | null |
| 2026-02-03 | Data Augmentation for High-Fidelity Generation of CAR-T/NK Immunological Synapse Images | Xiang Zhang et.al. | 2602.00949 | null |
| 2026-01-31 | Safety-Efficacy Trade Off: Robustness against Data-Poisoning | Diego Granziol et.al. | 2602.00822 | null |
| 2026-01-27 | 1S-DAug: One-Shot Data Augmentation for Robust Few-Shot Generalization | Yunwei Bai et.al. | 2602.00114 | null |
| 2026-01-30 | Adaptive Edge Learning for Density-Aware Graph Generation | Seyedeh Ava Razi Razavi et.al. | 2601.23052 | null |
| 2026-01-30 | Improving Supervised Machine Learning Performance in Optical Quality Control via Generative AI for Dataset Expansion | Dennis Sprute et.al. | 2601.22961 | null |
| 2026-01-30 | WED-Net: A Weather-Effect Disentanglement Network with Causal Augmentation for Urban Flow Prediction | Qian Hong et.al. | 2601.22586 | null |
| 2026-01-30 | Cross-Domain Few-Shot Learning for Hyperspectral Image Classification Based on Mixup Foundation Model | Naeem Paeedeh et.al. | 2601.22581 | null |
| 2026-01-30 | CoDCL: Counterfactual Data Augmentation Contrastive Learning for Continuous-Time Dynamic Network Link Prediction | Hantong Feng et.al. | 2601.22427 | null |
| 2026-01-29 | Mechanistic Data Attribution: Tracing the Training Origins of Interpretable LLM Units | Jianhui Chen et.al. | 2601.21996 | null |
| 2026-01-29 | Localizing Speech Deepfakes Beyond Transitions via Segment-Aware Learning | Yuchen Mao et.al. | 2601.21925 | null |
| 2026-01-29 | Generative Design of Ship Propellers using Conditional Flow Matching | Patrick Kruger et.al. | 2601.21637 | null |
| 2026-01-29 | Note2Chat: Improving LLMs for Multi-Turn Clinical History Taking Using Medical Notes | Yang Zhou et.al. | 2601.21551 | null |
| 2026-02-06 | inversedMixup: Data Augmentation via Inverting Mixed Embeddings | Fanshuang Kong et.al. | 2601.21543 | null |
| 2026-01-20 | BioNIC: Biologically Inspired Neural Network for Image Classification Using Connectomics Principles | Diya Prasanth et.al. | 2601.20876 | null |
| 2026-01-28 | Cross-Country Learning for National Infectious Disease Forecasting Using European Data | Zacharias Komodromos et.al. | 2601.20771 | null |
| 2026-01-28 | Replicating weak-lensing summary-statistic covariances with normalizing flows | Joaquin Armijo et.al. | 2601.20669 | null |
| 2026-01-28 | IoT Device Identification with Machine Learning: Common Pitfalls and Best Practices | Kahraman Kostas et.al. | 2601.20548 | null |
| 2026-01-28 | PalmBridge: A Plug-and-Play Feature Alignment Framework for Open-Set Palmprint Verification | Chenke Zhang et.al. | 2601.20351 | null |
| 2026-01-28 | Demonstration-Free Robotic Control via LLM Agents | Brian Y. Tsui et.al. | 2601.20334 | null |
| 2026-01-16 | oculomix: Hierarchical Sampling for Retinal-Based Systemic Disease Prediction | Hyunmin Kim et.al. | 2601.19939 | null |
| 2026-01-29 | Real-Time Pulsatile Flow Prediction for Realistic, Diverse Intracranial Aneurysm Morphologies using a Graph Transformer and Steady-Flow Data Augmentation | Yiying Sheng et.al. | 2601.19876 | null |
| 2026-01-27 | Grasynda: Graph-based Synthetic Time Series Generation | Luis Amorim et.al. | 2601.19668 | null |
| 2026-01-27 | Algorithmic Prompt-Augmentation for Efficient LLM-Based Heuristic Design for A* Search | Thomas Bömer et.al. | 2601.19622 | null |
| 2026-01-27 | DSTCS: Dual-Student Teacher Framework with Segment Anything Model for Semi-Supervised Pubic Symphysis Fetal Head Segmentation | Yalin Luo et.al. | 2601.19446 | null |
| 2026-01-27 | High-quality data augmentation for code comment classification | Thomas Borsani et.al. | 2601.19383 | null |
| 2026-01-27 | Binary Token-Level Classification with DeBERTa for All-Type MWE Identification: A Lightweight Approach with Linguistic Enhancement | Diego Rossini et.al. | 2601.19360 | null |
| 2026-01-27 | Implicit Non-Causal Factors are Out via Dataset Splitting for Domain Generalization Object Detection | Zhilong Zhang et.al. | 2601.19127 | null |
| 2026-01-27 | Leveraging Sentence-oriented Augmentation and Transformer-Based Architecture for Vietnamese-Bahnaric Translation | Tan Sang Nguyen et.al. | 2601.19124 | null |
| 2026-01-27 | Exploring Weaknesses in Function Call Models via Reinforcement Learning: An Adversarial Data Augmentation Approach | Weiran Guo et.al. | 2601.19122 | null |
| 2026-01-26 | OATS: Online Data Augmentation for Time Series Foundation Models | Junwei Deng et.al. | 2601.19040 | null |
| 2026-01-26 | ExoGS: A 4D Real-to-Sim-to-Real Framework for Scalable Manipulation Data Collection | Yiming Wang et.al. | 2601.18629 | null |
| 2026-01-26 | Generative Diffusion Augmentation with Quantum-Enhanced Discrimination for Medical Image Diagnosis | Jingsong Xia et.al. | 2601.18556 | null |
| 2026-01-26 | A Dataset for Automatic Vocal Mode Classification | Reemt Hinrichs et.al. | 2601.18339 | null |
| 2026-01-26 | Analytic Incremental Learning For Sound Source Localization With Imbalance Rectification | Zexia Fan et.al. | 2601.18335 | null |
| 2026-01-26 | Facial Emotion Recognition on FER-2013 using an EfficientNetB2-Based Approach | Sahil Naik et.al. | 2601.18228 | null |
| 2026-01-25 | MarketGANs: Multivariate financial time-series data augmentation using generative adversarial networks | Jeonggyu Huh et.al. | 2601.17773 | null |
| 2026-01-25 | Training-Free Text-to-Image Compositional Food Generation via Prompt Grafting | Xinyue Pan et.al. | 2601.17666 | null |
| 2026-01-24 | Stylizing ViT: Anatomy-Preserving Instance Style Transfer for Domain Generalization | Sebastian Doerrich et.al. | 2601.17586 | null |
| 2026-01-24 | SpatialMath: Spatial Comprehension-Infused Symbolic Reasoning for Mathematical Problem-Solving | Ashutosh Bajpai et.al. | 2601.17489 | null |
| 2026-01-23 | Semi-Supervised Domain Adaptation with Latent Diffusion for Pathology Image Classification | Tengyue Zhang et.al. | 2601.17228 | null |
| 2026-01-23 | Fully 3D Unrolled Magnetic Resonance Fingerprinting Reconstruction via Staged Pretraining and Implicit Gridding | Yonatan Urman et.al. | 2601.17143 | null |
| 2026-01-22 | A Computer Vision Pipeline for Iterative Bullet Hole Tracking in Rifle Zeroing | Robert M. Belcher et.al. | 2601.17062 | null |
| 2026-01-22 | Frequency-aware Adaptive Contrastive Learning for Sequential Recommendation | Zhikai Wang et.al. | 2601.17057 | null |
| 2026-01-20 | Arabic Sign Language Recognition using Multimodal Approach | Ghadeer Alanazi et.al. | 2601.17041 | null |
| 2026-01-23 | Latent Diffusion for Internet of Things Attack Data Generation in Intrusion Detection | Estela Sánchez-Carballo et.al. | 2601.16976 | null |
| 2026-01-23 | A Novel Transfer Learning Approach for Mental Stability Classification from Voice Signal | Rafiul Islam et.al. | 2601.16793 | null |
| 2026-01-22 | Synthetic Augmentation in Imbalanced Learning: When It Helps, When It Hurts, and How Much to Add | Zhengchi Ma et.al. | 2601.16120 | null |
| 2026-01-22 | synthocr-gen: A synthetic ocr dataset generator for low-resource languages- breaking the data barrier | Haq Nawaz Malik et.al. | 2601.16113 | null |
| 2026-01-23 | Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model | Chenghao Fan et.al. | 2601.15892 | null |
| 2026-01-22 | Beyond Off-the-Shelf Models: A Lightweight and Accessible Machine Learning Pipeline for Ecologists Working with Image Data | Clare Chemery et.al. | 2601.15813 | null |
| 2026-01-22 | Diffusion Model-Based Data Augmentation for Enhanced Neuron Segmentation | Liuyun Jiang et.al. | 2601.15779 | null |
| 2026-01-22 | Materealize: a multi-agent deliberation system for end-to-end material design and synthesis | Seongmin Kim et.al. | 2601.15743 | null |
| 2026-01-21 | AI-Based Culvert-Sewer Inspection | Christina Thrainer et.al. | 2601.15366 | null |
| 2026-01-21 | Synthetic Data Augmentation for Multi-Task Chinese Porcelain Classification: A Stable Diffusion Approach | Ziyao Ling et.al. | 2601.14791 | null |
| 2026-01-21 | Context Patch Fusion With Class Token Enhancement for Weakly Supervised Semantic Segmentation | Yiyang Fu et.al. | 2601.14718 | null |
| 2026-01-21 | Counterfactual Modeling with Fine-Tuned LLMs for Health Intervention Design and Sensor Data Augmentation | Shovito Barua Soumma et.al. | 2601.14590 | null |
| 2026-01-20 | Attention-Based Offline Reinforcement Learning and Clustering for Interpretable Sepsis Treatment | Punit Kumar et.al. | 2601.14228 | null |
| 2026-01-21 | RL-BioAug: Label-Efficient Reinforcement Learning for Self-Supervised EEG Representation Learning | Cheol-Hui Lee et.al. | 2601.13964 | null |
| 2026-01-20 | Towards Effective Negation Modeling in Joint Audio-Text Models for Music | Yannis Vasilakis et.al. | 2601.13931 | null |
| 2026-01-20 | Inverting Self-Organizing Maps: A Unified Activation-Based Framework | Alessandro Londei et.al. | 2601.13851 | null |
| 2026-01-19 | Discrete-Time Optimal Control of Species Augmentation for Predator-Prey Model | Munkaila Dasumani et.al. | 2601.13394 | null |
| 2026-01-30 | Diffusion-Driven Synthetic Tabular Data Generation for Enhanced DoS/DDoS Attack Classification | Aravind B et.al. | 2601.13197 | null |
| 2026-01-19 | NeuroShield: A Neuro-Symbolic Framework for Adversarial Robustness | Ali Shafiee Sarvestani et.al. | 2601.13162 | null |
| 2026-01-19 | Adaptive Speaker Embedding Self-Augmentation for Personal Voice Activity Detection with Short Enrollment Speech | Fuyuan Feng et.al. | 2601.12769 | null |
| 2026-01-18 | Single-index Semiparametric Transformation Cure Models with Interval-censored Data | Xiaoru Huang et.al. | 2601.12370 | null |
| 2026-01-18 | Beyond Human Annotation: Recent Advances in Data Generation Methods for Document Intelligence | Dehao Ying et.al. | 2601.12318 | null |
| 2026-01-18 | An Innovative Framework for Breast Cancer Detection Using Pyramid Adaptive Atrous Convolution, Transformer Integration, and Multi-Scale Feature Fusion | Ehsan Sadeghi Pour et.al. | 2601.12249 | null |
| 2026-01-16 | Isotropy-Optimized Contrastive Learning for Semantic Course Recommendation | Ali Khreis et.al. | 2601.11427 | null |
| 2026-01-16 | How DDAIR you? Disambiguated Data Augmentation for Intent Recognition | Galo Castillo-López et.al. | 2601.11234 | null |
| 2026-01-16 | Tail-Aware Data Augmentation for Long-Tail Sequential Recommendation | Yizhou Dang et.al. | 2601.10933 | null |
| 2026-01-15 | A Unified 3D Object Perception Framework for Real-Time Outside-In Multi-Camera Systems | Yizhou Wang et.al. | 2601.10819 | null |
| 2026-01-15 | Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models | Zirui Ren et.al. | 2601.10679 | null |
| 2026-01-15 | History Is Not Enough: An Adaptive Dataflow System for Financial Time-Series Synthesis | Haochong Xia et.al. | 2601.10143 | null |
| 2026-01-14 | Explainable Deep Learning for Pediatric Pneumonia Detection in Chest X-Ray Images | Adil O. Khadidos et.al. | 2601.09814 | null |
| 2026-01-30 | WiFo-M $^2$ : Empower Wireless Communications With Plug-and-Play Environment Sensing via Foundation Model | Haotian Zhang et.al. | 2601.09179 | null |
| 2026-01-14 | From Snow to Rain: Evaluating Robustness, Calibration, and Complexity of Model-Based Robust Training | Josué Martínez-Martínez et.al. | 2601.09153 | null |
| 2026-01-14 | Enhancing Imbalanced Electrocardiogram Classification: A Novel Approach Integrating Data Augmentation through Wavelet Transform and Interclass Fusion | Haijian Shao et.al. | 2601.09103 | null |
| 2026-01-09 | Bias Detection and Rotation-Robustness Mitigation in Vision-Language Models and Generative Image Models | Tarannum Mithila et.al. | 2601.08860 | null |
| 2026-01-13 | Get away with less: Need of source side data curation to build parallel corpus for low resource Machine Translation | Saumitra Yadav et.al. | 2601.08629 | null |
| 2026-01-13 | REVNET: Rotation-Equivariant Point Cloud Completion via Vector Neuron Anchor Transformer | Zhifan Ni et.al. | 2601.08558 | null |
| 2026-01-13 | Hybrid Distillation with CoT Guidance for Edge-Drone Control Code Generation | Yizhan Feng et.al. | 2601.08412 | null |
| 2026-01-13 | VGG Induced Deep Hand Sign Language Detection | Subham Sharma et.al. | 2601.08262 | null |
| 2026-01-13 | Towards Cross-Platform Generalization: Domain Adaptive 3D Detection with Augmentation and Pseudo-Labeling | Xiyan Feng et.al. | 2601.08174 | null |
| 2026-01-13 | PathoGen: Diffusion-Based Synthesis of Realistic Lesions in Histopathology Images | Mohamad Koohi-Moghadam et.al. | 2601.08127 | null |
| 2026-01-12 | Bayesian nonparametric models for zero-inflated count-compositional data using ensembles of regression trees | André F. B. Menezes et.al. | 2601.08067 | null |
| 2026-01-12 | AdaField: Generalizable Surface Pressure Modeling with Physics-Informed Pre-training and Flow-Conditioned Adaptation | Junhong Zou et.al. | 2601.07139 | null |
| 2026-01-11 | Paraphrasing Adversarial Attack on LLM-as-a-Reviewer | Masahiro Kaneko et.al. | 2601.06884 | null |
| 2026-01-04 | AIS-CycleGen: A CycleGAN-Based Framework for High-Fidelity Synthetic AIS Data Generation and Augmentation | SM Ashfaq uz Zaman et.al. | 2601.06127 | null |
| 2026-01-09 | Cedalion Tutorial: A Python-based framework for comprehensive analysis of multimodal fNIRS & DOT from the lab to the everyday world | E. Middell et.al. | 2601.05923 | null |
| 2026-01-09 | Data Augmented Pipeline for Legal Information Extraction and Reasoning | Nguyen Minh Phuong et.al. | 2601.05609 | null |
| 2026-01-09 | Learn to Evolve: Self-supervised Neural JKO Operator for Wasserstein Gradient Flow | Xue Feng et.al. | 2601.05583 | null |
| 2026-01-09 | Generalizable and Adaptive Continual Learning Framework for AI-generated Image Detection | Hanyi Wang et.al. | 2601.05580 | null |
| 2026-01-09 | LEAPS: An LLM-Empowered Adaptive Plugin for Taobao AI Search | Lei Wang et.al. | 2601.05513 | null |
| 2026-01-08 | FlowLet: Conditional 3D Brain MRI Synthesis using Wavelet Flow Matching | Danilo Danese et.al. | 2601.05212 | null |
| 2026-01-08 | SimuAgent: An LLM-Based Simulink Modeling Assistant Enhanced with Reinforcement Learning | Yanchang Liang et.al. | 2601.05187 | null |
| 2026-01-08 | Approximate equivariance via projection-based regularisation | Torben Berndt et.al. | 2601.05028 | null |
| 2026-01-08 | A new method for augmenting short time series, with application to pain events in sickle cell disease | Kumar Utkarsh et.al. | 2601.04538 | null |
| 2026-01-20 | Causal Data Augmentation for Robust Fine-Tuning of Tabular Foundation Models | Magnus Bühler et.al. | 2601.04110 | null |
| 2026-01-07 | Investigation into respiratory sound classification for an imbalanced data set using hybrid LSTM-KAN architectures | Nithinkumar K. et.al. | 2601.03610 | null |
| 2026-01-07 | Artificial Intelligence and Skills: Evidence from Contrastive Learning in Online Job Vacancies | Hangyu Chen et.al. | 2601.03558 | null |
| 2026-01-07 | Persona-aware and Explainable Bikeability Assessment: A Vision-Language Model Approach | Yilong Dai et.al. | 2601.03534 | null |
| 2026-01-10 | Improving Indigenous Language Machine Translation with Synthetic Data and Language-Specific Preprocessing | Aashish Dhawan et.al. | 2601.03135 | null |
| 2026-01-06 | ToxiGAN: Toxic Data Augmentation via LLM-Guided Directional Adversarial Generation | Peiran Li et.al. | 2601.03121 | null |
| 2026-01-06 | Enhancing Multilingual RAG Systems with Debiased Language Preference-Guided Query Fusion | Jeonghyun Park et.al. | 2601.02956 | null |
| 2026-01-13 | Training Language Models with homotokens Leads to Delayed Overfitting | Adrian Cosma et.al. | 2601.02867 | null |
| 2026-01-06 | Adversarial Question Answering Robustness: A Multi-Level Error Analysis and Mitigation Study | Agniv Roy Choudhury et.al. | 2601.02700 | null |
| 2026-01-05 | API: Empowering Generalizable Real-World Image Dehazing via Adaptive Patch Importance Learning | Chen Zhu et.al. | 2601.01992 | null |
| 2026-01-05 | Thinking with Blueprints: Assisting Vision-Language Models in Spatial Reasoning via Structured Object Representation | Weijian Ma et.al. | 2601.01984 | null |
| 2026-01-05 | Theoretical Convergence of SMOTE-Generated Samples | Firuz Kamalov et.al. | 2601.01927 | null |
| 2026-01-05 | AlignDrive: Aligned Lateral-Longitudinal Planning for End-to-End Autonomous Driving | Yanhao Wu et.al. | 2601.01762 | null |
| 2026-01-04 | FALCON: Few-Shot Adversarial Learning for Cross-Domain Medical Image Segmentation | Abdur R. Fayjie et.al. | 2601.01687 | null |
| 2026-01-04 | DiffKD-DCIS: Predicting Upgrade of Ductal Carcinoma In Situ with Diffusion Augmentation and Knowledge Distillation | Tao Li et.al. | 2601.01507 | null |
| 2026-01-04 | DeepInv: A Novel Self-supervised Learning Approach for Fast and Accurate Diffusion Inversion | Ziyue Zhang et.al. | 2601.01487 | null |
| 2026-01-04 | iFlip: Iterative Feedback-driven Counterfactual Example Refinement | Yilong Wang et.al. | 2601.01446 | null |
| 2026-01-04 | In defense of the two-stage framework for open-set domain adaptive semantic segmentation | Wenqi Ren et.al. | 2601.01439 | null |
| 2026-01-03 | DST-Calib: A Dual-Path, Self-Supervised, Target-Free LiDAR-Camera Extrinsic Calibration Network | Zhiwei Huang et.al. | 2601.01188 | null |
| 2026-01-03 | Comparative Evaluation of VAE, GAN, and SMOTE for Tor Detection in Encrypted Network Traffic | Saravanan A et.al. | 2601.01183 | null |
| 2026-01-03 | 600k-ks-ocr: a large-scale synthetic dataset for optical character recognition in kashmiri script | Haq Nawaz Malik et.al. | 2601.01088 | null |
| 2026-01-03 | Enhanced Leukemic Cell Classification Using Attention-Based CNN and Data Augmentation | Douglas Costa Braga et.al. | 2601.01026 | null |
| 2026-01-02 | A Deep Learning Approach for Automated Skin Lesion Diagnosis with Explainable AI | Md. Maksudul Haque et.al. | 2601.00964 | null |
| 2026-01-01 | Four-Stage Alzheimer’s Disease Classification from MRI Using Topological Feature Extraction, Feature Selection, and Ensemble Learning | Faisal Ahmed et.al. | 2601.00918 | null |
| 2025-12-25 | ShrimpXNet: A Transfer Learning Framework for Shrimp Disease Classification with Augmented Regularization, Adversarial Training, and Explainable AI | Israk Hasan Jone et.al. | 2601.00832 | null |
| 2026-01-08 | RoboReward: General-Purpose Vision-Language Reward Models for Robotics | Tony Lee et.al. | 2601.00675 | null |
| 2026-01-01 | Detecting Spike Wave Discharges (SWD) using 1-dimensional Residual UNet | Saurav Sengupta et.al. | 2601.00459 | null |
| 2026-01-01 | ReMA: A Training-Free Plug-and-Play Mixing Augmentation for Video Behavior Recognition | Feng-Qi Cui et.al. | 2601.00311 | null |
| 2026-01-01 | Towards Automated Differential Diagnosis of Skin Diseases Using Deep Learning and Imbalance-Aware Strategies | Ali Anaissi et.al. | 2601.00286 | null |
| 2026-01-01 | Parallel Universes, Parallel Languages: A Comprehensive Study on LLM-based Multilingual Counterfactual Example Generation | Qianli Wang et.al. | 2601.00263 | null |
| 2026-01-01 | Application Research of a Deep Learning Model Integrating CycleGAN and YOLO in PCB Infrared Defect Detection | Chao Yang et.al. | 2601.00237 | null |
| 2025-12-31 | MUSIC: MUlti-Step Instruction Contrast for Multi-Turn Reward Models | Wenzhe Li et.al. | 2512.24693 | null |
| 2025-12-30 | Comparing Approaches to Automatic Summarization in Less-Resourced Languages | Chester Palen-Michel et.al. | 2512.24410 | null |
| 2025-12-30 | One-shot synthesis of rare gastrointestinal lesions improves diagnostic accuracy and clinical training | Jia Yu et.al. | 2512.24278 | null |
| 2025-12-30 | Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes | Shuyun Wang et.al. | 2512.24227 | null |
| 2026-01-13 | Enhanced Web Payload Classification Using WAMM: An AI-Based Framework for Dataset Refinement and Model Evaluation | Heba Osama et.al. | 2512.23610 | null |
| 2025-12-29 | Detection Fire in Camera RGB-NIR | Nguyen Truong Khai et.al. | 2512.23594 | null |
| 2025-12-29 | GeoTeacher: Geometry-Guided Semi-Supervised 3D Object Detection | Jingyu Li et.al. | 2512.23147 | null |
| 2025-12-31 | A Context-Aware Temporal Modeling through Unified Multi-Scale Temporal Encoding and Hierarchical Sequence Learning for Single-Channel EEG Sleep Staging | Amirali Vakili et.al. | 2512.22976 | null |
| 2025-12-28 | Robust LLM-based Column Type Annotation via Prompt Augmentation with LoRA Tuning | Hanze Meng et.al. | 2512.22742 | null |
| 2025-12-28 | Data Augmentation for Classification of Negative Pregnancy Outcomes in Imbalanced Data | Md Badsha Biswas et.al. | 2512.22732 | null |
| 2025-12-20 | SAMM2D: Scale-Aware Multi-Modal 2D Dual-Encoder for High-Sensitivity Intracrania Aneurysm Screening | Antara Titikhsha et.al. | 2512.22185 | null |
| 2025-12-26 | High-Fidelity and Long-Duration Human Image Animation with Diffusion Transformer | Shen Zheng et.al. | 2512.21905 | null |
| 2025-12-26 | Contextual Biasing for LLM-Based ASR with Hotword Retrieval and Reinforcement Learning | YuXiang Kong et.al. | 2512.21828 | null |
| 2025-12-25 | AVP-Fusion: Adaptive Multi-Modal Fusion and Contrastive Learning for Two-Stage Antiviral Peptide Identification | Xinru Wen et.al. | 2512.21544 | null |
| 2025-12-25 | Intelligent recognition of GPR road hidden defect images based on feature fusion and attention mechanism | Haotian Lv et.al. | 2512.21452 | null |
| 2025-12-24 | Granular-ball Guided Masking: Structure-aware Data Augmentation | Shuyin Xia et.al. | 2512.21011 | null |
| 2025-12-23 | Convergence analysis of data augmentation algorithms in Bayesian lasso models with log-concave likelihoods | Jingkai Cui et.al. | 2512.20041 | null |
| 2025-12-23 | GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators | Jiacheng Guo et.al. | 2512.19682 | null |
| 2025-12-22 | No Data? No Problem: Robust Vision-Tabular Learning with Missing Values | Marta Hasny et.al. | 2512.19602 | null |
| 2025-12-22 | srvar-toolkit: A Python Implementation of Shadow-Rate Vector Autoregressions with Stochastic Volatility | Charles Shaw et.al. | 2512.19589 | null |
| 2025-12-22 | BabyFlow: 3D modeling of realistic and expressive infant faces | Antonia Alomar et.al. | 2512.19560 | null |
| 2025-12-22 | GANeXt: A Fully ConvNeXt-Enhanced Generative Adversarial Network for MRI- and CBCT-to-CT Synthesis | Siyuan Mei et.al. | 2512.19336 | null |
| 2025-12-22 | IndoorUAV: Benchmarking Vision-Language UAV Navigation in Continuous Indoor Environments | Xu Liu et.al. | 2512.19024 | null |
| 2025-12-22 | DTCCL: Disengagement-Triggered Contrastive Continual Learning for Autonomous Bus Planners | Yanding Yang et.al. | 2512.18988 | null |
| 2025-12-20 | Generalization Gaps in Political Fake News Detection: An Empirical Study on the LIAR Dataset | S Mahmudul Hasan et.al. | 2512.18533 | null |
| 2025-12-10 | Continual Learning for Acoustic Event Classification | Yang Xiao et.al. | 2512.17932 | null |
| 2026-01-09 | SCOPE: Sequential Causal Optimization of Process Interventions | Jakob De Moor et.al. | 2512.17629 | null |
| 2025-12-19 | SkinGenBench: Generative Model and Preprocessing Effects for Synthetic Dermoscopic Augmentation in Melanoma Diagnosis | N. A. Adarsh Pritam et.al. | 2512.17585 | null |
| 2025-12-19 | SCAR: Semantic Cardiac Adversarial Representation via Spatiotemporal Manifold Optimization in ECG | Shunbo Jia et.al. | 2512.17423 | null |
| 2025-12-18 | Data Augmentation Supporting a Conversational Agent Designed for Smoking Cessation Support Groups | Salar Hashemitaheri et.al. | 2512.17092 | null |
| 2025-12-18 | Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation | Min-Jung Kim et.al. | 2512.17040 | null |
| 2025-12-23 | Do Generalized-Gamma Scale Mixtures of Normals Fit Large Image Datasets? | Brandon Marks et.al. | 2512.17038 | null |
| 2025-12-18 | Exploration of Augmentation Strategies in Multi-modal Retrieval-Augmented Generation for the Biomedical Domain: A Case Study Evaluating Question Answering in Glycobiology | Primož Kocbek et.al. | 2512.16802 | null |
| 2025-12-13 | Two-Step Data Augmentation for Masked Face Detection and Recognition: Turning Fake Masks to Real | Yan Yang et.al. | 2512.15774 | null |
| 2025-12-12 | Data-Chain Backdoor: Do You Trust Diffusion Models as Generative Data Supplier? | Junchi Lu et.al. | 2512.15769 | null |
| 2025-12-19 | Stylized Synthetic Augmentation further improves Corruption Robustness | Georg Siedel et.al. | 2512.15675 | null |
| 2025-12-17 | BEAT2AASIST model with layer fusion for ESDD 2026 Challenge | Sanghyeok Chung et.al. | 2512.15180 | null |
| 2025-12-16 | Bayesian Latent Class Regression and Variable Selection with Applications to Sleep Patterns Data | Matthew Heaney et.al. | 2512.14903 | null |
| 2025-12-15 | Revisiting the Reliability of Language Models in Instruction-Following | Jianshuo Dong et.al. | 2512.14754 | null |
| 2025-12-16 | CHIP: Adaptive Compliance for Humanoid Control through Hindsight Perturbation | Sirui Chen et.al. | 2512.14689 | null |
| 2025-12-16 | Robust Training of Singing Voice Synthesis Using Prior and Posterior Uncertainty | Yiwen Zhao et.al. | 2512.14653 | null |
| 2025-12-16 | Attention-Based Preprocessing Framework for Improving Rare Transient Classification | Xinyue Sheng et.al. | 2512.14644 | null |
| 2025-12-18 | Synthetic Electrogram Generation with Variational Autoencoders for ECGI | Miriam Gutiérrez-Fernández et.al. | 2512.14537 | null |
| 2025-12-16 | Mimicking Human Visual Development for Learning Robust Image Representations | Ankita Raj et.al. | 2512.14360 | null |
| 2025-12-16 | 4D-RaDiff: Latent Diffusion for 4D Radar Point Cloud Generation | Jimmie Kwok et.al. | 2512.14235 | null |
| 2025-12-16 | AsarRec: Adaptive Sequential Augmentation for Robust Self-supervised Sequential Recommendation | Kaike Zhang et.al. | 2512.14047 | null |
| 2025-12-15 | Comparative Evaluation of Embedding Representations for Financial News Sentiment Analysis | Joyjit Roy et.al. | 2512.13749 | null |
| 2025-12-15 | Advancing Machine Learning Optimization of Chiral Photonic Metasurface: Comparative Study of Neural Network and Genetic Algorithm Approaches | Davide Filippozzi et.al. | 2512.13656 | null |
| 2026-01-05 | Test-Time Modification: Inverse Domain Transformation for Robust Perception | Arpit Jadon et.al. | 2512.13454 | null |
| 2025-12-15 | Measurement of Material Volume Fractions in a Microwave Resonant Cavity Sensor Using Convolutional Neural Network | Mojtaba Joodaki et.al. | 2512.13233 | null |
| 2025-12-15 | Comprehensive Deployment-Oriented Assessment for Cross-Environment Generalization in Deep Learning-Based mmWave Radar Sensing | Tomoya Tanaka et.al. | 2512.13018 | null |
| 2025-12-15 | BLADE: A Behavior-Level Data Augmentation Framework with Dual Fusion Modeling for Multi-Behavior Sequential Recommendation | Yupeng Li et.al. | 2512.12964 | null |
| 2025-12-14 | Adapting Multimodal Foundation Models for Few-Shot Learning: A Comprehensive Study on Contrastive Captioners | N. K. B. M. P. K. B. Narasinghe et.al. | 2512.12824 | null |
| 2025-12-14 | Personalized QoE Prediction: A Demographic-Augmented Machine Learning Framework for 5G Video Streaming Networks | Syeda Zunaira Ahmed et.al. | 2512.12736 | null |
| 2025-12-14 | Supervised Contrastive Frame Aggregation for Video Representation Learning | Shaif Chowdhury et.al. | 2512.12549 | null |
| 2025-12-14 | Generative Spatiotemporal Data Augmentation | Jinfan Zhou et.al. | 2512.12508 | null |
| 2025-12-12 | Towards Channel-Robust and Receiver-Independent Radio Frequency Fingerprint Identification | Jie Ma et.al. | 2512.12070 | null |
| 2025-12-07 | Pseudo-Label Refinement for Robust Wheat Head Segmentation via Two-Stage Hybrid Training | Jiahao Jiang et.al. | 2512.11874 | null |
| 2025-12-12 | Depth-Copy-Paste: Multimodal and Depth-Aware Compositing for Robust Face Detection | Qiushi Guo et.al. | 2512.11683 | null |
| 2025-12-12 | Kinetic Mining in Context: Few-Shot Action Synthesis via Text-to-Motion Distillation | Luca Cazzola et.al. | 2512.11654 | null |
| 2025-12-12 | DREAM-B3P: Dual-Stream Transformer Network Enhanced by Feedback Diffusion Model for Blood-Brain Barrier Penetrating Peptide Prediction | Kaijie Wang et.al. | 2512.11511 | null |
| 2025-12-12 | Towards Logic-Aware Manipulation: A Knowledge Primitive for VLM-Based Assistants in Smart Manufacturing | Suchang Chen et.al. | 2512.11275 | null |
| 2025-12-15 | Template-Free Retrosynthesis with Graph-Prior Augmented Transformers | Youjun Zhao et.al. | 2512.10770 | null |
| 2025-12-12 | Textual Data Bias Detection and Mitigation – An Extensible Pipeline with Experimental Evaluation | Rebekka Görge et.al. | 2512.10734 | null |
| 2025-12-11 | A Conditional Generative Framework for Synthetic Data Augmentation in Segmenting Thin and Elongated Structures in Biological Images | Yi Liu et.al. | 2512.10334 | null |
| 2025-12-11 | CIEGAD: Cluster-Conditioned Interpolative and Extrapolative Framework for Geometry-Aware and Domain-Aligned Data Augmentation | Keito Inoshita et.al. | 2512.10178 | null |
| 2025-12-10 | Knowledge Graph Enrichment and Reasoning for Nobel Laureates | Thanh-Lam T. Nguyen et.al. | 2512.09707 | null |
| 2025-12-10 | Hands-on Evaluation of Visual Transformers for Object Recognition and Detection | Dimitrios N. Vlachogiannis et.al. | 2512.09579 | null |
| 2025-12-09 | Protein Secondary Structure Prediction Using Transformers | Manzi Kevin Maxime et.al. | 2512.08613 | null |
| 2025-12-09 | LLM-based Vulnerable Code Augmentation: Generate or Refactor? | Dyna Soumhane Ouchebara et.al. | 2512.08493 | null |
| 2025-12-09 | FastBEV++: Fast by Algorithm, Deployable by Design | Yuanpeng Chen et.al. | 2512.08237 | null |
| 2025-12-08 | Coherent Audio-Visual Editing via Conditional Audio Generation Following Video Edits | Masato Ishii et.al. | 2512.07209 | null |
| 2025-12-07 | OXtal: An All-Atom Diffusion Model for Organic Crystal Structure Prediction | Emily Jin et.al. | 2512.06987 | null |
| 2025-12-07 | RMAdapter: Reconstruction-based Multi-Modal Adapter for Vision-Language Models | Xiang Lin et.al. | 2512.06811 | null |
| 2025-12-07 | Stitch and Tell: A Structured Multimodal Data Augmentation Method for Spatial Understanding | Hang Yin et.al. | 2512.06769 | null |
| 2025-12-07 | XM-ALIGN: Unified Cross-Modal Embedding Alignment for Face-Voice Association | Zhihua Fang et.al. | 2512.06757 | null |
| 2025-12-20 | Monotone data augmentation algorithm for longitudinal continuous, binary and ordinal outcomes: a unifying approach | Yongqiang Tang et.al. | 2512.06621 | null |
| 2025-12-12 | Less Is More for Multi-Step Logical Reasoning of LLM Generalisation Under Rule Removal, Paraphrasing, and Compression | Qiming Bao et.al. | 2512.06393 | null |
| 2025-12-06 | DaGRPO: Rectifying Gradient Conflict in Reasoning via Distinctiveness-Aware Group Relative Policy Optimization | Xuan Xie et.al. | 2512.06337 | null |
| 2025-12-10 | Stronger is not better: Better Augmentations in Contrastive Learning for Medical Image Segmentation | Azeez Idris et.al. | 2512.05992 | null |
| 2025-12-05 | LeAD-M3D: Leveraging Asymmetric Distillation for Real-time Monocular 3D Detection | Johannes Meier et.al. | 2512.05663 | null |
| 2025-12-05 | Matching Ranks Over Probability Yields Truly Deep Safety Alignment | Jason Vega et.al. | 2512.05518 | null |
| 2025-12-04 | The Erosion of LLM Signatures: Can We Still Distinguish Human and LLM-Generated Scientific Ideas After Iterative Paraphrasing? | Sadat Shahriar et.al. | 2512.05311 | null |
| 2025-12-04 | Uncertainty-Aware Data-Efficient AI: An Information-Theoretic Perspective | Osvaldo Simeone et.al. | 2512.05267 | null |
| 2025-12-04 | Multi-Loss Learning for Speech Emotion Recognition with Energy-Adaptive Mixup and Frame-Level Attention | Cong Wang et.al. | 2512.04551 | null |
| 2025-12-04 | EvoEdit: Lifelong Free-Text Knowledge Editing through Latent Perturbation Augmentation and Knowledge-driven Parameter Fusion | Pengfei Cao et.al. | 2512.04545 | null |
| 2025-12-03 | Text-Only Training for Image Captioning with Retrieval Augmentation and Modality Gap Correction | Rui Fonseca et.al. | 2512.04309 | null |
| 2025-12-03 | Studying Various Activation Functions and Non-IID Data for Machine Learning Model Robustness | Long Dang et.al. | 2512.04264 | null |
| 2025-12-03 | SimFlow: Simplified and End-to-End Training of Latent Normalizing Flows | Qinyu Zhao et.al. | 2512.04084 | null |
| 2025-12-03 | SELF: A Robust Singular Value and Eigenvalue Approach for LLM Fingerprinting | Hanxiu Zhang et.al. | 2512.03620 | null |
| 2025-12-03 | MAGE-ID: A Multimodal Generative Framework for Intrusion Detection Systems | Mahdi Arab Loodaricheh et.al. | 2512.03375 | null |
| 2025-12-02 | OmniPerson: Unified Identity-Preserving Pedestrian Generation | Changxiao Ma et.al. | 2512.02554 | null |
| 2025-12-02 | VibOmni: Towards Scalable Bone-conduction Speech Enhancement on Earables | Lixing He et.al. | 2512.02515 | null |
| 2025-12-02 | VACoT: Rethinking Visual Data Augmentation with VLMs | Zhengzhuo Xu et.al. | 2512.02361 | null |
| 2025-12-02 | Training Dynamics of Learning 3D-Rotational Equivariance | Max W. Shen et.al. | 2512.02303 | null |
| 2025-12-01 | Feature Selection Empowered BERT for Detection of Hate Speech with Vocabulary Augmentation | Pritish N. Desai et.al. | 2512.02141 | null |
| 2025-12-01 | StyleYourSmile: Cross-Domain Face Retargeting Without Paired Multi-Style Data | Avirup Dey et.al. | 2512.01895 | null |
| 2025-12-01 | GRASP: Guided Residual Adapters with Sample-wise Partitioning | Felix Nützel et.al. | 2512.01675 | null |
| 2025-12-01 | Neural Networks for Predicting Permeability Tensors of 2D Porous Media: Comparison of Convolution- and Transformer-based Architectures | Sigurd Vargdal et.al. | 2512.01517 | null |
| 2025-12-01 | ChronosObserver: Taming 4D World with Hyperspace Diffusion Sampling | Qisen Wang et.al. | 2512.01481 | null |
| 2025-12-01 | $\mathbf{M^3A}$ Policy: Mutable Material Manipulation Augmentation Policy through Photometric Re-rendering | Jiayi Li et.al. | 2512.01446 | null |
| 2025-12-01 | MEGConformer: Conformer-Based MEG Decoder for Robust Speech and Phoneme Classification | Xabier de Zuazo et.al. | 2512.01443 | null |
| 2025-12-01 | Teaching by Failure: Counter-Example-Driven Curricula for Transformer Self-Improvement | Harshil Vejendla et.al. | 2512.01187 | null |
| 2025-11-30 | MS-PPO: Morphological-Symmetry-Equivariant Policy for Legged Robot Locomotion | Sizhe Wei et.al. | 2512.00727 | null |
| 2025-11-30 | Graph Data Augmentation with Contrastive Learning on Covariate Distribution Shift | Fanlong Zeng et.al. | 2512.00716 | null |
| 2025-12-03 | XAI-Driven Skin Disease Classification: Leveraging GANs to Augment ResNet-50 Performance | Kim Gerard A. Villanueva et.al. | 2512.00626 | null |
| 2025-11-29 | Explainable Multi-Modal Deep Learning for Automatic Detection of Lung Diseases from Respiratory Audio Signals | S M Asiful Islam Saky et.al. | 2512.00563 | null |
| 2025-11-28 | SD-CGAN: Conditional Sinkhorn Divergence GAN for DDoS Anomaly Detection in IoT Networks | Henry Onyeka et.al. | 2512.00251 | null |
| 2025-11-28 | Mesh Augmentation of LoRaWAN-based IoT Networks | Ram Ramanathan et.al. | 2512.00161 | null |
| 2025-11-28 | Local and Global Context-and-Object-part-Aware Superpixel-based Data Augmentation for Deep Visual Recognition | Fadi Dornaika et.al. | 2512.00130 | null |
| 2025-12-16 | ASTRO: Adaptive Stitching via Dynamics-Guided Trajectory Rollouts | Hang Yu et.al. | 2511.23442 | null |
| 2025-11-28 | Robust In-the-Wild Exercise Recognition from a Single Wearable: Data-Side Fusion, Sensor Rotation, and Feature Engineering | Hoang Khang Phan et.al. | 2511.23173 | null |
| 2025-11-28 | A General Bayesian Nonparametric Approach for Estimating Population-Level and Conditional Causal Effects | Yongseok Hur et.al. | 2511.23085 | null |
| 2025-11-28 | RobotSeg: A Model and Dataset for Segmenting Robots in Image and Video | Haiyang Mei et.al. | 2511.22950 | null |
| 2025-11-27 | Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield | Dongyang Liu et.al. | 2511.22677 | null |
| 2025-11-27 | Selecting User Histories to Generate LLM Users for Cold-Start Item Recommendation | Nachiket Subbaraman et.al. | 2511.21989 | null |
| 2025-11-26 | Deep Learning Architectures for Code-Modulated Visual Evoked Potentials Detection | Kiran Nair et.al. | 2511.21940 | null |
| 2025-11-26 | A Comparative Study of LLM Prompting and Fine-Tuning for Cross-genre Authorship Attribution on Chinese Lyrics | Yuxin Li et.al. | 2511.21930 | null |
| 2025-11-26 | Advancing Marine Bioacoustics with Deep Generative Models: A Hybrid Augmentation Strategy for Southern Resident Killer Whale Detection | Bruno Padovese et.al. | 2511.21872 | null |
| 2025-11-26 | Revolutionizing Glioma Segmentation & Grading Using 3D MRI - Guided Hybrid Deep Learning Models | Pandiyaraju V et.al. | 2511.21673 | null |
| 2025-11-29 | Deep Learning-Based Multiclass Classification of Oral Lesions with Stratified Augmentation | Joy Naoum et.al. | 2511.21582 | null |
| 2025-11-26 | Shift-Equivariant Complex-Valued Convolutional Neural Networks | Quentin Gabot et.al. | 2511.21250 | null |
| 2025-11-26 | Dynamic Stratified Contrastive Learning with Upstream Augmentation for MILP Branching | Tongkai Lu et.al. | 2511.21107 | null |
| 2025-11-26 | A Probabilistic Framework for Temporal Distribution Generalization in Industry-Scale Recommender Systems | Yuxuan Zhu et.al. | 2511.21032 | null |
| 2025-11-26 | FANoise: Singular Value-Adaptive Noise Modulation for Robust Multimodal Representation Learning | Jiaoyang Li et.al. | 2511.20997 | null |
| 2025-11-25 | Winning with Less for Low Resource Languages: Advantage of Cross-Lingual English_Persian Argument Mining Model over LLM Augmentation | Ali Jahan et.al. | 2511.20872 | null |
| 2025-11-25 | Bridging the Language Gap: Synthetic Voice Diversity via Latent Mixup for Equitable Speech Recognition | Wesley Bian et.al. | 2511.20534 | null |
| 2025-11-25 | Data Augmentation Techniques to Reverse-Engineer Neural Network Weights from Input-Output Queries | Alexander Beiser et.al. | 2511.20312 | null |
| 2025-11-25 | Robust 3D Brain MRI Inpainting with Random Masking Augmentation | Juexin Zhang et.al. | 2511.20202 | null |
| 2025-11-25 | SEDA: A Self-Adapted Entity-Centric Data Augmentation for Boosting Gird-based Discontinuous NER Models | Wen-Fang Su et.al. | 2511.20143 | null |
| 2025-11-25 | BERT-APC: A Reference-free Framework for Automatic Pitch Correction via Musical Context Inference | Sungjae Kim et.al. | 2511.20006 | null |
| 2025-11-25 | Rethinking Semi-Supervised Node Classification with Self-Supervised Graph Clustering | Songbo Wang et.al. | 2511.19976 | null |
| 2025-11-26 | TiCT: A Synthetically Pre-Trained Foundation Model for Time Series Classification | Chin-Chia Michael Yeh et.al. | 2511.19694 | null |
| 2025-11-24 | Blinking Beyond EAR: A Stable Eyelid Angle Metric for Driver Drowsiness Detection and Data Augmentation | Mathis Wolter et.al. | 2511.19519 | null |
| 2025-11-22 | A Multi-Stage Deep Learning Framework with PKCP-MixUp Augmentation for Pediatric Liver Tumor Diagnosis Using Multi-Phase Contrast-Enhanced CT | Wanqi Wang et.al. | 2511.19478 | null |
| 2025-11-24 | BackSplit: The Importance of Sub-dividing the Background in Biomedical Lesion Segmentation | Rachit Saluja et.al. | 2511.19394 | null |
| 2025-11-24 | Tiny-TSM: Efficiently Training a Lightweight SOTA Time Series Foundation Model | Felix Birkel et.al. | 2511.19272 | null |
| 2025-11-24 | Experimental insights into data augmentation techniques for deep learning-based multimode fiber imaging: limitations and success | Jawaria Maqbool et.al. | 2511.19072 | null |
| 2025-11-24 | Skeletons Matter: Dynamic Data Augmentation for Text-to-Query | Yuchen Ji et.al. | 2511.18934 | null |
| 2025-11-24 | Multidimensional Music Aesthetic Evaluation via Semantically Consistent C-Mixup Augmentation | Shuyang Liu et.al. | 2511.18869 | null |
| 2025-11-24 | Higgs Production Classifier using Weak Supervision | Kai-Feng Chen et.al. | 2511.18726 | null |
| 2025-11-24 | Data Augmentation Strategies for Robust Lane Marking Detection | Flora Lian et.al. | 2511.18668 | null |
| 2025-11-23 | Re(Visiting) Time Series Foundation Models in Finance | Eghbal Rahimikia et.al. | 2511.18578 | null |
| 2025-11-23 | Stro-VIGRU: Defining the Vision Recurrent-Based Baseline Model for Brain Stroke Classification | Subhajeet Das et.al. | 2511.18316 | null |
| 2025-11-23 | MultiDiffNet: A Multi-Objective Diffusion Framework for Generalizable Brain Decoding | Mengchun Zhang et.al. | 2511.18294 | null |
| 2025-11-22 | Enhancing Large Language Models for Automated Homework Assessment in Undergraduate Circuit Analysis | Liangliang Chen et.al. | 2511.18221 | null |
| 2025-11-22 | Generating Synthetic Human Blastocyst Images for In-Vitro Fertilization Blastocyst Grading | Pavan Narahari et.al. | 2511.18204 | null |
| 2025-11-22 | LocaGen: Low-Overhead Indoor Localization Through Spatial Augmentation | Abdelrahman Abdelmotlb et.al. | 2511.18158 | null |
| 2025-11-22 | Diverse Instance Generation via Diffusion Models for Enhanced Few-Shot Object Detection in Remote Sensing Images | Yanxing Liu et.al. | 2511.18031 | null |
| 2025-11-21 | Group Equivariant Convolutional Networks for Pathloss Estimation | Ziyue Yang et.al. | 2511.17841 | null |
| 2025-11-21 | Addressing A Posteriori Performance Degradation in Neural Network Subgrid Stress Models | Andy Wu et.al. | 2511.17475 | null |
| 2025-11-21 | ATAC: Augmentation-Based Test-Time Adversarial Correction for CLIP | Linxiang Su et.al. | 2511.17362 | null |
| 2025-11-20 | Motion Transfer-Enhanced StyleGAN for Generating Diverse Macaque Facial Expressions | Takuya Igaue et.al. | 2511.16711 | null |
| 2025-11-20 | Boosting Predictive Performance on Tabular Data through Data Augmentation with Latent-Space Flow-Based Diffusion | Md. Tawfique Ihsan et.al. | 2511.16571 | null |
| 2025-11-20 | Contrastive vision-language learning with paraphrasing and negation | Kwun Ho Ngan et.al. | 2511.16527 | null |
| 2025-11-20 | Physics-Informed Machine Learning for Efficient Sim-to-Real Data Augmentation in Micro-Object Pose Estimation | Zongcai Tan et.al. | 2511.16494 | null |
| 2025-11-25 | Prediction of atomic H adsorption energies in metalloid doped MSSe (M = Mo/W) Janus layers: A combined DFT and machine learning study | G. Tejaswini et.al. | 2511.16263 | null |
| 2025-11-20 | LLMs-based Augmentation for Domain Adaptation in Long-tailed Food Datasets | Qing Wang et.al. | 2511.16037 | null |
| 2025-11-26 | KRAL: Knowledge and Reasoning Augmented Learning for LLM-assisted Clinical Antimicrobial Therapy | Zhe Li et.al. | 2511.15974 | null |
| 2025-11-19 | Learning from Imperfect Labels: A Physics-Aware Neural Operator with Application to DAS Data Denoising | Yang Cui et.al. | 2511.15638 | null |
| 2025-11-19 | A Hybrid CNN-ViT-GNN Framework with GAN-Based Augmentation for Intelligent Weed Detection in Precision Agriculture | Pandiyaraju V et.al. | 2511.15535 | null |
| 2025-11-19 | Advancing Identification method of Gamma-Ray Bursts with Data and Feature Enhancement | Peng Zhang et.al. | 2511.15470 | null |
| 2025-11-20 | Selective Mixup for Debiasing Question Selection in Computerized Adaptive Testing | Mi Tian et.al. | 2511.15241 | null |
| 2025-11-19 | Data-driven Prediction of Species-Specific Plant Responses to Spectral-Shifting Films from Leaf Phenotypic and Photosynthetic Traits | Jun Hyeun Kang et.al. | 2511.15173 | null |
| 2025-11-19 | Deep Learning Assisted Prediction of Electrochemical Lithiation State in Spinel Lithium Titanium Oxide Thin Films | Devin Chugh et.al. | 2511.15109 | null |
| 2025-11-18 | Structured Contrastive Learning for Interpretable Latent Representations | Zhengyang Shen et.al. | 2511.14920 | null |
| 2025-11-18 | Tell Me: An LLM-powered Mental Well-being Assistant with RAG, Synthetic Dialogue Generation, and Agentic Planning | Trishala Jayesh Ahalpara et.al. | 2511.14445 | null |
| 2025-11-18 | Continuous Vision-Language-Action Co-Learning with Semantic-Physical Alignment for Behavioral Cloning | Xiuxiu Qi et.al. | 2511.14396 | null |
| 2025-11-18 | H-LDM: Hierarchical Latent Diffusion Models for Controllable and Interpretable PCG Synthesis from Clinical Metadata | Chenyang Xu et.al. | 2511.14312 | null |
| 2025-11-18 | A Comprehensive Study of Implicit and Explicit Biases in Large Language Models | Fatima Kazi et.al. | 2511.14153 | null |
| 2025-11-17 | Segment Anything Across Shots: A Method and Benchmark | Hengrui Hu et.al. | 2511.13715 | null |
| 2025-11-17 | MMWSTM-ADRAN+: A Novel Hybrid Deep Learning Architecture for Enhanced Climate Time Series Forecasting and Extreme Event Prediction | Shaheen Mohammed Saleh Ahmed et.al. | 2511.13419 | null |
| 2025-11-17 | A Lightweight 3D Anomaly Detection Method with Rotationally Invariant Features | Hanzhe Liang et.al. | 2511.13115 | null |
| 2025-11-17 | Learning from the Undesirable: Robust Adaptation of Language Models without Forgetting | Yunhun Nam et.al. | 2511.13052 | null |
| 2025-11-17 | Medal S: Spatio-Textual Prompt Model for Medical Segmentation | Pengcheng Shi et.al. | 2511.13001 | null |
| 2025-11-17 | CalibrateMix: Guided-Mixup Calibration of Image Semi-Supervised Models | Mehrab Mustafy Rahman et.al. | 2511.12964 | null |
| 2025-11-16 | Evaluating Autoformalization Robustness via Semantically Similar Paraphrasing | Hayden Moore et.al. | 2511.12784 | null |
| 2025-11-16 | Mitigating Length Bias in RLHF through a Causal Lens | Hyeonji Kim et.al. | 2511.12573 | null |
| 2025-11-16 | HiGFA: Hierarchical Guidance for Fine-grained Data Augmentation with Diffusion Models | Zhiguang Lu et.al. | 2511.12547 | null |
| 2025-11-16 | Designing-with More-than-Human Through Human Augmentation | Botao ‘Amber’ Hu et.al. | 2511.12533 | null |
| 2025-11-16 | Task-Aware Retrieval Augmentation for Dynamic Recommendation | Zhen Tao et.al. | 2511.12495 | null |
| 2025-11-15 | Leveraging Quantum-Based Architectures for Robust Diagnostics | Shabnam Sodagari et.al. | 2511.12386 | null |
| 2025-11-15 | Rethinking Bias in Generative Data Augmentation for Medical AI: a Frequency Recalibration Method | Chi Liu et.al. | 2511.12301 | null |
| 2025-11-15 | Understanding InfoNCE: Transition Probability Matrix Induced Feature Clustering | Ge Cheng et.al. | 2511.12180 | null |
| 2025-11-15 | FIA-Edit: Frequency-Interactive Attention for Efficient and High-Fidelity Inversion-Free Text-Guided Image Editing | Kaixiang Yang et.al. | 2511.12151 | null |
| 2025-11-15 | Breaking the Modality Wall: Time-step Mixup for Efficient Spiking Knowledge Transfer from Static to Event Domain | Yuqi Xie et.al. | 2511.12150 | null |
| 2025-11-15 | Did Models Sufficient Learn? Attribution-Guided Training via Subset-Selected Counterfactual Augmentation | Yannan Chen et.al. | 2511.12100 | null |
| 2025-11-15 | Treatment Stitching with Schrödinger Bridge for Enhancing Offline Reinforcement Learning in Adaptive Treatment Strategies | Dong-Hee Shin et.al. | 2511.12075 | null |
| 2025-11-15 | Informed Bootstrap Augmentation Improves EEG Decoding | Woojae Jeong et.al. | 2511.12073 | null |
| 2025-11-14 | Augmenting The Weather: A Hybrid Counterfactual-SMOTE Algorithm for Improving Crop Growth Prediction When Climate Changes | Mohammed Temraz et.al. | 2511.11945 | null |
| 2025-11-14 | CLUE: Controllable Latent space of Unprompted Embeddings for Diversity Management in Text-to-Image Synthesis | Keunwoo Park et.al. | 2511.10993 | null |
| 2025-11-14 | Automated Analysis of Learning Outcomes and Exam Questions Based on Bloom’s Taxonomy | Ramya Kumar et.al. | 2511.10903 | null |
| 2025-11-12 | Graph Neural Field with Spatial-Correlation Augmentation for HRTF Personalization | De Hu et.al. | 2511.10697 | null |
| 2025-11-13 | Panda: Test-Time Adaptation with Negative Data Augmentation | Ruxi Deng et.al. | 2511.10481 | null |
| 2025-11-13 | Causal Model-Based Reinforcement Learning for Sample-Efficient IoT Channel Access | Aswin Arun et.al. | 2511.10291 | null |
| 2025-11-13 | MTP: Exploring Multimodal Urban Traffic Profiling with Modality Augmentation and Spectrum Fusion | Haolong Xiang et.al. | 2511.10218 | null |
| 2025-11-14 | Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL | Qifeng Cai et.al. | 2511.10192 | null |
| 2025-11-13 | ELYADATA & LIA at NADI 2025: ASR and ADI Subtasks | Haroun Elleuch et.al. | 2511.10090 | null |
| 2025-11-13 | Opinion: Towards Unified Expressive Policy Optimization for Robust Robot Learning | Haidong Huang et.al. | 2511.10087 | null |
| 2025-11-13 | Learning phase diversity for solving ill-posed inverse problems in imaging | Jasleen Birdi et.al. | 2511.09952 | null |
| 2025-11-13 | A Study on Enhancing the Generalization Ability of Visuomotor Policies via Data Augmentation | Hanwen Wang et.al. | 2511.09932 | null |
| 2025-11-13 | Simulating Distribution Dynamics: Liquid Temporal Feature Evolution for Single-Domain Generalized Object Detection | Zihao Zhang et.al. | 2511.09909 | null |
| 2025-11-12 | PANDA - Patch And Distribution-Aware Augmentation for Long-Tailed Exemplar-Free Continual Learning | Siddeshwar Raghavan et.al. | 2511.09791 | null |
| 2025-11-12 | LLM-Guided Dynamic-UMAP for Personalized Federated Graph Learning | Sai Puppala et.al. | 2511.09438 | null |
| 2025-11-12 | AuthSig: Safeguarding Scanned Signatures Against Unauthorized Reuse in Paperless Workflows | RuiQiang Zhang et.al. | 2511.08967 | null |
| 2025-11-11 | 3D-TDA – Topological feature extraction from 3D images for Alzheimer’s disease classification | Faisal Ahmed et.al. | 2511.08663 | null |
| 2025-11-14 | Bot Meets Shortcut: How Can LLMs Aid in Handling Unknown Invariance OOD Scenarios? | Shiyan Zheng et.al. | 2511.08455 | null |
| 2025-11-12 | SASG-DA: Sparse-Aware Semantic-Guided Diffusion Augmentation For Myoelectric Gesture Recognition | Chen Liu et.al. | 2511.08344 | null |
| 2025-11-11 | Learning Omnidirectional Locomotion for a Salamander-Like Quadruped Robot | Zhiang Liu et.al. | 2511.08299 | null |
| 2025-11-11 | Forgetting Alternation and Blossoms: A New Framework for Fast Matching Augmentation and Its Applications to Sequential/Distributed/Streaming Computation | Taisuke Izumi et.al. | 2511.08210 | null |
| 2025-11-11 | I2E: Real-Time Image-to-Event Conversion for High-Performance Spiking Neural Networks | Ruichen Ma et.al. | 2511.08065 | null |
| 2025-11-11 | From LLMs to Agents: A Comparative Evaluation of LLMs and LLM-based Agents in Security Patch Detection | Junxiao Han et.al. | 2511.08060 | null |
| 2025-11-11 | Computational Blueprints: Generating Isomorphic Mathematics Problems with Large Language Models | Jeong-Hoon Kim et.al. | 2511.07932 | null |
| 2025-11-11 | IBMA: An Imputation-Based Mixup Augmentation Using Self-Supervised Learning for Time Series Data | Dang Nha Nguyen et.al. | 2511.07930 | null |
| 2025-11-10 | ACE-ICD: Acronym Expansion As Data Augmentation For Automated ICD Coding | Tuan-Dung Le et.al. | 2511.07311 | null |
| 2025-11-10 | Improving Deepfake Detection with Reinforcement Learning-Based Adaptive Data Augmentation | Yuxuan Zhou et.al. | 2511.07051 | null |
| 2025-11-10 | Evaluating LLMs for Anxiety, Depression, and Stress Detection Evaluating Large Language Models for Anxiety, Depression, and Stress Detection: Insights into Prompting Strategies and Synthetic Data | Mihael Arcan et.al. | 2511.07044 | null |
| 2025-11-10 | Hybrid Autoencoders for Tabular Data: Leveraging Model-Based Augmentation in Low-Label Settings | Erel Naor et.al. | 2511.06961 | null |
| 2025-11-10 | GNN-Enabled Robust Hybrid Beamforming with Score-Based CSI Generation and Denoising | Yuhang Li et.al. | 2511.06663 | null |
| 2025-11-10 | On the Potential of Digital Twins for Distribution System State Estimation with Randomly Missing Data in Heterogeneous Measurements | Ying Zhang et.al. | 2511.06583 | null |
| 2025-11-09 | Adaptive PID Control for Robotic Systems via Hierarchical Meta-Learning and Reinforcement Learning with Physics-Based Data Augmentation | JiaHao Wu et.al. | 2511.06500 | null |
| 2025-11-09 | LLM-Driven Completeness and Consistency Evaluation for Cultural Heritage Data Augmentation in Cross-Modal Retrieval | Jian Zhang et.al. | 2511.06268 | null |
| 2025-11-09 | Analyzing and Mitigating Negation Artifacts using Data Augmentation for Improving ELECTRA-Small Model Accuracy | Mojtaba Noghabaei et.al. | 2511.06234 | null |
| 2025-11-08 | Reperio-rPPG: Relational Temporal Graph Neural Networks for Periodicity Learning in Remote Physiological Measurement | Ba-Thinh Nguyen et.al. | 2511.05946 | null |
| 2025-11-07 | Persian Musical Instruments Classification Using Polyphonic Data Augmentation | Diba Hadi Esfangereh et.al. | 2511.05717 | null |
| 2025-11-07 | Robust Neural Audio Fingerprinting using Music Foundation Models | Shubhr Singh et.al. | 2511.05399 | null |
| 2025-11-07 | Entropy-Rank Ratio: A Novel Entropy-Based Perspective for DNA Complexity and Classification | Emmanuel Pio Pastore et.al. | 2511.05300 | null |
| 2025-11-07 | Embedding-Space Data Augmentation to Prevent Membership Inference Attacks in Clinical Time Series Forecasting | Marius Fracarolli et.al. | 2511.05289 | null |
| 2025-11-07 | Less Is More: Generating Time Series with LLaMA-Style Autoregression in Simple Factorized Latent Spaces | Siyuan Li et.al. | 2511.04973 | null |
| 2025-11-06 | PromptSep: Generative Audio Separation via Multimodal Prompting | Yutong Wen et.al. | 2511.04623 | null |
| 2025-11-06 | Comparative Study of CNN Architectures for Binary Classification of Horses and Motorcycles in the VOC 2008 Dataset | Muhammad Annas Shaikh et.al. | 2511.04344 | null |
| 2025-11-06 | Black-Box Guardrail Reverse-engineering Attack | Hongwei Yao et.al. | 2511.04215 | null |
| 2025-11-06 | MedDChest: A Content-Aware Multimodal Foundational Vision Model for Thoracic Imaging | Mahmoud Soliman et.al. | 2511.04016 | null |
| 2025-11-05 | Desert Waste Detection and Classification Using Data-Based and Model-Based Enhanced YOLOv12 DL Model | Abdulmumin Sa’ad et.al. | 2511.03888 | null |
| 2025-11-05 | A Lightweight 3D-CNN for Event-Based Human Action Recognition with Privacy-Preserving Potential | Mehdi Sefidgar Dilmaghani et.al. | 2511.03665 | null |
| 2025-11-05 | Towards Transparent Stance Detection: A Zero-Shot Approach Using Implicit and Explicit Interpretability | Apoorva Upadhyaya et.al. | 2511.03635 | null |
| 2025-11-05 | The Bradley-Terry Stochastic Block Model | Lapo Santi et.al. | 2511.03467 | null |
| 2025-11-05 | Knowledge-Augmented Question Error Correction for Chinese Question Answer System with QuestionRAG | Longpeng Qiu et.al. | 2511.03410 | null |
| 2025-11-05 | Overcoming the Generalization Limits of SLM Finetuning for Shape-Based Extraction of Datatype and Object Properties | Célian Ringwald et.al. | 2511.03407 | null |
| 2025-11-05 | LFC-DA: Logical Formula-Controlled Data Augmentation for Enhanced Logical Reasoning | Shenghao Li et.al. | 2511.03372 | null |
| 2025-11-05 | Decoupling Augmentation Bias in Prompt Learning for Vision-Language Models | Gahyeon Kim et.al. | 2511.03367 | null |
| 2025-11-05 | An Augmentation Overlap Theory of Contrastive Learning | Qi Zhang et.al. | 2511.03114 | null |
| 2025-11-04 | Generative Hints | Andy Dimnaku et.al. | 2511.02933 | null |
| 2025-11-04 | IllumFlow: Illumination-Adaptive Low-Light Enhancement via Conditional Rectified Flow and Retinex Decomposition | Wenyang Wei et.al. | 2511.02411 | null |
| 2025-10-29 | An Experimental Comparison of Alternative Techniques for Event-Log Augmentation | Alessandro Padella et.al. | 2511.01896 | null |
| 2025-11-03 | DINO-MX: A Modular & Flexible Framework for Self-Supervised Learning | Mahmut Selman Gokmen et.al. | 2511.01610 | null |
| 2025-11-03 | Driving scenario generation and evaluation using a structured layer representation and foundational models | Arthur Hubert et.al. | 2511.01541 | null |
| 2025-11-03 | Difficulty-Controllable Cloze Question Distractor Generation | Seokhoon Kang et.al. | 2511.01526 | null |
| 2025-11-03 | Conditional Diffusion Model-Enabled Scenario-Specific Neural Receivers for Superimposed Pilot Schemes | Xingyu Zhou et.al. | 2511.01173 | null |
| 2025-11-02 | A Distributed Plug-and-Play MCMC Algorithm for High-Dimensional Inverse Problems | Maxime Bouton et.al. | 2511.00870 | null |
| 2025-11-07 | Med-Banana-50K: A Cross-modality Large-Scale Dataset for Text-guided Medical Image Editing | Zhihui Chen et.al. | 2511.00801 | null |
| 2025-11-01 | Exploring and Mitigating Gender Bias in Encoder-Based Transformer Models | Ariyan Hossain et.al. | 2511.00519 | null |
| 2025-11-04 | Simple and Behavior-Driven Augmentation for Recommendation with Rich Collaborative Signals | Doyun Choi et.al. | 2511.00436 | null |
| 2025-10-31 | Casing Collar Identification using AlexNet-based Neural Networks for Depth Measurement in Oil and Gas Wells | Siyu Xiao et.al. | 2511.00129 | null |
| 2025-10-26 | Mutual Information guided Visual Contrastive Learning | Hanyang Chen et.al. | 2511.00028 | null |
| 2025-10-31 | Effect of Domain Generalization Techniques in Low Resource Systems | Mahi Aminu et.al. | 2510.27512 | null |
| 2025-10-31 | FedSM: Robust Semantics-Guided Feature Mixup for Bias Reduction in Federated Learning with Long-Tail Data | Jingrui Zhang et.al. | 2510.27240 | null |
| 2025-10-31 | A Survey on Generative Recommendation: Data, Model, and Tasks | Min Hou et.al. | 2510.27157 | null |
| 2025-10-30 | Dataset Creation and Baseline Models for Sexism Detection in Hausa | Fatima Adam Muhammad et.al. | 2510.27038 | null |
| 2025-10-30 | SYNAPSE-Net: A Unified Framework with Lesion-Aware Hierarchical Gating for Robust Segmentation of Heterogeneous Brain Lesions | Md. Mehedi Hassan et.al. | 2510.26961 | null |
| 2025-10-31 | Offline Clustering of Preference Learning with Active-data Augmentation | Jingyuan Liu et.al. | 2510.26301 | null |
| 2025-10-29 | An Analysis of Causal Effect Estimation using Outcome Invariant Data Augmentation | Uzair Akbar et.al. | 2510.25128 | null |
| 2025-10-15 | Comparative Analysis of Data Augmentation for Clinical ECG Classification with STAR | Nader Nemati et.al. | 2510.24740 | null |
| 2025-10-28 | SPARTA: Evaluating Reasoning Segmentation Robustness through Black-Box Adversarial Paraphrasing in Text Autoencoder Latent Space | Viktoriia Zinkovich et.al. | 2510.24446 | null |
| 2025-10-28 | UtilGen: Utility-Centric Generative Data Augmentation with Dual-Level Task Adaptation | Jiyu Guo et.al. | 2510.24262 | null |
| 2025-10-27 | Learning Linearity in Audio Consistency Autoencoders via Implicit Regularization | Bernardo Torres et.al. | 2510.23530 | null |
| 2025-10-27 | DPGLA: Bridging the Gap between Synthetic and Real Data for Unsupervised Domain Adaptation in 3D LiDAR Semantic Segmentation | Wanmeng Li et.al. | 2510.23525 | null |
| 2025-10-27 | MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding | Xin Jin et.al. | 2510.23479 | null |
| 2025-10-27 | Treble10: A high-quality dataset for far-field speech recognition, dereverberation, and enhancement | Sarabeth S. Mullins et.al. | 2510.23141 | null |
| 2025-10-27 | Tagging-Augmented Generation: Assisting Language Models in Finding Intricate Knowledge In Long Contexts | Anwesan Pal et.al. | 2510.22956 | null |
| 2025-10-26 | ConMatFormer: A Multi-attention and Transformer Integrated ConvNext based Deep Learning Model for Enhanced Diabetic Foot Ulcer Classification | Raihan Ahamed Rifat et.al. | 2510.22743 | null |
| 2025-10-26 | Learning Without Augmenting: Unsupervised Time Series Representation Learning via Frame Projections | Berken Utku Demirel et.al. | 2510.22655 | null |
| 2025-11-01 | Knowledge-guided Continual Learning for Behavioral Analytics Systems | Yasas Senarath et.al. | 2510.22405 | null |
| 2025-10-24 | AutoSciDACT: Automated Scientific Discovery through Contrastive Embedding and Hypothesis Testing | Samuel Bright-Thonney et.al. | 2510.21935 | null |
| 2025-10-24 | Foundation Models in Dermatopathology: Skin Tissue Classification | Riya Gupta et.al. | 2510.21664 | null |
| 2025-10-24 | TerraGen: A Unified Multi-Task Layout Generation Framework for Remote Sensing Data Augmentation | Datao Tang et.al. | 2510.21391 | null |
| 2025-10-24 | Generative Federated Learning for Smart Prediction and Recommendation Applications | Anwesha Mukherjee et.al. | 2510.21183 | null |
| 2025-10-24 | SafetyPairs: Isolating Safety Critical Image Features with Counterfactual Image Generation | Alec Helbling et.al. | 2510.21120 | null |
| 2025-10-24 | Bridging Language Gaps with Adaptive RAG: Improving Indonesian Language Question Answering | William Christian et.al. | 2510.21068 | null |
| 2025-10-24 | Deep learning-based automated damage detection in concrete structures using images from earthquake events | Abdullah Turer et.al. | 2510.21063 | null |
| 2025-10-23 | Information Theoretic Learning for Diffusion Models with Warm Start | Yirong Shen et.al. | 2510.20903 | null |
| 2025-10-23 | Analyticup E-commerce Product Search Competition Technical Report from Team Tredence_AICOE | Rakshith R et.al. | 2510.20674 | null |
| 2025-10-23 | LM-mixup: Text Data Augmentation via Language Model based Mixup | Zhijie Deng et.al. | 2510.20449 | null |
| 2025-10-23 | Neural Networks for Censored Expectile Regression Based on Data Augmentation | Wei Cao et.al. | 2510.20344 | null |
| 2025-10-25 | DB-FGA-Net: Dual Backbone Frequency Gated Attention Network for Multi-Class Brain Tumor Classification with Grad-CAM Interpretability | Saraf Anzum Shreya et.al. | 2510.20299 | null |
| 2025-10-21 | Cyberattack Detection in Critical Infrastructure and Supply Chains | Smita Khapre et.al. | 2510.19859 | null |
| 2025-10-22 | Curvilinear Structure-preserving Unpaired Cross-domain Medical Image Translation | Zihao Chen et.al. | 2510.19679 | null |
| 2025-10-22 | Towards Single-Source Domain Generalized Object Detection via Causal Visual Prompts | Chen Li et.al. | 2510.19487 | null |
| 2025-10-22 | KORE: Enhancing Knowledge Injection for Large Multimodal Models via Knowledge-Oriented Augmentations and Constraints | Kailin Jiang et.al. | 2510.19316 | null |
| 2025-10-21 | SO(3)-invariant PCA with application to molecular data | Michael Fraiman et.al. | 2510.18827 | null |
| 2025-10-21 | Finding the Sweet Spot: Optimal Data Augmentation Ratio for Imbalanced Credit Scoring Using ADASYN | Luis H. Chia et.al. | 2510.18252 | null |
| 2025-10-20 | Learning from Generalization Patterns: An Evaluation-Driven Approach to Enhanced Data Augmentation for Fine-Tuning Small Language Models | Huan Song et.al. | 2510.18143 | null |
| 2025-10-24 | ViBED-Net: Video Based Engagement Detection Network Using Face-Aware and Scene-Aware Spatiotemporal Cues | Prateek Gothwal et.al. | 2510.18016 | null |
| 2025-10-15 | CMIS-Net: A Cascaded Multi-Scale Individual Standardization Network for Backchannel Agreement Estimation | Yuxuan Huang et.al. | 2510.17855 | null |
| 2025-10-10 | MAT-Agent: Adaptive Multi-Agent Training Optimization | Jusheng Zhang et.al. | 2510.17845 | null |
| 2025-10-20 | PANER: A Paraphrase-Augmented Framework for Low-Resource Named Entity Recognition | Nanda Kumar Rengarajan et.al. | 2510.17720 | null |
| 2025-10-20 | Handling Extreme Class Imbalance: Using GANs in Data Augmentation for Suicide Prediction | Vaishnavi Visweswaraiah et.al. | 2510.17661 | null |
| 2025-10-20 | ZACH-ViT: A Zero-Token Vision Transformer with ShuffleStrides Data Augmentation for Robust Lung Ultrasound Classification | Athanasios Angelakis et.al. | 2510.17650 | null |
| 2025-10-24 | RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation | Yuquan Xue et.al. | 2510.17640 | null |
| 2025-10-20 | Fair and Interpretable Deepfake Detection in Videos | Akihito Yoshii et.al. | 2510.17264 | null |
| 2025-10-19 | Addressing data scarcity in structural health monitoring through generative augmentation | Sasan Farhadi et.al. | 2510.16889 | null |
| 2025-10-19 | Connecting Domains and Contrasting Samples: A Ladder for Domain Generalization | Tianxin Wei et.al. | 2510.16704 | null |
| 2025-10-19 | Zero- and One-Shot Data Augmentation for Sentence-Level Dysarthric Speech Recognition in Constrained Scenarios | Shiyao Wang et.al. | 2510.16700 | null |
| 2025-10-18 | ViT-Transformer: Self-attention mechanism based constitutive modeling for nonlinear heterogeneous materials | Yijing Zhou et.al. | 2510.16575 | null |
| 2025-10-18 | ReviewGuard: Enhancing Deficient Peer Review Detection via LLM-Driven Data Augmentation | Haoxuan Zhang et.al. | 2510.16549 | null |
| 2025-10-17 | Data-Centric AI for Tropical Agricultural Mapping: Challenges, Strategies and Scalable Solutions | Mateus Pinto da Silva et.al. | 2510.16207 | null |
| 2025-11-03 | Bridging Symmetry and Robustness: On the Role of Equivariance in Enhancing Adversarial Robustness | Longwei Wang et.al. | 2510.16171 | null |
| 2025-10-17 | Learning density ratios in causal inference using Bregman-Riesz regression | Oliver J. Hines et.al. | 2510.16127 | null |
| 2025-10-17 | Data-Driven Analysis of Intersectional Bias in Image Classification: A Framework with Bias-Weighted Augmentation | Farjana Yesmin et.al. | 2510.16072 | null |
| 2025-10-13 | Bolster Hallucination Detection via Prompt-Guided Data Augmentation | Wenyun Li et.al. | 2510.15977 | null |
| 2025-10-17 | ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection | Haowei Zhu et.al. | 2510.15783 | null |
| 2025-10-17 | SAMix: Calibrated and Accurate Continual Learning via Sphere-Adaptive Mixup and Neural Collapse | Trung-Anh Dang et.al. | 2510.15751 | null |
| 2025-10-17 | Improving Micro-Expression Recognition with Phase-Aware Temporal Augmentation | Vu Tram Anh Khuong et.al. | 2510.15466 | null |
| 2025-10-17 | Robust Optimization in Causal Models and G-Causal Normalizing Flows | Gabriele Visentin et.al. | 2510.15458 | null |
| 2025-10-17 | Towards In-Situ Failure Assessment: Deep Learning on DIC Results for Laminated Composites | Amir Mohammad Mirzaei et.al. | 2510.15424 | null |
| 2025-10-16 | Salient Concept-Aware Generative Data Augmentation | Tianchen Zhao et.al. | 2510.15194 | null |
| 2025-10-16 | Automated Snippet-Alignment Data Augmentation for Code Translation | Zhiming Zhang et.al. | 2510.15004 | null |
| 2025-10-16 | What is missing from this picture? Persistent homology and mixup barcodes as a means of investigating negative embedding space | Himanshu Yadav et.al. | 2510.14327 | null |
| 2025-10-15 | Do Slides Help? Multi-modal Context for Automatic Transcription of Conference Talks | Supriti Sinhamahapatra et.al. | 2510.13979 | null |
| 2025-10-15 | OralGPT: A Two-Stage Vision-Language Model for Oral Mucosal Disease Diagnosis and Description | Jia Zhang et.al. | 2510.13911 | null |
| 2025-10-15 | A fully automated and scalable Parallel Data Augmentation for Low Resource Languages using Image and Text Analytics | Prawaal Sharma et.al. | 2510.13211 | null |
| 2025-10-15 | LLM-Guided Synthetic Augmentation (LGSA) for Mitigating Bias in AI Systems | Sai Suhruth Reddy Karri et.al. | 2510.13202 | null |
| 2025-10-15 | GRACE: Globally-Seeded Representation-Aware Cluster-Specific Evolution for Compiler Auto-Tuning | Haolin Pan et.al. | 2510.13176 | null |
| 2025-10-13 | Data-Augmented Machine Learning for Predicting Biomass-Derived Hard Carbon Anode Performance in Sodium-Ion Batteries | Gang Chen et.al. | 2510.12833 | null |
| 2025-10-14 | A Text-Image Fusion Method with Data Augmentation Capabilities for Referring Medical Image Segmentation | Shurong Chai et.al. | 2510.12482 | null |
| 2025-10-14 | A Function Centric Perspective On Flat and Sharp Minima | Israel Mason-Williams et.al. | 2510.12451 | null |
| 2025-10-14 | APGNet: Adaptive Prior-Guided for Underwater Camouflaged Object Detection | Xinxin Huang et.al. | 2510.12056 | null |
| 2025-10-13 | MammoDINO: Anatomically Aware Self-Supervision for Mammographic Images | Sicheng Zhou et.al. | 2510.11883 | null |
| 2025-10-13 | MS-Mix: Unveiling the Power of Mixup for Multimodal Sentiment Analysis | Hongyu Zhu et.al. | 2510.11579 | null |
| 2025-10-13 | DiffStyleTS: Diffusion Model for Style Transfer in Time Series | Mayank Nagda et.al. | 2510.11335 | null |
| 2025-10-13 | LightPneumoNet: Lightweight Pneumonia Classifier | Neilansh Chauhan et.al. | 2510.11232 | null |
| 2025-10-13 | Mixup Helps Understanding Multimodal Video Better | Xiaoyu Ma et.al. | 2510.10986 | null |
| 2025-10-12 | From Detection to Mitigation: Addressing Bias in Deep Learning Models for Chest X-Ray Diagnosis | Clemence Mottez et.al. | 2510.10822 | null |
| 2025-10-12 | Proficiency-Aware Adaptation and Data Augmentation for Robust L2 ASR | Ling Sun et.al. | 2510.10738 | null |
| 2025-10-11 | A Survey of Inductive Reasoning for Large Language Models | Kedi Chen et.al. | 2510.10182 | null |
| 2025-10-11 | Diversity Augmentation of Dynamic User Preference Data for Boosting Personalized Text Summarizers | Parthiv Chatterjee et.al. | 2510.10082 | null |
| 2025-10-11 | Improving Speech Emotion Recognition with Mutual Information Regularized Generative Model | Chung-Soo Ahn et.al. | 2510.10078 | null |
| 2025-10-10 | Closing the Data-Efficiency Gap Between Autoregressive and Masked Diffusion LLMs | Xu Pan et.al. | 2510.09885 | null |
| 2025-10-10 | Accent-Invariant Automatic Speech Recognition via Saliency-Driven Spectrogram Masking | Mohammad Hossein Sameti et.al. | 2510.09528 | null |
| 2025-10-10 | Cattle-CLIP: A Multimodal Framework for Cattle Behaviour Recognition | Huimin Liu et.al. | 2510.09203 | null |
| 2025-10-10 | Augmented data and neural networks for robust epidemic forecasting: application to COVID-19 in Italy | Giacomo Dimarco et.al. | 2510.09192 | null |
| 2025-10-10 | Generative Data Augmentation in Graph Contrastive Learning for Recommendation | Yansong Wang et.al. | 2510.09129 | null |
| 2025-10-14 | Denoised Diffusion for Object-Focused Image Augmentation | Nisha Pillai et.al. | 2510.08955 | null |
| 2025-10-09 | SAFER-AiD: Saccade-Assisted Foveal-peripheral vision Enhanced Reconstruction for Adversarial Defense | Jiayang Liu et.al. | 2510.08761 | null |
| 2025-10-08 | Reproducible Evaluation of Data Augmentation and Loss Functions for Brain Tumor Segmentation | Saumya B et.al. | 2510.08617 | null |
| 2025-10-09 | Hyperspectral data augmentation with transformer-based diffusion models | Mattia Ferrari et.al. | 2510.08363 | null |
| 2025-10-10 | A Multimodal Depth-Aware Method For Embodied Reference Understanding | Fevziye Irem Eyiokur et.al. | 2510.08278 | null |
| 2025-10-09 | Robust Canonicalization through Bootstrapped Data Re-Alignment | Johann Schmidt et.al. | 2510.08178 | null |
| 2025-10-09 | Long-tailed Recognition with Model Rebalancing | Jiaan Luo et.al. | 2510.08177 | null |
| 2025-10-09 | Self-Improving LLM Agents at Test-Time | Emre Can Acikgoz et.al. | 2510.07841 | null |
| 2025-10-09 | Enhancing Visual Prompting through Expanded Transformation Space and Overfitting Mitigation | Shohei Enomoto et.al. | 2510.07823 | null |
| 2025-10-07 | Enhancing Maritime Object Detection in Real-Time with RT-DETR and Data Augmentation | Nader Nemati et.al. | 2510.07346 | null |
| 2025-10-08 | Vision-Language-Action Models for Robotics: A Review Towards Real-World Applications | Kento Kawaharazuka et.al. | 2510.07077 | null |
| 2025-10-08 | Lung Infection Severity Prediction Using Transformers with Conditional TransMix Augmentation and Cross-Attention | Bouthaina Slika et.al. | 2510.06887 | null |
| 2025-10-08 | PTEB: Towards Robust Text Embedding Evaluation via Stochastic Paraphrasing at Evaluation Time with LLMs | Manuel Frank et.al. | 2510.06730 | null |
| 2025-10-07 | Data Factory with Minimal Human Effort Using VLMs | Jiaojiao Ye et.al. | 2510.05722 | null |
| 2025-10-07 | Transfer Learning on Edge Connecting Probability Estimation under Graphon Model | Yuyao Wang et.al. | 2510.05527 | null |
| 2025-10-06 | NASP-T: A Fuzzy Neuro-Symbolic Transformer for Logic-Constrained Aviation Safety Report Classification | Fadi Al Machot et.al. | 2510.05451 | null |
| 2025-09-30 | CARE: Cognitive-reasoning Augmented Reinforcement for Emotional Support Conversation | Jie Zhu et.al. | 2510.05122 | null |
| 2025-10-06 | How does the optimizer implicitly bias the model merging loss landscape? | Chenxiang Zhang et.al. | 2510.04686 | null |
| 2025-10-05 | RAP: 3D Rasterization Augmented End-to-End Planning | Lan Feng et.al. | 2510.04333 | null |
| 2025-10-05 | PABSA: Hybrid Framework for Persian Aspect-Based Sentiment Analysis | Mehrzad Tareh et.al. | 2510.04291 | null |
| 2025-10-05 | Enhancing Fake News Video Detection via LLM-Driven Creative Process Simulation | Yuyan Bu et.al. | 2510.04024 | null |
| 2025-10-04 | Exploring the Challenge and Value of Deep Learning in Automated Skin Disease Diagnosis | Runhao Liu et.al. | 2510.03869 | null |
| 2025-10-04 | Cellular Learning: Scattered Data Regression in High Dimensions via Voronoi Cells | Shankar Prasad Sastry et.al. | 2510.03810 | null |
| 2025-10-09 | From Moments to Models: Graphon Mixture-Aware Mixup and Contrastive Learning | Ali Azizpour et.al. | 2510.03690 | null |
| 2025-10-15 | Diffusion-Classifier Synergy: Reward-Aligned Learning via Mutual Boosting Loop for FSCIL | Ruitao Wu et.al. | 2510.03608 | null |
| 2025-10-04 | Exploring the Hierarchical Reasoning Model for Small Natural-Image Classification Without Augmentation | Alexander V. Mantzaris et.al. | 2510.03598 | null |
| 2025-10-09 | How We Won BraTS-SSA 2025: Brain Tumor Segmentation in the Sub-Saharan African Population Using Segmentation-Aware Data Augmentation and Model Ensembling | Claudia Takyi Ankomah et.al. | 2510.03568 | null |
| 2025-10-03 | InsideOut: An EfficientNetV2-S Based Deep Learning Framework for Robust Multi-Class Facial Emotion Recognition | Ahsan Farabi et.al. | 2510.03066 | null |
| 2025-10-03 | Denoising and Augmentation: A Dual Use of Diffusion Model for Enhanced CSI Recovery | Yupeng Li et.al. | 2510.02744 | null |
| 2025-10-03 | Hybrid-Collaborative Augmentation and Contrastive Sample Adaptive-Differential Awareness for Robust Attributed Graph Clustering | Tianxiang Zhao et.al. | 2510.02731 | null |
| 2025-10-03 | Mind the Gap: Linguistic Divergence and Adaptation Strategies in Human-LLM Assistant vs. Human-Human Interactions | Fulei Zhang et.al. | 2510.02645 | null |
| 2025-10-02 | Extreme value forecasting using relevance-based data augmentation with deep learning models | Junru Hua et.al. | 2510.02407 | null |
| 2025-10-02 | Non-Asymptotic Analysis of Data Augmentation for Precision Matrix Estimation | Lucas Morisset et.al. | 2510.02119 | null |
| 2025-10-02 | Mapping Historic Urban Footprints in France: Balancing Quality, Scalability and AI Techniques | Walid Rabehi et.al. | 2510.02097 | null |
| 2025-10-02 | Explicit Discovery of Nonlinear Symmetries from Dynamic Data | Lexiang Hu et.al. | 2510.01855 | null |
| 2025-10-02 | NGGAN: Noise Generation GAN Based on the Practical Measurement Dataset for Narrowband Powerline Communications | Ying-Ren Chien et.al. | 2510.01850 | null |
| 2025-10-01 | RealClass: A Framework for Classroom Speech Simulation with Public Datasets and Game Engines | Ahmed Adel Attia et.al. | 2510.01462 | null |
| 2025-10-01 | Diffusion Modeling of the Three-Dimensional Magnetic Field in the Sun’s Corona | Daniel E. da Silva et.al. | 2510.01441 | null |
| 2025-10-01 | To Augment or Not to Augment? Diagnosing Distributional Symmetry Breaking | Hannah Lawrence et.al. | 2510.01349 | null |
| 2025-10-01 | Towards Adversarial Training under Hyperspectral Images | Weihua Zhang et.al. | 2510.01014 | null |
| 2025-10-01 | EvolProver: Advancing Automated Theorem Proving by Evolving Formalized Problems via Symmetry and Difficulty | Yuchen Tian et.al. | 2510.00732 | null |
| 2025-10-01 | Disentangling Foreground and Background for vision-Language Navigation via Online Augmentation | Yunbo Xu et.al. | 2510.00604 | null |
| 2025-10-01 | SAGE-LD: Towards Scalable and Generalizable End-to-End Language Diarization via Simulated Data Augmentation | Sangmin Lee et.al. | 2510.00582 | null |
| 2025-10-01 | On-the-Fly Data Augmentation via Gradient-Guided and Sample-Aware Influence Estimation | Suorong Yang et.al. | 2510.00434 | null |
| 2025-09-30 | Subjective quality evaluation of personalized own voice reconstruction systems | Mattes Ohlenbusch et.al. | 2510.00256 | null |
| 2025-09-26 | Deep Learning-Based Pneumonia Detection from Chest X-ray Images: A CNN Approach with Performance Analysis and Clinical Implications | P K Dutta et.al. | 2510.00035 | null |
| 2025-09-25 | Learning Inter-Atomic Potentials without Explicit Equivariance | Ahmed A. Elhag et.al. | 2510.00027 | null |
| 2025-10-08 | OmniRetarget: Interaction-Preserving Data Generation for Humanoid Whole-Body Loco-Manipulation and Scene Interaction | Lujie Yang et.al. | 2509.26633 | null |
| 2025-09-30 | Source Separation for A Cappella Music | Luca A. Lanzendörfer et.al. | 2509.26580 | null |
| 2025-09-30 | GastroViT: A Vision Transformer Based Ensemble Learning Approach for Gastrointestinal Disease Classification with Grad CAM & SHAP Visualization | Sumaiya Tabassum et.al. | 2509.26502 | null |
| 2025-10-14 | Limited Preference Data? Learning Better Reward Model with Latent Space Synthesis | Leitian Tao et.al. | 2509.26074 | null |
| 2025-09-30 | Geometric Learning of Canonical Parameterizations of $2D$ -curves | Ioana Ciuclea et.al. | 2509.26070 | null |
| 2025-09-30 | ASR Under Noise: Exploring Robustness for Sundanese and Javanese | Salsabila Zahirah Pranida et.al. | 2509.25878 | null |
| 2025-09-30 | MIDAS: Misalignment-based Data Augmentation Strategy for Imbalanced Multimodal Learning | Seong-Hyeon Hwang et.al. | 2509.25831 | null |
| 2025-09-30 | Less is More: Towards Simple Graph Contrastive Learning | Yanan Zhao et.al. | 2509.25742 | null |
| 2025-10-03 | YOLO-Based Defect Detection for Metal Sheets | Po-Heng Chou et.al. | 2509.25659 | null |
| 2025-09-29 | On-the-Fly Data Augmentation for Brain Tumor Segmentation | Ishika Jain et.al. | 2509.24973 | null |
| 2025-09-29 | Adaptive Canonicalization with Application to Invariant Anisotropic Geometric Networks | Ya-Wei Eileen Lin et.al. | 2509.24886 | null |
| 2025-09-29 | Intelligent Optimization of Wireless Access Point Deployment for Communication-Based Train Control Systems Using Deep Reinforcement Learning | Kunyu Wu et.al. | 2509.24819 | null |
| 2025-09-29 | Fidelity-Aware Data Composition for Robust Robot Generalization | Zizhao Tong et.al. | 2509.24797 | null |
| 2025-09-29 | Toward a Vision-Language Foundation Model for Medical Data: Multimodal Dataset and Benchmarks for Vietnamese PET/CT Report Generation | Huu Tien Nguyen et.al. | 2509.24739 | null |
| 2025-09-29 | Circuit-Aware Reward Training: A Mechanistic Framework for Longtail Robustness in RLHF | Jing Liu et.al. | 2509.24713 | null |
| 2025-09-29 | LEAF: A Robust Expert-Based Framework for Few-Shot Continual Event Detection | Bao-Ngoc Dao et.al. | 2509.24547 | null |
| 2025-09-29 | Beyond Repetition: Text Simplification and Curriculum Learning for Data-Constrained Pretraining | Matthew Theodore Roque et.al. | 2509.24356 | null |
| 2025-09-29 | Cycle Diffusion Model for Counterfactual Image Generation | Fangrui Huang et.al. | 2509.24267 | null |
| 2025-09-28 | Clebsch-Gordan Transformer: Fast and Global Equivariant Attention | Owen Lewis Howell et.al. | 2509.24093 | null |
| 2025-09-28 | DexFlyWheel: A Scalable and Self-improving Data Generation Framework for Dexterous Manipulation | Kefei Zhu et.al. | 2509.23829 | null |
| 2025-09-30 | VioPTT: Violin Technique-Aware Transcription from Synthetic Data Augmentation | Ting-Kang Wang et.al. | 2509.23759 | null |
| 2025-09-28 | A Hierarchical Structure-Enhanced Personalized Recommendation Model for Traditional Chinese Medicine Formulas Based on KG Diffusion Guidance | ChaoBo Zhang et.al. | 2509.23560 | null |
| 2025-09-26 | FishAI 2.0: Marine Fish Image Classification with Multi-modal Few-shot Learning | Chenghan Yang et.al. | 2509.22930 | null |
| 2025-09-24 | Achieving Fair Skin Lesion Detection through Skin Tone Normalization and Channel Pruning | Zihan Wei et.al. | 2509.22712 | null |
| 2025-09-26 | FreqDebias: Towards Generalizable Deepfake Detection via Consistency-Driven Frequency Debiasing | Hossein Kashiani et.al. | 2509.22412 | null |
| 2025-09-26 | Deep Learning-Based Cross-Anatomy CT Synthesis Using Adapted nnResU-Net with Anatomical Feature Prioritized Loss | Javier Sequeiro González et.al. | 2509.22394 | null |
| 2025-09-26 | Cross-Dialect Bird Species Recognition with Dialect-Calibrated Augmentation | Jiani Ding et.al. | 2509.22317 | null |
| 2025-09-26 | Debiasing Large Language Models in Thai Political Stance Detection via Counterfactual Calibration | Kasidit Sermsri et.al. | 2509.21946 | null |
| 2025-09-25 | Expert-guided Clinical Text Augmentation via Query-Based Model Collaboration | Dongkyu Cho et.al. | 2509.21530 | null |
| 2025-09-25 | Contrastive Mutual Information Learning: Toward Robust Representations without Positive-Pair Augmentations | Micha Livne et.al. | 2509.21511 | null |
| 2025-09-25 | Filtering with Confidence: When Data Augmentation Meets Conformal Prediction | Zixuan Wu et.al. | 2509.21479 | null |
| 2025-09-25 | Dense Semantic Matching with VGGT Prior | Songlin Yang et.al. | 2509.21263 | null |
| 2025-09-25 | From Physics to Machine Learning and Back: Part II - Learning and Observational Bias in PHM | Olga Fink et.al. | 2509.21207 | null |
| 2025-09-25 | An Improved Quantum Software Challenges Classification Approach using Transfer Learning and Explainable AI | Nek Dil Khan et.al. | 2509.21068 | null |
| 2025-09-25 | A Real-Time On-Device Defect Detection Framework for Laser Power-Meter Sensors via Unsupervised Learning | Dongqi Zheng et.al. | 2509.20946 | null |
| 2025-09-25 | LiLAW: Lightweight Learnable Adaptive Weighting to Meta-Learn Sample Difficulty and Improve Noisy Training | Abhishek Moturu et.al. | 2509.20786 | null |
| 2025-09-25 | Addressing Gradient Misalignment in Data-Augmented Training for Robust Speech Deepfake Detection | Duc-Tuan Truong et.al. | 2509.20682 | null |
| 2025-10-05 | Region-of-Interest Augmentation for Mammography Classification under Patient-Level Cross-Validation | Farbod Bigdeli et.al. | 2509.20585 | null |
| 2025-10-02 | Feature Dynamics as Implicit Data Augmentation: A Depth-Decomposed View on Deep Neural Network Generalization | Tianyu Ruan et.al. | 2509.20334 | null |
| 2025-09-24 | Z-Scores: A Metric for Linguistically Assessing Disfluency Removal | Maria Teleki et.al. | 2509.20319 | null |
| 2025-09-24 | Enhancing Requirement Traceability through Data Augmentation Using Large Language Models | Jianzhang Zhang et.al. | 2509.20149 | null |
| 2025-09-24 | A Simple Data Augmentation Strategy for Text-in-Image Scientific VQA | Belal Shoer et.al. | 2509.20119 | null |
| 2025-09-25 | Diffusion-Augmented Contrastive Learning: A Noise-Robust Encoder for Biosignal Representations | Rami Zewail et.al. | 2509.20048 | null |
| 2025-09-24 | Rectified Decoupled Dataset Distillation: A Closer Look for Fair and Comprehensive Evaluation | Xinhao Zhong et.al. | 2509.19743 | null |
| 2025-09-23 | Quantum Harmonic Analysis and the Structure in Data: Augmentation | Monika Doerfler et.al. | 2509.19474 | null |
| 2025-09-23 | ROPA: Synthetic Robot Pose Generation for RGB-D Bimanual Data Augmentation | Jason Chen et.al. | 2509.19454 | null |
| 2025-09-23 | Improving Outdoor Multi-cell Fingerprinting-based Positioning via Mobile Data Augmentation | Tony Chahoud et.al. | 2509.19405 | null |
| 2025-09-16 | Fine-Grained AI Model Caching and Downloading With Coordinated Multipoint Broadcasting in Multi-Cell Edge Networks | Yang Fu et.al. | 2509.19341 | null |
| 2025-09-23 | Generative data augmentation for biliary tract detection on intraoperative images | Cristina Iacono et.al. | 2509.18958 | null |
| 2025-09-23 | PIE: Perception and Interaction Enhanced End-to-End Motion Planning for Autonomous Driving | Chengran Yuan et.al. | 2509.18609 | null |
| 2025-09-24 | SynSonic: Augmenting Sound Event Detection through Text-to-Audio Diffusion ControlNet and Effective Sample Filtering | Jiarui Hai et.al. | 2509.18603 | null |
| 2025-09-23 | Efficient Breast and Ovarian Cancer Classification via ViT-Based Preprocessing and Transfer Learning | Richa Rawat et.al. | 2509.18553 | null |
| 2025-09-23 | Reverse-Complement Consistency for DNA Language Models | Mingqian Ma et.al. | 2509.18529 | null |
| 2025-09-21 | Automatic Classification of Magnetic Chirality of Solar Filaments from H-Alpha Observations | Alexis Chalmers et.al. | 2509.18214 | null |
| 2025-09-22 | Intra-Cluster Mixup: An Effective Data Augmentation Technique for Complementary-Label Learning | Tan-Ha Mai et.al. | 2509.17971 | null |
| 2025-09-22 | SeqUDA-Rec: Sequential User Behavior Enhanced Recommendation via Global Unsupervised Data Augmentation for Personalized Content Marketing | Ruihan Luo et.al. | 2509.17361 | null |
| 2025-09-21 | Enhanced Detection of Tiny Objects in Aerial Images | Kihyun Kim et.al. | 2509.17078 | null |
| 2025-09-23 | Penalizing Boundary Activation for Object Completeness in Diffusion Models | Haoyang Xu et.al. | 2509.16968 | null |
| 2025-09-20 | IPF-RDA: An Information-Preserving Framework for Robust Data Augmentation | Suorong Yang et.al. | 2509.16678 | null |
| 2025-09-20 | MedCutMix: A Data-Centric Approach to Improve Radiology Vision-Language Pre-training with Disease Awareness | Sinuo Wang et.al. | 2509.16673 | null |
| 2025-09-20 | AISTAT lab system for DCASE2025 Task6: Language-based audio retrieval | Hyun Jun Kim et.al. | 2509.16649 | null |
| 2025-09-19 | Intrinsic Meets Extrinsic Fairness: Assessing the Downstream Impact of Bias Mitigation in Large Language Models | ‘Mina Arzaghi’ et.al. | 2509.16462 | null |
| 2025-09-19 | Evaluating the Effectiveness and Scalability of LLM-Based Data Augmentation for Retrieval | Pranjal A. Chitale et.al. | 2509.16442 | null |
| 2025-09-19 | DistillMatch: Leveraging Knowledge Distillation from Vision Foundation Model for Multimodal Image Matching | Meng Yang et.al. | 2509.16017 | null |
| 2025-09-19 | Chunk Based Speech Pre-training with High Resolution Finite Scalar Quantization | Yun Tang et.al. | 2509.15579 | null |
| 2025-09-19 | Contrastive Learning with Spectrum Information Augmentation in Abnormal Sound Detection | Xinxin Meng et.al. | 2509.15570 | null |
| 2025-09-18 | Generative AI Meets Wireless Sensing: Towards Wireless Foundation Model | Zheng Yang et.al. | 2509.15258 | null |
| 2025-09-17 | GenCAD-3D: CAD Program Generation using Multimodal Latent Space Alignment and Synthetic Dataset Balancing | Nomi Yu et.al. | 2509.15246 | null |
| 2025-09-18 | Synthetic-to-Real Object Detection using YOLOv11 and Domain Randomization Strategies | Luisa Torquato Niño et.al. | 2509.15045 | null |
| 2025-09-18 | Data Augmentation via Latent Diffusion Models for Detecting Smell-Related Objects in Historical Artworks | Ahmed Sheta et.al. | 2509.14755 | null |
| 2025-09-18 | SpeechMLC: Speech Multi-label Classification | Miseul Kim et.al. | 2509.14677 | null |
| 2025-09-18 | How Does Instrumental Music Help SingFake Detection? | Xuanjun Chen et.al. | 2509.14675 | null |
| 2025-09-18 | SWE-QA: Can Language Models Answer Repository-level Code Questions? | Weihan Peng et.al. | 2509.14635 | null |
| 2025-09-18 | Mitigating Intra-Speaker Variability in Diarization with Style-Controllable Speech Augmentation | Miseul Kim et.al. | 2509.14632 | null |
| 2025-09-18 | LSTC-MDA: A Unified Framework for Long-Short Term Temporal Convolution and Mixed Data Augmentation in Skeleton-Based Action Recognition | Feng Ding et.al. | 2509.14619 | null |
| 2025-09-18 | Leveraging IndoBERT and DistilBERT for Indonesian Emotion Classification in E-Commerce Reviews | William Christian et.al. | 2509.14611 | null |
| 2025-09-18 | VisMoDAl: Visual Analytics for Evaluating and Improving Corruption Robustness of Vision-Language Models | Huanchen Wang et.al. | 2509.14571 | null |
| 2025-09-18 | Learning to Retrieve for Environmental Knowledge Discovery: An Augmentation-Adaptive Self-Supervised Learning Framework | Shiyuan Luo et.al. | 2509.14563 | null |
| 2025-09-18 | Data coarse graining can improve model performance | Alex Nguyen et.al. | 2509.14498 | null |
| 2025-09-17 | Sequential Data Augmentation for Generative Recommendation | Geon Lee et.al. | 2509.13648 | null |
| 2025-09-17 | Multimodal signal fusion for stress detection using deep neural networks: a novel approach for converting 1D signals to unified 2D images | Yasin Hasanpoor et.al. | 2509.13636 | null |
| 2025-09-16 | Adversarial Appearance Learning in Augmented Cityscapes for Pedestrian Recognition in Autonomous Driving | Artem Savkin et.al. | 2509.13507 | null |
| 2025-09-16 | Contrastive timbre representations for musical instrument and synthesizer retrieval | Gwendal Le Vaillant et.al. | 2509.13285 | null |
| 2025-09-16 | Time-step Mixup for Efficient Spiking Knowledge Transfer from Appearance to Event Domain | Yuqi Xie et.al. | 2509.12959 | null |
| 2025-09-16 | Synthetic Protein-Ligand Complex Generation for Deep Molecular Docking | Sofiene Khiari et.al. | 2509.12915 | null |
| 2025-09-16 | Cumulative Consensus Score: Label-Free and Model-Agnostic Evaluation of Object Detectors in Deployment | Avinaash Manoharan et.al. | 2509.12871 | null |
| 2025-09-20 | Data Augmentation for Maltese NLP using Transliterated and Machine Translated Arabic Data | Kurt Micallef et.al. | 2509.12853 | null |
| 2025-09-16 | Double Helix Diffusion for Cross-Domain Anomaly Image Generation | Linchun Wu et.al. | 2509.12787 | null |
| 2025-09-15 | Robust Fetal Pose Estimation across Gestational Ages via Cross-Population Augmentation | Sebastian Diaz et.al. | 2509.12062 | null |
| 2025-09-15 | Learning to Generate 4D LiDAR Sequences | Ao Liang et.al. | 2509.11959 | null |
| 2025-09-15 | Automated training of neural-network interatomic potentials | Davide Bidoggia et.al. | 2509.11703 | null |
| 2025-09-15 | DTGen: Generative Diffusion-Based Few-Shot Data Augmentation for Fine-Grained Dirty Tableware Recognition | Lifei Hao et.al. | 2509.11661 | null |
| 2025-09-15 | Task Decoding based on Eye Movements using Synthetic Data Augmentation | Shanmuka Sadhu et.al. | 2509.11547 | null |
| 2025-09-14 | An Entropy-Guided Curriculum Learning Strategy for Data-Efficient Acoustic Scene Classification under Domain Shift | Peihong Zhang et.al. | 2509.11168 | null |
| 2025-09-14 | An Advanced Convolutional Neural Network for Bearing Fault Diagnosis under Limited Data | Shengke Sun et.al. | 2509.11053 | null |
| 2025-09-13 | Point-Plane Projections for Accurate LiDAR Semantic Segmentation in Small Data Scenarios | Simone Mosco et.al. | 2509.10841 | null |
| 2025-09-01 | MIDOG 2025 Track 2: A Deep Learning Model for Classification of Atypical and Normal Mitotic Figures under Class and Hardness Imbalances | Sujatha Kotte et.al. | 2509.10502 | null |
| 2025-09-12 | Improving Audio Event Recognition with Consistency Regularization | Shanmuka Sadhu et.al. | 2509.10391 | null |
| 2025-09-12 | Scaling Arabic Medical Chatbots Using Synthetic Data: Enhancing Generative AI with Synthetic Patient Records | Abdulrahman Allam et.al. | 2509.10108 | null |
| 2025-09-11 | Combining Textual and Spectral Features for Robust Classification of Pilot Communications | Abdullah All Tanvir et.al. | 2509.09752 | null |
| 2025-09-24 | Structure Matters: Brain Graph Augmentation via Learnable Edge Masking for Data-efficient Psychiatric Diagnosis | Mujie Liu et.al. | 2509.09744 | null |
| 2025-09-11 | Virtual staining for 3D X-ray histology of bone implants | Sarah C. Irvine et.al. | 2509.09235 | null |
| 2025-09-11 | Target-oriented Multimodal Sentiment Classification with Counterfactual-enhanced Debiasing | Zhiyue Liu et.al. | 2509.09160 | null |
| 2025-09-10 | Handling Open-Vocabulary Constructs in Formalizing Specifications: Retrieval-Augmented Parsing with Expert Knowledge | Mohammad Saqib Hasan et.al. | 2509.08808 | null |
| 2025-09-10 | ADHDeepNet From Raw EEG to Diagnosis: Improving ADHD Diagnosis through Temporal-Spatial Processing, Adaptive Attention Mechanisms, and Explainability in Raw EEG Signals | Ali Amini et.al. | 2509.08779 | null |
| 2025-09-10 | Ensemble Distribution Distillation for Self-Supervised Human Activity Recognition | Matthew Nolan et.al. | 2509.08225 | null |
| 2025-09-09 | Transformer-Based Approach to Optimal Sensor Placement for Structural Health Monitoring of Probe Cards | Mehdi Bejani et.al. | 2509.07603 | null |
| 2025-10-21 | From Scarcity to Efficiency: Investigating the Effects of Data Augmentation on African Machine Translation | Mardiyyah Oduwole et.al. | 2509.07471 | null |
| 2025-09-08 | Breast Cancer Detection in Thermographic Images via Diffusion-Based Augmentation and Nonlinear Feature Fusion | Sepehr Salem et.al. | 2509.07277 | null |
| 2025-09-08 | Pothole Detection and Recognition based on Transfer Learning | Mang Hu et.al. | 2509.06750 | null |
| 2025-09-08 | Contrastive Self-Supervised Network Intrusion Detection using Augmented Negative Pairs | Jack Wilkie et.al. | 2509.06550 | null |
| 2025-09-08 | IGAff: Benchmarking Adversarial Iterative and Genetic Affine Algorithms on Deep Neural Networks | Sebastian-Vasile Echim et.al. | 2509.06459 | null |
| 2025-09-08 | CAPMix: Robust Time Series Anomaly Detection Based on Abnormal Assumptions with Dual-Space Mixup | Xudong Mou et.al. | 2509.06419 | null |
| 2025-09-08 | PL-CA: A Parametric Legal Case Augmentation Framework | Ao Chang et.al. | 2509.06356 | null |
| 2025-09-07 | Exploring Light-Weight Object Recognition for Real-Time Document Detection | Lucas Wojcik et.al. | 2509.06246 | null |
| 2025-09-07 | Learning in ImaginationLand: Omnidirectional Policies through 3D Generative Models (OP-Gen) | Yifei Ren et.al. | 2509.06191 | null |
| 2025-09-06 | CardiacFlow: 3D+t Four-Chamber Cardiac Shape Completion and Generation via Flow Matching | Qiang Ma et.al. | 2509.05754 | null |
| 2025-09-05 | DuoCLR: Dual-Surrogate Contrastive Learning for Skeleton-based Human Action Segmentation | Haitao Tian et.al. | 2509.05543 | null |
| 2025-09-05 | Handling Data Gaps for the Next Generation of Gravitational-Wave Observatories | Noah Pearson et.al. | 2509.05479 | null |
| 2025-09-01 | Handling imbalance and few-sample size in ML based Onion disease classification | Abhijeet Manoj Pal et.al. | 2509.05341 | null |
| 2025-08-30 | A Dataset Generation Scheme Based on Video2EEG-SPGN-Diffusion for SEED-VD | Yunfei Guo et.al. | 2509.05321 | null |
| 2025-09-05 | Uncertain but Useful: Leveraging CNN Variability into Data Augmentation | Inés Gonzalez-Pepe et.al. | 2509.05238 | null |
| 2025-09-05 | SL-SLR: Self-Supervised Representation Learning for Sign Language Recognition | Ariel Basso Madjoukeng et.al. | 2509.05188 | null |
| 2025-09-05 | Hybrid Matrix Factorization Based Graph Contrastive Learning for Recommendation System | Hao Chen et.al. | 2509.05115 | null |
| 2025-09-05 | Leveraging Transfer Learning and Mobile-enabled Convolutional Neural Networks for Improved Arabic Handwritten Character Recognition | Mohsine El Khayati et.al. | 2509.05019 | null |
| 2025-09-05 | Optimizing Small Transformer-Based Language Models for Multi-Label Sentiment Analysis in Short Texts | Julius Neumann et.al. | 2509.04982 | null |
| 2025-09-05 | DeGuV: Depth-Guided Visual Reinforcement Learning for Generalization and Interpretability in Manipulation | Tien Pham et.al. | 2509.04970 | null |
| 2025-09-05 | A transformer-BiGRU-based framework with data augmentation and confident learning for network intrusion detection | Jiale Zhang et.al. | 2509.04925 | null |
| 2025-09-05 | Evaluating Multiple Instance Learning Strategies for Automated Sebocyte Droplet Counting | Maryam Adelipour et.al. | 2509.04895 | null |
| 2025-08-29 | MOSAIC: A Multilingual, Taxonomy-Agnostic, and Computationally Efficient Approach for Radiological Report Classification | Alice Schiavone et.al. | 2509.04471 | null |
| 2025-09-04 | TauGenNet: Plasma-Driven Tau PET Image Synthesis via Text-Guided 3D Diffusion Models | Yuxin Gong et.al. | 2509.04269 | null |
| 2025-09-04 | How many patients could we save with LLM priors? | Shota Arai et.al. | 2509.04250 | null |
| 2025-09-04 | Explicit and Implicit Data Augmentation for Social Event Detection | Congbo Ma et.al. | 2509.04202 | null |
| 2025-09-04 | Chest X-ray Pneumothorax Segmentation Using EfficientNet-B4 Transfer Learning in a U-Net Architecture | Alvaro Aranibar Roque et.al. | 2509.03950 | null |
| 2025-09-04 | A Generative Foundation Model for Chest Radiography | Yuanfeng Ji et.al. | 2509.03903 | null |
| 2025-09-04 | Data-Augmented Quantization-Aware Knowledge Distillation | Justin Kur et.al. | 2509.03850 | null |
| 2025-09-03 | Lightweight image segmentation for echocardiography | Anders Kjelsrud et.al. | 2509.03631 | null |
| 2025-09-04 | Invariant Features for Global Crop Type Classification | Xin-Yi Tong et.al. | 2509.03497 | null |
| 2025-09-03 | Joint Training of Image Generator and Detector for Road Defect Detection | Kuan-Chuan Peng et.al. | 2509.03465 | null |
| 2025-09-02 | Enhancing Machine Learning for Imbalanced Medical Data: A Quantum-Inspired Approach to Synthetic Oversampling (QI-SMOTE) | Vikas Kashtriya et.al. | 2509.02863 | null |
| 2025-08-29 | Foundation Model-Driven Classification of Atypical Mitotic Figures with Domain-Aware Training Strategies | Piotr Giedziun et.al. | 2509.02601 | null |
| 2025-09-02 | PalmX 2025: The First Shared Task on Benchmarking LLMs on Arabic and Islamic Culture | Fakhraddin Alwajih et.al. | 2509.02550 | null |
| 2025-09-02 | EmoPerso: Enhancing Personality Detection with Self-Supervised Emotion-Aware Modelling | Lingzhi Shen et.al. | 2509.02450 | null |
| 2025-09-02 | Improving Electroencephalogram-Based Deception Detection in Concealed Information Test under Low Stimulus Heterogeneity | Suhye Kim et.al. | 2509.02234 | null |
| 2025-09-02 | Enhancing Zero-Shot Pedestrian Attribute Recognition with Synthetic Data Generation: A Comparative Study with Image-To-Image Diffusion Models | Pablo Ayuso-Albizu et.al. | 2509.02161 | null |
| 2025-09-02 | A Data-Centric Approach to Pedestrian Attribute Recognition: Synthetic Augmentation via Prompt-driven Diffusion Models | Alejandro Alonso et.al. | 2509.02099 | null |
| 2025-09-16 | Abex-rat: Synergizing Abstractive Augmentation and Adversarial Training for Classification of Occupational Accident Reports | Jian Chen et.al. | 2509.02072 | null |
| 2025-09-01 | CabinSep: IR-Augmented Mask-Based MVDR for Real-Time In-Car Speech Separation with Distributed Heterogeneous Arrays | Runduo Han et.al. | 2509.01399 | null |
| 2025-09-01 | MARS: Modality-Aligned Retrieval for Sequence Augmented CTR Prediction | Yutian Xiao et.al. | 2509.01184 | null |
| 2025-08-31 | A Unified Denoising and Adaptation Framework for Self-Supervised Bengali Dialectal ASR | Swadhin Biswas et.al. | 2509.00988 | null |
| 2025-09-05 | Semi-Supervised Bayesian GANs with Log-Signatures for Uncertainty-Aware Credit Card Fraud Detection | David Hirnschall et.al. | 2509.00931 | null |
| 2025-08-30 | NoiseCutMix: A Novel Data Augmentation Approach by Mixing Estimated Noise in Diffusion Models | Shumpei Takezaki et.al. | 2509.00378 | null |
| 2025-08-26 | Amplifying Emotional Signals: Data-Efficient Deep Learning for Robust Speech Emotion Recognition | Tai Vu et.al. | 2509.00077 | null |
| 2025-08-29 | A Multi-Stage Fine-Tuning and Ensembling Strategy for Pancreatic Tumor Segmentation in Diagnostic and Therapeutic MRI | Omer Faruk Durugol et.al. | 2508.21775 | null |
| 2025-08-29 | QZhou-Embedding Technical Report | Peng Yu et.al. | 2508.21632 | null |
| 2025-08-29 | Towards On-Device Personalization: Cloud-device Collaborative Data Augmentation for Efficient On-device Language Model | Zhaofeng Zhong et.al. | 2508.21313 | null |
| 2025-08-28 | Reverse Imaging for Wide-spectrum Generalization of Cardiac MRI Segmentation | Yidong Zhao et.al. | 2508.21254 | null |
| 2025-08-26 | CoBA: Counterbias Text Augmentation for Mitigating Various Spurious Correlations via Semantic Triples | Kyohoon Jin et.al. | 2508.21083 | null |
| 2025-08-28 | Improved photometric redshift estimations through self-organising map-based data augmentation | Yun-Hao Zhang et.al. | 2508.20903 | null |
| 2025-08-28 | Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision | Ao Cheng et.al. | 2508.20729 | null |
| 2025-08-28 | Compositionality in Time Series: A Proof of Concept using Symbolic Dynamics and Compositional Data Augmentation | Michael Hagmann et.al. | 2508.20656 | null |
| 2025-08-28 | Mask-Guided Multi-Channel SwinUNETR Framework for Robust MRI Classification | Smriti Joshi et.al. | 2508.20621 | null |
| 2025-08-28 | KCS: Diversify Multi-hop Question Generation with Knowledge Composition Sampling | Yangfan Wang et.al. | 2508.20567 | null |
| 2025-08-28 | Enhancing Health Fact-Checking with LLM-Generated Synthetic Data | Jingze Zhang et.al. | 2508.20525 | null |
| 2025-08-27 | IELDG: Suppressing Domain-Specific Noise with Inverse Evolution Layers for Domain Generalized Semantic Segmentation | Qizhe Fan et.al. | 2508.19604 | null |
| 2025-08-27 | Improving Recommendation Fairness via Graph Structure and Representation Augmentation | Tongxin Xu et.al. | 2508.19547 | null |
| 2025-08-26 | Database Entity Recognition with Data Augmentation and Deep Learning | Zikun Fu et.al. | 2508.19372 | null |
| 2025-08-26 | HuBE: Cross-Embodiment Human-like Behavior Execution for Humanoid Robots | Shipeng Lyu et.al. | 2508.19002 | null |
| 2025-08-26 | Enhancing compact convolutional transformers with super attention | Simpenzwe Honore Leandre et.al. | 2508.18960 | null |
| 2025-08-26 | SegReConcat: A Data Augmentation Method for Voice Anonymization Attack | Ridwan Arefeen et.al. | 2508.18907 | null |
| 2025-08-26 | Enhancing Video-Based Robot Failure Detection Using Task Knowledge | Santosh Thoduka et.al. | 2508.18705 | null |
| 2025-08-26 | Auditing Approximate Machine Unlearning for Differentially Private Models | Yuechun Gu et.al. | 2508.18671 | null |
| 2025-08-25 | Analise de Desaprendizado de Maquina em Modelos de Classificacao de Imagens Medicas | Andreza M. C. Falcao et.al. | 2508.18509 | null |
| 2025-08-25 | Data Augmentation Improves Machine Unlearning | Andreza M. C. Falcao et.al. | 2508.18502 | null |
| 2025-08-29 | German4All – A Dataset and Model for Readability-Controlled Paraphrasing in German | Miriam Anschütz et.al. | 2508.17973 | null |
| 2025-08-25 | Diffusion-Based Data Augmentation for Medical Image Segmentation | Maham Nazir et.al. | 2508.17844 | null |
| 2025-08-25 | LLMulator: Generalizable Cost Modeling for Dataflow Accelerators with Input-Adaptive Control Flow | Kaiyan Chang et.al. | 2508.17826 | null |
| 2025-08-24 | LodeStar: Long-horizon Dexterity via Synthetic Data Augmentation from Human Demonstrations | Weikang Wan et.al. | 2508.17547 | null |
| 2025-07-28 | Data Augmentation for Spoken Grammatical Error Correction | Penny Karanasou et.al. | 2507.19374 | null |
| 2025-07-08 | Music Boomerang: Reusing Diffusion Models for Data Augmentation and Audio Manipulation | Alexander Fichtinger et.al. | 2507.04864 | null |
| 2025-04-07 | Mind the Prompt: Prompting Strategies in Audio Generations for Improving Sound Classification | Francesca Ronchini et.al. | 2504.03329 | null |
| 2025-03-25 | Multimodal Large Language Models for Image, Text, and Speech Data Augmentation: A Survey | Ranjan Sapkota et.al. | 2501.18648 | null |
| 2025-01-24 | Generative Data Augmentation Challenge: Synthesis of Room Acoustics for Speaker Distance Estimation | Jackie Lin et.al. | 2501.13250 | null |
| 2024-12-03 | Sample adaptive data augmentation with progressive scheduling | Hongxuan Lu et.al. | 2412.00415 | null |
| 2024-10-15 | SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs | Wenxi Chen et.al. | 2410.09503 | null |
| 2025-02-05 | Exploring Empty Spaces: Human-in-the-Loop Data Augmentation | Catherine Yeh et.al. | 2410.01088 | null |
| 2024-06-28 | Leveraging Synthetic Audio Data for End-to-End Low-Resource Speech Translation | Yasmin Moslem et.al. | 2406.17363 | null |
| 2024-06-25 | Revisiting Interpolation Augmentation for Speech-to-Text Generation | Chen Xu et.al. | 2406.15846 | null |
| 2024-01-18 | On the Effect of Data-Augmentation on Local Embedding Properties in the Contrastive Learning of Music Audio Representations | Matthew C. McCallum et.al. | 2401.08889 | null |
| 2024-01-17 | Maximum-Entropy Adversarial Audio Augmentation for Keyword Spotting | Zuzhao Ye et.al. | 2401.06897 | null |
| 2023-12-27 | Exploring data augmentation in bias mitigation against non-native-accented speech | Yuanyuan Zhang et.al. | 2312.15499 | null |
| 2023-12-15 | Towards Automatic Data Augmentation for Disordered Speech Recognition | Zengrui Jin et.al. | 2312.08641 | null |
| 2023-12-15 | PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition | Chengxi Lei et.al. | 2312.08571 | null |
| 2023-10-27 | Dialect Adaptation and Data Augmentation for Low-Resource ASR: TalTech Systems for the MADASR 2023 Challenge | Tanel Alumäe et.al. | 2310.17448 | null |
| 2024-01-11 | Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation | Shih-Lun Wu et.al. | 2309.17352 | null |
| 2024-02-21 | Deepfake audio as a data augmentation technique for training automatic speech to text transcription models | Alexandre R. Ferreira et.al. | 2309.12802 | null |
| 2024-07-02 | Reduce, Reuse, Recycle: Is Perturbed Data better than Other Language augmentation for Low Resource Self-Supervised Speech Models | Asad Ullah et.al. | 2309.12763 | null |
| 2024-01-10 | Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation | Emiru Tsunoo et.al. | 2309.08876 | null |
| 2023-08-01 | Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text | Eric Sun et.al. | 2307.16332 | null |
| 2023-06-08 | Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based Augmentation | Massa Baali et.al. | 2306.04368 | null |
| 2023-04-26 | Selective Data Augmentation for Robust Speech Translation | Rajul Acharya et.al. | 2304.03169 | null |
| 2024-04-01 | A Comparison of Speech Data Augmentation Methods Using S3PRL Toolkit | Mina Huh et.al. | 2303.00510 | null |
| 2023-11-02 | SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations | Ioannis Tsiamas et.al. | 2212.09699 | null |
| 2023-05-24 | Exploring Train and Test-Time Augmentations for Audio-Language Learning | Eungbeom Kim et.al. | 2210.17143 | null |
| 2022-11-01 | Improving Natural-Language-based Audio Retrieval with Transfer Learning and Audio & Text Augmentations | Paul Primus et.al. | 2208.11460 | null |
| 2022-08-11 | Generative Data Augmentation Guided by Triplet Loss for Speech Emotion Recognition | Shijun Wang et.al. | 2208.04994 | null |
| 2022-07-21 | Improving Data Driven Inverse Text Normalization using Data Augmentation | Laxmi Pandey et.al. | 2207.09674 | null |
| 2022-07-15 | Data Augmentation for Low-Resource Quechua ASR Improvement | Rodolfo Zevallos et.al. | 2207.06872 | null |
| 2022-07-19 | Data Augmentation for Dementia Detection in Spoken Language | Anna Hlédiková et.al. | 2206.12879 | null |
| 2023-06-02 | Audio Data Augmentation for Acoustic-to-articulatory Speech Inversion using Bidirectional Gated RNNs | Yashish M. Siriwardena et.al. | 2205.13086 | null |
| 2025-11-05 | Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition | Zengrui Jin et.al. | 2205.06445 | null |
| 2022-04-12 | Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition | Zehai Tu et.al. | 2204.04284 | null |
| 2022-04-11 | Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning | Salah Zaiem et.al. | 2204.04170 | null |
| 2022-07-07 | SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training Strategy | Shuai Guo et.al. | 2203.17001 | null |
| 2023-06-12 | Sample, Translate, Recombine: Leveraging Audio Alignments for Data Augmentation in End-to-end Speech Translation | Tsz Kin Lam et.al. | 2203.08757 | null |
| 2022-09-02 | A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification | Qing Wang et.al. | 2203.04114 | null |
| 2022-02-22 | ImportantAug: a data augmentation agent for speech | Viet Anh Trinh et.al. | 2112.07156 | null |
| 2022-05-20 | Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection | Ricardo Falcon-Perez et.al. | 2110.06126 | null |
| 2021-10-14 | SpliceOut: A Simple and Efficient Audio Augmentation Method | Arjit Jain et.al. | 2110.00046 | null |
| 2021-08-17 | Data Augmentation for Scene Text Recognition | Rowel Atienza et.al. | 2108.06949 | null |
| 2021-08-09 | SpecMix : A Mixed Sample Data Augmentation method for Training withTime-Frequency Domain Features | Gwantae Kim et.al. | 2108.03020 | null |
| 2021-08-03 | Adversarial Data Augmentation for Disordered Speech Recognition | Zengrui Jin et.al. | 2108.00899 | null |
| 2021-04-27 | Semantic Data Augmentation for End-to-End Mandarin Speech Recognition | Jianwei Sun et.al. | 2104.12521 | null |
| 2021-04-16 | EnvGAN: Adversarial Synthesis of Environmental Sounds for Data Augmentation | Aswathy Madhu et.al. | 2104.07326 | null |
| 2023-06-13 | On-the-Fly Aligned Data Augmentation for Sequence-to-Sequence ASR | Tsz Kin Lam et.al. | 2104.01393 | null |
| 2021-06-16 | SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification | Helin Wang et.al. | 2103.16858 | null |
| 2021-02-26 | MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition | Linghui Meng et.al. | 2102.12664 | null |
| 2022-11-17 | Back Translation Survey for Improving Text Augmentation | Matthew Ciolino et.al. | 2102.09708 | null |
| 2021-02-19 | Fundamental Frequency Feature Normalization and Data Augmentation for Child Speech Recognition | Gary Yeung et.al. | 2102.09106 | null |
| 2021-02-17 | Adaptive Weighting Scheme for Automatic Time-Series Data Augmentation | Elizabeth Fons et.al. | 2102.08310 | null |
| 2021-04-20 | Enhancing Audio Augmentation Methods with Consistency Learning | Turab Iqbal et.al. | 2102.05151 | null |
| 2023-03-08 | A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection | Qing Wang et.al. | 2101.02919 | null |
| 2022-02-17 | Multi-Window Data Augmentation Approach for Speech Emotion Recognition | Sarala Padi et.al. | 2010.09895 | null |
| 2020-09-01 | Data augmentation using prosody and false starts to recognize non-native children’s speech | Hemant Kathania et.al. | 2008.12914 | null |
| 2020-08-18 | StoRIR: Stochastic Room Impulse Response Generation for Audio Data Augmentation | Piotr Masztalski et.al. | 2008.07231 | null |
| 2020-09-25 | Data augmentation and loss normalization for deep noise suppression | Sebastian Braun et.al. | 2008.06412 | null |
| 2021-03-29 | Data augmentation enhanced speaker enrollment for text-dependent speaker verification | Achintya Kumar Sarkar et.al. | 2007.08004 | null |
| 2020-06-11 | Data Augmentation for Training Dialog Models Robust to Speech Recognition Errors | Longshaokan Wang et.al. | 2006.05635 | null |
| 2020-09-04 | On the Effectiveness of Neural Text Generation based Data Augmentation for Recognition of Morphologically Rich Speech | Balázs Tarján et.al. | 2006.05129 | null |
| 2020-01-16 | A Multi-cascaded Model with Data Augmentation for Enhanced Paraphrase Detection in Short Texts | Muhammad Haroon Shakeel et.al. | 1912.12068 | null |
| 2022-01-04 | Audiogmenter: a MATLAB Toolbox for Audio Data Augmentation | Gianluca Maguolo et.al. | 1912.05472 | null |
| 2020-02-04 | Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation | Thai-Son Nguyen et.al. | 1910.13296 | null |
| 2019-12-04 | SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition | Daniel S. Park et.al. | 1904.08779 | null |
| 2020-12-07 | Data Augmentation of Room Classifiers using Generative Adversarial Networks | Constantinos Papayiannis et.al. | 1901.03257 | null |
| 2018-08-14 | Sample Mixed-Based Data Augmentation for Domestic Audio Tagging | Shengyun Wei et.al. | 1808.03883 | null |
| 2018-09-13 | CNN+LSTM Architecture for Speech Emotion Recognition with Data Augmentation | Caroline Etienne et.al. | 1802.05630 | null |
| 2022-07-06 | Efficient data augmentation techniques for some classes of state space models | Linda S. L. Tan et.al. | 1712.08887 | null |
📊 1506 papers
| 📅 Publish Date | 📝 Title | 👥 Authors | 💻 Code | |
|---|---|---|---|---|
| 2026-04-01 | Bridging the Simulation-to-Experiment Gap with Generative Models using Adversarial Distribution Alignment | Kai Nelson et.al. | 2604.01169 | null |
| 2026-04-01 | Looking into a Pixel by Nonlinear Unmixing – A Generative Approach | Maofeng Tang et.al. | 2604.01141 | null |
| 2026-04-01 | Optimsyn: Influence-Guided Rubrics Optimization for Synthetic Data Generation | Zhiting Fan et.al. | 2604.00536 | null |
| 2026-03-31 | SANA I2I: A Text Free Flow Matching Framework for Paired Image to Image Translation with a Case Study in Fetal MRI Artifact Reduction | Italo Felix Santos et.al. | 2604.00298 | null |
| 2026-03-31 | SYNTHONY: A Stress-Aware, Intent-Conditioned Agent for Deep Tabular Generative Models Selection | Hochan Son et.al. | 2604.00293 | null |
| 2026-03-31 | RawGen: Learning Camera Raw Image Generation | Dongyoung Kim et.al. | 2604.00093 | null |
| 2026-03-31 | Reasoning-Driven Synthetic Data Generation and Evaluation | Tim R. Davidson et.al. | 2603.29791 | null |
| 2026-03-31 | Multi-Feature Fusion Approach for Generative AI Images Detection | Abderrezzaq Sendjasni et.al. | 2603.29788 | null |
| 2026-03-31 | Leveraging Synthetic Data for Enhancing Egocentric Hand-Object Interaction Detection | Rosario Leonardi et.al. | 2603.29733 | null |
| 2026-03-31 | 6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network Management | Jiao Chen et.al. | 2603.29656 | null |
| 2026-03-31 | Concept frustration: Aligning human concepts and machine representations | Enrico Parisini et.al. | 2603.29654 | null |
| 2026-03-31 | CIPHER: Counterfeit Image Pattern High-level Examination via Representation | Kyeonghun Kim et.al. | 2603.29356 | null |
| 2026-03-31 | Differentiable Normative Guidance for Nash Bargaining Solution Recovery | Moirangthem Tiken Singh et.al. | 2603.29297 | null |
| 2026-03-31 | Customer Analysis and Text Generation for Small Retail Stores Using LLM-Generated Marketing Presence | Shiori Nakamura et.al. | 2603.29273 | null |
| 2026-03-30 | Generating Humanless Environment Walkthroughs from Egocentric Walking Tour Videos | Yujin Ham et.al. | 2603.29036 | null |
| 2026-03-30 | The Ultimate Tutorial for AI-driven Scale Development in Generative Psychometrics: Releasing AIGENIE from its Bottle | Lara Russell-Lasalandra et.al. | 2603.28643 | null |
| 2026-03-30 | Unrestrained Simplex Denoising for Discrete Data. A Non-Markovian Approach Applied to Graph Generation | Yoann Boget et.al. | 2603.28572 | null |
| 2026-03-30 | A Probabilistic Generative Model for Spectral Speech Enhancement | Marco Hidalgo-Araya et.al. | 2603.28436 | null |
| 2026-03-30 | From Independent to Correlated Diffusion: Generalized Generative Modeling with Probabilistic Computers | Nihal Sanjay Singh et.al. | 2603.27996 | null |
| 2026-03-30 | Scaling Atomistic Protein Binder Design with Generative Pretraining and Test-Time Compute | Kieran Didi et.al. | 2603.27950 | null |
| 2026-03-29 | Diversity Matters: Dataset Diversification and Dual-Branch Network for Generalized AI-Generated Image Detection | Nusrat Tasnim et.al. | 2603.27800 | null |
| 2026-03-29 | Emergent Social Intelligence Risks in Generative Multi-Agent Systems | Yue Huang et.al. | 2603.27771 | null |
| 2026-03-29 | Test-Time Instance-Specific Parameter Composition: A New Paradigm for Adaptive Generative Modeling | Minh-Tuan Tran et.al. | 2603.27665 | null |
| 2026-03-29 | Understanding Semantic Perturbations on In-Processing Generative Image Watermarks | Anirudh Nakra et.al. | 2603.27513 | null |
| 2026-03-28 | Beyond Descriptions: A Generative Scene2Audio Framework for Blind and Low-Vision Users to Experience Vista Landscapes | Chitralekha Gupta et.al. | 2603.27295 | null |
| 2026-03-28 | Amalgam: Hybrid LLM-PGM Synthesis Algorithm for Accuracy and Realism | Antheas Kapenekakis et.al. | 2603.27254 | null |
| 2026-03-27 | Material Identification using Multi-Modal Intrinsic Radiation and Radiography | Khoa Nguyen et.al. | 2603.27036 | null |
| 2026-03-27 | Generative Shape Reconstruction with Geometry-Guided Langevin Dynamics | Linus Härenstam-Nielsen et.al. | 2603.27016 | null |
| 2026-03-27 | Transparency as Architecture: Structural Compliance Gaps in EU AI Act Article 50 II | Vera Schmitt et.al. | 2603.26983 | null |
| 2026-03-27 | Synthesizing the Counterfactual: A CTGAN-Augmented Causal Evaluation of Palliative Care on Spousal Depression | Pietro Grassi et.al. | 2603.26913 | null |
| 2026-03-27 | Strategic Candidacy in Generative AI Arenas | Chris Hays et.al. | 2603.26891 | null |
| 2026-03-27 | AFSS: Artifact-Focused Self-Synthesis for Mitigating Bias in Audio Deepfake Detection | Hai-Son Nguyen-Le et.al. | 2603.26856 | null |
| 2026-03-27 | Generative Modeling in Protein Design: Neural Representations, Conditional Generation, and Evaluation Standards | Senura Hansaja Wanasekara et.al. | 2603.26378 | null |
| 2026-03-27 | A Formal Framework for Uncertainty Analysis of Text Generation with Large Language Models | Steffen Herbold et.al. | 2603.26363 | null |
| 2026-03-27 | Generative Score Inference for Multimodal Data | Xinyu Tian et.al. | 2603.26349 | null |
| 2026-03-27 | Physics-Informed Neural Networks and Sequence Encoder: Application to heating and early cooling of thermo-stamping process | Mouad Elaarabi et.al. | 2603.26245 | null |
| 2026-03-27 | Cinematic Audio Source Separation Using Visual Cues | Kang Zhang et.al. | 2603.26113 | null |
| 2026-03-27 | JRM: Joint Reconstruction Model for Multiple Objects without Alignment | Qirui Wu et.al. | 2603.25985 | null |
| 2026-03-26 | Seeing Through Smoke: Surgical Desmoking for Improved Visual Perception | Jingpei Lu et.al. | 2603.25867 | null |
| 2026-03-26 | ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners? | Haonan Han et.al. | 2603.25823 | null |
| 2026-03-26 | SoftMimicGen: A Data Generation System for Scalable Robot Learning in Deformable Object Manipulation | Masoud Moghani et.al. | 2603.25725 | null |
| 2026-03-26 | Knowledge-Guided Retrieval-Augmented Generation for Zero-Shot Psychiatric Data: Privacy Preserving Synthetic Data Generation | Adam Jakobsen et.al. | 2603.25186 | null |
| 2026-03-26 | AnyDoc: Enhancing Document Generation via Large-Scale HTML/CSS Data Synthesis and Height-Aware Reinforcement Optimization | Jiawei Lin et.al. | 2603.25118 | null |
| 2026-03-26 | Discrete Causal Representation Learning | Wenjin Zhang et.al. | 2603.25017 | null |
| 2026-03-25 | Post-selection inference in generalized linear models via parametric programming | Qinyan Shen et.al. | 2603.24875 | null |
| 2026-03-25 | Synthetic Rewriting as a Quality Multiplier: Evidence from Portuguese Continued Pretraining | Thales Sales Almeida et.al. | 2603.24826 | null |
| 2026-03-25 | Synthetic Cardiac MRI Image Generation using Deep Generative Models | Ishan Kumarasinghe et.al. | 2603.24764 | null |
| 2026-03-25 | Contrastive Learning Boosts Deterministic and Generative Models for Weather Data | Nathan Bailey et.al. | 2603.24744 | null |
| 2026-03-25 | Training LLMs for Multi-Step Tool Orchestration with Constrained Data Synthesis and Graduated Rewards | Cheng Jiayang et.al. | 2603.24709 | null |
| 2026-03-25 | Saranga: MilliWatt Ultrasound for Navigation in Visually Degraded Environments on Palm-Sized Aerial Robots | Manoj Velmurugan et.al. | 2603.24699 | null |
| 2026-03-25 | SpinGQE: A Generative Quantum Eigensolver for Spin Hamiltonians | Alexander Holden et.al. | 2603.24298 | null |
| 2026-03-25 | PosterIQ: A Design Perspective Benchmark for Poster Understanding and Generation | Yuheng Feng et.al. | 2603.24078 | null |
| 2026-03-27 | CVPD at QIAS 2026: RAG-Guided LLM Reasoning for Al-Mawarith Share Computation and Heir Allocation | Wassim Swaileh et.al. | 2603.24012 | null |
| 2026-03-25 | GARP-EFM: Improving Foundation Models with Revealed Preference Structure | Victor H. Aguiar et.al. | 2603.23993 | null |
| 2026-03-25 | Argument Mining as a Text-to-Text Generation Task | Masayuki Kawarada et.al. | 2603.23949 | null |
| 2026-03-24 | CDMT-EHR: A Continuous-Time Diffusion Framework for Generating Mixed-Type Time-Series Electronic Health Records | Shaonan Liu et.al. | 2603.23719 | null |
| 2026-03-24 | GO-Renderer: Generative Object Rendering with 3D-aware Controllable Video Diffusion Models | Zekai Gu et.al. | 2603.23246 | null |
| 2026-03-25 | DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models | Donya Jafari et.al. | 2603.23140 | null |
| 2026-03-24 | HUydra: Full-Range Lung CT Synthesis via Multiple HU Interval Generative Modelling | António Cardoso et.al. | 2603.23041 | null |
| 2026-03-24 | Few-Shot Generative Model Adaption via Identity Injection and Preservation | Yeqi He et.al. | 2603.22965 | null |
| 2026-03-23 | MIOFlow 2.0: A unified framework for inferring cellular stochastic dynamics from single cell and spatial transcriptomics data | Xingzhi Sun et.al. | 2603.22564 | null |
| 2026-03-23 | MCLR: Improving Conditional Modeling in Visual Generative Models via Inter-Class Likelihood-Ratio Maximization and Establishing the Equivalence between Classifier-Free Guidance and Alignment Objectives | Xiang Li et.al. | 2603.22364 | null |
| 2026-03-23 | GenOpticalFlow: A Generative Approach to Unsupervised Optical Flow Learning | Yixuan Luo et.al. | 2603.22270 | null |
| 2026-03-23 | Gumbel Distillation for Parallel Text Generation | Chi Zhang et.al. | 2603.22216 | null |
| 2026-03-23 | Dual-Space Knowledge Distillation with Key-Query Matching for Large Language Models with Vocabulary Mismatch | Stella Eva Tsiapali et.al. | 2603.22056 | null |
| 2026-03-23 | DiT-Flow: Speech Enhancement Robust to Multiple Distortions based on Flow Matching in Latent Space and Diffusion Transformers | Tianyu Cao et.al. | 2603.21608 | null |
| 2026-03-23 | SynSym: A Synthetic Data Generation Framework for Psychiatric Symptom Identification | Migyeong Kang et.al. | 2603.21529 | null |
| 2026-03-22 | Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs | Mariela M. Nina et.al. | 2603.21418 | null |
| 2026-03-22 | Amortized Variational Inference for Logistic Regression with Missing Covariates | M. Cherifi et.al. | 2603.21244 | null |
| 2026-03-22 | Does Mechanistic Interpretability Transfer Across Data Modalities? A Cross-Domain Causal Circuit Analysis of Variational Autoencoders | Dip Roy et.al. | 2603.21236 | null |
| 2026-03-22 | Incentivizing Generative Zero-Shot Learning via Outcome-Reward Reinforcement Learning with Visual Cues | Wenjin Hou et.al. | 2603.21138 | null |
| 2026-03-21 | NextSense: A Semi-Synthetic Sensing Data generation Platform | David Rico Menéndez et.al. | 2603.20789 | null |
| 2026-03-21 | Generative Diffusion Model for Risk-Neutral Derivative Pricing | Nilay Tiwari et.al. | 2603.20582 | null |
| 2026-03-20 | Revenue-Sharing as Infrastructure: A Distributed Business Model for Generative AI Platforms | Ghislain Dorian Tchuente Mondjo et.al. | 2603.20533 | null |
| 2026-03-20 | Diffutron: A Masked Diffusion Language Model for Turkish Language | Şuayp Talha Kocabay et.al. | 2603.20466 | null |
| 2026-03-20 | MME-CoF-Pro: Evaluating Reasoning Coherence in Video Generative Models with Text and Visual Hints | Yu Qi et.al. | 2603.20194 | null |
| 2026-03-20 | Deterministic Mode Proposals: An Efficient Alternative to Generative Sampling for Ambiguous Segmentation | Sebastian Gerard et.al. | 2603.20191 | null |
| 2026-03-20 | Kolmogorov-Arnold causal generative models | Alejandro Almodóvar et.al. | 2603.20184 | null |
| 2026-03-20 | Audio Avatar Fingerprinting: An Approach for Authorized Use of Voice Cloning in the Era of Synthetic Audio | Candice R. Gerstner et.al. | 2603.20165 | null |
| 2026-03-20 | Var-JEPA: A Variational Formulation of the Joint-Embedding Predictive Architecture – Bridging Predictive and Generative Self-Supervised Learning | Moritz Gögl et.al. | 2603.20111 | null |
| 2026-03-20 | GO-GenZip: Goal-Oriented Generative Sampling and Hybrid Compression | Pietro Talli et.al. | 2603.20109 | null |
| 2026-03-20 | FoleyDirector: Fine-Grained Temporal Steering for Video-to-Audio Generation via Structured Scripts | You Li et.al. | 2603.19857 | null |
| 2026-03-20 | Diminishing Returns in Expanding Generative Models and Godel-Tarski-Lob Limits | Angshul Majumdar et.al. | 2603.19687 | null |
| 2026-03-24 | LoD-Loc v3: Generalized Aerial Localization in Dense Cities using Instance Silhouette Alignment | Shuaibang Peng et.al. | 2603.19609 | null |
| 2026-03-19 | Warm-Start Flow Matching for Guaranteed Fast Text/Image Generation | Minyoung Kim et.al. | 2603.19360 | null |
| 2026-03-19 | MIDST Challenge at SaTML 2025: Membership Inference over Diffusion-models-based Synthetic Tabular data | Masoumeh Shafieinejad et.al. | 2603.19185 | null |
| 2026-03-19 | Revisiting Autoregressive Models for Generative Image Classification | Ilia Sudakov et.al. | 2603.19122 | null |
| 2026-03-19 | Foundations of Schrödinger Bridges for Generative Modeling | Sophia Tang et.al. | 2603.18992 | null |
| 2026-03-19 | Translating MRI to PET through Conditional Diffusion Models with Enhanced Pathology Awareness | Yitong Li et.al. | 2603.18896 | null |
| 2026-03-19 | A Human-in/on-the-Loop Framework for Accessible Text Generation | Lourdes Moreno et.al. | 2603.18879 | null |
| 2026-03-19 | Seasoning Generative Models for a Generalization Aftertaste | Hisham Husain et.al. | 2603.18817 | null |
| 2026-03-19 | Scaling Sim-to-Real Reinforcement Learning for Robot VLAs with Generative 3D Worlds | Andrew Choi et.al. | 2603.18532 | null |
| 2026-03-19 | From Snapshots to Symphonies: The Evolution of Protein Prediction from Static Structures to Generative Dynamics and Multimodal Interactions | Jingzhi Chen et.al. | 2603.18505 | null |
| 2026-03-18 | Synthetic Data Generation for Training Diversified Commonsense Reasoning Models | Tianhui Zhang et.al. | 2603.18361 | null |
| 2026-03-18 | Epistemic Generative Adversarial Networks | Muhammad Mubashar et.al. | 2603.18348 | null |
| 2026-03-20 | MOSS-TTS Technical Report | Yitian Gong et.al. | 2603.18090 | null |
| 2026-03-18 | Generative Replica-Exchange: A Flow-based Framework for Accelerating Replica Exchange Simulations | Shengjie Huang et.al. | 2603.18076 | null |
| 2026-03-18 | Machine Learning for Network Attacks Classification and Statistical Evaluation of Machine Learning for Network Attacks Classification and Adversarial Learning Methodologies for Synthetic Data Generation | Iakovos-Christos Zarkadis et.al. | 2603.17717 | null |
| 2026-03-18 | Cohomological Obstructions to Global Counterfactuals: A Sheaf-Theoretic Foundation for Generative Causal Models | Rui Wu et.al. | 2603.17384 | null |
| 2026-03-18 | Neuron-Level Emotion Control in Speech-Generative Large Audio-Language Models | Xiutian Zhao et.al. | 2603.17231 | null |
| 2026-03-17 | Early Quantization Shrinks Codebook: A Simple Fix for Diversity-Preserving Tokenization | Wenhao Zhao et.al. | 2603.17052 | null |
| 2026-03-17 | SCE-LITE-HQ: Smooth visual counterfactual explanations with generative foundation models | Ahmed Zeid et.al. | 2603.17048 | null |
| 2026-03-17 | Dependence Fidelity and Downstream Inference Stability in Generative Models | Nazia Riasat et.al. | 2603.17041 | null |
| 2026-03-19 | HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning | Shenzhi Wang et.al. | 2603.17024 | null |
| 2026-03-17 | SegviGen: Repurposing 3D Generative Model for Part Segmentation | Lin Li et.al. | 2603.16869 | null |
| 2026-03-17 | A Semantic Timbre Dataset for the Electric Guitar | Joseph Cameron et.al. | 2603.16682 | null |
| 2026-03-17 | VideoMatGen: PBR Materials through Joint Generative Modeling | Jon Hasselgren et.al. | 2603.16566 | null |
| 2026-03-17 | Unlearning for One-Step Generative Models via Unbalanced Optimal Transport | Hyundo Choi et.al. | 2603.16489 | null |
| 2026-03-17 | DermaFlux: Synthetic Skin Lesion Generation with Rectified Flows for Enhanced Image Classification | Stathis Galanakis et.al. | 2603.16392 | null |
| 2026-03-17 | Out-of-Distribution Object Detection in Street Scenes via Synthetic Outlier Exposure and Transfer Learning | Sadia Ilyas et.al. | 2603.16122 | null |
| 2026-03-17 | Diffusion Models for Joint Audio-Video Generation | Alejandro Paredes La Torre et.al. | 2603.16093 | null |
| 2026-03-16 | FlatLands: Generative Floormap Completion From a Single Egocentric View | Subhransu S. Bhattacharjee et.al. | 2603.16016 | null |
| 2026-03-16 | Time-Aware Prior Fitted Networks for Zero-Shot Forecasting with Exogenous Variables | Andres Potapczynski et.al. | 2603.15802 | null |
| 2026-03-16 | AC-Foley: Reference-Audio-Guided Video-to-Audio Synthesis with Acoustic Transfer | Pengjun Fang et.al. | 2603.15597 | null |
| 2026-03-16 | Clinically Aware Synthetic Image Generation for Concept Coverage in Chest X-ray Models | Amy Rafferty et.al. | 2603.15525 | null |
| 2026-03-18 | NV-Bench: Benchmark of Nonverbal Vocalization Synthesis for Expressive Text-to-Speech Generation | Qinke Ni et.al. | 2603.15352 | null |
| 2026-03-16 | Faster Inference of Flow-Based Generative Models via Improved Data-Noise Coupling | Aram Davtyan et.al. | 2603.15279 | null |
| 2026-03-16 | Modeling Matches as Language: A Generative Transformer Approach for Counterfactual Player Valuation in Football | Miru Hong et.al. | 2603.15212 | null |
| 2026-03-16 | PhonemeDF: A Synthetic Speech Dataset for Audio Deepfake Detection and Naturalness Evaluation | Vamshi Nallaguntla et.al. | 2603.15037 | null |
| 2026-03-18 | Training-free Detection of Generated Videos via Spatial-Temporal Likelihoods | Omer Ben Hayun et.al. | 2603.15026 | null |
| 2026-03-16 | OrgForge: A Multi-Agent Simulation Framework for Verifiable Synthetic Corporate Corpora | Jeffrey Flynt et.al. | 2603.14997 | null |
| 2026-03-16 | Preconditioned One-Step Generative Modeling for Bayesian Inverse Problems in Function Spaces | Zilan Cheng et.al. | 2603.14798 | null |
| 2026-03-16 | LiDAR-EVS: Enhance Extrapolated View Synthesis for 3D Gaussian Splatting with Pseudo-LiDAR Supervision | Yiming Huang et.al. | 2603.14763 | null |
| 2026-03-16 | Chain-of-Trajectories: Unlocking the Intrinsic Generative Optimality of Diffusion Models via Graph-Theoretic Planning | Ping Chen et.al. | 2603.14704 | null |
| 2026-03-15 | QiMeng-CodeV-SVA: Training Specialized LLMs for Hardware Assertion Generation via RTL-Grounded Bidirectional Data Synthesis | Yutong Wu et.al. | 2603.14239 | null |
| 2026-03-15 | Fair Benchmarking of Emerging One-Step Generative Models Against Multistep Diffusion and Flow Models | Advaith Ravishankar et.al. | 2603.14186 | null |
| 2026-03-14 | Sat-JEPA-Diff: Bridging Self-Supervised Learning and Generative Diffusion for Remote Sensing | Kursat Komurcu et.al. | 2603.13943 | null |
| 2026-03-14 | Discriminative Flow Matching Via Local Generative Predictors | Om Govind Jha et.al. | 2603.13928 | null |
| 2026-03-14 | Evaluating Semantic Fragility in Text-to-Audio Generation Systems Under Controlled Prompt Perturbations | Jiahui Wu et.al. | 2603.13824 | null |
| 2026-03-14 | PhysAlign: Physics-Coherent Image-to-Video Generation through Feature and 3D Representation Alignment | Zhexiao Xiong et.al. | 2603.13770 | null |
| 2026-03-14 | Implicit Maximum Likelihood Estimation for Real-time Generative Model Predictive Control | Grayson Lee et.al. | 2603.13733 | null |
| 2026-03-14 | Steering Generative Models for Accessibility: EasyRead Image Generation | Nicolas Dickenmann et.al. | 2603.13695 | null |
| 2026-03-13 | EmDT: Embedding Diffusion Transformer for Tabular Data Generation in Fraud Detection | En-Ya Kuo et.al. | 2603.13566 | null |
| 2026-03-13 | MIRAGE: Model-agnostic Industrial Realistic Anomaly Generation and Evaluation for Visual Anomaly Detection | Jinwei Hu et.al. | 2603.13507 | null |
| 2026-03-13 | Understanding the strengths and weaknesses of SSL models for audio deepfake model attribution | Gabriel Pîrlogeanu et.al. | 2603.13488 | null |
| 2026-03-13 | A Generative Model of Conspicuous Consumption and Status Signaling | Logan Cross et.al. | 2603.13220 | null |
| 2026-03-13 | V-Bridge: Bridging Video Generative Priors to Versatile Few-shot Image Restoration | Shenghe Zheng et.al. | 2603.13089 | null |
| 2026-03-16 | DS $^2$ -Instruct: Domain-Specific Data Synthesis for Large Language Models Instruction Tuning | Ruiyao Xu et.al. | 2603.12932 | null |
| 2026-03-13 | HaltNav: Reactive Visual Halting over Lightweight Topological Priors for Robust Vision-Language Navigation | Pingcong Li et.al. | 2603.12696 | null |
| 2026-03-12 | RAW-Domain Degradation Models for Realistic Smartphone Super-Resolution | Ali Mosleh et.al. | 2603.12493 | null |
| 2026-03-12 | Sinkhorn-Drifting Generative Models | Ping He et.al. | 2603.12366 | null |
| 2026-03-11 | Synthetic Data Generation for Brain-Computer Interfaces: Overview, Benchmarking, and Future Directions | Ziwei Wang et.al. | 2603.12296 | null |
| 2026-03-12 | DVD: Deterministic Video Depth Estimation with Generative Priors | Hongfei Zhang et.al. | 2603.12250 | null |
| 2026-03-15 | QAQ: Bidirectional Semantic Coherence for Selecting High-Quality Synthetic Code Instructions | Jiayin Lei et.al. | 2603.12165 | null |
| 2026-03-16 | Structure Selection for Fairness-Constrained Differentially Private Data Synthesis | Naeim Ghahramanpour et.al. | 2603.12112 | null |
| 2026-03-12 | Efficient Generative Modeling with Unitary Matrix Product States Using Riemannian Optimization | Haotong Duan et.al. | 2603.12026 | null |
| 2026-03-12 | AS-Bridge: A Bidirectional Generative Framework Bridging Next-Generation Astronomical Surveys | Dichang Zhang et.al. | 2603.11928 | null |
| 2026-03-12 | Language Generation with Replay: A Learning-Theoretic View of Model Collapse | Giorgio Racca et.al. | 2603.11784 | null |
| 2026-03-12 | Anomaly detection in time-series via inductive biases in the latent space of conditional normalizing flows | David Baumgartner et.al. | 2603.11756 | null |
| 2026-03-12 | Gender Bias in Generative AI-assisted Recruitment Processes | Martina Ullasci et.al. | 2603.11736 | null |
| 2026-03-12 | Resonate: Reinforcing Text-to-Audio Generation via Online Feedback from Large Audio Language Models | Xiquan Li et.al. | 2603.11661 | null |
| 2026-03-12 | Personalized Federated Learning via Gaussian Generative Modeling | Peng Hu et.al. | 2603.11620 | null |
| 2026-03-12 | Gen-Fab: A Variation-Aware Generative Model for Predicting Fabrication Variations in Nanophotonic Devices | Rambod Azimi et.al. | 2603.11505 | null |
| 2026-03-12 | Reproducible Synthetic Clinical Letters for Seizure Frequency Information Extraction | Yujian Gan et.al. | 2603.11407 | null |
| 2026-03-11 | A Standardized Framework For Evaluating Gene Expression Generative Models | Andrea Rubbi et.al. | 2603.11244 | null |
| 2026-03-11 | Generative modeling with Gaussian Boson Sampling: classically trainable Bosonic Born Machines | Zoltán Kolarovszki et.al. | 2603.11195 | null |
| 2026-03-11 | Interventional Time Series Priors for Causal Foundation Models | Dennis Thumm et.al. | 2603.11090 | null |
| 2026-03-11 | V2A-DPO: Omni-Preference Optimization for Video-to-Audio Generation | Nolan Chan et.al. | 2603.11089 | null |
| 2026-03-11 | Universality of Classically Trainable, Quantum-Deployed Boson-Sampling Generative Models | Andrii Kurkin et.al. | 2603.11014 | null |
| 2026-03-11 | Quantifying Membership Disclosure Risk for Tabular Synthetic Data Using Kernel Density Estimators | Rajdeep Pathak et.al. | 2603.10937 | null |
| 2026-03-11 | SNPgen: Phenotype-Supervised Genotype Representation and Synthetic Data Generation via Latent Diffusion | Andrea Lampis et.al. | 2603.10873 | null |
| 2026-03-11 | ReTabSyn: Realistic Tabular Data Synthesis via Reinforcement Learning | Xiaofeng Lin et.al. | 2603.10823 | null |
| 2026-03-11 | Semantic Satellite Communications for Synchronized Audiovisual Reconstruction | Fangyu Liu et.al. | 2603.10791 | null |
| 2026-03-12 | Probabilistic Verification of Voice Anti-Spoofing Models | Evgeny Kushnir et.al. | 2603.10713 | null |
| 2026-03-11 | AlphaFlowTSE: One-Step Generative Target Speaker Extraction via Conditional AlphaFlow | Duojia Li et.al. | 2603.10701 | null |
| 2026-03-11 | Learning Bimanual Cloth Manipulation with Vision-based Tactile Sensing via Single Robotic Arm | Dongmyoung Lee et.al. | 2603.10609 | null |
| 2026-03-11 | HyPER-GAN: Hybrid Patch-Based Image-to-Image Translation for Real-Time Photorealism Enhancement | Stefanos Pasios et.al. | 2603.10604 | null |
| 2026-03-11 | Layer Consistency Matters: Elegant Latent Transition Discrepancy for Generalizable Synthetic Image Detection | Yawen Yang et.al. | 2603.10598 | null |
| 2026-03-11 | Gradient Flow Drifting: Generative Modeling via Wasserstein Gradient Flows of KDE-Approximated Divergences | Jiarui Cao et.al. | 2603.10592 | null |
| 2026-03-10 | Improving TabPFN’s Synthetic Data Generation by Integrating Causal Structure | Davide Tugnoli et.al. | 2603.10254 | null |
| 2026-03-10 | Generative Drifting is Secretly Score Matching: a Spectral and Variational Perspective | Erkan Turan et.al. | 2603.09936 | null |
| 2026-03-10 | You Didn’t Have to Say It like That: Subliminal Learning from Faithful Paraphrases | Isaia Gisler et.al. | 2603.09517 | null |
| 2026-03-10 | Cognitively Layered Data Synthesis for Domain Adaptation of LLMs to Space Situational Awareness | Ding Linghu et.al. | 2603.09231 | null |
| 2026-03-10 | Differentiable Stochastic Traffic Dynamics: Physics-Informed Generative Modelling in Transportation | Wuping Xin et.al. | 2603.09174 | null |
| 2026-03-09 | Statistical Inference via Generative Models: Flow Matching and Causal Inference | Shinto Eguchi et.al. | 2603.09009 | null |
| 2026-03-09 | VoxEmo: Benchmarking Speech Emotion Recognition with Speech LLMs | Hezhao Zhang et.al. | 2603.08936 | null |
| 2026-03-09 | HeteroFedSyn: Differentially Private Tabular Data Synthesis for Heterogeneous Federated Settings | Xiaochen Li et.al. | 2603.08832 | null |
| 2026-03-09 | Efficient training of photonic quantum generative models | Felix Gottlieb et.al. | 2603.08793 | null |
| 2026-03-09 | Generative Adversarial Regression (GAR): Learning Conditional Risk Scenarios | Saeed Asadi et.al. | 2603.08553 | null |
| 2026-03-09 | Foley-Flow: Coordinated Video-to-Audio Generation with Masked Audio-Visual Alignment and Dynamic Conditional Flows | Shentong Mo et.al. | 2603.08126 | null |
| 2026-03-12 | Evaluating Generative Models via One-Dimensional Code Distributions | Zexi Jia et.al. | 2603.08064 | null |
| 2026-03-08 | Uncertainty-Gated Generative Modeling | Xingrui Gu et.al. | 2603.07753 | null |
| 2026-03-08 | Evaluating Synthetic Data for Baggage Trolley Detection in Airport Logistics | Abdeldjalil Taibi et.al. | 2603.07645 | null |
| 2026-03-08 | Targeted Speaker Poisoning Framework in Zero-Shot Text-to-Speech | Thanapat Trachu et.al. | 2603.07551 | null |
| 2026-03-08 | Learning-free L2-Accented Speech Generation using Phonological Rules | Thanathai Lertpetchpun et.al. | 2603.07550 | null |
| 2026-03-08 | Contact-Guided 3D Genome Structure Generation of E. coli via Diffusion Transformers | Mingxin Zhang et.al. | 2603.07472 | null |
| 2026-03-07 | DiffSIM: Unconditional and conditional facies simulation based on denoising diffusion generative models | Minghui Xu et.al. | 2603.07383 | null |
| 2026-03-07 | ConfHit: Conformal Generative Design with Oracle Free Guarantees | Siddhartha Laghuvarapu et.al. | 2603.07371 | null |
| 2026-03-10 | Latent Generative Models with Tunable Complexity for Compressed Sensing and other Inverse Problems | Sean Gunn et.al. | 2603.07357 | null |
| 2026-03-07 | Agentic Planning with Reasoning for Image Styling via Offline RL | Subhojyoti Mukherjee et.al. | 2603.07148 | null |
| 2026-03-07 | Resource-Adaptive Federated Text Generation with Differential Privacy | Jiayi Wang et.al. | 2603.07027 | null |
| 2026-03-07 | Conditional Unbalanced Optimal Transport Maps: An Outlier-Robust Framework for Conditional Generative Modeling | Jiwoo Yoon et.al. | 2603.06972 | null |
| 2026-03-06 | Stability-Guided Exploration for Diverse Motion Generation | Eckart Cobo-Briesewitz et.al. | 2603.06773 | null |
| 2026-03-06 | Improved Constrained Generation by Bridging Pretrained Generative Models | Xiaoxuan Liang et.al. | 2603.06742 | null |
| 2026-03-06 | From Statistical Fidelity to Clinical Consistency: Scalable Generation and Auditing of Synthetic Patient Trajectories | Guanglin Zhou et.al. | 2603.06720 | null |
| 2026-03-06 | Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis | Hila Chefer et.al. | 2603.06507 | null |
| 2026-03-06 | Training Flow Matching: The Role of Weighting and Parameterization | Anne Gagneux et.al. | 2603.06454 | null |
| 2026-03-06 | Toward Generative Quantum Utility via Correlation-Complexity Map | Chen-Yu Liu et.al. | 2603.06440 | null |
| 2026-03-10 | Making Training-Free Diffusion Segmentors Scale with the Generative Power | Benyuan Meng et.al. | 2603.06178 | null |
| 2026-03-06 | Longitudinal NSCLC Treatment Progression via Multimodal Generative Models | Massimiliano Mantegna et.al. | 2603.06147 | null |
| 2026-03-06 | A Hazard-Informed Data Pipeline for Robotics Physical Safety | Alexei Odinokov et.al. | 2603.06130 | null |
| 2026-03-06 | PixARMesh: Autoregressive Mesh-Native Single-View Scene Reconstruction | Xiang Zhang et.al. | 2603.05888 | null |
| 2026-03-06 | StreamWise: Serving Multi-Modal Generation in Real-Time at Scale | Haoran Qiu et.al. | 2603.05800 | null |
| 2026-03-06 | CBCT-Based Synthetic CT Generation Using Conditional Flow Matching Model | Junbo Peng et.al. | 2603.05796 | null |
| 2026-03-05 | EigenData: A Self-Evolving Multi-Agent Platform for Function-Calling Data Synthesis, Auditing, and Repair | Jiaao Chen et.al. | 2603.05553 | null |
| 2026-03-05 | Building Enterprise Realtime Voice Agents from Scratch: A Technical Tutorial | Jielin Qiu et.al. | 2603.05413 | null |
| 2026-03-05 | Harnessing Synthetic Data from Generative AI for Statistical Inference | Ahmad Abdel-Azim et.al. | 2603.05396 | null |
| 2026-03-05 | WavSLM: Single-Stream Speech Language Modeling via WavLM Distillation | Luca Della Libera et.al. | 2603.05299 | null |
| 2026-03-05 | How far have we gone in Generative Image Restoration? A study on its capability, limitations and evaluation practices | Xiang Yin et.al. | 2603.05010 | link |
| 2026-03-05 | HiFlow: Hierarchical Feedback-Driven Optimization for Constrained Long-Form Text Generation | Yifan Zhu et.al. | 2603.04996 | null |
| 2026-03-05 | Free Lunch for Pass@ $k$ ? Low Cost Diverse Sampling for Diffusion Language Models | Sean Lamont et.al. | 2603.04893 | null |
| 2026-03-04 | Semi-Supervised Generative Learning via Latent Space Distribution Matching | Kwong Yu Chong et.al. | 2603.04223 | null |
| 2026-03-05 | TumorFlow: Physics-Guided Longitudinal MRI Synthesis of Glioblastoma Growth | Valentin Biller et.al. | 2603.04058 | null |
| 2026-03-04 | Towards Generalized Multimodal Homography Estimation | Jinkun You et.al. | 2603.03956 | null |
| 2026-03-04 | Harnessing Selective State Space Models to Enhance Semianalytical Design of Fabrication-Ready Multilayered Huygens’ Metasurfaces: Part II - Generative Inverse Design (MetaMamba) | Natanel Nissan et.al. | 2603.03877 | null |
| 2026-03-04 | Relational In-Context Learning via Synthetic Pre-training with Structural Prior | Yanbo Wang et.al. | 2603.03805 | null |
| 2026-03-04 | JANUS: Structured Bidirectional Generation for Guaranteed Constraints and Analytical Uncertainty | Taha Racicot et.al. | 2603.03748 | null |
| 2026-03-03 | PRIVATEEDIT: A Privacy-Preserving Pipeline for Face-Centric Generative Image Editing | Dipesh Tamboli et.al. | 2603.03412 | null |
| 2026-03-03 | Infinite dimensional generative sensing | Paolo Angella et.al. | 2603.03196 | null |
| 2026-03-04 | QFlowNet: Fast, Diverse, and Efficient Unitary Synthesis with Generative Flow Networks | Inhoe Koo et.al. | 2603.03045 | null |
| 2026-03-03 | Integrating Homomorphic Encryption and Synthetic Data in FL for Privacy and Learning Quality | Yenan Wang et.al. | 2603.02969 | null |
| 2026-03-03 | On Discriminative vs. Generative classifiers: Rethinking MLLMs for Action Understanding | Zhanzhong Pang et.al. | 2603.02546 | null |
| 2026-03-02 | A Directed Graph Model and Experimental Framework for Design and Study of Time-Dependent Text Visualisation | Songhai Fan et.al. | 2603.02422 | null |
| 2026-03-02 | RO-N3WS: Enhancing Generalization in Low-Resource ASR with Diverse Romanian Speech Benchmarks | Alexandra Diaconu et.al. | 2603.02368 | null |
| 2026-03-02 | CausalWrap: Model-Agnostic Causal Constraint Wrappers for Tabular Synthetic Data | Amir Asiaee et.al. | 2603.02015 | null |
| 2026-03-02 | Noise-Calibrated Inference from Differentially Private Sufficient Statistics in Exponential Families | Amir Asiaee et.al. | 2603.02010 | null |
| 2026-03-02 | CoVAE: correlated multimodal generative modeling | Federico Caretti et.al. | 2603.01965 | null |
| 2026-03-02 | Phase-Type Variational Autoencoders for Heavy-Tailed Data | Abdelhakim Ziani et.al. | 2603.01800 | null |
| 2026-03-02 | A Diffusion-Driven Fine-Grained Nodule Synthesis Framework for Enhanced Lung Nodule Detection from Chest Radiographs | Aryan Goyal et.al. | 2603.01659 | null |
| 2026-03-02 | Transform-Invariant Generative Ray Path Sampling for Efficient Radio Propagation Modeling | Jérome Eertmans et.al. | 2603.01655 | null |
| 2026-03-02 | RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry | Xinchang Wang et.al. | 2603.01544 | null |
| 2026-03-02 | PhysFormer: A Physics-Embedded Generative Model for Physically Self-Consistent Spectral Synthesis | Siqi Wang et.al. | 2603.01459 | null |
| 2026-03-02 | Autoregressive Synthesis of Sparse and Semi-Structured Mixed-Type Data | Thomas Rückstieß et.al. | 2603.01444 | null |
| 2026-03-02 | LaSER: Internalizing Explicit Reasoning into Latent Space for Dense Retrieval | Jiajie Jin et.al. | 2603.01425 | null |
| 2026-03-01 | Velocity Model Building and Editing with Guided Denoising Diffusion Implicit Models | Francesco Brandolin et.al. | 2603.01231 | null |
| 2026-03-01 | Generative AI & Fictionality: How Novels Power Large Language Models | Edwin Roland et.al. | 2603.01220 | null |
| 2026-02-28 | Constitutional Black-Box Monitoring for Scheming in LLM Agents | Simon Storf et.al. | 2603.00829 | null |
| 2026-02-28 | Designing the Haystack: Programmable Chemical Space for Generative Molecular Discovery | Yuchen Zhu et.al. | 2603.00614 | null |
| 2026-02-28 | SesaHand: Enhancing 3D Hand Reconstruction via Controllable Generation with Semantic and Structural Alignment | Zhuoran Zhao et.al. | 2603.00443 | null |
| 2026-02-28 | Mamba-CAD: State Space Model For 3D Computer-Aided Design Generative Modeling | Xueyang Li et.al. | 2603.00439 | null |
| 2026-02-27 | SKeDA: A Generative Watermarking Framework for Text-to-video Diffusion Models | Yang Yang et.al. | 2603.00194 | null |
| 2026-02-27 | TradeFM: A Generative Foundation Model for Trade-flow and Market Microstructure | Maxime Kawawa-Beaudan et.al. | 2602.23784 | null |
| 2026-02-27 | DashengTokenizer: One layer is enough for unified audio understanding and generation | Heinrich Dinkel et.al. | 2602.23765 | null |
| 2026-02-27 | MMKG-RDS: Reasoning Data Synthesis via Deep Mining of Multimodal Knowledge Graphs | Lun Zhan et.al. | 2602.23632 | null |
| 2026-02-27 | Synthetic Data Powers Product Retrieval for Long-tail Knowledge-Intensive Queries in E-commerce Search | Gui Ling et.al. | 2602.23620 | null |
| 2026-02-27 | Flowette: Flow Matching with Graphette Priors for Graph Generation | Asiri Wijesinghe et.al. | 2602.23566 | null |
| 2026-03-02 | Synthetic Visual Genome 2: Extracting Large-scale Spatio-Temporal Scene Graphs from Videos | Ziqi Gao et.al. | 2602.23543 | null |
| 2026-02-26 | Uncovering Physical Drivers of Dark Matter Halo Structures with Auxiliary-Variable-Guided Generative Models | Arkaprabha Ganguli et.al. | 2602.23518 | null |
| 2026-02-26 | SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation | Vaibhav Agrawal et.al. | 2602.23359 | null |
| 2026-02-26 | SemanticVocoder: Bridging Audio Generation and Audio Understanding via Semantic Latents | Zeyu Xie et.al. | 2602.23333 | null |
| 2026-02-26 | Data-Efficient Generative Modeling of Non-Gaussian Global Climate Fields via Scalable Composite Transformations | Johannes Brachem et.al. | 2602.23311 | null |
| 2026-02-26 | Efficient training of generative models from multireference simulations and its application to the design of Dy complexes with large magnetic anisotropy | Zahra Khatibi et.al. | 2602.23230 | null |
| 2026-02-26 | Q-Tag: Watermarking Quantum Circuit Generative Models | Yang Yang et.al. | 2602.23085 | null |
| 2026-02-28 | Deepfake Word Detection by Next-token Prediction using Fine-tuned Whisper | Hoan My Tran et.al. | 2602.22658 | null |
| 2026-02-26 | CRAG: Can 3D Generative Models Help 3D Assembly? | Zeyu Jiang et.al. | 2602.22629 | null |
| 2026-02-26 | BetterScene: 3D Scene Synthesis with Representation-Aligned Generative Model | Yuci Han et.al. | 2602.22596 | null |
| 2026-02-26 | Where Relevance Emerges: A Layer-Wise Study of Internal Attention for Zero-Shot Re-Ranking | Haodong Chen et.al. | 2602.22591 | null |
| 2026-02-25 | Bayesian Generative Adversarial Networks via Gaussian Approximation for Tabular Data Synthesis | Bahrul Ilmi Nasution et.al. | 2602.21948 | null |
| 2026-02-25 | Joint Shadow Generation and Relighting via Light-Geometry Interaction Maps | Shan Wang et.al. | 2602.21820 | null |
| 2026-02-26 | SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model | Guibin Chen et.al. | 2602.21818 | null |
| 2026-02-25 | Inverse prediction of capacitor multiphysics dynamic parameters using deep generative model | Kart-Leong Lim et.al. | 2602.21606 | null |
| 2026-02-27 | Provably Safe Generative Sampling with Constricting Barrier Functions | Darshan Gadginmath et.al. | 2602.21429 | null |
| 2026-02-24 | Archetypal Graph Generative Models: Explainable and Identifiable Communities via Anchor-Dominant Convex Hulls | Nikolaos Nakis et.al. | 2602.21342 | null |
| 2026-02-24 | SOM-VQ: Topology-Aware Tokenization for Interactive Generative Models | Alessandro Londei et.al. | 2602.21133 | null |
| 2026-02-25 | Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models | Christian Simon et.al. | 2602.20981 | null |
| 2026-02-24 | See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis | Jaehyun Park et.al. | 2602.20951 | null |
| 2026-02-24 | BoxSplitGen: A Generative Model for 3D Part Bounding Boxes in Varying Granularity | Juil Koo et.al. | 2602.20666 | null |
| 2026-02-24 | CAD-Prompted SAM3: Geometry-Conditioned Instance Segmentation for Industrial Objects | Zhenran Tang et.al. | 2602.20551 | null |
| 2026-02-23 | gQIR: Generative Quanta Image Reconstruction | Aryan Garg et.al. | 2602.20417 | null |
| 2026-02-23 | CaDrift: A Time-dependent Causal Generator of Drifting Data Streams | Eduardo V. L. Barboza et.al. | 2602.20329 | null |
| 2026-02-27 | Discrete Diffusion with Sample-Efficient Estimators for Conditionals | Karthik Elamvazhuthi et.al. | 2602.20293 | null |
| 2026-02-22 | OrgFlow: Generative Modeling of Organic Crystal Structures from Molecular Graphs | Mohammadmahdi Vahediahmar et.al. | 2602.20195 | null |
| 2026-02-22 | FedAvg-Based CTMC Hazard Model for Federated Bridge Deterioration Assessment | Takato Yasuno et.al. | 2602.20194 | null |
| 2026-02-23 | ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models | Andre He et.al. | 2602.20117 | null |
| 2026-02-25 | Training-Free Generative Modeling via Kernelized Stochastic Interpolants | Florentin Coeurdoux et.al. | 2602.20070 | null |
| 2026-02-23 | Schrödinger bridges with jumps for time series generation | Stefano De Marco et.al. | 2602.20011 | null |
| 2026-02-23 | RL-RIG: A Generative Spatial Reasoner via Intrinsic Reflection | Tianyu Wang et.al. | 2602.19974 | null |
| 2026-02-23 | Make Some Noise: Unsupervised Remote Sensing Change Detection Using Latent Space Perturbations | Blaž Rolih et.al. | 2602.19881 | null |
| 2026-02-27 | Multimodal Dataset Distillation Made Simple by Prototype-Guided Data Synthesis | Junhyeok Choi et.al. | 2602.19756 | null |
| 2026-02-23 | Hardware-Accelerated Geometrical Simulation of Biological and Engineered In-Air Ultrasonic Systems | Wouter Jansen et.al. | 2602.19652 | null |
| 2026-02-23 | Manifold-Aligned Generative Transport | Xinyu Tian et.al. | 2602.19600 | null |
| 2026-02-26 | DICArt: Advancing Category-level Articulated Object Pose Estimation in Discrete State-Spaces | Li Zhang et.al. | 2602.19565 | null |
| 2026-02-23 | Agentic AI as a Cybersecurity Attack Surface: Threats, Exploits, and Defenses in Runtime Supply Chains | Xiaochong Jiang et.al. | 2602.19555 | null |
| 2026-02-23 | Laplacian Multi-scale Flow Matching for Generative Modeling | Zelin Zhao et.al. | 2602.19461 | null |
| 2026-02-22 | IDLM: Inverse-distilled Diffusion Language Models | David Li et.al. | 2602.19066 | null |
| 2026-02-22 | A Markovian View of Iterative-Feedback Loops in Image Generative Models: Neural Resonance and Model Collapse | Vibhas Kumar Vats et.al. | 2602.19033 | null |
| 2026-02-21 | DeepInterestGR: Mining Deep Multi-Interest Using Multi-Modal LLMs for Generative Recommendation | Yangchen Zeng et.al. | 2602.18907 | null |
| 2026-02-21 | IDperturb: Enhancing Variation in Synthetic Face Generation via Angular Perturbation | Fadi Boutros et.al. | 2602.18831 | null |
| 2026-02-21 | RadioGen3D: 3D Radio Map Generation via Adversarial Learning on Large-Scale Synthetic Data | Junshen Chen et.al. | 2602.18744 | null |
| 2026-02-21 | RoboCurate: Harnessing Diversity with Action-Verified Neural Trajectory for Robot Learning | Seungku Kim et.al. | 2602.18742 | null |
| 2026-02-20 | DP-RFT: Learning to Generate Synthetic Text via Differentially Private Reinforcement Fine-Tuning | Fangyuan Xu et.al. | 2602.18633 | null |
| 2026-02-20 | Generative Model via Quantile Assignment | Georgi Hrusanov et.al. | 2602.18216 | null |
| 2026-02-19 | Market Games for Generative Models: Equilibria, Welfare, and Strategic Entry | Xiukun Wei et.al. | 2602.17787 | null |
| 2026-02-24 | A Theoretical Framework for Modular Learning of Robust Generative Models | Corinna Cortes et.al. | 2602.17554 | null |
| 2026-02-19 | QuPAINT: Physics-Aware Instruction Tuning Approach to Quantum Material Discovery | Xuan-Bac Nguyen et.al. | 2602.17478 | null |
| 2026-02-19 | From Labor to Collaboration: A Methodological Experiment Using AI Agents to Augment Research Perspectives in Taiwan’s Humanities and Social Sciences | Yi-Chih Huang et.al. | 2602.17221 | null |
| 2026-02-19 | HybridPrompt: Bridging Generative Priors and Traditional Codecs for Mobile Streaming | Liming Liu et.al. | 2602.17120 | null |
| 2026-02-19 | Epistemology of Generative AI: The Geometry of Knowing | Ilya Levin et.al. | 2602.17116 | null |
| 2026-02-19 | Synergizing Transport-Based Generative Models and Latent Geometry for Stochastic Closure Modeling | Xinghao Dong et.al. | 2602.17089 | null |
| 2026-02-19 | Generative modeling for the bootstrap | Leon Tran et.al. | 2602.17052 | null |
| 2026-02-18 | Synthetic-Powered Multiple Testing with FDR Control | Yonghoon Lee et.al. | 2602.16690 | null |
| 2026-02-19 | Style-Aware Gloss Control for Generative Non-Photorealistic Rendering | Santiago Jimenez-Navarro et.al. | 2602.16611 | null |
| 2026-02-18 | GICDM: Mitigating Hubness for Reliable Distance-Based Generative Model Evaluation | Nicolas Salvy et.al. | 2602.16449 | null |
| 2026-02-17 | Can Generative Artificial Intelligence Survive Data Contamination? Theoretical Guarantees under Contaminated Recursive Training | Kevin Wang et.al. | 2602.16065 | null |
| 2026-02-17 | VideoSketcher: Video Models Prior Enable Versatile Sequential Sketch Generation | Hui Ren et.al. | 2602.15819 | null |
| 2026-02-17 | Developing AI Agents with Simulated Data: Why, what, and how? | Xiaoran Liu et.al. | 2602.15816 | null |
| 2026-02-19 | A Generative-First Neural Audio Autoencoder | Jonah Casebeer et.al. | 2602.15749 | null |
| 2026-02-17 | LLM-to-Speech: A Synthetic Data Pipeline for Training Dialectal Text-to-Speech Models | Ahmed Khaled Khamis et.al. | 2602.15675 | null |
| 2026-02-17 | Molecular Design beyond Training Data with Novel Extended Objective Functionals of Generative AI Models Driven by Quantum Annealing Computer | Hayato Kunugi et.al. | 2602.15451 | null |
| 2026-02-17 | Efficient Generative Modeling beyond Memoryless Diffusion via Adjoint Schrödinger Bridge Matching | Jeongwoo Shin et.al. | 2602.15396 | null |
| 2026-02-17 | Making Large Language Models Speak Tulu: Structured Prompting for an Extremely Low-Resource Language | Prathamesh Devadiga et.al. | 2602.15378 | null |
| 2026-02-17 | GMAIL: Generative Modality Alignment for generated Image Learning | Shentong Mo et.al. | 2602.15368 | null |
| 2026-02-20 | Non-Stationary Covariance Functions for Spatial Data on Linear Networks | Alfredo Alegría et.al. | 2602.15328 | null |
| 2026-02-17 | Enhancing Diversity and Feasibility: Joint Population Synthesis from Multi-source Data Using Generative Models | Farbod Abbasi et.al. | 2602.15270 | null |
| 2026-02-16 | Exposing Diversity Bias in Deep Generative Models: Statistical Origins and Correction of Diversity Error | Farzan Farnia et.al. | 2602.14682 | null |
| 2026-02-15 | BitDance: Scaling Autoregressive Generative Models with Binary Tokens | Yuang Ai et.al. | 2602.14041 | null |
| 2026-02-14 | GSRM: Generative Speech Reward Model for Speech RLHF | Maohao Shen et.al. | 2602.13891 | null |
| 2026-02-14 | Generative Latent Representations of 3D Brain MRI for Multi-Task Downstream Analysis in Down Syndrome | Jordi Malé et.al. | 2602.13731 | null |
| 2026-02-14 | A WDLoRA-Based Multimodal Generative Framework for Clinically Guided Corneal Confocal Microscopy Image Synthesis in Diabetic Neuropathy | Xin Zhang et.al. | 2602.13693 | null |
| 2026-02-10 | Situation Graph Prediction: Structured Perspective Inference for User Modeling | Jisung Shin et.al. | 2602.13319 | null |
| 2026-02-13 | A Calibrated Memorization Index (MI) for Detecting Training Data Leakage in Generative MRI Models | Yash Deo et.al. | 2602.13066 | null |
| 2026-02-13 | QTabGAN: A Hybrid Quantum-Classical GAN for Tabular Data Synthesis | Subhangi Kumari et.al. | 2602.12704 | null |
| 2026-02-13 | Formalizing the Sampling Design Space of Diffusion-Based Generative Models via Adaptive Solvers and Wasserstein-Bounded Timesteps | Sangwoo Jo et.al. | 2602.12624 | null |
| 2026-02-13 | Generative Site-Specific Beamforming via Information-Maximizing Codebook | Cheng-Jie Zhao et.al. | 2602.12552 | null |
| 2026-02-12 | Synthetic Interaction Data for Scalable Personalization in Large Language Models | Yuchen Ma et.al. | 2602.12394 | null |
| 2026-02-12 | Synthetic Image Detection with CLIP: Understanding and Assessing Predictive Cues | Marco Willi et.al. | 2602.12381 | null |
| 2026-02-13 | T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization | Tunyu Zhang et.al. | 2602.12262 | null |
| 2026-02-16 | “Sorry, I Didn’t Catch That”: How Speech Models Miss What Matters Most | Kaitlyn Zhou et.al. | 2602.12249 | null |
| 2026-02-12 | Pedagogically-Inspired Data Synthesis for Language Model Knowledge Distillation | Bowei He et.al. | 2602.12172 | null |
| 2026-02-12 | Affordance-Graphed Task Worlds: Self-Evolving Task Generation for Scalable Embodied Learning | Xiang Liu et.al. | 2602.12065 | null |
| 2026-02-15 | VLAW: Iterative Co-Improvement of Vision-Language-Action Policy and World Model | Yanjiang Guo et.al. | 2602.12063 | null |
| 2026-02-12 | Fourier Transformers for Latent Crystallographic Diffusion and Generative Modeling | Jed A. Duersch et.al. | 2602.12045 | null |
| 2026-02-13 | When Should LLMs Be Less Specific? Selective Abstraction for Reliable Long-Form Text Generation | Shani Goren et.al. | 2602.11908 | null |
| 2026-02-12 | How to Sample High Quality 3D Fractals for Action Recognition Pre-Training? | Marko Putak et.al. | 2602.11810 | null |
| 2026-02-12 | RELATE: A Reinforcement Learning-Enhanced LLM Framework for Advertising Text Generation | Jinfang Wang et.al. | 2602.11780 | link |
| 2026-02-13 | Bizarre Love Triangle: Generative AI, Art, and Kitsch | Dejan Grba et.al. | 2602.11353 | null |
| 2026-02-11 | TabICLv2: A better, faster, scalable, and open tabular foundation model | Jingang Qu et.al. | 2602.11139 | null |
| 2026-02-11 | Beyond Confidence: The Rhythms of Reasoning in Generative Models | Deyuan Liu et.al. | 2602.10816 | null |
| 2026-02-11 | A Diffusion-Based Generative Prior Approach to Sparse-view Computed Tomography | Davide Evangelista et.al. | 2602.10722 | null |
| 2026-02-11 | Evaluation metrics for temporal preservation in synthetic longitudinal patient data | Katariina Perkonoja et.al. | 2602.10643 | null |
| 2026-02-11 | Generative clinical time series models trained on moderate amounts of patient data are privacy preserving | Rustam Zhumagambetov et.al. | 2602.10631 | null |
| 2026-02-11 | Flow of Spans: Generalizing Language Models to Dynamic Span-Vocabulary via GFlowNets | Bo Xue et.al. | 2602.10583 | null |
| 2026-02-11 | From Collapse to Improvement: Statistical Perspectives on the Evolutionary Dynamics of Iterative Training on Contaminated Sources | Soham Bakshi et.al. | 2602.10531 | null |
| 2026-02-12 | Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs | Zhongzhi Li et.al. | 2602.10388 | null |
| 2026-02-10 | Temper-Then-Tilt: Principled Unlearning for Generative Models through Tempering and Classifier Guidance | Jacob L. Block et.al. | 2602.10217 | null |
| 2026-02-10 | Anatomy-Preserving Latent Diffusion for Generation of Brain Segmentation Masks with Ischemic Infarct | Lucia Borrego et.al. | 2602.10167 | null |
| 2026-02-10 | CAPID: Context-Aware PII Detection for Question-Answering Systems | Mariia Ponomarenko et.al. | 2602.10074 | null |
| 2026-02-10 | Preventing Barren Plateaus in Continuous Quantum Generative Models | Olli Hirviniemi et.al. | 2602.10049 | null |
| 2026-02-11 | Monocular Normal Estimation via Shading Sequence Estimation | Zongrui Li et.al. | 2602.09929 | null |
| 2026-02-10 | AmharicIR+Instr: A Two-Dataset Resource for Neural Retrieval and Instruction Tuning | Tilahun Yeshambel et.al. | 2602.09914 | null |
| 2026-02-10 | Efficient Unsupervised Environment Design through Hierarchical Policy Representation Learning | Dexun Li et.al. | 2602.09813 | null |
| 2026-02-10 | Allure of Craquelure: A Variational-Generative Approach to Crack Detection in Paintings | Laura Paul et.al. | 2602.09730 | null |
| 2026-02-10 | Why the Counterintuitive Phenomenon of Likelihood Rarely Appears in Tabular Anomaly Detection with Deep Generative Models? | Donghwan Kim et.al. | 2602.09593 | null |
| 2026-02-10 | MieDB-100k: A Comprehensive Dataset for Medical Image Editing | Yongfan Lai et.al. | 2602.09587 | null |
| 2026-02-10 | Smaller is Better: Generative Models Can Power Short Video Preloading | Liming Liu et.al. | 2602.09484 | null |
| 2026-02-10 | The Wisdom of Many Queries: Complexity-Diversity Principle for Dense Retriever Training | Xincan Feng et.al. | 2602.09448 | null |
| 2026-02-10 | AgentSkiller: Scaling Generalist Agent Intelligence through Semantically Integrated Cross-Domain Data Synthesis | Zexu Sun et.al. | 2602.09372 | null |
| 2026-02-10 | How Far Can You Grow? Characterizing the Extrapolation Frontier of Graph Generative Models for Materials Science | Can Polat et.al. | 2602.09309 | null |
| 2026-02-10 | Measuring Privacy Risks and Tradeoffs in Financial Synthetic Data Generation | Michael Zuo et.al. | 2602.09288 | null |
| 2026-02-09 | RAPID: Risk of Attribute Prediction-Induced Disclosure in Synthetic Microdata | Matthias Templ et.al. | 2602.09235 | null |
| 2026-02-09 | What do Geometric Hallucination Detection Metrics Actually Measure? | Eric Yeats et.al. | 2602.09158 | null |
| 2026-02-09 | Distributionally Robust Optimization via Generative Ambiguity Modeling | Jiaqi Wen et.al. | 2602.08976 | null |
| 2026-02-09 | How University Disability Services Professionals Write Image Descriptions for HCI Figures Using Generative AI | Muhammad Raees et.al. | 2602.08937 | null |
| 2026-02-10 | MOVA: Towards Scalable and Synchronized Video-Audio Generation | SII-OpenMOSS Team et.al. | 2602.08794 | null |
| 2026-02-09 | Equalized Generative Treatment: Matching f-divergences for Fairness in Generative Models | Alexandre Verine et.al. | 2602.08660 | null |
| 2026-02-09 | Projected Gradient Ascent for Efficient Reward-Guided Updates with One-Step Generative Models | Jisung Hwang et.al. | 2602.08646 | null |
| 2026-02-09 | Modeling Protein Evolution via Generative Inference From Monte Carlo Chains to Population Genetics | Leonardo Di Bari et.al. | 2602.08641 | null |
| 2026-02-09 | Inspiration Seeds: Learning Non-Literal Visual Combinations for Generative Exploration | Kfir Goldberg et.al. | 2602.08615 | null |
| 2026-02-09 | CoTZero: Annotation-Free Human-Like Vision Reasoning via Hierarchical Synthetic CoT | Chengyi Du et.al. | 2602.08339 | null |
| 2026-02-09 | An Attention-over-Attention Generative Model for Joint Multiple Intent Detection and Slot Filling | Wei Zhu et.al. | 2602.08322 | null |
| 2026-02-09 | Cyclic Adaptive Private Synthesis for Sharing Real-World Data in Education | Hibiki Ito et.al. | 2602.08299 | null |
| 2026-02-09 | Nansde-net: A neural sde framework for generating time series with memory | Hiromu Ozai et.al. | 2602.08182 | null |
| 2026-02-08 | Cross-Linguistic Persona-Driven Data Synthesis for Robust Multimodal Cognitive Decline Detection | Rui Feng et.al. | 2602.07978 | null |
| 2026-02-08 | Time Series Reasoning via Process-Verifiable Thinking Data Synthesis and Scheduling for Tailored LLM Reasoning | Jiahui Zhou et.al. | 2602.07830 | null |
| 2026-02-07 | Automated rock joint trace mapping using a supervised learning model trained on synthetic data generated by parametric modelling | Jessica Ka Yi Chiu et.al. | 2602.07590 | null |
| 2026-02-07 | Capturing the Topological Phase Transition and Thermodynamics of the 2D XY Model via Manifold-Aware Score-Based Generative Modeling | Pratyush Jha et.al. | 2602.07548 | null |
| 2026-02-06 | VideoNeuMat: Neural Material Extraction from Generative Video Models | Bowen Xue et.al. | 2602.07272 | null |
| 2026-02-06 | Discrete Adjoint Matching | Oswin So et.al. | 2602.07132 | null |
| 2026-02-06 | Finding Connections: Membership Inference Attacks for the Multi-Table Synthetic Data Setting | Joshua Ward et.al. | 2602.07126 | null |
| 2026-02-05 | MRI Cross-Modal Synthesis: A Comparative Study of Generative Models for T1-to-T2 Reconstruction | Ali Alqutayfi et.al. | 2602.07068 | null |
| 2026-02-06 | Learning a Generative Meta-Model of LLM Activations | Grace Luo et.al. | 2602.06964 | null |
| 2026-02-09 | Improved Sampling Schedules for Discrete Diffusion Models | Alberto Foresti et.al. | 2602.06849 | null |
| 2026-02-06 | RAIGen: Rare Attribute Identification in Text-to-Image Generative Models | Silpa Vadakkeeveetil Sreelatha et.al. | 2602.06806 | null |
| 2026-02-06 | Force Generative Imitation Learning: Bridging Position Trajectory and Force Commands through Control Technique | Hiroshi Sato et.al. | 2602.06620 | null |
| 2026-02-06 | Generating High-quality Privacy-preserving Synthetic Data | David Yavo et.al. | 2602.06390 | null |
| 2026-02-06 | Misophonia Trigger Sound Detection on Synthetic Soundscapes Using a Hybrid Model with a Frozen Pre-Trained CNN and a Time-Series Module | Kurumi Sashida et.al. | 2602.06271 | null |
| 2026-02-05 | From Blurry to Believable: Enhancing Low-quality Talking Heads with 3D Generative Priors | Ding-Jiun Huang et.al. | 2602.06122 | null |
| 2026-02-05 | Discrete diffusion samplers and bridges: Off-policy algorithms and applications in latent spaces | Arran Carter et.al. | 2602.05961 | null |
| 2026-02-05 | Verification of the Implicit World Model in a Generative Model via Adversarial Sequences | András Balogh et.al. | 2602.05903 | null |
| 2026-02-05 | FHAIM: Fully Homomorphic AIM For Private Synthetic Data Generation | Mayank Kumar et.al. | 2602.05838 | null |
| 2026-02-05 | Synthesizing Realistic Test Data without Breaking Privacy | Laura Plein et.al. | 2602.05833 | null |
| 2026-02-05 | Wave-Trainer-Fit: Neural Vocoder with Trainable Prior and Fixed-Point Iteration towards High-Quality Speech Generation from SSL features | Hien Ohnaka et.al. | 2602.05443 | null |
| 2026-02-05 | Synthetic Defect Geometries of Cast Metal Objects Modeled via 2d Voronoi Tessellations | Natascha Jeziorski et.al. | 2602.05440 | null |
| 2026-02-05 | GAS: Enhancing Reward-Cost Balance of Generative Model-assisted Offline Safe RL | Zifan Liu et.al. | 2602.05323 | null |
| 2026-02-05 | GT-SVJ: Generative-Transformer-Based Self-Supervised Video Judge For Efficient Video Reward Modeling | Shivanshu Shekhar et.al. | 2602.05202 | null |
| 2026-02-04 | Data Kernel Perspective Space Performance Guarantees for Synthetic Data from Transformer Models | Michael Browder et.al. | 2602.05106 | null |
| 2026-02-04 | Private PoEtry: Private In-Context Learning via Product of Experts | Rob Romijnders et.al. | 2602.05012 | null |
| 2026-02-03 | Privacy Amplification Persists under Unlimited Synthetic Data Release | Clément Pierquin et.al. | 2602.04895 | null |
| 2026-02-06 | Generative Modeling via Drifting | Mingyang Deng et.al. | 2602.04770 | null |
| 2026-02-04 | Audio ControlNet for Fine-Grained Audio Generation and Editing | Haina Zhu et.al. | 2602.04680 | null |
| 2026-02-04 | PFluxTTS: Hybrid Flow-Matching TTS with Robust Cross-Lingual Voice Cloning and Inference-Time Model Fusion | Vikentii Pankov et.al. | 2602.04160 | null |
| 2026-02-06 | PromptSplit: Revealing Prompt-Level Disagreement in Generative Models | Mehdi Lotfian et.al. | 2602.04009 | null |
| 2026-02-03 | pop-cosmos: Forward modeling KiDS-1000 redshift distributions using realistic galaxy populations | Boris Leistedt et.al. | 2602.03935 | null |
| 2026-02-03 | HY3D-Bench: Generation of 3D Assets | Team Hunyuan3D et.al. | 2602.03907 | null |
| 2026-02-03 | DiffLOB: Diffusion Models for Counterfactual Generation in Limit Order Books | Zhuohan Wang et.al. | 2602.03776 | null |
| 2026-02-03 | Efficient Variance-reduced Estimation from Generative EHR Models: The SCOPE and REACH Estimators | Luke Solo et.al. | 2602.03730 | null |
| 2026-02-03 | CTTVAE: Latent Space Structuring for Conditional Tabular Data Generation on Imbalanced Datasets | Milosh Devic et.al. | 2602.03641 | null |
| 2026-02-03 | Generator-based Graph Generation via Heat Diffusion | Anthony Stephenson et.al. | 2602.03612 | null |
| 2026-02-03 | Riemannian Neural Optimal Transport | Alessandro Micheli et.al. | 2602.03566 | null |
| 2026-02-03 | R1-SyntheticVL: Is Synthetic Data from Generative Models Ready for Multimodal Large Language Model? | Jingyi Zhang et.al. | 2602.03300 | null |
| 2026-02-03 | Beyond Quantity: Trajectory Diversity Scaling for Code Agents | Guhong Chen et.al. | 2602.03219 | null |
| 2026-02-03 | Consensus Group Relative Policy Optimization for Text Generation | Yuki Ichihara et.al. | 2602.03102 | null |
| 2026-02-03 | Distance Marching for Generative Modeling | Zimo Wang et.al. | 2602.02928 | null |
| 2026-02-02 | Beyond Content: Behavioral Policies Reveal Actors in Information Operations | Philipp J. Schneider et.al. | 2602.02838 | null |
| 2026-02-04 | daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently | Mohan Jiang et.al. | 2602.02619 | null |
| 2026-02-01 | VividVoice: A Unified Framework for Scene-Aware Visually-Driven Speech Synthesis | Chengyuan Ma et.al. | 2602.02591 | null |
| 2026-02-02 | MIRROR: Manifold Ideal Reference ReconstructOR for Generalizable AI-Generated Image Detection | Ruiqi Liu et.al. | 2602.02222 | null |
| 2026-02-02 | The Verification Crisis: Expert Perceptions of GenAI Disinformation and the Case for Reproducible Provenance | Alexander Loth et.al. | 2602.02100 | null |
| 2026-02-02 | Logic-Guided Vector Fields for Constrained Generative Modeling | Ali Baheri et.al. | 2602.02009 | null |
| 2026-02-02 | Synesthesia of Vehicles: Tactile Data Synthesis from Visual Inputs | Rui Wang et.al. | 2602.01832 | null |
| 2026-02-02 | Reconstruction of instantaneous flow fields from transient velocity snapshots using physics-informed neural networks: Applications to pulsatile blood flow behind a stenosis | Kakeru Ueda et.al. | 2602.01542 | null |
| 2026-02-01 | Addressing Explainability of Generative AI using SMILE (Statistical Model-agnostic Interpretability with Local Explanations) | Zeinab Dehghani et.al. | 2602.01206 | null |
| 2026-01-31 | Factuality on Demand: Controlling the Factuality-Informativeness Trade-off in Text Generation | Ziwei Gong et.al. | 2602.00848 | null |
| 2026-01-31 | Scalable Generative Game Engine: Breaking the Resolution Wall via Hardware-Algorithm Co-Design | Wei Zeng et.al. | 2602.00608 | null |
| 2026-01-31 | RVCBench: Benchmarking the Robustness of Voice Cloning Across Modern Audio Generation Models | Xinting Liao et.al. | 2602.00443 | null |
| 2026-01-31 | Toward Autonomous Laboratory Safety Monitoring with Vision Language Models: Learning to See Hazards Through Scene Structure | Trishna Chakraborty et.al. | 2602.00414 | null |
| 2026-01-30 | Planning with Language and Generative Models: Toward General Reward-Guided Wireless Network Design | Chenyang Yuan et.al. | 2602.00357 | null |
| 2026-01-30 | Reducing Memorisation in Generative Models via Riemannian Bayesian Inference | Johanna Marie Gegenfurtner et.al. | 2602.00199 | null |
| 2026-01-30 | How well do generative models solve inverse problems? A benchmark study | Patrick Krüger et.al. | 2601.23238 | null |
| 2026-01-30 | JobResQA: A Benchmark for LLM Machine Reading Comprehension on Multilingual Résumés and JDs | Casimiro Pio Carrino et.al. | 2601.23183 | null |
| 2026-01-30 | Behemoth: Benchmarking Unlearning in LLMs Using Fully Synthetic Data | Eugenia Iofinova et.al. | 2601.23153 | null |
| 2026-01-30 | Manifold-Aware Perturbations for Constrained Generative Modeling | Katherine Keegan et.al. | 2601.23151 | null |
| 2026-01-30 | ExplainerPFN: Towards tabular foundation models for model-free zero-shot feature importance estimations | Joao Fonseca et.al. | 2601.23068 | null |
| 2026-01-30 | MoVE: Mixture of Value Embeddings – A New Axis for Scaling Parametric Memory in Autoregressive Models | Yangyan Li et.al. | 2601.22887 | null |
| 2026-01-30 | Generative and Nonparametric Approaches for Conditional Distribution Estimation: Methods, Perspectives, and Comparative Evaluations | Yen-Shiu Chin et.al. | 2601.22650 | null |
| 2026-01-30 | Beyond Medical Chatbots: Meddollina and the Rise of Continuous Clinical Intelligence | Vaibhav Ram S. V. N. S et.al. | 2601.22645 | null |
| 2026-01-30 | VocBulwark: Towards Practical Generative Speech Watermarking via Additional-Parameter Injection | Weizhi Liu et.al. | 2601.22556 | null |
| 2026-01-30 | Towards the Holographic Characteristic of LLMs for Efficient Short-text Generation | Shun Qian et.al. | 2601.22546 | null |
| 2026-01-30 | DNA: Uncovering Universal Latent Forgery Knowledge | Jingtong Dou et.al. | 2601.22515 | null |
| 2026-01-30 | ScribbleSense: Generative Scribble-Based Texture Editing with Intent Prediction | Yudi Zhang et.al. | 2601.22455 | null |
| 2026-01-30 | Rethinking Anonymity Claims in Synthetic Data Generation: A Model-Centric Privacy Attack Perspective | Georgi Ganev et.al. | 2601.22434 | null |
| 2026-01-29 | Conformal Prediction for Generative Models via Adaptive Cluster-Based Density Estimation | Qidong Yang et.al. | 2601.22298 | null |
| 2026-01-29 | Investigating Associational Biases in Inter-Model Communication of Large Generative Models | Fethiye Irmak Dogan et.al. | 2601.22093 | null |
| 2026-01-29 | Holographic generative flows with AdS/CFT | Ehsan Mirafzali et.al. | 2601.22033 | null |
| 2026-01-29 | The Ensemble Inverse Problem: Applications and Methods | Zhengyan Huan et.al. | 2601.22029 | null |
| 2026-01-30 | From Tokens to Blocks: A Block-Diffusion Perspective on Molecular Generation | Qianwei Yang et.al. | 2601.21964 | null |
| 2026-01-29 | From Generative Modeling to Clinical Classification: A GPT-Based Architecture for EHR Notes | Fariba Afrin Irany et.al. | 2601.21955 | null |
| 2026-01-29 | On Forgetting and Stability of Score-based Generative models | Stanislas Strasman et.al. | 2601.21868 | null |
| 2026-01-29 | Generative Modeling of Discrete Data Using Geometric Latent Subspaces | Daniel Gonzalez-Alvarado et.al. | 2601.21831 | null |
| 2026-01-29 | DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning | Mingshuang Luo et.al. | 2601.21716 | null |
| 2026-01-30 | SmartMeterFM: Unifying Smart Meter Data Generative Tasks Using Flow Matching Models | Nan Lin et.al. | 2601.21706 | null |
| 2026-01-30 | Bi-Anchor Interpolation Solver for Accelerating Generative Modeling | Hongxu Chen et.al. | 2601.21542 | null |
| 2026-01-29 | HERS: Hidden-Pattern Expert Learning for Risk-Specific Vehicle Damage Adaptation in Diffusion Models | Teerapong Panboonyuen et.al. | 2601.21517 | null |
| 2026-01-29 | Nimbus: A Unified Embodied Synthetic Data Generation Framework | Zeyu He et.al. | 2601.21449 | null |
| 2026-01-29 | SemanticAudio: Audio Generation and Editing in Semantic Space | Zheqi Dai et.al. | 2601.21402 | null |
| 2026-01-29 | Understanding Frechet Speech Distance for Synthetic Speech Quality Evaluation | June-Woo Kim et.al. | 2601.21386 | null |
| 2026-01-29 | Conditional Generative Framework with Peak-Aware Attention for Robust Chemical Detection under Interferences | Namkyung Yoon et.al. | 2601.21246 | null |
| 2026-01-29 | Rethinking Refinement: Correcting Generative Bias without Noise Injection | Xin Peng et.al. | 2601.21182 | null |
| 2026-01-29 | WheelArm-Sim: A Manipulation and Navigation Combined Multimodal Synthetic Data Generation Simulator for Unified Control in Assistive Robotics | Guangping Liu et.al. | 2601.21129 | null |
| 2026-01-28 | MapPFN: Learning Causal Perturbation Maps in Context | Marvin Sextro et.al. | 2601.21092 | null |
| 2026-01-28 | Accelerated Inorganic Electrides Discovery by Generative Models and Hierarchical Screening | Shuo Tao et.al. | 2601.21077 | null |
| 2026-01-28 | Signal from Structure: Exploiting Submodular Upper Bounds in Generative Flow Networks | Alexandre Larouche et.al. | 2601.21061 | null |
| 2026-01-28 | Privatization of Synthetic Gaze: Attenuating State Signatures in Diffusion-Generated Eye Movements | Kamrul Hasan et.al. | 2601.21057 | null |
| 2026-01-28 | A Diffusive Classification Loss for Learning Energy-based Generative Models | Louis Grenioux et.al. | 2601.21025 | null |
| 2026-01-28 | Low performing pixel correction in computed tomography with unrolled network and synthetic data training | Hongxu Yang et.al. | 2601.20995 | null |
| 2026-01-28 | Gen-SER: When the generative model meets speech emotion recognition | Taihui Wang et.al. | 2601.20573 | null |
| 2026-01-28 | Audio Deepfake Detection in the Age of Advanced Text-to-Speech models | Robin Singh et.al. | 2601.20510 | null |
| 2026-01-28 | StormDiT: A generative AI model bridges the 2-6 hour ‘gray zone’ in precipitation nowcasting | Haofei Sun et.al. | 2601.20342 | null |
| 2026-01-28 | BLENDER: Blended Text Embeddings and Diffusion Residuals for Intra-Class Image Synthesis in Deep Metric Learning | Jan Niklas Kolf et.al. | 2601.20246 | null |
| 2026-01-28 | Quantum statistics from classical simulations via generative Gibbs sampling | Weizhou Wang et.al. | 2601.20228 | null |
| 2026-01-28 | Parametric and Generative Forecasts of Day-Ahead Market Curves for Storage Optimization | Julian Gutierrez et.al. | 2601.20226 | null |
| 2026-01-27 | GenCP: Towards Generative Modeling Paradigm of Coupled Physics | Tianrun Gao et.al. | 2601.19541 | null |
| 2026-01-27 | Cortex-Grounded Diffusion Models for Brain Image Generation | Fabian Bongratz et.al. | 2601.19498 | null |
| 2026-01-27 | Cross-Examination Framework: A Task-Agnostic Diagnostic for Information Fidelity in Text-to-Text Generation | Tathagata Raha et.al. | 2601.19350 | null |
| 2026-01-27 | Handcrafted Feature Fusion for Reliable Detection of AI-Generated Images | Syed Mehedi Hasan Nirob et.al. | 2601.19262 | null |
| 2026-01-27 | E-QRGMM: Efficient Generative Metamodeling for Covariate-Dependent Uncertainty Quantification | Zhiyang Liang et.al. | 2601.19256 | null |
| 2026-01-27 | EnzyPGM: Pocket-conditioned Generative Model for Substrate-specific Enzyme Design | Zefeng Lin et.al. | 2601.19205 | null |
| 2026-01-27 | A Hybrid Discriminative and Generative System for Universal Speech Enhancement | Yinghao Liu et.al. | 2601.19113 | null |
| 2026-01-27 | Proactive Hardening of LLM Defenses with HASTE | Henry Chen et.al. | 2601.19051 | null |
| 2026-01-26 | Advances in Diffusion-Based Generative Compression | Yibo Yang et.al. | 2601.18932 | null |
| 2026-01-26 | OptiGAN for Crystal Arrays: Physics-Informed Generative Modeling of Optical Photon Transport in PET Detector Arrays | Stephan Naunheim et.al. | 2601.18780 | null |
| 2026-01-26 | Riemannian AmbientFlow: Towards Simultaneous Manifold Learning and Generative Modeling from Corrupted Data | Willem Diepeveen et.al. | 2601.18728 | null |
| 2026-01-26 | Conditioned Generative Modeling of Molecular Glues: A Realistic AI Approach for Synthesizable Drug-like Molecules | Naeyma N. Islam et.al. | 2601.18716 | null |
| 2026-01-26 | Neural Multi-Speaker Voice Cloning for Nepali in Low-Resource Settings | Aayush M. Shrestha et.al. | 2601.18694 | null |
| 2026-01-26 | Quasi Monte Carlo methods enable extremely low-dimensional deep generative models | Miles Martinez et.al. | 2601.18676 | null |
| 2026-01-26 | GCFX: Generative Counterfactual Explanations for Deep Graph Models at the Model Level | Jinlong Hu et.al. | 2601.18447 | null |
| 2026-01-26 | GenCI: Generative Modeling of User Interest Shift via Cohort-based Intent Learning for CTR Prediction | Kesha Ou et.al. | 2601.18251 | null |
| 2026-01-25 | Feature-Space Generative Models for One-Shot Class-Incremental Learning | Jack Foster et.al. | 2601.17905 | null |
| 2026-01-25 | Controlling Reading Ease with Gaze-Guided Text Generation | Andreas Säuberli et.al. | 2601.17781 | null |
| 2026-01-24 | Correct-by-Construction Vision-based Pose Estimation using Geometric Generative Models | Ulices Santa Cruz et.al. | 2601.17556 | null |
| 2026-01-24 | Error Analysis of Bayesian Inverse Problems with Generative Priors | Bamdad Hosseini et.al. | 2601.17374 | null |
| 2026-01-24 | TheoremForge: Scaling up Formal Data Synthesis with Low-Budget Agentic Workflow | Yicheng Tao et.al. | 2601.17332 | null |
| 2026-01-23 | HapticMatch: An Exploration for Generative Material Haptic Simulation and Interaction | Mingxin Zhang et.al. | 2601.16639 | null |
| 2026-01-23 | SCHIGAND: A Synthetic Facial Generation Mode Pipeline | Ananya Kadali et.al. | 2601.16627 | null |
| 2026-01-23 | MultiLexNorm++: A Unified Benchmark and a Generative Model for Lexical Normalization for Asian Languages | Weerayut Buaphet et.al. | 2601.16623 | null |
| 2026-01-23 | LLM-based Semantic Search for Conversational Queries in E-commerce | Emad Siddiqui et.al. | 2601.16492 | null |
| 2026-01-23 | Beyond the Training Domain: Robust Generative Transition State Models for Unseen Chemistry | Samir Darouich et.al. | 2601.16469 | null |
| 2026-01-22 | Better as Generators Than Classifiers: Leveraging LLMs and Synthetic Data for Low-Resource Multilingual Classification | Branislav Pecher et.al. | 2601.16278 | null |
| 2026-01-24 | Point Bridge: 3D Representations for Cross Domain Policy Learning | Siddhant Haldar et.al. | 2601.16212 | null |
| 2026-01-24 | PAL*M: Property Attestation for Large Generative Models | Prach Chantasantitam et.al. | 2601.16199 | null |
| 2026-01-22 | Learning to Watermark in the Latent Space of Generative Models | Sylvestre-Alvise Rebuffi et.al. | 2601.16140 | null |
| 2026-01-23 | Recursive Flow: A Generative Framework for MIMO Channel Estimation | Zehua Jiang et.al. | 2601.15767 | null |
| 2026-01-22 | Communication-efficient Federated Graph Classification via Generative Diffusion Modeling | Xiuling Wang et.al. | 2601.15722 | null |
| 2026-01-22 | Explainable Deepfake Detection with RL Enhanced Self-Blended Images | Ning Jiang et.al. | 2601.15624 | null |
| 2026-01-22 | DeepASMR: LLM-Based Zero-Shot ASMR Speech Generation for Anyone of Any Voice | Leying Zhang et.al. | 2601.15596 | null |
| 2026-01-21 | Reliability by design: quantifying and eliminating fabrication risk in LLMs. From generative to consultative AI: a comparative analysis in the legal domain and lessons for high-stakes knowledge bases | Alex Dantart et.al. | 2601.15476 | null |
| 2026-01-21 | Ambient Dataloops: Generative Models for Dataset Refinement | Adrián Rodríguez-Muñoz et.al. | 2601.15417 | null |
| 2026-01-21 | GeMM-GAN: A Multimodal Generative Model Conditioned on Histopathology Images and Clinical Descriptions for Gene Expression Profile Generation | Francesca Pia Panaccione et.al. | 2601.15392 | null |
| 2026-01-21 | SpooFL: Spoofing Federated Learning | Isaac Baglin et.al. | 2601.15055 | null |
| 2026-01-21 | SpatialV2A: Visual-Guided High-fidelity Spatial Audio Generation | Yanan Wang et.al. | 2601.15017 | null |
| 2026-01-21 | AQAScore: Evaluating Semantic Alignment in Text-to-Audio Generation via Audio Question Answering | Chun-Yi Kuan et.al. | 2601.14728 | null |
| 2026-01-20 | Business Logic-Driven Text-to-SQL Data Synthesis for Business Intelligence | Jinhui Liu et.al. | 2601.14518 | null |
| 2026-01-20 | Self-Supervised Score-Based Despeckling for SAR Imagery via Log-Domain Transformation | Junhyuk Heo et.al. | 2601.14334 | null |
| 2026-01-18 | Guided by the Plan: Enhancing Faithful Autoregressive Text-to-Audio Generation with Guided Decoding | Juncheng Wang et.al. | 2601.14304 | null |
| 2026-01-16 | Guardrails for trust, safety, and ethical development and deployment of Large Language Models (LLM) | Anjanava Biswas et.al. | 2601.14298 | null |
| 2026-01-20 | Domain-Adaptation through Synthetic Data: Fine-Tuning Large Language Models for German Law | Ali Hamza Bashir et.al. | 2601.14160 | null |
| 2026-01-20 | Style Transfer as Bias Mitigation: Diffusion Models for Synthetic Mental Health Text for Arabic | Saad Mankarious et.al. | 2601.14124 | null |
| 2026-01-20 | GOMPSNR: Reflourish the Signal-to-Noise Ratio Metric for Audio Generation Tasks | Lingling Dai et.al. | 2601.13758 | null |
| 2026-01-20 | Beyond Known Facts: Generating Unseen Temporal Knowledge to Address Data Contamination in LLM Evaluation | Arthur Amalvy et.al. | 2601.13658 | null |
| 2026-01-19 | BladeSDF : Unconditional and Conditional Generative Modeling of Representative Blade Geometries Using Signed Distance Functions | Ashish S. Nair et.al. | 2601.13445 | null |
| 2026-01-19 | CausationEntropy: Pythonic Optimal Causation Entropy | Kevin Slote et.al. | 2601.13365 | null |
| 2026-01-19 | OFA-MAS: One-for-All Multi-Agent System Topology Design based on Mixture-of-Experts Graph Generative Models | Shiyuan Li et.al. | 2601.12996 | null |
| 2026-01-19 | Beyond Visual Realism: Toward Reliable Financial Time Series Generation | Fan Zhang et.al. | 2601.12990 | null |
| 2026-01-19 | ImmersiveFlow: Stereo-to-7.1.4 spatial audio generation with flow matching | Zining Liang et.al. | 2601.12950 | null |
| 2026-01-21 | AI-generated data contamination erodes pathological variability and diagnostic reliability | Hongyu He et.al. | 2601.12946 | null |
| 2026-01-19 | SciCoQA: Quality Assurance for Scientific Paper–Code Alignment | Tim Baumgärtner et.al. | 2601.12910 | null |
| 2026-01-19 | Text2Structure3D: Graph-Based Generative Modeling of Equilibrium Structures with Diffusion Transformers | Lazlo Bleker et.al. | 2601.12870 | null |
| 2026-01-18 | A Unified Neural Codec Language Model for Selective Editable Text to Speech Generation | Hanchen Pei et.al. | 2601.12480 | null |
| 2026-01-18 | S^2F-Net:A Robust Spatial-Spectral Fusion Framework for Cross-Model AIGC Detection | Xiangyu Hu et.al. | 2601.12313 | null |
| 2026-01-18 | ParaMETA: Towards Learning Disentangled Paralinguistic Speaking Styles Representations from Speech | Haowei Lou et.al. | 2601.12289 | null |
| 2026-01-17 | SynQP: A Framework and Metrics for Evaluating the Quality and Privacy Risk of Synthetic Data | Bing Hu et.al. | 2601.12124 | null |
| 2026-01-16 | Cleansing the Artificial Mind: A Self-Reflective Detoxification Framework for Large Language Models | Kaituo Zhang et.al. | 2601.11776 | null |
| 2026-01-16 | Generative Scenario Rollouts for End-to-End Autonomous Driving | Rajeev Yasarla et.al. | 2601.11475 | null |
| 2026-01-16 | FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning | Tanyu Chen et.al. | 2601.11141 | null |
| 2026-01-16 | PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models | Qiyuan Zhang et.al. | 2601.11087 | null |
| 2026-01-16 | Your One-Stop Solution for AI-Generated Video Detection | Long Ma et.al. | 2601.11035 | null |
| 2026-01-15 | BYOL: Bring Your Own Language Into LLMs | Syed Waqas Zamir et.al. | 2601.10804 | null |
| 2026-01-15 | Inference-time Physics Alignment of Video Generative Models with Latent World Models | Jianhao Yuan et.al. | 2601.10553 | null |
| 2026-01-15 | TF3-RO-50M: Training Compact Romanian Language Models from Scratch on Synthetic Moral Microfiction | Mihai Dan Nadas et.al. | 2601.10410 | null |
| 2026-01-15 | Joint Bayesian inference of Earth’s magnetic field and core surface flow on millennial timescales | Andreas Nilsson et.al. | 2601.10344 | null |
| 2026-01-15 | Boundary-Aware NL2SQL: Integrating Reliability through Hybrid Reward and Data Synthesis | Songsong Tian et.al. | 2601.10318 | null |
| 2026-01-15 | ADVOSYNTH: A Synthetic Multi-Advocate Dataset for Speaker Identification in Courtroom Scenarios | Aniket Deroy et.al. | 2601.10315 | null |
| 2026-01-15 | In-Context Operator Learning on the Space of Probability Measures | Frank Cole et.al. | 2601.09979 | null |
| 2026-01-14 | Terminally constrained flow-based generative models from an optimal control perspective | Weiguo Gao et.al. | 2601.09474 | null |
| 2026-01-14 | Long-term Task-oriented Agent: Proactive Long-term Intent Maintenance in Dynamic Environments | Qinglong Shi et.al. | 2601.09382 | null |
| 2026-01-14 | Honesty-Aware Multi-Agent Framework for High-Fidelity Synthetic Data Generation in Digital Psychiatric Intake Doctor-Patient Interactions | Xinyuan Zhang et.al. | 2601.09216 | null |
| 2026-01-14 | Seeking Human Security Consensus: A Unified Value Scale for Generative AI Value Safety | Ying He et.al. | 2601.09112 | null |
| 2026-01-14 | Mi:dm 2.0 Korea-centric Bilingual Language Models | Donghoon Shin et.al. | 2601.09066 | null |
| 2026-01-13 | Spectral Generative Flow Models: A Physics-Inspired Replacement for Vectorized Large Language Models | Andrew Kiruluta et.al. | 2601.08893 | null |
| 2026-01-13 | RAGShaper: Eliciting Sophisticated Agentic RAG Skills via Automated Data Synthesis | Zhengwei Tao et.al. | 2601.08699 | null |
| 2026-01-13 | Creativity in AI as Emergence from Domain-Limited Generative Models | Corina Chutaux et.al. | 2601.08388 | null |
| 2026-01-13 | Training-Free Distribution Adaptation for Diffusion Models via Maximum Mean Discrepancy Guidance | Matina Mahdizadeh Sani et.al. | 2601.08379 | null |
| 2026-01-13 | Intra-tree Column Subsampling Hinders XGBoost Learning of Ratio-like Interactions | Mykola Pinchuk et.al. | 2601.08121 | null |
| 2026-01-12 | Studying the Role of Synthetic Data for Machine Learning-based Wireless Networks Traffic Forecasting | José Pulido et.al. | 2601.07646 | null |
| 2026-01-12 | Puzzle it Out: Local-to-Global World Model for Offline Multi-Agent Reinforcement Learning | Sijia li et.al. | 2601.07463 | null |
| 2026-01-12 | SceneNAT: Masked Generative Modeling for Language-Guided Indoor Scene Synthesis | Jeongjun Choi et.al. | 2601.07218 | null |
| 2026-01-12 | Agents of Diffusion: Enhancing Diffusion Language Models with Multi-Agent Reinforcement Learning for Structured Data Generation (Extended Version) | Aja Khanal et.al. | 2601.07152 | null |
| 2026-01-11 | Bridging Attribution and Open-Set Detection using Graph-Augmented Instance Learning in Synthetic Speech | Mohd Mujtaba Akhtar et.al. | 2601.07064 | null |
| 2026-01-11 | Codified Foreshadowing-Payoff Text Generation | Longfei Yun et.al. | 2601.07033 | null |
| 2026-01-11 | Continuous Energy Landscape Model for Analyzing Brain State Transitions | Triet M. Tran et.al. | 2601.06991 | null |
| 2026-01-11 | X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests | Jie Wu et.al. | 2601.06953 | null |
| 2026-01-11 | Generative Modeling of Human-Computer Interfaces with Diffusion Processes and Conditional Control | Rui Liu et.al. | 2601.06823 | null |
| 2026-01-11 | Cross-Modal Computational Model of Brain-Heart Interactions via HRV and EEG Feature | Malavika Pradeep et.al. | 2601.06792 | null |
| 2026-01-11 | CyberLLM-FINDS 2025: Instruction-Tuned Fine-tuning of Domain-Specific LLMs with Retrieval-Augmented Generation and Graph Integration for MITRE Evaluation | Vasanth Iyer et.al. | 2601.06779 | null |
| 2026-01-11 | When Humans Judge Irises: Pupil Size Normalization as an Aid and Synthetic Irises as a Challenge | Mahsa Mitcheff et.al. | 2601.06725 | null |
| 2026-01-10 | Characterising Toxicity in Generative Large Language Models | Zhiyao Zhang et.al. | 2601.06700 | null |
| 2026-01-10 | From Easy to Hard++: Promoting Differentially Private Image Synthesis Through Spatial-Frequency Curriculum | Chen Gong et.al. | 2601.06368 | null |
| 2026-01-09 | CARD: Cluster-level Adaptation with Reward-guided Decoding for Personalized Text Generation | Yutong Song et.al. | 2601.06352 | null |
| 2026-01-09 | Multi-Agent Framework for Controllable and Protected Generative Content Creation: Addressing Copyright and Provenance in AI-Generated Media | Haris Khan et.al. | 2601.06232 | null |
| 2026-01-09 | Discriminative-Generative Target Speaker Extraction with Decoder-Only Language Models | Bang Zeng et.al. | 2601.06006 | null |
| 2026-01-09 | GenCtrl – A Formal Controllability Toolkit for Generative Models | Emily Cheng et.al. | 2601.05637 | null |
| 2026-01-12 | Continual Pretraining on Encrypted Synthetic Data for Privacy-Preserving LLMs | Honghao Liu et.al. | 2601.05635 | null |
| 2026-01-09 | Research Integrity and Academic Authority in the Age of Artificial Intelligence: From Discovery to Curation? | Simon Chesterman et.al. | 2601.05574 | null |
| 2026-01-08 | A Bayesian Generative Modeling Approach for Arbitrary Conditional Inference | Qiao Liu et.al. | 2601.05355 | null |
| 2026-01-08 | Generate, Transfer, Adapt: Learning Functional Dexterous Grasping from a Single Human Demonstration | Xingyi He et.al. | 2601.05243 | null |
| 2026-01-08 | DocDancer: Towards Agentic Document-Grounded Information Seeking | Qintong Zhang et.al. | 2601.05163 | null |
| 2026-01-08 | EvolSQL: Structure-Aware Evolution for Scalable Text-to-SQL Data Synthesis | Xuanguang Pan et.al. | 2601.04875 | null |
| 2026-01-08 | PROMISE: Process Reward Models Unlock Test-Time Scaling Laws in Generative Recommendations | Chengcheng Guo et.al. | 2601.04674 | null |
| 2026-01-08 | Know Thy Enemy: Securing LLMs Against Prompt Injection via Diverse Data Synthesis and Instruction-Level Chain-of-Thought Learning | Zhiyuan Chang et.al. | 2601.04666 | null |
| 2026-01-08 | LLMs-Integrated Automatic Hate Speech Recognition Using Controllable Text Generation Models | Ryutaro Oshima et.al. | 2601.04654 | null |
| 2026-01-08 | 3D Conditional Image Synthesis of Left Atrial LGE MRI from Composite Semantic Masks | Yusri Al-Sanaani et.al. | 2601.04588 | null |
| 2026-01-08 | BanglaLorica: Design and Evaluation of a Robust Watermarking Algorithm for Large Language Models in Bangla Text Generation | Amit Bin Tariqul et.al. | 2601.04534 | null |
| 2026-01-07 | From Domains to Instances: Dual-Granularity Data Synthesis for LLM Unlearning | Xiaoyu Xu et.al. | 2601.04278 | null |
| 2026-01-04 | LEMAS: Large A 150K-Hour Large-scale Extensible Multilingual Audio Suite with Generative Speech Models | Zhiyuan Zhao et.al. | 2601.04233 | null |
| 2026-01-07 | SpeakerSleuth: Evaluating Large Audio-Language Models as Judges for Multi-turn Speaker Consistency | Jonggeun Lee et.al. | 2601.04029 | null |
| 2026-01-12 | Muse: Towards Reproducible Long-Form Song Generation with Fine-Grained Style Control | Changhao Jiang et.al. | 2601.03973 | null |
| 2026-01-07 | Local Interpolation via Low-Rank Tensor Trains | Siddhartha E. Guzman et.al. | 2601.03885 | null |
| 2026-01-07 | Logic Tensor Network-Enhanced Generative Adversarial Network | Nijesh Upreti et.al. | 2601.03839 | null |
| 2026-01-07 | Prompt Tuning without Labeled Samples for Zero-Shot Node Classification in Text-Attributed Graphs | Sethupathy Parameswaran et.al. | 2601.03793 | null |
| 2026-01-07 | VietMed-MCQ: A Consistency-Filtered Data Synthesis Framework for Vietnamese Traditional Medicine Evaluation | Huynh Trung Kiet et.al. | 2601.03792 | null |
| 2026-01-07 | A Comparative Study of 3D Model Acquisition Methods for Synthetic Data Generation of Agricultural Products | Steven Moonen et.al. | 2601.03784 | null |
| 2026-01-07 | Evaluation of Multilingual LLMs Personalized Text Generation Capabilities Targeting Groups and Social-Media Platforms | Dominik Macko et.al. | 2601.03752 | null |
| 2026-01-07 | Domain Adaptation of the Pyannote Diarization Pipeline for Conversational Indonesian Audio | Muhammad Daffa’i Rafi Prasetyo et.al. | 2601.03684 | null |
| 2026-01-07 | Towards Compositional Generalization of LLMs via Skill Taxonomy Guided Data Synthesis | Yifan Wei et.al. | 2601.03676 | null |
| 2026-01-07 | eTracer: Towards Traceable Text Generation via Claim-Level Grounding | Bohao Chu et.al. | 2601.03669 | null |
| 2026-01-06 | Fine-tuning Small Language Models as Efficient Enterprise Search Relevance Labelers | Yue Kang et.al. | 2601.03211 | null |
| 2026-01-06 | UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward | Yile Liu et.al. | 2601.03205 | null |
| 2026-01-06 | Quality Degradation Attack in Synthetic Data | Qinyi Liu et.al. | 2601.02947 | null |
| 2026-01-06 | Vulnerabilities of Audio-Based Biometric Authentication Systems Against Deepfake Speech Synthesis | Mengze Hong et.al. | 2601.02914 | null |
| 2026-01-06 | Q-Regularized Generative Auto-Bidding: From Suboptimal Trajectories to Optimal Policies | Mingming Zhang et.al. | 2601.02754 | null |
| 2026-01-06 | Omni2Sound: Towards Unified Video-Text-to-Audio Generation | Yusheng Dai et.al. | 2601.02731 | null |
| 2026-01-06 | GRRE: Leveraging G-Channel Removed Reconstruction Error for Robust Detection of AI-Generated Images | Shuman He et.al. | 2601.02709 | null |
| 2026-01-05 | Generative Site-Specific Beamforming for Next-Generation Spatial Intelligence | Zhaolin Wang et.al. | 2601.02301 | null |
| 2026-01-05 | HeadLighter: Disentangling Illumination in Generative 3D Gaussian Heads via Lightstage Captures | Yating Wang et.al. | 2601.02103 | null |
| 2026-01-05 | SerpentFlow: Generative Unpaired Domain Alignment via Shared-Structure Decomposition | Julie Keisler et.al. | 2601.01979 | null |
| 2026-01-05 | Forget Less by Learning from Parents Through Hierarchical Relationships | Arjun Ramesh Kaushik et.al. | 2601.01892 | null |
| 2026-01-04 | Deep Linear Discriminant Analysis Revisited | Maxat Tezekbayev et.al. | 2601.01619 | null |
| 2026-01-08 | MM-Sonate: Multimodal Controllable Audio-Video Generation with Zero-Shot Voice Cloning | Chunyu Qiang et.al. | 2601.01568 | null |
| 2026-01-08 | Logics-STEM: Empowering LLM Reasoning via Failure-Driven Post-Training and Document Knowledge Enhancement | Mingyu Xu et.al. | 2601.01562 | null |
| 2026-01-04 | DrivingGen: A Comprehensive Benchmark for Generative Video World Models in Autonomous Driving | Yang Zhou et.al. | 2601.01528 | null |
| 2026-01-03 | GenCAMO: Scene-Graph Contextual Decoupling for Environment-aware and Mask-free Camouflage Image-Dense Annotation Generation | Chenglizhao Chen et.al. | 2601.01181 | null |
| 2026-01-03 | Histogram Assisted Quality Aware Generative Model for Resolution Invariant NIR Image Colorization | Abhinav Attri et.al. | 2601.01103 | null |
| 2026-01-03 | Luminark: Training-free, Probabilistically-Certified Watermarking for General Vision Generative Models | Jiayi Xu et.al. | 2601.01085 | null |
| 2026-01-06 | Coarse-Grained Kullback–Leibler Control of Diffusion-Based Generative AI | Tatsuaki Tsuruyama et.al. | 2601.01045 | null |
| 2025-12-31 | A Chemically Grounded Evaluation Framework for Generative Models in Materials Discovery | Elohan Veillon et.al. | 2601.00886 | null |
| 2025-12-30 | Path Integral Solution for Dissipative Generative Dynamics | Xidi Wang et.al. | 2601.00860 | null |
| 2026-01-02 | FedHypeVAE: Federated Learning with Hypernetwork Generated Conditional VAEs for Differentially Private Embedding Sharing | Sunny Gupta et.al. | 2601.00785 | null |
| 2026-01-02 | Gradient-free ensemble transform methods for generalized Bayesian inference in generative models | Diksha Bhandari et.al. | 2601.00760 | null |
| 2026-01-02 | Peak-Nadir Encoding for Efficient CGM Data Compression and High-Fidelity Reconstruction | Clara Bender et.al. | 2601.00608 | null |
| 2026-01-01 | Unknown Aware AI-Generated Content Attribution | Ellie Thieu et.al. | 2601.00218 | null |
| 2025-12-31 | Generative Classifiers Avoid Shortcut Solutions | Alexander C. Li et.al. | 2512.25034 | null |
| 2025-12-31 | ShowUI- $π$ : Flow-based Generative Models as GUI Dexterous Hands | Siyuan Hu et.al. | 2512.24965 | null |
| 2025-12-31 | Limits of quantum generative models with classical sampling hardness | Sabrina Herbst et.al. | 2512.24801 | null |
| 2025-12-31 | HiGR: Efficient Generative Slate Recommendation via Hierarchical Planning and Multi-Objective Preference Alignment | Yunsheng Pang et.al. | 2512.24787 | null |
| 2025-12-30 | Generative forecasting with joint probability models | Patrick Wyrod et.al. | 2512.24446 | null |
| 2025-12-30 | GPT-like transformer model for silicon tracking detector simulation | Tadej Novak et.al. | 2512.24254 | null |
| 2025-12-30 | Assured Autonomy: How Operations Research Powers and Orchestrates Generative AI Systems | Tinglong Dai et.al. | 2512.23978 | null |
| 2025-12-30 | Assessing generative modeling approaches for free energy estimates in condensed matter | Maximilian Schebek et.al. | 2512.23930 | null |
| 2025-12-29 | Flow Matching Neural Processes | Hussen Abu Hamad et.al. | 2512.23853 | null |
| 2025-12-29 | Exploiting the Prior of Generative Time Series Imputation | YuYang Miao et.al. | 2512.23832 | null |
| 2025-12-26 | State-of-the-art Small Language Coder Model: Mify-Coder | Abhinav Parmar et.al. | 2512.23747 | null |
| 2025-12-29 | Diffusion priors enhanced velocity model building from time-lag images using a neural operator | Xiao Ma et.al. | 2512.23375 | null |
| 2025-12-29 | AGRO-SQL: Agentic Group-Relative Optimization with High-Fidelity Data Synthesis | Cehua Yang et.al. | 2512.23366 | null |
| 2025-12-29 | Flow2GAN: Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-step High-Fidelity Audio Generation | Zengwei Yao et.al. | 2512.23278 | null |
| 2025-12-29 | Anomaly Detection by Effectively Leveraging Synthetic Images | Sungho Kang et.al. | 2512.23227 | null |
| 2025-12-29 | PathoSyn: Imaging-Pathology MRI Synthesis via Disentangled Deviation Diffusion | Jian Wang et.al. | 2512.23130 | null |
| 2025-12-27 | Quantum Generative Models for Computational Fluid Dynamics: A First Exploration of Latent Space Learning in Lattice Boltzmann Simulations | Achraf Hsain et.al. | 2512.22672 | null |
| 2025-12-27 | Visual Autoregressive Modelling for Monocular Depth Estimation | Amir El-Ghoussani et.al. | 2512.22653 | null |
| 2025-12-26 | LLA: Enhancing Security and Privacy for Generative Models with Logic-Locked Accelerators | You Li et.al. | 2512.22307 | null |
| 2025-12-25 | Human-Aligned Generative Perception: Bridging Psychophysics and Generative Models | Antara Titikhsha et.al. | 2512.22272 | null |
| 2025-12-26 | From In Silico to In Vitro: Evaluating Molecule Generative Models for Hit Generation | Nagham Osman et.al. | 2512.22031 | null |
| 2025-12-29 | Deep Generative Models for Synthetic Financial Data: Applications to Portfolio and Risk Modeling | Christophe D. Hounwanou et.al. | 2512.21798 | null |
| 2025-12-25 | Synthetic Financial Data Generation for Enhanced Financial Modelling | Christophe D. Hounwanou et.al. | 2512.21791 | null |
| 2025-12-25 | BeHGAN: Bengali Handwritten Word Generation from Plain Text Using Generative Adversarial Networks | Md. Rakibul Islam et.al. | 2512.21694 | null |
| 2025-12-25 | Dictionary-Transform Generative Adversarial Networks | Angshul Majumdar et.al. | 2512.21677 | null |
| 2025-12-25 | Residual Prior Diffusion: A Probabilistic Framework Integrating Coarse Latent Priors with Diffusion Models | Takuro Kutsuna et.al. | 2512.21593 | null |
| 2025-12-25 | Generative Actor Critic | Aoyang Qin et.al. | 2512.21527 | null |
| 2025-12-24 | A Reinforcement Learning Approach to Synthetic Data Generation | Natalia Espinosa-Dice et.al. | 2512.21395 | null |
| 2025-12-24 | A Turn Toward Better Alignment: Few-Shot Generative Adaptation with Equivariant Feature Rotation | Chenghao Xu et.al. | 2512.21174 | null |
| 2025-12-24 | Active inference and artificial reasoning | Karl Friston et.al. | 2512.21129 | null |
| 2025-12-24 | PUFM++: Point Cloud Upsampling via Enhanced Flow Matching | Zhi-Song Liu et.al. | 2512.20988 | null |
| 2025-12-24 | X-ray Insights Unleashed: Pioneering the Enhancement of Multi-Label Long-Tail Data | Xinquan Yang et.al. | 2512.20980 | null |
| 2025-12-24 | GenTSE: Enhancing Target Speaker Extraction via a Coarse-to-Fine Generative Language Model | Haoyang Li et.al. | 2512.20978 | null |
| 2025-12-24 | Beyond Artifacts: Real-Centric Envelope Modeling for Reliable AI-Generated Image Detection | Ruiqi Liu et.al. | 2512.20937 | null |
| 2025-12-23 | Improving Matrix Exponential for Generative AI Flows: A Taylor-Based Approach Beyond Paterson–Stockmeyer | Jorge Sastre et.al. | 2512.20777 | null |
| 2025-12-23 | UTDesign: A Unified Framework for Stylized Text Editing and Generation in Graphic Design Images | Yiming Zhao et.al. | 2512.20479 | null |
| 2025-12-23 | Enriching Earth Observation labeled data with Quantum Conditioned Diffusion Models | Francesco Mauro et.al. | 2512.20448 | null |
| 2025-12-23 | Structured Visualization Design Knowledge for Grounding Generative Reasoning and Situated Feedback | Péter Ferenc Gyarmati et.al. | 2512.20306 | null |
| 2025-12-23 | HGAN-SDEs: Learning Neural Stochastic Differential Equations with Hermite-Guided Adversarial Training | Yuanjian Xu et.al. | 2512.20272 | null |
| 2025-12-23 | Automated Training of Learned Database Components with Generative AI | Angjela Davitkova et.al. | 2512.20271 | null |
| 2025-12-23 | Aliasing-Free Neural Audio Synthesis | Yicheng Gu et.al. | 2512.20211 | null |
| 2025-12-23 | QuarkAudio Technical Report | Chengwei Liu et.al. | 2512.20151 | null |
| 2025-12-22 | Modeling Non-Ergodic Path Effects Using Conditional Generative Model for Fourier Amplitude Spectra | Maxime Lacour et.al. | 2512.19909 | null |
| 2025-12-22 | Generative diffusion models for agricultural AI: plant image generation, indoor-to-outdoor translation, and expert preference alignment | Da Tan et.al. | 2512.19632 | null |
| 2025-12-22 | MapTrace: Scalable Data Generation for Route Tracing on Maps | Artemis Panagopoulou et.al. | 2512.19609 | null |
| 2025-12-22 | GLUE: Generative Latent Unification of Expertise-Informed Engineering Models | Tim Aebersold et.al. | 2512.19469 | null |
| 2025-12-23 | SiamGPT: Quality-First Fine-Tuning for Stable Thai Text Generation | Thittipat Pairatsuppawat et.al. | 2512.19455 | null |
| 2025-12-22 | A Rate-Distortion Perspective on the Emergence of Number Sense in Unsupervised Generative Models | Leo D’Amato et.al. | 2512.19450 | null |
| 2025-12-26 | Real-Time Streamable Generative Speech Restoration with Flow Matching | Simon Welker et.al. | 2512.19442 | null |
| 2025-12-22 | Generative Krylov Subspace Representations for Scalable Quantum Eigensolvers | Changwon Lee et.al. | 2512.19420 | null |
| 2025-12-22 | Interpretable Hybrid Deep Q-Learning Framework for IoT-Based Food Spoilage Prediction with Synthetic Data Generation and Hardware Validation | Isshaan Singh et.al. | 2512.19361 | null |
| 2025-12-22 | VisionDirector: Vision-Language Guided Closed-Loop Refinement for Generative Image Synthesis | Meng Chu et.al. | 2512.19243 | null |
| 2025-12-22 | JoyVoice: Long-Context Conditioning for Anthropomorphic Multi-Speaker Conversational Synthesis | Fan Yu et.al. | 2512.19090 | null |
| 2025-12-22 | Efficient Personalization of Generative Models via Optimal Experimental Design | Guy Schacht et.al. | 2512.19057 | null |
| 2025-12-22 | Decoupled Generative Modeling for Human-Object Interaction Synthesis | Hwanhee Jung et.al. | 2512.19049 | null |
| 2025-12-22 | On Conditional Stochastic Interpolation for Generative Nonlinear Sufficient Dimension Reduction | Shuntuo Xu et.al. | 2512.18971 | null |
| 2025-12-22 | Symmetrization of 3D Generative Models | Nicolas Caytuiro et.al. | 2512.18953 | null |
| 2025-12-21 | Generative Modeling through Spectral Analysis of Koopman Operator | Yuanchao Xu et.al. | 2512.18837 | null |
| 2025-12-23 | Social Comparison without Explicit Inference of Others’ Reward Values: A Constructive Approach Using a Probabilistic Generative Model | Yosuke Taniuchi et.al. | 2512.18687 | null |
| 2025-12-20 | Feature-Enhanced Graph Neural Networks for Classification of Synthetic Graph Generative Models: A Benchmarking Study | Janek Dyer et.al. | 2512.18524 | null |
| 2025-12-20 | Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models | Chao Wen et.al. | 2512.18388 | null |
| 2025-12-19 | Generative Multi-Objective Bayesian Optimization with Scalable Batch Evaluations for Sample-Efficient De Novo Molecular Design | Madhav R. Muthyala et.al. | 2512.17659 | null |
| 2025-12-19 | Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs | Zhaolin Cai et.al. | 2512.17640 | null |
| 2025-12-19 | InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion | Hoiyeong Jin et.al. | 2512.17504 | null |
| 2025-12-19 | 3D-RE-GEN: 3D Reconstruction of Indoor Scenes with a Generative Framework | Tobias Sautter et.al. | 2512.17459 | null |
| 2025-12-22 | Generative modeling of conditional probability distributions on the level-sets of collective variables | Fatima-Zahrae Akhyar et.al. | 2512.17374 | null |
| 2025-12-18 | Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection | Kaixin Ding et.al. | 2512.16905 | null |
| 2025-12-18 | Sceniris: A Fast Procedural Scene Generation Framework | Jinghuan Shang et.al. | 2512.16896 | null |
| 2025-12-18 | Task-Oriented Data Synthesis and Control-Rectify Sampling for Remote Sensing Semantic Segmentation | Yunkai Yang et.al. | 2512.16740 | null |
| 2025-12-18 | Empirical Evaluation of Structured Synthetic Data Privacy Metrics: Novel experimental framework | Milton Nicolás Plasencia Palacios et.al. | 2512.16284 | null |
| 2025-12-18 | Scaling Spatial Reasoning in MLLMs through Programmatic Data Synthesis | Zhi Helu et.al. | 2512.16237 | null |
| 2025-12-18 | ToolForge: A Data Synthesis Pipeline for Multi-Hop Search without Real-World APIs | Hao Chen et.al. | 2512.16149 | null |
| 2025-12-18 | Evaluation of Generative Models for Emotional 3D Animation Generation in VR | Kiran Chhatre et.al. | 2512.16081 | null |
| 2025-12-17 | SoFlow: Solution Flow Models for One-Step Generative Modeling | Tianze Luo et.al. | 2512.15657 | null |
| 2025-12-17 | On Assessing the Relevance of Code Reviews Authored by Generative Models | Robert Heumüller et.al. | 2512.15466 | null |
| 2025-12-17 | Expand and Prune: Maximizing Trajectory Diversity for Effective GRPO in Generative Models | Shiran Ge et.al. | 2512.15347 | null |
| 2025-12-17 | SynthSeg-Agents: Multi-Agent Synthetic Data Generation for Zero-Shot Weakly Supervised Semantic Segmentation | Wangyu Wu et.al. | 2512.15310 | null |
| 2025-12-16 | Polypersona: Persona-Grounded LLM for Synthetic Survey Responses | Tejaswani Dash et.al. | 2512.14562 | null |
| 2025-12-16 | C-ing Clearly: Enhanced Binary Code Explanations using C code | Teodor Poncu et.al. | 2512.14500 | null |
| 2025-12-16 | A data-physics hybrid generative model for patient-specific post-stroke motor rehabilitation using wearable sensor data | Yanning Dai et.al. | 2512.14329 | null |
| 2025-12-16 | SS4D: Native 4D Generative Model via Structured Spacetime Latents | Zhibing Li et.al. | 2512.14284 | null |
| 2025-12-16 | Beyond MMD: Evaluating Graph Generative Models with Geometric Deep Learning | Salvatore Romano et.al. | 2512.14241 | null |
| 2025-12-16 | Estimating problem difficulty without ground truth using Large Language Model comparisons | Marthe Ballon et.al. | 2512.14220 | null |
| 2025-12-16 | Random-Bridges as Stochastic Transports for Generative Models | Stefano Goria et.al. | 2512.14190 | null |
| 2025-12-16 | Quantum-Inspired Approach to Analyzing Complex System Dynamics | Parsa Kafashi et.al. | 2512.14169 | null |
| 2025-12-16 | An intercomparison of generative machine learning methods for downscaling precipitation at fine spatial scales | Bryn Ward-Leikis et.al. | 2512.13987 | null |
| 2025-12-15 | An evaluation of SVBRDF Prediction from Generative Image Models for Appearance Modeling of 3D Scenes | Alban Gauthier et.al. | 2512.13950 | null |
| 2025-12-15 | Deepfakes in the 2025 Canadian Election: Prevalence, Partisanship, and Platform Dynamics | Victor Livernoche et.al. | 2512.13915 | null |
| 2025-12-15 | SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning | Jitesh Jain et.al. | 2512.13874 | null |
| 2025-12-15 | Improving the Plausibility of Pressure Distributions Synthesized from Depth through Generative Modeling | Neevkumar Manavar et.al. | 2512.13757 | null |
| 2025-12-16 | Lyra: A Hardware-Accelerated RISC-V Verification Framework with Generative Model-Based Processor Fuzzing | Juncheng Huo et.al. | 2512.13686 | null |
| 2025-12-15 | JoVA: Unified Multimodal Learning for Joint Video-Audio Generation | Xiaohu Huang et.al. | 2512.13677 | null |
| 2025-12-15 | PrahokBART: A Pre-trained Sequence-to-Sequence Model for Khmer Natural Language Generation | Hour Kaing et.al. | 2512.13552 | null |
| 2025-12-19 | Non-Resolution Reasoning (NRR): A Computational Framework for Contextual Identity and Ambiguity Preservation | Kei Saito et.al. | 2512.13478 | null |
| 2025-12-15 | ALIGN-FL: Architecture-independent Learning through Invariant Generative component sharing in Federated Learning | Mayank Gulati et.al. | 2512.13316 | null |
| 2025-12-18 | DisCo-Speech: Controllable Zero-Shot Speech Generation with A Disentangled Speech Codec | Tao Li et.al. | 2512.13251 | null |
| 2025-12-16 | POLAR: A Portrait OLAT Dataset and Generative Framework for Illumination-Aware Face Modeling | Zhuo Chen et.al. | 2512.13192 | null |
| 2025-12-16 | A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis | Xianchao Guan et.al. | 2512.13164 | null |
| 2025-12-14 | NagaNLP: Bootstrapping NLP for Low-Resource Nagamese Creole with Human-in-the-Loop Synthetic Data | Agniva Maiti et.al. | 2512.12537 | null |
| 2025-12-13 | Bayesian Full-waveform Monitoring of CO2 Storage with Fluid-flow Priors via Generative Modeling | Haipeng Li et.al. | 2512.12482 | null |
| 2025-12-13 | ArtGen: Conditional Generative Modeling of Articulated Objects in Arbitrary Part-Level States | Haowen Wang et.al. | 2512.12395 | null |
| 2025-12-13 | Quantum-Aware Generative AI for Materials Discovery: A Framework for Robust Exploration Beyond DFT Biases | Mahule Roy et.al. | 2512.12288 | null |
| 2025-12-13 | Rethinking Label Consistency of In-Context Learning: An Implicit Transductive Label Propagation Perspective | Haoyang Chen et.al. | 2512.12175 | null |
| 2025-12-13 | A comparative study of generative models for child voice conversion | Protima Nomo Sudro et.al. | 2512.12129 | null |
| 2025-12-12 | AnchorDream: Repurposing Video Diffusion for Embodiment-Aware Robot Data Synthesis | Junjie Ye et.al. | 2512.11797 | null |
| 2025-12-12 | Super Suffixes: Bypassing Text Generation Alignment and Guard Models Simultaneously | Andrew Adiletta et.al. | 2512.11783 | null |
| 2025-12-12 | Referring Change Detection in Remote Sensing Imagery | Yilmaz Korkmaz et.al. | 2512.11719 | null |
| 2025-12-12 | Emergence of Nonequilibrium Latent Cycles in Unsupervised Generative Modeling | Marco Baiesi et.al. | 2512.11415 | null |
| 2025-12-12 | Iterative Compositional Data Generation for Robot Control | Anh-Quan Pham et.al. | 2512.10891 | null |
| 2025-12-11 | Generative Modeling from Black-box Corruptions via Self-Consistent Stochastic Interpolants | Chirag Modi et.al. | 2512.10857 | null |
| 2025-12-11 | Beyond the Black Box: Identifiable Interpretation and Control in Generative Models via Causal Minimality | Lingjing Kong et.al. | 2512.10720 | null |
| 2025-12-11 | TriDF: Evaluating Perception, Detection, and Hallucination for Interpretable DeepFake Detection | Jian-Yu Jiang-Lin et.al. | 2512.10652 | null |
| 2025-12-11 | AgriGPT-Omni: A Unified Speech-Vision-Text Framework for Multilingual Agricultural Intelligence | Bo Yang et.al. | 2512.10624 | null |
| 2025-12-11 | Breaking the Vicious Cycle: Coherent 3D Gaussian Splatting from Sparse and Motion-Blurred Views | Zhankuo Xu et.al. | 2512.10369 | null |
| 2025-12-10 | Generative Modeling of Entangled Polymers with a Distance-Based Variational Autoencoder | Pietro Chiarantoni et.al. | 2512.10131 | null |
| 2025-12-10 | Workflow is All You Need: Escaping the “Statistical Smoothing Trap” via High-Entropy Information Foraging and Adversarial Pacing | Zhongjie Jiang et.al. | 2512.10121 | null |
| 2025-12-10 | PathCo-LatticE: Pathology-Constrained Lattice-Of Experts Framework for Fully-supervised Few-Shot Cardiac MRI Segmentation | Mohamed Elbayumi et.al. | 2512.09779 | null |
| 2025-12-10 | Membership and Dataset Inference Attacks on Large Audio Generative Models | Jakub Proboszcz et.al. | 2512.09654 | null |
| 2025-12-10 | ImageTalk: Designing a Multimodal AAC Text Generation System Driven by Image Recognition and Natural Language Generation | Boyin Yang et.al. | 2512.09610 | null |
| 2025-12-10 | Lazy Diffusion: Mitigating spectral collapse in generative diffusion-based stable autoregressive emulation of turbulent flows | Anish Sambamurthy et.al. | 2512.09572 | null |
| 2025-12-10 | Toward Closed-loop Molecular Discovery via Language Model, Property Alignment and Strategic Search | Junkai Ji et.al. | 2512.09566 | null |
| 2025-12-10 | Transport Novelty Distance: A Distributional Metric for Evaluating Material Generative Models | Paul Hagemann et.al. | 2512.09514 | null |
| 2025-12-10 | Color encoding in Latent Space of Stable Diffusion Models | Guillem Arias et.al. | 2512.09477 | null |
| 2025-12-10 | Generative Point Cloud Registration | Haobo Jiang et.al. | 2512.09407 | null |
| 2025-12-10 | ASSIST-3D: Adapted Scene Synthesis for Class-Agnostic 3D Instance Segmentation | Shengchao Zhou et.al. | 2512.09364 | null |
| 2025-12-12 | SDialog: A Python Toolkit for End-to-End Agent Building, User Simulation, Dialog Generation, and Evaluation | Sergio Burdisso et.al. | 2512.09142 | null |
| 2025-12-09 | Contrast transfer functions help quantify neural network out-of-distribution generalization in HRTEM | Luis Rangel DaCosta et.al. | 2512.09067 | null |
| 2025-12-09 | A Survey of Body and Face Motion: Datasets, Performance Evaluation Metrics and Generative Techniques | Lownish Rai Sookha et.al. | 2512.09005 | null |
| 2025-12-08 | Demo: Generative AI helps Radiotherapy Planning with User Preference | Riqiang Gao et.al. | 2512.08996 | null |
| 2025-12-09 | When Tables Leak: Attacking String Memorization in LLM-Based Tabular Data Generation | Joshua Ward et.al. | 2512.08875 | null |
| 2025-12-09 | Differentially Private Synthetic Data Generation Using Context-Aware GANs | Anantaa Kotal et.al. | 2512.08869 | null |
| 2025-12-09 | Democratizing ML for Enterprise Security: A Self-Sustained Attack Detection Framework | Sadegh Momeni et.al. | 2512.08802 | null |
| 2025-12-09 | LoFA: Learning to Predict Personalized Priors for Fast Adaptation of Visual Generative Models | Yiming Hao et.al. | 2512.08785 | null |
| 2025-12-09 | Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank | Shaofeng Zhang et.al. | 2512.08648 | null |
| 2025-12-09 | HealthcareNLP: where are we and what is next? | Lifeng Han et.al. | 2512.08617 | null |
| 2025-12-09 | A Novel Wasserstein Quaternion Generative Adversarial Network for Color Image Generation | Zhigang Jia et.al. | 2512.08542 | null |
| 2025-12-09 | PAVAS: Physics-Aware Video-to-Audio Synthesis | Oh Hyun-Bin et.al. | 2512.08282 | null |
| 2025-12-09 | Worst-case generation via minimax optimization in Wasserstein space | Xiuyuan Cheng et.al. | 2512.08176 | null |
| 2025-12-08 | SwissGov-RSD: A Human-annotated, Cross-lingual Benchmark for Token-level Recognition of Semantic Differences Between Related Documents | Michelle Wastl et.al. | 2512.07538 | null |
| 2025-12-07 | Progress Ratio Embeddings: An Impatience Signal for Robust Length Control in Neural Text Generation | Ivanhoé Botcazou et.al. | 2512.06938 | null |
| 2025-12-07 | RunawayEvil: Jailbreaking the Image-to-Video Generative Models | Songping Wang et.al. | 2512.06674 | null |
| 2025-12-06 | Generic visuality of war? How image-generative AI models (mis)represent Russia’s war against Ukraine | Mykola Makhortykh et.al. | 2512.06570 | null |
| 2025-12-06 | SUGAR: A Sweeter Spot for Generative Unlearning of Many Identities | Dung Thuy Nguyen et.al. | 2512.06562 | null |
| 2025-12-06 | LOCUS: A System and Method for Low-Cost Customization for Universal Specialization | Dhanasekar Sundararaman et.al. | 2512.06239 | null |
| 2025-12-05 | When Privacy Isn’t Synthetic: Hidden Data Leakage in Generative AI Models | S. M. Mustaqim et.al. | 2512.06062 | null |
| 2025-12-04 | DreamFoley: Scalable VLMs for High-Fidelity Video-to-Audio Generation | Fu Li et.al. | 2512.06022 | null |
| 2025-12-05 | MaxShapley: Towards Incentive-compatible Generative Search with Fair Context Attribution | Sara Patel et.al. | 2512.05958 | null |
| 2025-12-05 | Impugan: Learning Conditional Generative Models for Robust Data Imputation | Zalish Mahmud et.al. | 2512.05950 | null |
| 2025-12-05 | Measuring the Effect of Background on Classification and Feature Importance in Deep Learning for AV Perception | Anne Sielemann et.al. | 2512.05937 | null |
| 2025-12-05 | Synset Signset Germany: a Synthetic Dataset for German Traffic Sign Recognition | Anne Sielemann et.al. | 2512.05936 | null |
| 2025-12-05 | A Comparative Study on Synthetic Facial Data Generation Techniques for Face Recognition | Pedro Vidal et.al. | 2512.05928 | null |
| 2025-12-05 | 3D Path Planning for Robot-assisted Vertebroplasty from Arbitrary Bi-plane X-ray via Differentiable Rendering | Blanca Inigo et.al. | 2512.05803 | null |
| 2025-12-08 | General and Domain-Specific Zero-shot Detection of Generated Images via Conditional Likelihood | Roy Betser et.al. | 2512.05590 | null |
| 2025-12-05 | SSDLabeler: Realistic semi-synthetic data generation for multi-label artifact classification in EEG | Taketo Akama et.al. | 2512.05500 | null |
| 2025-12-05 | SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling | Elisabetta Fedele et.al. | 2512.05343 | null |
| 2025-12-04 | Light-X: Generative 4D Video Rendering with Camera and Illumination Control | Tianqi Liu et.al. | 2512.05115 | null |
| 2025-12-04 | ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning | Shengyuan Ding et.al. | 2512.05111 | null |
| 2025-12-04 | OMTRA: A Multi-Task Generative Model for Structure-Based Drug Design | Ian Dunn et.al. | 2512.05080 | null |
| 2025-12-04 | Object Reconstruction under Occlusion with Generative Priors and Contact-induced Constraints | Minghan Zhu et.al. | 2512.05079 | null |
| 2025-12-04 | HTR-ConvText: Leveraging Convolution and Textual Information for Handwritten Text Recognition | Pham Thach Thanh Truc et.al. | 2512.05021 | null |
| 2025-12-04 | Generative Neural Video Compression via Video Diffusion Prior | Qi Mao et.al. | 2512.05016 | null |
| 2025-12-04 | Reflection Removal through Efficient Adaptation of Diffusion Transformers | Daniyar Zakarin et.al. | 2512.05000 | null |
| 2025-12-04 | Efficient Generative Transformer Operators For Million-Point PDEs | Armand Kassaï Koupaï et.al. | 2512.04974 | null |
| 2025-12-04 | LatentFM: A Latent Flow Matching Approach for Generative Medical Image Segmentation | Huynh Trinh Ngoc et.al. | 2512.04821 | null |
| 2025-12-04 | LaFiTe: A Generative Latent Field for 3D Native Texturing | Chia-Hao Chen et.al. | 2512.04786 | null |
| 2025-12-04 | Complementary Characterization of Agent-Based Models via Computational Mechanics and Diffusion Models | Roberto Garrone et.al. | 2512.04771 | null |
| 2025-12-04 | LeMat-GenBench: A Unified Evaluation Framework for Crystal Generative Models | Siddharth Betala et.al. | 2512.04562 | null |
| 2025-12-04 | One-Step Generative Channel Estimation via Average Velocity Field | Zehua Jiang et.al. | 2512.04501 | null |
| 2025-12-04 | UniTS: Unified Time Series Generative Model for Remote Sensing | Yuxiang Zhang et.al. | 2512.04461 | null |
| 2025-12-03 | ActVAE: Modelling human activity schedules with a deep conditional generative approach | Fred Shone et.al. | 2512.04223 | null |
| 2025-12-03 | ReasonX: MLLM-Guided Intrinsic Image Decomposition | Alara Dirik et.al. | 2512.04222 | null |
| 2025-12-03 | Stable Signer: Hierarchical Sign Language Generative Model | Sen Fang et.al. | 2512.04048 | null |
| 2025-12-03 | Fast & Efficient Normalizing Flows and Applications of Image Generative Models | Sandeep Nagar et.al. | 2512.04039 | null |
| 2025-12-03 | Towards Privacy-Preserving Range Queries with Secure Learned Spatial Index over Encrypted Data | Zuan Wang et.al. | 2512.03669 | null |
| 2025-12-03 | AdaPower: Specializing World Foundation Models for Predictive Manipulation | Yuhang Huang et.al. | 2512.03538 | null |
| 2025-12-02 | SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning | Salman Rahman et.al. | 2512.03244 | null |
| 2025-12-02 | InvertiTune: High-Quality Data Synthesis for Cost-Effective Single-Shot Text-to-Knowledge Graph Generation | Faezeh Faez et.al. | 2512.03197 | null |
| 2025-12-02 | ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation | Mengchen Zhang et.al. | 2512.03036 | null |
| 2025-12-04 | LORE: A Large Generative Model for Search Relevance | Chenji Lu et.al. | 2512.03025 | null |
| 2025-12-02 | In Silico Development of Psychometric Scales: Feasibility of Representative Population Data Simulation with LLMs | Enrico Cipriani et.al. | 2512.02910 | null |
| 2025-12-02 | Leveraging generative adversarial networks with spatially adaptive denormalization for multivariate stochastic seismic data inversion | Roberto Miele et.al. | 2512.02863 | null |
| 2025-12-02 | Making Dialogue Grounding Data Rich: A Three-Tier Data Synthesis Framework for Generalized Referring Expression Comprehension | Juexi Shao et.al. | 2512.02791 | null |
| 2025-12-02 | Self-Improving AI Agents through Self-Play | Przemyslaw Chojecki et.al. | 2512.02731 | null |
| 2025-12-02 | Generative modeling using evolved quantum Boltzmann machines | Mark M. Wilde et.al. | 2512.02721 | null |
| 2025-12-02 | ClimaOoD: Improving Anomaly Segmentation via Physically Realistic Synthetic Data | Yuxing Liu et.al. | 2512.02686 | null |
| 2025-12-02 | Generative Multi-modal Feedback for Singing Voice Synthesis Evaluation | Xueyan Li et.al. | 2512.02523 | null |
| 2025-12-02 | Data Curation Through the Lens of Spectral Dynamics: Static Limits, Dynamic Acceleration, and Practical Oracles | Yizhou Zhang et.al. | 2512.02409 | null |
| 2025-12-01 | InstructLR: A Scalable Approach to Create Instruction Dataset for Under-Resourced Languages | Mamadou K. Keita et.al. | 2512.02213 | null |
| 2025-12-01 | Generative Video Motion Editing with 3D Point Tracks | Yao-Chih Lee et.al. | 2512.02015 | null |
| 2025-12-01 | Improved Mean Flows: On the Challenges of Fastforward Generative Models | Zhengyang Geng et.al. | 2512.02012 | null |
| 2025-12-02 | Consistent Synthetic Sequences Unlock Structural Diversity in Fully Atomistic De Novo Protein Design | Danny Reidenbach et.al. | 2512.01976 | null |
| 2025-12-01 | From Atomic to Composite: Reinforcement Learning Enables Generalization in Complementary Reasoning | Sitao Cheng et.al. | 2512.01970 | null |
| 2025-12-01 | Exploring Human Perceptions of AI Responses: Insights from a Mixed-Methods Study on Risk Mitigation in Generative Models | Heloisa Candello et.al. | 2512.01892 | null |
| 2025-12-01 | Tahr: The Generative Attribute Grammar Framework | Matteo Ciccaglione et.al. | 2512.01872 | null |
| 2025-12-01 | Deconstructing Generative Diversity: An Information Bottleneck Analysis of Discrete Latent Generative Models | Yudi Wu et.al. | 2512.01831 | null |
| 2025-12-01 | Dimension-free error estimate for diffusion model and optimal scheduling | Valentin de Bortoli et.al. | 2512.01820 | null |
| 2025-12-01 | Much Ado About Noising: Dispelling the Myths of Generative Robotic Control | Chaoyi Pan et.al. | 2512.01809 | null |
| 2025-12-01 | Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos | Xavier Thomas et.al. | 2512.01803 | null |
| 2025-11-28 | Object-Centric Data Synthesis for Category-level Object Detection | Vikhyat Agarwal et.al. | 2511.23450 | null |
| 2025-11-28 | MegaChat: A Synthetic Persian Q&A Dataset for High-Quality Sales Chatbot Evaluation | Mahdi Rahmani et.al. | 2511.23397 | null |
| 2025-11-28 | Identifying bars in galaxies using machine learning | Rajit Shrivastava et.al. | 2511.23383 | null |
| 2025-11-28 | Flow Straighter and Faster: Efficient One-Step Generative Modeling via MeanFlow on Rectified Trajectories | Xinxi Zhang et.al. | 2511.23342 | null |
| 2025-11-28 | Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach | Shuqi Liu et.al. | 2511.23335 | null |
| 2025-11-28 | Synthetic Industrial Object Detection: GenAI vs. Feature-Based Methods | Jose Moises Araya-Martinez et.al. | 2511.23241 | null |
| 2025-11-28 | db-SP: Accelerating Sparse Attention for Visual Generative Models with Dual-Balanced Sequence Parallelism | Siqi Chen et.al. | 2511.23113 | null |
| 2025-11-28 | Evaluating the Clinical Impact of Generative Inpainting on Bone Age Estimation | Felipe Akio Matsuoka et.al. | 2511.23066 | null |
| 2025-11-26 | Matrix: Peer-to-Peer Multi-Agent Synthetic Data Generation Framework | Dong Wang et.al. | 2511.21686 | null |
| 2025-11-26 | TAB-DRW: A DFT-based Robust Watermark for Generative Tabular Data | Yizhou Zhao et.al. | 2511.21600 | null |
| 2025-11-26 | Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy | Teng Hu et.al. | 2511.21579 | null |
| 2025-11-26 | Guiding Generative Models for Protein Design: Prompting, Steering and Aligning | Filippo Stocco et.al. | 2511.21476 | null |
| 2025-11-26 | Ensemble Performance Through the Lens of Linear Independence of Classifier Votes in Data Streams | Enes Bektas et.al. | 2511.21465 | null |
| 2025-11-26 | TSGM: Regular and Irregular Time-series Generation using Score-based Generative Models | Haksoo Lim et.al. | 2511.21335 | null |
| 2025-11-25 | MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models | Chieh-Yun Chen et.al. | 2511.20629 | null |
| 2025-11-25 | Copyright Detection in Large Language Models: An Ethical Approach to Generative AI Development | David Szczecina et.al. | 2511.20623 | null |
| 2025-11-25 | Anatomica: Localized Control over Geometric and Topological Properties for Anatomical Diffusion Models | Karim Kadry et.al. | 2511.20587 | null |
| 2025-11-25 | Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward | Yuwei Niu et.al. | 2511.20561 | null |
| 2025-11-25 | AlignBench: Benchmarking Fine-Grained Image-Text Alignment with Synthetic Image-Caption Pairs | Kuniaki Saito et.al. | 2511.20515 | null |
| 2025-11-25 | FRAGMENTA: End-to-end Fragmentation-based Generative Model with Agentic Tuning for Drug Lead Optimization | Yuto Suzuki et.al. | 2511.20510 | null |
| 2025-11-25 | Generative Modeling with Manifold Percolation | Rui Tong et.al. | 2511.20503 | null |
| 2025-11-25 | Quantifying the Privacy Implications of High-Fidelity Synthetic Network Traffic | Van Tran et.al. | 2511.20497 | null |
| 2025-11-25 | Efficient and Fast Generative-Based Singing Voice Separation using a Latent Diffusion Model | Genís Plaja-Roglans et.al. | 2511.20470 | null |
| 2025-11-25 | STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flow | Jiatao Gu et.al. | 2511.20462 | null |
| 2025-11-25 | Diffusion for Fusion: Designing Stellarators with Generative AI | Misha Padidar et.al. | 2511.20445 | null |
| 2025-11-24 | In-Video Instructions: Visual Signals as Generative Control | Gongfan Fang et.al. | 2511.19401 | null |
| 2025-11-24 | Historical Reconstruction of Solar Surface Magnetism from Cycle 1-24 Using the Synthetic Active Region Generator (SARG) and the Advective Flux Transport (AFT) Model | Bibhuti Kumar Jha et.al. | 2511.19371 | null |
| 2025-11-24 | Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoning | Qihan Huang et.al. | 2511.19343 | null |
| 2025-11-24 | Targeted Manipulation: Slope-Based Attacks on Financial Time-Series Data | Dominik Luszczynski et.al. | 2511.19330 | null |
| 2025-11-24 | Automated RF Phase Adjustment for Beam Stabilization in the Fermilab Linac | R. R. Chichili et.al. | 2511.19141 | null |
| 2025-11-21 | Designing and Generating Diverse, Equitable Face Image Datasets for Face Verification Tasks | Georgia Baltsou et.al. | 2511.17393 | null |
| 2025-11-21 | A Little More Like This: Text-to-Image Retrieval with Vision-Language Models Using Relevance Feedback | Bulat Khaertdinov et.al. | 2511.17255 | null |
| 2025-11-21 | Dual-domain Adaptation Networks for Realistic Image Super-resolution | Chaowei Fang et.al. | 2511.17217 | null |
| 2025-11-21 | PostCam: Camera-Controllable Novel-View Video Generation with Query-Shared Cross-Attention | Yipeng Chen et.al. | 2511.17185 | null |
| 2025-11-21 | Toward Sustainable Generative AI: A Scoping Review of Carbon Footprint and Environmental Impacts Across Training and Inference Stages | Min-Kyu Kim et.al. | 2511.17179 | null |
| 2025-11-21 | Towards Generative Design Using Optimal Transport for Shape Exploration and Solution Field Interpolation | Sergio Torregrosa et.al. | 2511.17111 | null |
| 2025-11-21 | Modeling memory in time-respecting paths on temporal networks | Silvia Guerrini et.al. | 2511.17108 | null |
| 2025-11-20 | Dataset Distillation for Pre-Trained Self-Supervised Vision Models | George Cazenavette et.al. | 2511.16674 | null |
| 2025-11-20 | V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models | Yang Luo et.al. | 2511.16668 | null |
| 2025-11-20 | InternData-A1: Pioneering High-Fidelity Synthetic Data for Pre-training Generalist Policy | Yang Tian et.al. | 2511.16651 | null |
| 2025-11-20 | SAM 3D: 3Dfy Anything in Images | SAM 3D Team et.al. | 2511.16624 | null |
| 2025-11-20 | gfnx: Fast and Scalable Library for Generative Flow Networks in JAX | Daniil Tiapkin et.al. | 2511.16592 | null |
| 2025-11-20 | The Oracle and The Prism: A Decoupled and Efficient Framework for Generative Recommendation Explanation | Jiaheng Zhang et.al. | 2511.16543 | null |
| 2025-11-20 | Supervised Contrastive Learning for Few-Shot AI-Generated Image Detection and Attribution | Jaime Álvarez Urueña et.al. | 2511.16541 | null |
| 2025-11-20 | From generative AI to the brain: five takeaways | Claudius Gros et.al. | 2511.16432 | null |
| 2025-11-20 | Generative Modeling of Clinical Time Series via Latent Stochastic Differential Equations | Muhammad Aslanimoghanloo et.al. | 2511.16427 | null |
| 2025-11-20 | Denoising weak lensing mass maps with diffusion model and generative adversarial network | Shohei D. Aoyama et.al. | 2511.16415 | null |
| 2025-11-20 | Reducing Instability in Synthetic Data Evaluation with a Super-Metric in MalDataGen | Anna Luiza Gomes da Silva et.al. | 2511.16373 | null |
| 2025-11-20 | Beyond Generative AI: World Models for Clinical Prediction, Counterfactuals, and Planning | Mohammad Areeb Qazi et.al. | 2511.16333 | null |
| 2025-11-19 | Computer-Use Agents as Judges for Generative User Interface | Kevin Qinghong Lin et.al. | 2511.15567 | null |
| 2025-11-19 | FunnyNodules: A Customizable Medical Dataset Tailored for Evaluating Explainable AI | Luisa Gallée et.al. | 2511.15481 | null |
| 2025-11-19 | Taming Generative Synthetic Data for X-ray Prohibited Item Detection | Jialong Sun et.al. | 2511.15299 | null |
| 2025-11-19 | Insert In Style: A Zero-Shot Generative Framework for Harmonious Cross-Domain Object Composition | Raghu Vamsi Chittersu et.al. | 2511.15197 | null |
| 2025-11-18 | Zero-shot Synthetic Video Realism Enhancement via Structure-aware Denoising | Yifan Wang et.al. | 2511.14719 | null |
| 2025-11-18 | Ground Truth Generation for Multilingual Historical NLP using LLMs | Clovis Gladstone et.al. | 2511.14688 | null |
| 2025-11-18 | Streamlining Industrial Contract Management with Retrieval-Augmented LLMs | Kristi Topollai et.al. | 2511.14671 | null |
| 2025-11-18 | A Controllable Perceptual Feature Generative Model for Melody Harmonization via Conditional Variational Autoencoder | Dengyun Huang et.al. | 2511.14600 | null |
| 2025-11-18 | LiveRAG: A diverse Q&A dataset with varying difficulty level for RAG evaluation | David Carmel et.al. | 2511.14531 | null |
| 2025-11-18 | A Generative Data Framework with Authentic Supervision for Underwater Image Restoration and Enhancement | Yufeng Tian et.al. | 2511.14521 | null |
| 2025-11-18 | Nonparametric estimation of conditional probability distributions using a generative approach based on conditional push-forward neural networks | Nicola Rares Franco et.al. | 2511.14455 | null |
| 2025-11-18 | Infer As You Train: A Symmetric Paradigm of Masked Generative for Click-Through Rate Prediction | Moyu Zhang et.al. | 2511.14403 | null |
| 2025-11-17 | Back to Basics: Let Denoising Generative Models Denoise | Tianhong Li et.al. | 2511.13720 | null |
| 2025-11-17 | TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models | Harold Haodong Chen et.al. | 2511.13704 | null |
| 2025-11-17 | Data Value in the Age of Scaling: Understanding LLM Scaling Dynamics Under Real-Synthetic Data Mixtures | Haohui Wang et.al. | 2511.13640 | null |
| 2025-11-17 | Statistically Accurate and Robust Generative Prediction of Rock Discontinuities with A Tabular Foundation Model | Han Meng et.al. | 2511.13339 | null |
| 2025-11-17 | AutoMalDesc: Large-Scale Script Analysis for Cyber Threat Research | Alexandru-Mihai Apostu et.al. | 2511.13333 | null |
| 2025-11-17 | TacEleven: generative tactic discovery for football open play | Siyao Zhao et.al. | 2511.13326 | null |
| 2025-11-17 | PASE: Leveraging the Phonological Prior of WavLM for Low-Hallucination Generative Speech Enhancement | Xiaobin Rong et.al. | 2511.13300 | null |
| 2025-11-17 | Grounded by Experience: Generative Healthcare Prediction Augmented with Hierarchical Agentic Retrieval | Chuang Zhao et.al. | 2511.13293 | null |
| 2025-11-17 | Examining the Usage of Generative AI Models in Student Learning Activities for Software Programming | Rufeng Chen et.al. | 2511.13271 | null |
| 2025-11-14 | Terrain Costmap Generation via Scaled Preference Conditioning | Luisa Mao et.al. | 2511.11529 | null |
| 2025-11-14 | SynthSoM-Twin: A Multi-Modal Sensing-Communication Digital-Twin Dataset for Sim2Real Transfer via Synesthesia of Machines | Junlong Chen et.al. | 2511.11503 | null |
| 2025-11-14 | From Synthetic Scenes to Real Performance: Enhancing Spatial Reasoning in VLMs | Massimo Rizzoli et.al. | 2511.11440 | null |
| 2025-11-14 | YCB-Ev SD: Synthetic event-vision dataset for 6DoF object pose estimation | Pavel Rojtberg et.al. | 2511.11344 | null |
| 2025-11-14 | How Physics Professors Use and Frame Generative AI Tools | Vidar Skogvoll et.al. | 2511.11317 | null |
| 2025-11-14 | 6D Strawberry Pose Estimation: Real-time and Edge AI Solutions Using Purely Synthetic Training Data | Saptarshi Neil Sinha et.al. | 2511.11307 | null |
| 2025-11-14 | Improving conditional generative adversarial networks for inverse design of plasmonic structures | Petter Persson et.al. | 2511.11279 | null |
| 2025-11-14 | Prompt Engineering vs. Fine-Tuning for LLM-Based Vulnerability Detection in Solana and Algorand Smart Contracts | Biagio Boi et.al. | 2511.11250 | null |
| 2025-11-14 | Arcee: Differentiable Recurrent State Chain for Generative Vision Modeling with Mamba SSMs | Jitesh Chavan et.al. | 2511.11243 | null |
| 2025-11-13 | Pretrained Joint Predictions for Scalable Batch Bayesian Optimization of Molecular Designs | Miles Wang-Henderson et.al. | 2511.10590 | null |
| 2025-11-13 | Don’t Waste It: Guiding Generative Recommenders with Structured Human Priors via Multi-head Decoding | Yunkai Zhang et.al. | 2511.10492 | null |
| 2025-11-13 | BhashaKritika: Building Synthetic Pretraining Data at Scale for Indic Languages | Guduru Manoj et.al. | 2511.10338 | null |
| 2025-11-13 | Bridging Synthetic and Real Routing Problems via LLM-Guided Instance Generation and Progressive Adaptation | Jianghan Zhu et.al. | 2511.10233 | null |
| 2025-11-10 | AdaRec: Adaptive Recommendation with LLMs via Narrative Profiling and Dual-Channel Reasoning | Meiyun Wang et.al. | 2511.07166 | null |
| 2025-11-10 | Guiding Generative Models to Uncover Diverse and Novel Crystals via Reinforcement Learning | Hyunsoo Park et.al. | 2511.07158 | null |
| 2025-11-10 | Conditional Diffusion as Latent Constraints for Controllable Symbolic Music Generation | Matteo Pettenó et.al. | 2511.07156 | null |
| 2025-11-10 | On the Joint Minimization of Regularization Loss Functions in Deep Variational Bayesian Methods for Attribute-Controlled Symbolic Music Generation | Matteo Pettenó et.al. | 2511.07118 | null |
| 2025-11-10 | Llama-Embed-Nemotron-8B: A Universal Text Embedding Model for Multilingual and Cross-Lingual Tasks | Yauhen Babakhin et.al. | 2511.07025 | null |
| 2025-11-10 | Image Restoration via Primal Dual Hybrid Gradient and Flow Generative Model | Ji Li et.al. | 2511.06748 | null |
| 2025-11-10 | Relative Energy Learning for LiDAR Out-of-Distribution Detection | Zizhao Li et.al. | 2511.06720 | null |
| 2025-11-10 | F2GAN: A Feature-Feedback Generative Framework for Reliable AI-Based Fault Diagnosis in Inverter-Dominated Microgrids | Swetha Rani Kasimalla et.al. | 2511.06677 | null |
| 2025-11-10 | Non-Rival Data as Rival Products: An Encapsulation-Forging Approach for Data Synthesis | Kaidong Wang et.al. | 2511.06610 | null |
| 2025-11-09 | Decomate: Leveraging Generative Models for Co-Creative SVG Animation | Jihyeon Park et.al. | 2511.06297 | null |
| 2025-11-09 | Synthetic Data-Driven Prompt Tuning for Financial QA over Tables and Documents | Yaoning Yu et.al. | 2511.06292 | null |
| 2025-11-09 | Breaking the Modality Barrier: Generative Modeling for Accurate Molecule Retrieval from Mass Spectra | Yiwen Zhang et.al. | 2511.06259 | null |
| 2025-11-09 | Gait Recognition via Collaborating Discriminative and Generative Diffusion Models | Haijun Xiong et.al. | 2511.06245 | null |
| 2025-11-08 | Adapting Web Agents with Synthetic Supervision | Zhaoyang Wang et.al. | 2511.06101 | null |
| 2025-11-08 | Identity Card Presentation Attack Detection: A Systematic Review | Esteban M. Ruiz et.al. | 2511.06056 | null |
| 2025-11-08 | CGCE: Classifier-Guided Concept Erasure in Generative Models | Viet Nguyen et.al. | 2511.05865 | null |
| 2025-11-07 | Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at Scale | David Acuna et.al. | 2511.05705 | null |
| 2025-11-07 | Associative Poisoning to Generative Machine Learning | Mathias Lundteigen Mohus et.al. | 2511.05177 | null |
| 2025-11-07 | Role-SynthCLIP: A Role Play Driven Diverse Synthetic Data Approach | Yuanxiang Huangfu et.al. | 2511.05057 | null |
| 2025-11-07 | Challenges in 3D Data Synthesis for Training Neural Networks on Topological Features | Dylan Peek et.al. | 2511.04972 | null |
| 2025-11-06 | Generate, Evaluate, Iterate: Synthetic Data for Human-in-the-Loop Refinement of LLM Judges | Hyo Jin Do et.al. | 2511.04478 | null |
| 2025-11-06 | Towards Causal Market Simulators | Dennis Thumm et.al. | 2511.04469 | null |
| 2025-11-06 | Deep learning-based object detection of offshore platforms on Sentinel-1 Imagery and the impact of synthetic training data | Robin Spanier et.al. | 2511.04304 | null |
| 2025-11-06 | Proto-LeakNet: Towards Signal-Leak Aware Attribution in Synthetic Human Face Imagery | Claudio Giusti et.al. | 2511.04260 | null |
| 2025-11-06 | Learning to Land Anywhere: Transferable Generative Models for Aircraft Trajectories | Olav Finne Praesteng Larsen et.al. | 2511.04155 | null |
| 2025-11-06 | Room Envelopes: A Synthetic Dataset for Indoor Layout Reconstruction from Images | Sam Bahrami et.al. | 2511.03970 | null |
| 2025-11-05 | Integrating Score-Based Generative Modeling and Neural ODEs for Accurate Representation of Multiscale Chaotic Dynamics | Giulio Del Felice et.al. | 2511.03862 | null |
| 2025-11-05 | Human Mesh Modeling for Anny Body | Romain Brégier et.al. | 2511.03589 | null |
| 2025-11-05 | Generative Artificial Intelligence in Bioinformatics: A Systematic Review of Models, Applications, and Methodological Advances | Riasad Alvi et.al. | 2511.03354 | null |
| 2025-11-05 | From Insight to Exploit: Leveraging LLM Collaboration for Adaptive Adversarial Text Generation | Najrin Sultana et.al. | 2511.03128 | null |
| 2025-11-04 | Discrete Bayesian Sample Inference for Graph Generation | Ole Petersen et.al. | 2511.03015 | null |
| 2025-11-04 | A Non-Adversarial Approach to Idempotent Generative Modelling | Mohammed Al-Jaff et.al. | 2511.02614 | null |
| 2025-11-04 | DetectiumFire: A Comprehensive Multi-modal Dataset Bridging Vision and Language for Fire Understanding | Zixuan Liu et.al. | 2511.02495 | null |
| 2025-11-04 | A New Perspective on Precision and Recall for Generative Models | Benjamin Sykes et.al. | 2511.02414 | null |
| 2025-11-04 | From Models to Operators: Rethinking Autoscaling Granularity for Large Generative Models | Xingqi Cui et.al. | 2511.02248 | null |
| 2025-11-04 | Language-Enhanced Generative Modeling for PET Synthesis from MRI and Blood Biomarkers | Zhengjie Zhang et.al. | 2511.02206 | null |
| 2025-11-04 | DoFlow: Causal Generative Flows for Interventional and Counterfactual Time-Series Prediction | Dongze Wu et.al. | 2511.02137 | null |
| 2025-11-03 | Quantum-Enhanced Generative Models for Rare Event Prediction | M. Z. Haider et.al. | 2511.02042 | null |
| 2025-11-03 | The Born Ultimatum: Conditions for Classical Surrogation of Quantum Generative Models with Correlators | Mario Herrero-Gonzalez et.al. | 2511.01845 | null |
| 2025-11-03 | GenDexHand: Generative Simulation for Dexterous Hands | Feng Chen et.al. | 2511.01791 | null |
| 2025-11-03 | Game-theoretic distributed learning of generative models for heterogeneous data collections | Dmitrij Schlesinger et.al. | 2511.01740 | null |
| 2025-11-03 | Generative Adversarial Synthesis and Deep Feature Discrimination of Brain Tumor MRI Images | Md Sumon Ali et.al. | 2511.01574 | null |
| 2025-11-03 | NSYNC: Negative Synthetic Image Generation for Contrastive Training to Improve Stylized Text-To-Image Translation | Serkan Ozturk et.al. | 2511.01517 | null |
| 2025-11-03 | UniREditBench: A Unified Reasoning-based Image Editing Benchmark | Feng Han et.al. | 2511.01295 | null |
| 2025-11-03 | Speech-DRAME: A Framework for Human-Aligned Benchmarks in Speech Role-Play | Jiatong Shi et.al. | 2511.01261 | null |
| 2025-11-02 | Feedback-driven Retrieval-augmented Audio Generation with Large Audio Language Models | Junqi Zhao et.al. | 2511.01091 | null |
| 2025-11-02 | SliceVision-F2I: A Synthetic Feature-to-Image Dataset for Visual Pattern Representation on Network Slices | Md. Abid Hasan Rafi et.al. | 2511.01087 | null |
| 2025-11-02 | Using Synthetic Data to estimate the True Error is theoretically and practically doable | Hai Hoang Thanh et.al. | 2511.00964 | null |
| 2025-11-04 | Deep Generative Models for Enhanced Vitreous OCT Imaging | Simone Sarrocco et.al. | 2511.00881 | null |
| 2025-11-01 | Sensitivity Analysis for Climate Science with Generative Flow Models | Alex Dobra et.al. | 2511.00663 | null |
| 2025-10-31 | A Technical Exploration of Causal Inference with Hybrid LLM Synthetic Data | Dana Kim et.al. | 2511.00318 | null |
| 2025-10-31 | Generative Modeling Enables Molecular Structure Retrieval from Coulomb Explosion Imaging | Xiang Li et.al. | 2511.00179 | null |
| 2025-10-31 | GeoFM: Enhancing Geometric Reasoning of MLLMs via Synthetic Data Generation through Formal Language | Yuhao Zhang et.al. | 2510.27448 | null |
| 2025-10-31 | Generative Semantic Coding for Ultra-Low Bitrate Visual Communication and Analysis | Weiming Chen et.al. | 2510.27324 | null |
| 2025-10-31 | Disrupting Networks: Amplifying Social Dissensus via Opinion Perturbation and Large Language Models | Erica Coppolillo et.al. | 2510.27152 | null |
| 2025-11-03 | Generative diffusion modeling protocols for improving the Kikuchi pattern indexing in electron back-scatter diffraction | Meghraj Prajapat et.al. | 2510.26907 | null |
| 2025-10-30 | BI-DCGAN: A Theoretically Grounded Bayesian Framework for Efficient and Diverse GANs | Mahsa Valizadeh et.al. | 2510.26892 | null |
| 2025-10-30 | Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench | Fenfen Lin et.al. | 2510.26865 | null |
| 2025-10-30 | OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes | Yukun Huang et.al. | 2510.26800 | null |
| 2025-10-30 | FlowQ-Net: A Generative Framework for Automated Quantum Circuit Design | Jun Dai et.al. | 2510.26688 | null |
| 2025-10-30 | Generative sampling with physics-informed kernels | Friederike Ihssen et.al. | 2510.26678 | null |
| 2025-10-30 | Metacognition and Confidence Dynamics in Advice Taking from Generative AI | Clara Colombatto et.al. | 2510.26508 | null |
| 2025-10-30 | UniTok-Audio: A Unified Audio Generation Framework via Generative Modeling on Discrete Codec Tokens | Chengwei Liu et.al. | 2510.26372 | null |
| 2025-10-30 | MisSynth: Improving MISSCI Logical Fallacies Classification with Synthetic Data | Mykhailo Poliakov et.al. | 2510.26345 | null |
| 2025-10-30 | Likely Interpolants of Generative Models | Frederik Möbius Rygaard et.al. | 2510.26266 | null |
| 2025-10-30 | New Money: A Systematic Review of Synthetic Data Generation for Finance | James Meldrum et.al. | 2510.26076 | null |
| 2025-10-30 | Bias-Corrected Data Synthesis for Imbalanced Learning | Pengfei Lyu et.al. | 2510.26046 | null |
| 2025-10-29 | Generative Image Restoration and Super-Resolution using Physics-Informed Synthetic Data for Scanning Tunneling Microscopy | Nikola L. Kolev et.al. | 2510.25921 | null |
| 2025-10-29 | A Survey on Efficient Large Language Model Training: From Data-centric Perspectives | Junyu Luo et.al. | 2510.25817 | null |
| 2025-10-28 | SHA-256 Infused Embedding-Driven Generative Modeling of High-Energy Molecules in Low-Data Regimes | Siddharth Verma et.al. | 2510.25788 | null |
| 2025-10-29 | E-Scores for (In)Correctness Assessment of Generative Model Outputs | Guneet S. Dhillon et.al. | 2510.25770 | null |
| 2025-10-29 | Distributional Evaluation of Generative Models via Relative Density Ratio | Yuliang Xu et.al. | 2510.25507 | null |
| 2025-10-31 | TempoPFN: Synthetic Pre-training of Linear RNNs for Zero-shot Time Series Forecasting | Vladyslav Moroshan et.al. | 2510.25502 | null |
| 2025-10-29 | Generative Bayesian Optimization: Generative Models as Acquisition Functions | Rafael Oliveira et.al. | 2510.25240 | null |
| 2025-10-29 | Scaling Cultural Resources for Improving Generative Models | Hayk Stepanyan et.al. | 2510.25167 | null |
| 2025-10-29 | Target-Guided Bayesian Flow Networks for Quantitatively Constrained CAD Generation | Wenhao Zheng et.al. | 2510.25163 | null |
| 2025-10-28 | VividCam: Learning Unconventional Camera Motions from Virtual Synthetic Videos | Qiucheng Wu et.al. | 2510.24904 | null |
| 2025-10-28 | A Parameter-Efficient Multi-Scale Convolutional Adapter for Synthetic Speech Detection | Yassine El Kheir et.al. | 2510.24852 | null |
| 2025-10-28 | AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis | Xuanzhong Chen et.al. | 2510.24695 | null |
| 2025-10-28 | OrchDAG: Complex Tool Orchestration in Multi-Turn Interactions with Plan DAGs | Yifu Lu et.al. | 2510.24663 | null |
| 2025-10-28 | A Comprehensive Evaluation Framework for Synthetic Trip Data Generation in Public Transport | Yuanyuan Wu et.al. | 2510.24375 | null |
| 2025-10-28 | Bayesian Speech synthesizers Can Learn from Multiple Teachers | Ziyang Zhang et.al. | 2510.24372 | null |
| 2025-10-28 | PRIVET: Privacy Metric Based on Extreme Value Theory | Antoine Szatkownik et.al. | 2510.24233 | null |
| 2025-10-28 | Model-Guided Dual-Role Alignment for High-Fidelity Open-Domain Video-to-Audio Generation | Kang Zhang et.al. | 2510.24103 | null |
| 2025-10-28 | Beyond Objects: Contextual Synthetic Data Generation for Fine-Grained Classification | William Yang et.al. | 2510.24078 | null |
| 2025-10-28 | Score-based constrained generative modeling via Langevin diffusions with boundary conditions | Adam Nordenhög et.al. | 2510.23985 | null |
| 2025-10-27 | Learning Interpretable Features in Audio Latent Spaces via Sparse Autoencoders | Nathan Paek et.al. | 2510.23802 | null |
| 2025-10-27 | Robust and Generalizable Background Subtraction on Images of Calorimeter Jets using Unsupervised Generative Learning | Yeonju Go et.al. | 2510.23717 | null |
| 2025-10-27 | Variational Masked Diffusion Models | Yichi Zhang et.al. | 2510.23606 | null |
| 2025-10-27 | RobotArena $\infty$ : Scalable Robot Benchmarking via Real-to-Sim Translation | Yash Jangir et.al. | 2510.23571 | null |
| 2025-10-27 | An Efficient Remote Sensing Super Resolution Method Exploring Diffusion Priors and Multi-Modal Constraints for Crop Type Mapping | Songxi Yang et.al. | 2510.23382 | null |
| 2025-10-27 | Model-Behavior Alignment under Flexible Evaluation: When the Best-Fitting Model Isn’t the Right One | Itamar Avitan et.al. | 2510.23321 | null |
| 2025-10-27 | Increasing LLM Coding Capabilities through Diverse Synthetic Coding Tasks | Amal Abed et.al. | 2510.23208 | null |
| 2025-10-27 | On the Anisotropy of Score-Based Generative Models | Andreas Floros et.al. | 2510.22899 | null |
| 2025-10-26 | A Comprehensive Dataset for Human vs. AI Generated Text Detection | Rajarshi Roy et.al. | 2510.22874 | null |
| 2025-10-26 | SAO-Instruct: Free-form Audio Editing using Natural Language Instructions | Michael Ungersböck et.al. | 2510.22795 | null |
| 2025-10-26 | Semi-Supervised Learning under General Causal Models | Archer Moore et.al. | 2510.22567 | null |
| 2025-10-25 | GigaEmbeddings: Efficient Russian Language Embedding Model | Egor Kolodin et.al. | 2510.22369 | null |
| 2025-10-24 | Linearized Optimal Transport for Analysis of High-Dimensional Point-Cloud and Single-Cell Data | Tianxiang Wang et.al. | 2510.22033 | null |
| 2025-10-24 | Embedding Trust: Semantic Isotropy Predicts Nonfactuality in Long-Form Text Generation | Dhrupad Bhardwaj et.al. | 2510.21891 | null |
| 2025-10-23 | Generative AI in Depth: A Survey of Recent Advances, Model Variants, and Real-World Applications | Shamim Yazdani et.al. | 2510.21887 | null |
| 2025-10-24 | Generative Correlation Manifolds: Generating Synthetic Data with Preserved Higher-Order Correlations | Jens E. d’Hondt et.al. | 2510.21610 | null |
| 2025-10-24 | Generalised Flow Maps for Few-Step Generative Modelling on Riemannian Manifolds | Oscar Davis et.al. | 2510.21608 | null |
| 2025-10-24 | S3OD: Towards Generalizable Salient Object Detection with Synthetic Data | Orest Kupyn et.al. | 2510.21605 | null |
| 2025-10-24 | Are These Even Words? Quantifying the Gibberishness of Generative Speech Models | Danilo de Oliveira et.al. | 2510.21317 | null |
| 2025-10-24 | Text-Guided Diffusion Model-based Generative Communication for Wireless Image Transmission | Shengkang Chen et.al. | 2510.21299 | null |
| 2025-10-24 | Robust Distortion-Free Watermark for Autoregressive Audio Generation Models | Yihan Wu et.al. | 2510.21115 | null |
| 2025-10-23 | Amortized Active Generation of Pareto Sets | Daniel M. Steinberg et.al. | 2510.21052 | null |
| 2025-10-23 | Can Current Detectors Catch Face-to-Voice Deepfake Attacks? | Nguyen Linh Bao Nguyen et.al. | 2510.21004 | null |
| 2025-10-23 | CUPID: Pose-Grounded Generative 3D Reconstruction from a Single Image | Binbin Huang et.al. | 2510.20776 | null |
| 2025-10-24 | EditInfinity: Image Editing with Binary-Quantized Generative Models | Jiahuan Wang et.al. | 2510.20217 | null |
| 2025-10-22 | Learning and Simulating Building Evacuation Patterns for Enhanced Safety Design Using Generative Models | Jin Han et.al. | 2510.19623 | null |
| 2025-10-22 | The Intricate Dance of Prompt Complexity, Quality, Diversity, and Consistency in T2I Models | Xiaofeng Zhang et.al. | 2510.19557 | link |
| 2025-10-22 | Predicting before Reconstruction: A generative prior framework for MRI acceleration | Juhyung Park et.al. | 2510.19472 | null |
| 2025-10-22 | Unified Reinforcement and Imitation Learning for Vision-Language Models | Byung-Kwan Lee et.al. | 2510.19307 | null |
| 2025-10-22 | Loopholing Discrete Diffusion: Deterministic Bypass of the Sampling Wall | Mingyu Jo et.al. | 2510.19304 | null |
| 2025-10-24 | Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks | Kai Zeng et.al. | 2510.19195 | null |
| 2025-10-21 | Improving the Generation and Evaluation of Synthetic Data for Downstream Medical Causal Inference | Harry Amad et.al. | 2510.18768 | null |
| 2025-10-21 | ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization | Yuanhe Guo et.al. | 2510.18433 | null |
| 2025-10-21 | GPTFace: Generative Pre-training of Facial-Linguistic Transformer by Span Masking and Weakly Correlated Text-image Data | Yudong Li et.al. | 2510.18345 | null |
| 2025-10-21 | Towards Identifiability of Hierarchical Temporal Causal Representation Learning | Zijian Li et.al. | 2510.18310 | null |
| 2025-10-21 | ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive Text-to-Speech Generation | Haowei Lou et.al. | 2510.18308 | null |
| 2025-10-21 | Efficient Few-shot Identity Preserving Attribute Editing for 3D-aware Deep Generative Models | Vishal Vinod et.al. | 2510.18287 | null |
| 2025-10-21 | ACTG-ARL: Differentially Private Conditional Text Generation with RL-Boosted Control | Yuzheng Hu et.al. | 2510.18232 | null |
| 2025-10-20 | Gradient Variance Reveals Failure Modes in Flow-Based Generative Models | Teodora Reu et.al. | 2510.18118 | null |
| 2025-10-20 | Fine-tuning Flow Matching Generative Models with Intermediate Feedback | Jiajun Fan et.al. | 2510.18072 | null |
| 2025-10-20 | Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models | Jiajun Fan et.al. | 2510.18053 | null |
| 2025-10-20 | EvoSyn: Generalizable Evolutionary Data Synthesis for Verifiable Learning | He Du et.al. | 2510.17928 | null |
| 2025-10-20 | QueST: Incentivizing LLMs to Generate Difficult Problems | Hanxu Hu et.al. | 2510.17715 | null |
| 2025-10-20 | Quantum Synthetic Data Generation for Industrial Bioprocess Monitoring | Shawn M. Gibford et.al. | 2510.17688 | null |
| 2025-10-20 | Integrating BIM and UAV-based photogrammetry for Automated 3D Structure Model Segmentation | Siqi Chen et.al. | 2510.17609 | null |
| 2025-10-20 | Towards geological inference with process-based and deep generative modeling, part 2: inversion of fluvial deposits and latent-space disentanglement | Guillaume Rongier et.al. | 2510.17478 | null |
| 2025-10-20 | Optimal transport by a Lagrangian dynamics of population distribution | Babak Benam et.al. | 2510.17193 | null |
| 2025-10-19 | Hephaestus: Mixture Generative Modeling with Energy Guidance for Large-scale QoS Degradation | Nguyen Do et.al. | 2510.17036 | null |
| 2025-10-19 | Conditional Synthetic Live and Spoof Fingerprint Generation | Syed Konain Abbas et.al. | 2510.17035 | null |
| 2025-10-19 | Towards Real-Time Generative Speech Restoration with Flow-Matching | Tsun-An Hsieh et.al. | 2510.16997 | null |
| 2025-10-19 | Differentially Private Linear Regression and Synthetic Data Generation with Statistical Guarantees | Shurong Lin et.al. | 2510.16974 | null |
| 2025-10-19 | Class-N-Diff: Classification-Induced Diffusion Model Can Make Fair Skin Cancer Diagnosis | Nusrat Munia et.al. | 2510.16887 | null |
| 2025-10-19 | U-Codec: Ultra Low Frame-rate Neural Speech Codec for Fast High-fidelity Speech Generation | Xusheng Yang et.al. | 2510.16718 | null |
| 2025-10-18 | Escaping Model Collapse via Synthetic Data Verification: Near-term Improvements and Long-term Convergence | Bingji Yi et.al. | 2510.16657 | null |
| 2025-10-18 | Accelerated Learning on Large Scale Screens using Generative Library Models | Eli N. Weinstein et.al. | 2510.16612 | null |
| 2025-10-17 | AtomBench: A Benchmark for Generative Atomic Structure Models using GPT, Diffusion, and Flow Architectures | Charles Rhys Campbell et.al. | 2510.16165 | null |
| 2025-10-16 | Membership Inference over Diffusion-models-based Synthetic Tabular Data | Peini Cheng et.al. | 2510.16037 | null |
| 2025-10-17 | GENESIS: A Generative Model of Episodic-Semantic Interaction | Marco D’Alessandro et.al. | 2510.15828 | null |
| 2025-10-16 | Deep generative priors for 3D brain analysis | Ana Lawry Aguila et.al. | 2510.15119 | null |
| 2025-10-16 | AlignFlow: Improving Flow-based Generative Models with Semi-Discrete Optimal Transport | Lingkai Kong et.al. | 2510.15038 | null |
| 2025-10-16 | Harmonizing Diverse Models: A Layer-wise Merging Strategy for Consistent Generation | Xujun Peng et.al. | 2510.14915 | null |
| 2025-10-16 | FraQAT: Quantization Aware Training with Fractional bits | Luca Morreale et.al. | 2510.14823 | null |
| 2025-10-16 | SpeechLLM-as-Judges: Towards General and Interpretable Speech Quality Evaluation | Hui Wang et.al. | 2510.14664 | null |
| 2025-10-16 | Generative Models From and For Sampling-Based MPC: A Bootstrapped Approach For Adaptive Contact-Rich Manipulation | Lara Brudermüller et.al. | 2510.14643 | null |
| 2025-10-16 | Unsupervised Deep Generative Models for Anomaly Detection in Neuroimaging: A Systematic Scoping Review | Youwan Mahé et.al. | 2510.14462 | null |
| 2025-10-16 | Towards geological inference with process-based and deep generative modeling, part 1: training on fluvial deposits | Guillaume Rongier et.al. | 2510.14445 | null |
| 2025-10-16 | Qwen3Guard Technical Report | Haiquan Zhao et.al. | 2510.14276 | null |
| 2025-10-15 | Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms | Shrey Pandit et.al. | 2510.13913 | null |
| 2025-10-13 | Joint Discriminative-Generative Modeling via Dual Adversarial Training | Xuwang Yin et.al. | 2510.13872 | null |
| 2025-10-15 | Assessing the Geographic Generalization and Physical Consistency of Generative Models for Climate Downscaling | Carlo Saccardi et.al. | 2510.13722 | null |
| 2025-10-15 | Manifold Decoders: A Framework for Generative Modeling from Nonlinear Embeddings | Riddhish Thakare et.al. | 2510.13622 | null |
| 2025-10-15 | FreshTab: Sourcing Fresh Data for Table-to-Text Generation Evaluation | Kristýna Onderková et.al. | 2510.13598 | null |
| 2025-10-15 | Reinforcement Learning Meets Masked Generative Models: Mask-GRPO for Text-to-Image Generation | Yifu Luo et.al. | 2510.13418 | null |
| 2025-10-15 | UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE | Zhenyu Liu et.al. | 2510.13344 | null |
| 2025-10-15 | Federated Conditional Conformal Prediction via Generative Models | Rui Xu et.al. | 2510.13297 | null |
| 2025-10-15 | Generative model for information metamaterial design | Jun Ming Hou et.al. | 2510.13264 | null |
| 2025-10-15 | NeuroRVQ: Multi-Scale EEG Tokenization for Generative Large Brainwave Models | Konstantinos Barmpas et.al. | 2510.13068 | null |
| 2025-10-16 | Adapting Noise to Data: Generative Flows from 1D Processes | Jannis Chemseddine et.al. | 2510.12636 | null |
| 2025-10-14 | Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training | Jiachen Lei et.al. | 2510.12586 | null |
| 2025-10-14 | Mitigating the Noise Shift for Denoising Generative Models via Noise Awareness Guidance | Jincheng Zhong et.al. | 2510.12497 | null |
| 2025-10-14 | Continuous Uniqueness and Novelty Metrics for Generative Modeling of Inorganic Crystals | Masahiro Negishi et.al. | 2510.12405 | null |
| 2025-10-14 | Generative Diffusion Model DiffCrysGen Discovers Rare Earth-Free Magnetic Materials | Sourav Mal et.al. | 2510.12329 | null |
| 2025-10-14 | Beating Harmful Stereotypes Through Facts: RAG-based Counter-speech Generation | Greta Damo et.al. | 2510.12316 | null |
| 2025-10-14 | GOAT: A Training Framework for Goal-Oriented Agent with Tools | Hyunji Min et.al. | 2510.12218 | null |
| 2025-10-15 | DiSTAR: Diffusion over a Scalable Token Autoregressive Representation for Speech Generation | Yakun Song et.al. | 2510.12210 | null |
| 2025-10-14 | The Impact of Synthetic Data on Object Detection Model Performance: A Comparative Analysis with Real-World Data | Muammer Bay et.al. | 2510.12208 | null |
| 2025-10-14 | Audio Palette: A Diffusion Transformer with Multi-Signal Conditioning for Controllable Foley Synthesis | Junnuo Wang et.al. | 2510.12175 | null |
| 2025-10-14 | Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing | Rongzhi Zhang et.al. | 2510.12121 | null |
| 2025-10-14 | G4Splat: Geometry-Guided Gaussian Splatting with Generative Prior | Junfeng Ni et.al. | 2510.12099 | null |
| 2025-10-14 | Your VAR Model is Secretly an Efficient and Explainable Generative Classifier | Yi-Chung Chen et.al. | 2510.12060 | null |
| 2025-10-13 | UALM: Unified Audio Language Model for Understanding, Generation and Reasoning | Jinchuan Tian et.al. | 2510.12000 | null |
| 2025-10-15 | Y-shaped Generative Flows | Arip Asadulaev et.al. | 2510.11955 | null |
| 2025-10-13 | GRAVITY: A Framework for Personalized Text Generation via Profile-Grounded Synthetic Preferences | Priyanka Dey et.al. | 2510.11952 | null |
| 2025-10-13 | LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens | Armel Zebaze et.al. | 2510.11919 | null |
| 2025-10-13 | Balancing Synthetic Data and Replay for Enhancing Task-Specific Capabilities | Urs Spiegelhalter et.al. | 2510.11842 | null |
| 2025-10-13 | Schrödinger bridge for generative AI: Soft-constrained formulation and convergence analysis | Jin Ma et.al. | 2510.11829 | null |
| 2025-10-13 | OneRec-Think: In-Text Reasoning for Generative Recommendation | Zhanyu Liu et.al. | 2510.11639 | null |
| 2025-10-13 | A Framework for Low-Effort Training Data Generation for Urban Semantic Segmentation | Denis Zavadski et.al. | 2510.11567 | null |
| 2025-10-13 | Offline Reinforcement Learning with Generative Trajectory Policies | Xinsong Feng et.al. | 2510.11499 | null |
| 2025-10-13 | Into the Unknown: Towards using Generative Models for Sampling Priors of Environment Uncertainty for Planning in Configuration Spaces | Subhransu S. Bhattacharjee et.al. | 2510.11014 | null |
| 2025-10-13 | Secret-Protected Evolution for Differentially Private Synthetic Text Generation | Tianze Wang et.al. | 2510.10990 | null |
| 2025-10-13 | IUT-Plug: A Plug-in tool for Interleaved Image-Text Generation | Zeteng Lin et.al. | 2510.10969 | null |
| 2025-10-13 | Comparative Evaluation of Neural Network Architectures for Generalizable Human Spatial Preference Prediction in Unseen Built Environments | Maral Doctorarastoo et.al. | 2510.10954 | null |
| 2025-10-13 | Towards Distribution-Shift Uncertainty Estimation for Inverse Problems with Generative Priors | Namhoon Kim et.al. | 2510.10947 | null |
| 2025-10-13 | Find Your Optimal Teacher: Personalized Data Synthesis via Router-Guided Multi-Teacher Distillation | Hengyuan Zhang et.al. | 2510.10925 | null |
| 2025-10-12 | DISC-GAN: Disentangling Style and Content for Cluster-Specific Synthetic Underwater Image Generation | Sneha Varur et.al. | 2510.10782 | null |
| 2025-10-12 | Controllable Generative Trajectory Prediction via Weak Preference Alignment | Yongxi Cao et.al. | 2510.10731 | null |
| 2025-10-12 | Designing ReLU Generative Networks to Enumerate Trees with a Given Tree Edit Distance | Mamoona Ghafoor et.al. | 2510.10706 | null |
| 2025-10-15 | Trustworthy Retrosynthesis: Eliminating Hallucinations with a Diverse Ensemble of Reaction Scorers | Michal Sadowski et.al. | 2510.10645 | null |
| 2025-10-12 | GraphTracer: Graph-Guided Failure Tracing in LLM Agents for Robust Multi-Turn Deep Search | Heng Zhang et.al. | 2510.10581 | null |
| 2025-10-12 | Reverse Supervision at Scale: Exponential Search Meets the Economics of Annotation | Masoud Makrehchi et.al. | 2510.10446 | null |
| 2025-10-11 | Generative Modeling of Aerosol State Representations | Ehsan Saleh et.al. | 2510.10361 | null |
| 2025-10-11 | LLM-Friendly Knowledge Representation for Customer Support | Hanchen Su et.al. | 2510.10331 | null |
| 2025-10-11 | Calibrating Generative Models | Henry D. Smith et.al. | 2510.10020 | link |
| 2025-10-11 | Generative Latent Video Compression | Zongyu Guo et.al. | 2510.09987 | null |
| 2025-10-10 | Augmenting generative models with biomedical knowledge graphs improves targeted drug discovery | Aditya Malusare et.al. | 2510.09914 | null |
| 2025-10-10 | Domain Knowledge Infused Generative Models for Drug Discovery Synthetic Data | Bing Hu et.al. | 2510.09837 | null |
| 2025-10-10 | BaNEL: Exploration Posteriors for Generative Modeling Using Only Negative Rewards | Sangyun Lee et.al. | 2510.09596 | null |
| 2025-10-10 | Efficient Autoregressive Inference for Transformer Probabilistic Models | Conor Hassan et.al. | 2510.09477 | null |
| 2025-10-13 | Failure Prediction at Runtime for Generative Robot Policies | Ralf Römer et.al. | 2510.09459 | null |
| 2025-10-10 | A Biophysically-Conditioned Generative Framework for 3D Brain Tumor MRI Synthesis | Valentin Biller et.al. | 2510.09365 | null |
| 2025-10-10 | SOS: Synthetic Object Segments Improve Detection, Segmentation, and Grounding | Weikai Huang et.al. | 2510.09110 | null |
| 2025-10-10 | MCMC: Bridging Rendering, Optimization and Generative AI | Gurprit Singh et.al. | 2510.09078 | null |
| 2025-10-10 | MMAudioSep: Taming Video-to-Audio Generative Model Towards Video/Text-Queried Sound Separation | Akira Takahashi et.al. | 2510.09065 | null |
| 2025-10-10 | O_O-VC: Synthetic Data-Driven One-to-One Alignment for Any-to-Any Voice Conversion | Huu Tuong Tu et.al. | 2510.09061 | null |
| 2025-10-10 | Mirror Flow Matching with Heavy-Tailed Priors for Generative Modeling on Convex Domains | Yunrui Guan et.al. | 2510.08929 | null |
| 2025-10-10 | ControlAudio: Tackling Text-Guided, Timing-Indicated and Intelligible Audio Generation via Progressive Diffusion Modeling | Yuxuan Jiang et.al. | 2510.08878 | null |
| 2025-10-08 | Next Semantic Scale Prediction via Hierarchical Diffusion Language Models | Cai Zhou et.al. | 2510.08632 | null |
| 2025-10-11 | MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization | Xiangyu Zhao et.al. | 2510.08540 | null |
| 2025-10-09 | Universality and kernel-adaptive training for classically trained, quantum-deployed generative models | Andrii Kurkin et.al. | 2510.08476 | null |
| 2025-10-09 | SummDiff: Generative Modeling of Video Summarization with Diffusion | Kwanseok Kim et.al. | 2510.08458 | null |
| 2025-10-09 | ReasonEmbed: Enhanced Text Embeddings for Reasoning-Intensive Document Retrieval | Jianlyu Chen et.al. | 2510.08252 | null |
| 2025-10-09 | Contrastive Decoding for Synthetic Data Generation in Low-Resource Language Modeling | Jannek Ulm et.al. | 2510.08245 | null |
| 2025-10-09 | Detecting and Mitigating Insertion Hallucination in Video-to-Audio Generation | Liyang Chen et.al. | 2510.08078 | null |
| 2025-10-09 | IntMeanFlow: Few-step Speech Generation with Integral Velocity Distillation | Wei Wang et.al. | 2510.07979 | null |
| 2025-10-09 | Comprehensiveness Metrics for Automatic Evaluation of Factual Recall in Text Generation | Adam Dejl et.al. | 2510.07926 | null |
| 2025-10-09 | GeoGen: A Two-stage Coarse-to-Fine Framework for Fine-grained Synthetic Location-based Social Network Trajectory Generation | Rongchao Xu et.al. | 2510.07735 | null |
| 2025-10-09 | SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction | Wenyue Chen et.al. | 2510.07723 | null |
| 2025-10-08 | Transferable Generative Models Bridge Femtosecond to Nanosecond Time-Step Molecular Dynamics | Juan Viguera Diez et.al. | 2510.07589 | null |
| 2025-10-07 | Mitigating Surgical Data Imbalance with Dual-Prediction Video Diffusion Model | Danush Kumar Venkatesh et.al. | 2510.07345 | null |
| 2025-10-08 | A Digital Twin Framework for Metamorphic Testing of Autonomous Driving Systems Using Generative Model | Tony Zhang et.al. | 2510.07133 | null |
| 2025-10-08 | Sharpness-Aware Data Generation for Zero-shot Quantization | Dung Hoang-Anh et.al. | 2510.07018 | null |
| 2025-10-08 | Differentially Private Synthetic Text Generation for Retrieval-Augmented Generation (RAG) | Junki Mori et.al. | 2510.06719 | null |
| 2025-10-08 | XLSR-Kanformer: A KAN-Intergrated model for Synthetic Speech Detection | Phuong Tuan Dat et.al. | 2510.06706 | null |
| 2025-10-08 | Three Forms of Stochastic Injection for Improved Distribution-to-Distribution Generative Modeling | Shiye Su et.al. | 2510.06634 | null |
| 2025-10-08 | AIM 2025 Challenge on Real-World RAW Image Denoising | Feiran Li et.al. | 2510.06601 | null |
| 2025-10-08 | SDQM: Synthetic Data Quality Metric for Object Detection Dataset Evaluation | Ayush Zenith et.al. | 2510.06596 | null |
| 2025-10-07 | Deep Generative Model for Human Mobility Behavior | Ye Hong et.al. | 2510.06473 | null |
| 2025-10-07 | FinLFQA: Evaluating Attributed Text Generation of LLMs in Financial Long-Form Question Answering | Yitao Long et.al. | 2510.06426 | null |
| 2025-10-07 | Controllable Stylistic Text Generation with Train-Time Attribute-Regularized Diffusion | Fan Zhou et.al. | 2510.06386 | null |
| 2025-10-07 | Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models | Jiahao Wang et.al. | 2510.06209 | null |
| 2025-10-07 | Thermodynamic Performance Limits for Score-Based Diffusion Models | Nathan X. Kodama et.al. | 2510.06174 | null |
| 2025-10-07 | Towards Data-Efficient Medical Imaging: A Generative and Semi-Supervised Framework | Mosong Ma et.al. | 2510.06123 | null |
| 2025-10-07 | Carré du champ flow matching: better quality-generalisation tradeoff in generative models | Jacob Bamberger et.al. | 2510.05930 | null |
| 2025-10-07 | FoleyGRAM: Video-to-Audio Generation with GRAM-Aligned Multimodal Encoders | Riccardo Fosco Gramaccioni et.al. | 2510.05829 | null |
| 2025-10-07 | StereoSync: Spatially-Aware Stereo Audio Generation from Video | Christian Marinoni et.al. | 2510.05828 | null |
| 2025-10-07 | Physicochemically Informed Dual-Conditioned Generative Model of T-Cell Receptor Variable Regions for Cellular Therapy | Jiahao Ma et.al. | 2510.05747 | null |
| 2025-10-07 | Redefining Generalization in Visual Domains: A Two-Axis Framework for Fake Image Detection with FusionDetect | Amirtaha Amanzadi et.al. | 2510.05740 | null |
| 2025-10-08 | Scalable In-context Ranking with Generative Models | Nilesh Gupta et.al. | 2510.05396 | null |
| 2025-10-06 | Watch and Learn: Learning to Use Computers from Online Videos | Chan Hee Song et.al. | 2510.04673 | null |
| 2025-10-06 | Forecasting-Based Biomedical Time-series Data Synthesis for Open Data and Robust AI | Youngjoon Lee et.al. | 2510.04622 | null |
| 2025-10-06 | Language Model Based Text-to-Audio Generation: Anti-Causally Aligned Collaborative Residual Transformers | Juncheng Wang et.al. | 2510.04577 | null |
| 2025-10-06 | Quantum generative model on bicycle-sharing system and an application | Fumio Nemoto et.al. | 2510.04512 | null |
| 2025-10-05 | Score-based generative emulation of impact-relevant Earth system model outputs | Shahine Bouabid et.al. | 2510.04358 | null |
| 2025-10-05 | Pitch-Conditioned Instrument Sound Synthesis From an Interactive Timbre Latent Space | Christian Limberg et.al. | 2510.04339 | null |
| 2025-10-05 | Scaling Sequence-to-Sequence Generative Neural Rendering | Shikun Liu et.al. | 2510.04236 | null |
| 2025-10-05 | BLADE: Bias-Linked Adaptive DEbiasing | Piyush Arora et.al. | 2510.04174 | null |
| 2025-10-05 | A Multilingual Framework for Dysarthria: Detection, Severity Classification, Speech-to-Text, and Clean Speech Generation | Ananya Raghu et.al. | 2510.03986 | null |
| 2025-10-04 | Mirage: Unveiling Hidden Artifacts in Synthetic Images with Large Vision-Language Models | Pranav Sharma et.al. | 2510.03840 | null |
| 2025-10-07 | Neon: Negative Extrapolation From Self-Training Improves Image Generation | Sina Alemohammad et.al. | 2510.03597 | null |
| 2025-10-07 | Longitudinal Flow Matching for Trajectory Modeling | Mohammad Mohaiminul Islam et.al. | 2510.03569 | null |
| 2025-10-07 | Synthetic Audio Forensics Evaluation (SAFE) Challenge | Kirill Trapeznikov et.al. | 2510.03387 | null |
| 2025-10-03 | Predicting cell-specific gene expression profile and knockout impact through deep learning | Yongjian He et.al. | 2510.03359 | null |
| 2025-10-03 | Wave-GMS: Lightweight Multi-Scale Generative Model for Medical Image Segmentation | Talha Ahmed et.al. | 2510.03216 | null |
| 2025-10-06 | What Drives Compositional Generalization in Visual Generative Models? | Karim Farid et.al. | 2510.03075 | null |
| 2025-10-03 | SALSA-V: Shortcut-Augmented Long-form Synchronized Audio from Videos | Amir Dellali et.al. | 2510.02916 | null |
| 2025-10-03 | Neural Jump ODEs as Generative Models | Robert A. Crowell et.al. | 2510.02757 | null |
| 2025-10-03 | Automated Constraint Specification for Job Scheduling by Regulating Generative Model with Domain-Specific Representation | Yu-Zhe Shi et.al. | 2510.02679 | null |
| 2025-10-03 | Deep Generative Continual Learning using Functional LoRA: FunLoRA | Victor Enescu et.al. | 2510.02631 | null |
| 2025-10-02 | Beyond Linear Diffusions: Improved Representations for Rare Conditional Generative Modeling | Kulunu Dharmakeerthi et.al. | 2510.02499 | null |
| 2025-10-02 | Orthogonal Procrustes problem preserves correlations in synthetic data | Oussama Ounissi et.al. | 2510.02405 | null |
| 2025-10-02 | Equilibrium Matching: Generative Modeling with Implicit Energy-Based Models | Runqian Wang et.al. | 2510.02300 | null |
| 2025-10-02 | Study on LLMs for Promptagator-Style Dense Retriever Training | Daniel Gwon et.al. | 2510.02241 | null |
| 2025-10-02 | FlexDoc: Parameterized Sampling for Diverse Multilingual Synthetic Documents for Training Document Understanding Models | Karan Dua et.al. | 2510.02133 | null |
| 2025-10-02 | SoundReactor: Frame-level Online Video-to-Audio Generation | Koichi Saito et.al. | 2510.02110 | null |
| 2025-10-04 | NGGAN: Noise Generation GAN Based on the Practical Measurement Dataset for Narrowband Powerline Communications | Ying-Ren Chien et.al. | 2510.01850 | null |
| 2025-10-02 | Sensitivity, Specificity, and Consistency: A Tripartite Evaluation of Privacy Filters for Synthetic Data Generation | Adil Koeken et.al. | 2510.01793 | null |
| 2025-10-02 | A Locally Executable AI System for Improving Preoperative Patient Communication: A Multi-Domain Clinical Evaluation | Motoki Sato et.al. | 2510.01671 | null |
| 2025-10-02 | Demystifying Synthetic Data in LLM Pre-training: A Systematic Study of Scaling Laws, Benefits, and Pitfalls | Feiyang Kang et.al. | 2510.01631 | null |
| 2025-10-02 | Posterior Collapse as a Phase Transition in Variational Autoencoders | Zhen Li et.al. | 2510.01621 | null |
| 2025-10-02 | TimeGazer: Temporal Modeling of Predictive Gaze Stabilization for AR Interaction | Yaozheng Xia et.al. | 2510.01561 | null |
| 2025-10-01 | Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling | Huangjie Zheng et.al. | 2510.01329 | null |
| 2025-10-01 | MorphGen: Controllable and Morphologically Plausible Generative Cell-Imaging | Berker Demirel et.al. | 2510.01298 | null |
| 2025-10-01 | Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity | Jiayi Zhang et.al. | 2510.01171 | null |
| 2025-10-01 | Fiaingen: A financial time series generative method matching real-world data quality | Jože M. Rožanec et.al. | 2510.01169 | null |
| 2025-10-01 | GRAD: Generative Retrieval-Aligned Demonstration Sampler for Efficient Few-Shot Reasoning | Oussama Gabouj et.al. | 2510.01165 | null |
| 2025-10-01 | Apriel-1.5-15b-Thinker | Shruthan Radhakrishna et.al. | 2510.01141 | null |
| 2025-10-01 | Authentic Discrete Diffusion Model | Xiao Li et.al. | 2510.01047 | null |
| 2025-10-01 | Making, not Taking, the Best of N | Ammar Khairi et.al. | 2510.00931 | null |
| 2025-10-01 | Population Synthesis using Incomplete Information | Tanay Rastogi et.al. | 2510.00859 | null |
| 2025-10-01 | From Scores to Preferences: Redefining MOS Benchmarking for Speech Quality Reward Modeling | Yifei Cao et.al. | 2510.00743 | null |
| 2025-10-01 | Inclusive Easy-to-Read Generation for Individuals with Cognitive Impairments | François Ledoyen et.al. | 2510.00691 | null |
| 2025-10-01 | A Geometric Unification of Generative AI with Manifold-Probabilistic Projection Models | Leah Bar et.al. | 2510.00666 | null |
| 2025-10-01 | Facilitating Cognitive Accessibility with LLMs: A Multi-Task Approach to Easy-to-Read Text Generation | François Ledoyen et.al. | 2510.00662 | null |
| 2025-10-01 | MCM-DPO: Multifaceted Cross-Modal Direct Preference Optimization for Alt-text Generation | Jinlan Fu et.al. | 2510.00647 | null |
| 2025-10-01 | PodEval: A Multimodal Evaluation Framework for Podcast Audio Generation | Yujia Xiao et.al. | 2510.00485 | null |
| 2025-09-30 | Nonparametric Identification of Latent Concepts | Yujia Zheng et.al. | 2510.00136 | null |
| 2025-09-30 | Video Object Segmentation-Aware Audio Generation | Ilpo Viertola et.al. | 2509.26604 | null |
| 2025-09-30 | Learning from Hallucinating Critical Points for Navigation in Dynamic Environments | Saad Abdul Ghani et.al. | 2509.26513 | null |
| 2025-09-30 | Data-to-Energy Stochastic Dynamics | Kirill Tamogashev et.al. | 2509.26364 | null |
| 2025-09-30 | Reframing Generative Models for Physical Systems using Stochastic Interpolants | Anthony Zhou et.al. | 2509.26282 | null |
| 2025-09-30 | EnScale: Temporally-consistent multivariate generative downscaling via proper scoring rules | Maybritt Schillinger et.al. | 2509.26258 | null |
| 2025-09-30 | MARS: Audio Generation via Multi-Channel Autoregression on Spectrograms | Eleonora Ristori et.al. | 2509.26007 | null |
| 2025-09-30 | Think Less, Label Better: Multi-Stage Domain-Grounded Synthetic Data Generation for Fine-Tuning Large Language Models in Telecommunications | Chenhua Shi et.al. | 2509.25736 | null |
| 2025-09-30 | CATCH: A Novel Data Synthesis Framework for High Therapy Fidelity and Memory-Driven Planning Chain of Thought in AI Counseling | Mingyu Chen et.al. | 2509.25733 | null |
| 2025-09-30 | Controlled Generation for Private Synthetic Text | Zihao Zhao et.al. | 2509.25729 | null |
| 2025-09-30 | OmniDFA: A Unified Framework for Open Set Synthesis Image Detection and Few-Shot Attribution | Shiyu Wu et.al. | 2509.25682 | null |
| 2025-09-30 | SING-SQL: A Synthetic Data Generation Framework for In-Domain Text-to-SQL Translation | Hasan Alp Caferoğlu et.al. | 2509.25672 | null |
| 2025-09-29 | Coupling Generative Modeling and an Autoencoder with the Causal Bridge | Ruolin Meng et.al. | 2509.25599 | null |
| 2025-09-29 | Understanding Generative Recommendation with Semantic IDs from a Model-scaling View | Jingzhe Liu et.al. | 2509.25522 | null |
| 2025-09-29 | Uncertainty-Aware Generative Oversampling Using an Entropy-Guided Conditional Variational Autoencoder | Amirhossein Zare et.al. | 2509.25334 | null |
| 2025-09-29 | Paired by the Teacher: Turning Unpaired Data into High-Fidelity Pairs for Low-Resource Text Generation | Yen-Ju Lu et.al. | 2509.25144 | null |
| 2025-09-29 | MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech | Chengyao Wang et.al. | 2509.25131 | null |
| 2025-09-29 | Meta-Learning Theory-Informed Inductive Biases using Deep Kernel Gaussian Processes | Bahti Zakirov et.al. | 2509.24919 | null |
| 2025-09-29 | VAGUEGAN: Stealthy Poisoning and Backdoor Attacks on Image Generative Pipelines | Mostafa Mohaimen Akand Faisal et.al. | 2509.24891 | null |
| 2025-09-29 | ThermalGen: Style-Disentangled Flow-Based Generative Models for RGB-to-Thermal Image Translation | Jiuhong Xiao et.al. | 2509.24878 | null |
| 2025-09-29 | Cell2Text: Multimodal LLM for Generating Single-Cell Descriptions from RNA-Seq Data | Oussama Kharouiche et.al. | 2509.24840 | null |
| 2025-09-30 | MarS-FM: Generative Modeling of Molecular Dynamics via Markov State Models | Kacper Kapuśniak et.al. | 2509.24779 | null |
| 2025-09-30 | VSSFlow: Unifying Video-conditioned Sound and Speech Generation via Joint Learning | Xin Cheng et.al. | 2509.24773 | null |
| 2025-09-29 | Socratic-Zero : Bootstrapping Reasoning via Data-Free Agent Co-evolution | Shaobo Wang et.al. | 2509.24726 | null |
| 2025-09-29 | VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning | Yixuan Zhou et.al. | 2509.24650 | null |
| 2025-09-29 | When Audio Generators Become Good Listeners: Generative Features for Understanding Tasks | Zeyu Xie et.al. | 2509.24635 | null |
| 2025-09-29 | Training-Free Multimodal Guidance for Video to Audio Generation | Eleonora Grassucci et.al. | 2509.24550 | null |
| 2025-09-29 | Alternatives To Next Token Prediction In Text Generation – A Survey | Charlie Wyatt et.al. | 2509.24435 | null |
| 2025-09-29 | RapidMV: Leveraging Spatio-Angular Representations for Efficient and Consistent Text-to-Multi-View Synthesis | Seungwook Kim et.al. | 2509.24410 | null |
| 2025-09-29 | Unsupervised Single-Channel Speech Separation with a Diffusion Prior under Speaker-Embedding Guidance | Runwu Shi et.al. | 2509.24395 | null |
| 2025-09-29 | UniFlow-Audio: Unified Flow Matching for Audio Generation from Omni-Modalities | Xuenan Xu et.al. | 2509.24391 | null |
| 2025-09-29 | Towards Foundation Models for Cryo-ET Subtomogram Analysis | Runmin Jiang et.al. | 2509.24311 | null |
| 2025-09-28 | Define latent spaces by example: optimisation over the outputs of generative models | Samuel Willis et.al. | 2509.23800 | null |
| 2025-09-28 | AudioMoG: Guiding Audio Generation with Mixture-of-Guidance | Junyou Wang et.al. | 2509.23727 | null |
| 2025-09-28 | ReWatch-R1: Boosting Complex Video Reasoning in Large Vision-Language Models through Agentic Data Synthesis | Congzhi Zhang et.al. | 2509.23652 | null |
| 2025-09-28 | From Past To Path: Masked History Learning for Next-Item Prediction in Generative Recommendation | KaiWen Wei et.al. | 2509.23649 | null |
| 2025-09-28 | Disentanglement of Variations with Multimodal Generative Modeling | Yijie Zhang et.al. | 2509.23548 | null |
| 2025-09-27 | Generative Modeling of Shape-Dependent Self-Contact Human Poses | Takehiko Ohkawa et.al. | 2509.23393 | null |
| 2025-09-27 | SynDoc: A Hybrid Discriminative-Generative Framework for Enhancing Synthetic Domain-Adaptive Document Key Information Extraction | Yihao Ding et.al. | 2509.23273 | null |
| 2025-09-27 | OracleGS: Grounding Generative Priors for Sparse-View Gaussian Splatting | Atakan Topaloglu et.al. | 2509.23258 | null |
| 2025-09-27 | A Generative Model for Controllable Feature Heterophily in Graphs | Haoyu Wang et.al. | 2509.23230 | null |
| 2025-09-27 | Sparse2Dense: A Keypoint-driven Generative Framework for Human Video Compression and Vertex Prediction | Bolin Chen et.al. | 2509.23169 | null |
| 2025-09-27 | Dense associative memory on the Bures-Wasserstein space | Chandan Tankala et.al. | 2509.23162 | null |
| 2025-09-26 | GDR-learners: Orthogonal Learning of Generative Models for Potential Outcomes | Valentyn Melnychuk et.al. | 2509.22953 | null |
| 2025-09-26 | Extract-0: A Specialized Language Model for Document Information Extraction | Henrique Godoy et.al. | 2509.22906 | null |
| 2025-09-26 | ArFake: A Multi-Dialect Benchmark and Baselines for Arabic Spoof-Speech Detection | Mohamed Maged et.al. | 2509.22808 | null |
| 2025-09-26 | Generative Modeling and Decision Fusion for Unknown Event Detection and Classification Using Synchrophasor Data | Yi Hu et.al. | 2509.22795 | null |
| 2025-09-26 | Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance | Luc Boudier et.al. | 2509.22635 | null |
| 2025-09-26 | A Theoretical Analysis of Discrete Flow Matching Generative Models | Maojiang Su et.al. | 2509.22623 | null |
| 2025-09-26 | Transport Based Mean Flows for Generative Modeling | Elaheh Akbari et.al. | 2509.22592 | null |
| 2025-09-26 | ConQuER: Modular Architectures for Control and Bias Mitigation in IQP Quantum Generative Models | Xiaocheng Zou et.al. | 2509.22551 | null |
| 2025-09-26 | Overclocking Electrostatic Generative Models | Daniil Shlenskii et.al. | 2509.22454 | null |
| 2025-09-26 | SurvDiff: A Diffusion Model for Generating Synthetic Data in Survival Analysis | Marie Brockschmidt et.al. | 2509.22352 | null |
| 2025-09-26 | Preventing Model Collapse Under Overparametrization: Optimal Mixing Ratios for Interpolation Learning and Ridge Regression | Anvit Garg et.al. | 2509.22341 | null |
| 2025-09-26 | Accuracy-First Rényi Differential Privacy and Post-Processing Immunity | Ossi Räisä et.al. | 2509.22213 | null |
| 2025-09-26 | High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling | Chao Huang et.al. | 2509.22063 | null |
| 2025-09-26 | Comparative Analysis of GAN and Diffusion for MRI-to-CT translation | Emily Honey et.al. | 2509.22049 | null |
| 2025-09-26 | Text2Move: Text-to-moving sound generation via trajectory prediction and temporal alignment | Yunyi Liu et.al. | 2509.21919 | null |
| 2025-09-26 | UISim: An Interactive Image-Based UI Simulator for Dynamic Mobile Environments | Jiannan Xiang et.al. | 2509.21733 | null |
| 2025-09-25 | HuLA: Prosody-Aware Anti-Spoofing with Multi-Task Learning for Expressive and Emotional Synthetic Speech | Aurosweta Mahapatra et.al. | 2509.21676 | null |
| 2025-09-25 | Guiding Audio Editing with Audio Language Model | Zitong Lan et.al. | 2509.21625 | null |
| 2025-09-25 | QMill: Representative Quantum Data Generation for Quantum Machine Learning Utility | Jason Ludmir et.al. | 2509.21622 | null |
| 2025-09-25 | Federated Flow Matching | Zifan Wang et.al. | 2509.21250 | null |
| 2025-09-25 | MeanSE: Efficient Generative Speech Enhancement with Mean Flows | Jiahe Wang et.al. | 2509.21214 | null |
| 2025-09-25 | Super-resolution of 4D flow MRI through inverse problem explicit solving | Aurélien de Turenne et.al. | 2509.21071 | null |
| 2025-09-25 | Conditionally Whitened Generative Models for Probabilistic Time Series Forecasting | Yanfeng Yang et.al. | 2509.20928 | null |
| 2025-09-25 | FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies | Shuqiao Liang et.al. | 2509.20890 | null |
| 2025-09-25 | Verification Limits Code LLM Training | Srishti Gureja et.al. | 2509.20837 | null |
| 2025-09-25 | Measuring LLM Sensitivity in Transformer-based Tabular Data Synthesis | Maria F. Davila R et.al. | 2509.20768 | null |
| 2025-09-26 | Neptune-X: Active X-to-Maritime Generation for Universal Maritime Object Detection | Yu Guo et.al. | 2509.20745 | null |
| 2025-09-24 | FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models | Amin Karimi Monsefi et.al. | 2509.20624 | null |
| 2025-09-24 | pop-cosmos: Star formation over 12 Gyr from generative modelling of a deep infrared-selected galaxy catalogue | Sinan Deger et.al. | 2509.20430 | null |
| 2025-09-24 | Quasi-Synthetic Riemannian Data Generation for Writer-Independent Offline Signature Verification | Elias N. Zois et.al. | 2509.20420 | null |
| 2025-09-24 | PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation | Chen Wang et.al. | 2509.20358 | null |
| 2025-09-24 | Generative Model Inversion Through the Lens of the Manifold Hypothesis | Xiong Peng et.al. | 2509.20177 | null |
| 2025-09-24 | Discrete Diffusion for Generative Modeling of Text-Aligned Speech Tokens | Pin-Jui Ku et.al. | 2509.20060 | null |
| 2025-09-24 | MultiSoundGen: Video-to-Audio Generation for Multi-Event Scenarios via SlowFast Contrastive Audio-Visual Pretraining and Direct Preference Optimization | Jianxuan Yang et.al. | 2509.19999 | null |
| 2025-09-24 | Learnable Sampler Distillation for Discrete Diffusion Models | Feiyang Fu et.al. | 2509.19962 | null |
| 2025-09-24 | When Words Can’t Capture It All: Towards Video-Based User Complaint Text Generation with Multimodal Video Complaint Dataset | Sarmistha Das et.al. | 2509.19952 | null |
| 2025-09-24 | TABFAIRGDT: A Fast Fair Tabular Data Generator using Autoregressive Decision Trees | Emmanouil Panagiotou et.al. | 2509.19927 | null |
| 2025-09-25 | MAGE: A Coarse-to-Fine Speech Enhancer with Masked Generative Model | The Hieu Pham et.al. | 2509.19881 | null |
| 2025-09-24 | SCORE: Scaling audio generation using Standardized COmposite REwards | Jaemin Jung et.al. | 2509.19831 | null |
| 2025-09-24 | Efficient Speech Watermarking for Speech Synthesis via Progressive Knowledge Distillation | Yang Cui et.al. | 2509.19812 | null |
| 2025-09-25 | StrCGAN: A Generative Framework for Stellar Image Restoration | Shantanusinh Parmar et.al. | 2509.19805 | null |
| 2025-09-24 | EnAnchored-X2X: English-Anchored Optimization for Many-to-Many Translation | Sen Yang et.al. | 2509.19770 | null |
| 2025-09-24 | SMILES-Inspired Transfer Learning for Quantum Operators in Generative Quantum Eigensolver | Zhi Yin et.al. | 2509.19715 | null |
| 2025-09-24 | Towards Robust In-Context Learning for Medical Image Segmentation via Data Synthesis | Jiesi Hu et.al. | 2509.19711 | null |
| 2025-09-24 | Long-Range Dependence in Financial Markets: Empirical Evidence and Generative Modeling Challenges | Yifan He et.al. | 2509.19663 | null |
| 2025-09-24 | Statistical Parameter Calibration with the Generalized Fluctuation Dissipation Theorem and Generative Modeling | Ludovico T. Giorgini et.al. | 2509.19660 | null |
| 2025-09-23 | TIMED: Adversarial and Autoregressive Refinement of Diffusion-Based Time Series Generation | MohammadReza EskandariNasab et.al. | 2509.19638 | null |
| 2025-09-23 | Frame-Stacked Local Transformers For Efficient Multi-Codebook Speech Generation | Roy Fejgin et.al. | 2509.19592 | null |
| 2025-09-23 | Synthesizing Artifact Dataset for Pixel-level Detection | Dennis Menn et.al. | 2509.19589 | null |
| 2025-09-23 | CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching | Chen Chen et.al. | 2509.19300 | null |
| 2025-09-23 | Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation | Sherwin Bahmani et.al. | 2509.19296 | null |
| 2025-09-23 | Enabling Plant Phenotyping in Weedy Environments using Multi-Modal Imagery via Synthetic and Generated Training Data | Earl Ranario et.al. | 2509.19208 | null |
| 2025-09-23 | GSTM-HMU: Generative Spatio-Temporal Modeling for Human Mobility Understanding | Wenying Luo et.al. | 2509.19135 | null |
| 2025-09-23 | Extractive Fact Decomposition for Interpretable Natural Language Inference in one Forward Pass | Nicholas Popovič et.al. | 2509.18901 | null |
| 2025-09-24 | Scalable Evaluation for Audio Identification via Synthetic Latent Fingerprint Generation | Aditya Bhattacharjee et.al. | 2509.18620 | null |
| 2025-09-22 | Hierarchical Semi-Markov Models with Duration-Aware Dynamics for Activity Sequences | Rohit Dube et.al. | 2509.18414 | null |
| 2025-09-22 | Evaluating the Creativity of LLMs in Persian Literary Text Generation | Armin Tourajmehr et.al. | 2509.18401 | null |
| 2025-10-07 | StereoFoley: Object-Aware Stereo Audio Generation from Video | Tornike Karchkhadze et.al. | 2509.18272 | null |
| 2025-09-22 | Synth-MIA: A Testbed for Auditing Privacy Leakage in Tabular Data Synthesis | Joshua Ward et.al. | 2509.18014 | null |
| 2025-09-22 | Autoregressive-Gaussian Mixture Models: Efficient Generative Modeling of WSS Signals | Kathrin Klein et.al. | 2509.17953 | null |
| 2025-09-22 | Unsupervised Learning and Representation of Mandarin Tonal Categories by a Generative CNN | Kai Schenck et.al. | 2509.17859 | null |
| 2025-09-22 | Semantic and Visual Crop-Guided Diffusion Models for Heterogeneous Tissue Synthesis in Histopathology | Saghir Alfasly et.al. | 2509.17847 | null |
| 2025-09-22 | GEM-T: Generative Tabular Data via Fitting Moments | Miao Li et.al. | 2509.17752 | null |
| 2025-09-23 | A Generative Framework for Personalized Sticker Retrieval | Changjiang Zhou et.al. | 2509.17749 | null |
| 2025-09-22 | PG-CE: A Progressive Generation Dataset with Constraint Enhancement for Controllable Text Generation | Yan Zhuang et.al. | 2509.17669 | null |
| 2025-09-22 | Is It Certainly a Deepfake? Reliability Analysis in Detection & Generation Ecosystem | Neslihan Kose et.al. | 2509.17550 | null |
| 2025-09-22 | Audiobook-CC: Controllable Long-context Speech Generation for Multicast Audiobook | Min Liu et.al. | 2509.17516 | null |
| 2025-09-21 | Echo-Path: Pathology-Conditioned Echo Video Generation | Kabir Hamzah Muhammad et.al. | 2509.17190 | null |
| 2025-09-23 | STAR: Speech-to-Audio Generation via Representation Learning | Zeyu Xie et.al. | 2509.17164 | null |
| 2025-09-21 | ScenGAN: Attention-Intensive Generative Model for Uncertainty-Aware Renewable Scenario Forecasting | Yifei Wu et.al. | 2509.17119 | null |
| 2025-09-21 | Deep Synthetic Cross-Project Approaches for Software Reliability Growth Modeling | Taehyoun Kim et.al. | 2509.16939 | null |
| 2025-09-21 | PRISM: Precision-Recall Informed Data-Free Knowledge Distillation via Generative Diffusion | Xuewan He et.al. | 2509.16897 | null |
| 2025-09-20 | DoubleGen: Debiased Generative Modeling of Counterfactuals | Alex Luedtke et.al. | 2509.16842 | null |
| 2025-09-23 | Pain in 3D: Generating Controllable Synthetic Faces for Automated Pain Assessment | Xin Lei Lin et.al. | 2509.16727 | null |
| 2025-09-20 | Semi-Supervised Synthetic Data Generation with Fine-Grained Relevance Control for Short Video Search Relevance Modeling | Haoran Li et.al. | 2509.16717 | null |
| 2025-09-20 | An Octave-based Multi-Resolution CQT Architecture for Diffusion-based Audio Generation | Maurício do V. M. da Costa et.al. | 2509.16603 | null |
| 2025-09-20 | A Novel Metric for Detecting Memorization in Generative Models for Brain MRI Synthesis | Antonio Scardace et.al. | 2509.16582 | null |
| 2025-09-20 | SCAN: Self-Denoising Monte Carlo Annotation for Robust Process Reward Learning | Yuyang Ding et.al. | 2509.16548 | null |
| 2025-09-20 | ChemOrch: Empowering LLMs with Chemical Intelligence via Synthetic Instructions | Yue Huang et.al. | 2509.16543 | link |
| 2025-09-20 | mmExpert: Integrating Large Language Models for Comprehensive mmWave Data Synthesis and Understanding | Yifan Yan et.al. | 2509.16521 | null |
| 2025-09-20 | RLGF: Reinforcement Learning with Geometric Feedback for Autonomous Driving Video Generation | Tianyi Yan et.al. | 2509.16500 | null |
| 2025-09-19 | SynthIPD: assumption-lean synthetic individual patient data generation | Zixuan Zhao et.al. | 2509.16466 | null |
| 2025-09-19 | Entropic Causal Inference: Graph Identifiability | Spencer Compton et.al. | 2509.16463 | null |
| 2025-09-19 | Introducing Resizable Region Packing Problem in Image Generation, with a Heuristic Solution | Hrishikesh Sharma et.al. | 2509.16363 | null |
| 2025-09-19 | Guided Sequence-Structure Generative Modeling for Iterative Antibody Optimization | Aniruddh Raghu et.al. | 2509.16357 | null |
| 2025-09-19 | Rethinking Molecule Synthesizability with Chain-of-Reaction | Seul Lee et.al. | 2509.16084 | null |
| 2025-09-19 | Sampling String Vacua Using Generative Models | Moritz Walden et.al. | 2509.16029 | null |
| 2025-09-19 | Fed-PISA: Federated Voice Cloning via Personalized Identity-Style Adaptation | Qi Wang et.al. | 2509.16010 | null |
| 2025-09-19 | On Optimal Steering to Achieve Exact Fairness | Mohit Sharma et.al. | 2509.15759 | null |
| 2025-09-19 | TrueMoE: Dual-Routing Mixture of Discriminative Experts for Synthetic Image Detection | Laixin Zhang et.al. | 2509.15741 | null |
| 2025-09-19 | Toward Medical Deepfake Detection: A Comprehensive Dataset and Novel Method | Shuaibo Li et.al. | 2509.15711 | null |
| 2025-09-19 | Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification | Zinan Lin et.al. | 2509.15591 | null |
| 2025-09-19 | LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs | Junlong Jia et.al. | 2509.15568 | null |
| 2025-09-19 | Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech | Xinlei Niu et.al. | 2509.15492 | null |
| 2025-09-18 | Discrete Flow-Based Generative Models for Measurement Optimization in Quantum Computing | Isaac L. Huidobro-Meezs et.al. | 2509.15486 | null |
| 2025-09-18 | Efficient Multimodal Dataset Distillation via Generative Models | Zhenghao Zhao et.al. | 2509.15472 | null |
| 2025-09-18 | PILOT: Steering Synthetic Data Generation with Psychological & Linguistic Output Targeting | Caitlin Cisar et.al. | 2509.15447 | null |
| 2025-09-18 | Causal Fingerprints of AI Generative Models | Hui Xu et.al. | 2509.15406 | null |
| 2025-09-18 | Autoguided Online Data Curation for Diffusion Model Training | Valeria Pais et.al. | 2509.15267 | null |
| 2025-09-18 | Emotion-Aware Speech Generation with Character-Specific Voices for Comics | Zhiwen Qian et.al. | 2509.15253 | null |
| 2025-09-18 | Fair-GPTQ: Bias-Aware Quantization for Large Language Models | Irina Proskurina et.al. | 2509.15206 | null |
| 2025-09-18 | Learning Mechanistic Subtypes of Neurodegeneration with a Physics-Informed Variational Autoencoder Mixture Model | Sanduni Pinnawala et.al. | 2509.15124 | null |
| 2025-09-19 | Sea-ing Through Scattered Rays: Revisiting the Image Formation Model for Realistic Underwater Image Generation | Vasiliki Ismiroglou et.al. | 2509.15011 | null |
| 2025-09-20 | SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding | Bingsong Bai et.al. | 2509.14946 | null |
| 2025-09-18 | Mitigating data replication in text-to-audio generative diffusion models through anti-memorization guidance | Francisco Messina et.al. | 2509.14934 | null |
| 2025-09-19 | MeanFlowSE: one-step generative speech enhancement via conditional mean flow | Duojia Li et.al. | 2509.14858 | null |
| 2025-09-18 | SynBench: A Benchmark for Differentially Private Text Generation | Yidan Sun et.al. | 2509.14594 | null |
| 2025-09-18 | Cross-Lingual F5-TTS: Towards Language-Agnostic Voice Cloning and Speech Synthesis | Qingyu Liu et.al. | 2509.14579 | null |
| 2025-09-17 | A generative model of function growth explains hidden self-similarities across biological and social systems | James Holehouse et.al. | 2509.14468 | null |
| 2025-10-03 | SpeechWeave: Diverse Multilingual Synthetic Text & Audio Data Generation Pipeline for Training Text to Speech Models | Karan Dua et.al. | 2509.14270 | null |
| 2025-09-17 | Quantum Reinforcement Learning-Guided Diffusion Model for Image Synthesis via Hybrid Quantum-Classical Generative Model Architectures | Chi-Sheng Chen et.al. | 2509.14163 | null |
| 2025-09-19 | FlightDiffusion: Revolutionising Autonomous Drone Training with Diffusion Models Generating FPV Video | Valerii Serpiva et.al. | 2509.14082 | null |
| 2025-09-17 | Lightweight Implicit Neural Network for Binaural Audio Synthesis | Xikun Lu et.al. | 2509.14069 | null |
| 2025-09-17 | Enhancing Time Awareness in Generative Recommendation | Sunkyung Lee et.al. | 2509.13957 | null |
| 2025-09-17 | Synthetic Data Generation for Screen Time and App Usage | Gustavo Kruger et.al. | 2509.13892 | null |
| 2025-09-17 | EDITS: Enhancing Dataset Distillation with Implicit Textual Semantics | Qianxin Xia et.al. | 2509.13858 | null |
| 2025-09-17 | CraftMesh: High-Fidelity Generative Mesh Manipulation via Poisson Seamless Fusion | James Jincheng et.al. | 2509.13688 | null |
| 2025-09-17 | AgentCTG: Harnessing Multi-Agent Collaboration for Fine-Grained Precise Control in Text Generation | Xinxu Zhou et.al. | 2509.13677 | null |
| 2025-09-17 | LLM-I: LLMs are Naturally Interleaved Multimodal Creators | Zirun Guo et.al. | 2509.13642 | null |
| 2025-09-17 | Privacy-Aware In-Context Learning for Large Language Models | Bishnu Bhusal et.al. | 2509.13625 | null |
| 2025-09-14 | Synthetic Data and the Shifting Ground of Truth | Dietmar Offenhuber et.al. | 2509.13355 | null |
| 2025-09-16 | SURGIN: SURrogate-guided Generative INversion for subsurface multiphase flow with quantified uncertainty | Zhao Feng et.al. | 2509.13189 | null |
| 2025-09-17 | TeraSim-World: Worldwide Safety-Critical Data Synthesis for End-to-End Autonomous Driving | Jiawei Wang et.al. | 2509.13164 | null |
| 2025-09-16 | A Synthetic Data Pipeline for Supporting Manufacturing SMEs in Visual Assembly Control | Jonas Werheid et.al. | 2509.13089 | null |
| 2025-09-16 | MSR-Codec: A Low-Bitrate Multi-Stream Residual Codec for High-Fidelity Speech Generation with Information Disentanglement | Jingyu Li et.al. | 2509.13068 | null |
| 2025-09-16 | MIA-EPT: Membership Inference Attack via Error Prediction for Tabular Data | Eyal German et.al. | 2509.13046 | null |
| 2025-09-16 | A Lightweight Pipeline for Noisy Speech Voice Cloning and Accurate Lip Sync Synthesis | Javeria Amir et.al. | 2509.12831 | null |
| 2025-09-16 | ConvergeWriter: Data-Driven Bottom-Up Article Construction | Binquan Ji et.al. | 2509.12811 | null |
| 2025-09-16 | Toward Ownership Understanding of Objects: Active Question Generation with Large Language Model and Probabilistic Generative Model | Saki Hashimoto et.al. | 2509.12754 | null |
| 2025-09-16 | Chat-Driven Text Generation and Interaction for Person Retrieval | Zequn Xie et.al. | 2509.12662 | null |
| 2025-09-15 | MTEB-NL and E5-NL: Embedding Benchmark and Models for Dutch | Nikolay Banar et.al. | 2509.12340 | null |
| 2025-09-15 | VADER: A Variational Autoencoder to Infer Planetary Masses and Gas-Dust Disk Properties Around Young Stars | Sayed Shafaat Mahmud et.al. | 2509.12324 | null |
| 2025-09-14 | Prediction of Stocks Index Price using Quantum GANs | Sangram Deshpande et.al. | 2509.12286 | null |
| 2025-09-15 | OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling | Yang Zhou et.al. | 2509.12201 | null |
| 2025-09-15 | Learning Majority-to-Minority Transformations with MMD and Triplet Loss for Imbalanced Classification | Suman Cha et.al. | 2509.11511 | null |
| 2025-09-14 | Scaling Up Forest Vision with Synthetic Data | Yihang She et.al. | 2509.11201 | null |
| 2025-09-14 | Differentially-private text generation degrades output language quality | Erion Çano et.al. | 2509.11176 | null |
| 2025-09-14 | STASE: A spatialized text-to-audio synthesis engine for music generation | Tutti Chi et.al. | 2509.11124 | null |
| 2025-09-14 | Filling the Gaps: A Multitask Hybrid Multiscale Generative Framework for Missing Modality in Remote Sensing Semantic Segmentation | Nhi Kieu et.al. | 2509.11102 | null |
| 2025-09-14 | Patient-Zero: A Unified Framework for Real-Record-Free Patient Agent Generation | Yunghwei Lai et.al. | 2509.11078 | null |
| 2025-09-13 | Term2Note: Synthesising Differentially Private Clinical Notes from Medical Terms | Yuping Wu et.al. | 2509.10882 | null |
| 2025-09-13 | CogGNN: Cognitive Graph Neural Networks in Generative Connectomics | Mayssa Soussia et.al. | 2509.10864 | null |
| 2025-09-12 | Struct-Bench: A Benchmark for Differentially Private Structured Text Generation | Shuaiqi Wang et.al. | 2509.10696 | null |
| 2025-09-12 | Humanizing Automated Programming Feedback: Fine-Tuning Generative Models with Student-Written Feedback | Victor-Alexandru Pădurean et.al. | 2509.10647 | null |
| 2025-09-11 | The Coding Limits of Robust Watermarking for Generative Models | Danilo Francati et.al. | 2509.10577 | null |
| 2025-09-12 | Differentially Private Decentralized Dataset Synthesis Through Randomized Mixing with Correlated Noise | Utsab Saha et.al. | 2509.10385 | null |
| 2025-09-12 | Merging Physics-Based Synthetic Data and Machine Learning for Thermal Monitoring of Lithium-ion Batteries: The Role of Data Fidelity | Yusheng Zheng et.al. | 2509.10380 | null |
| 2025-09-12 | Arabic Large Language Models for Medical Text Generation | Abdulrahman Allam et.al. | 2509.10095 | null |
| 2025-09-11 | A Modular and Multimodal Generative AI Framework for Urban Building Energy Data: Generating Synthetic Homes | Jackson Eshbaugh et.al. | 2509.09794 | null |
| 2025-09-11 | OpenFake: An Open Dataset and Platform Toward Large-Scale Deepfake Detection | Victor Livernoche et.al. | 2509.09495 | null |
| 2025-09-11 | Diabatic quantum annealing for training energy-based generative models | Gilhan Kim et.al. | 2509.09374 | null |
| 2025-09-11 | HISPASpoof: A New Dataset For Spanish Speech Forensics | Maria Risques et.al. | 2509.09155 | null |
| 2025-09-10 | Generative quantum advantage for classical and quantum problems | Hsin-Yuan Huang et.al. | 2509.09033 | null |
| 2025-09-12 | ForTIFAI: Fending Off Recursive Training Induced Failure for AI Models | Soheil Zibakhsh Shabgahi et.al. | 2509.08972 | null |
| 2025-09-10 | PromptGuard: An Orchestrated Prompting Framework for Principled Synthetic Text Generation for Vulnerable Populations using LLMs with Enhanced Safety, Fairness, and Controllability | Tung Vu et.al. | 2509.08910 | null |
| 2025-09-10 | GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts | Jenna Kang et.al. | 2509.08818 | null |
| 2025-09-10 | Learning Turbulent Flows with Generative Models: Super-resolution, Forecasting, and Sparse Flow Reconstruction | Vivek Oommen et.al. | 2509.08752 | null |
| 2025-09-10 | Design-GenNO: A Physics-Informed Generative Model with Neural Operators for Inverse Microstructure Design | Yaohua Zang et.al. | 2509.08749 | null |
| 2025-09-11 | Generative Data Refinement: Just Ask for Better Data | Minqi Jiang et.al. | 2509.08653 | null |
| 2025-09-10 | Variational Rank Reduction Autoencoders for Generative Thermal Design | Alicia Tierz et.al. | 2509.08515 | null |
| 2025-09-10 | A Structured Review of Underwater Object Detection Challenges and Solutions: From Traditional to Large Vision Language Models | Edwine Nabahirwa et.al. | 2509.08490 | null |
| 2025-09-10 | Joint Learning using Mixture-of-Expert-Based Representation for Enhanced Speech Generation and Robust Emotion Recognition | Jing-Tong Tzeng et.al. | 2509.08470 | null |
| 2025-09-10 | LLM-Guided Ansätze Design for Quantum Circuit Born Machines in Financial Generative Modeling | Yaswitha Gujju et.al. | 2509.08385 | null |
| 2025-09-10 | Persistent-DPO: A novel loss function and hybrid learning for generative quantum eigensolver | Junya Nakamura et.al. | 2509.08351 | null |
| 2025-09-09 | Performance Assessment Strategies for Generative AI Applications in Healthcare | Victor Garcia et.al. | 2509.08087 | null |
| 2025-09-09 | One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation | Zheng Geng et.al. | 2509.07978 | null |
| 2025-09-09 | Enhancements in Score-based Channel Estimation for Real-Time Wireless Systems | Florian Strasser et.al. | 2509.07839 | null |
| 2025-09-09 | A Generalisable Generative Model for Multi-Detector Calorimeter Simulation | Piyush Raikwar et.al. | 2509.07700 | null |
| 2025-09-09 | Spectral Masking and Interpolation Attack (SMIA): A Black-box Adversarial Attack against Voice Authentication and Anti-Spoofing Systems | Kamel Kamel et.al. | 2509.07677 | null |
| 2025-09-09 | Target matching based generative model for speech enhancement | Taihui Wang et.al. | 2509.07521 | null |
| 2025-09-09 | Synthetic Data Generation with Lorenzetti for Time Series Anomaly Detection in High-Energy Physics Calorimeters | Laura Boggia et.al. | 2509.07451 | null |
| 2025-09-09 | When Fine-Tuning is Not Enough: Lessons from HSAD on Hybrid and Adversarial Audio Spoof Detection | Bin Hu et.al. | 2509.07323 | null |
| 2025-09-08 | A transformer-based generative model for planetary systems | Yann Alibert et.al. | 2509.07226 | null |
| 2025-09-08 | Neurocognitive Modeling for Text Generation: Deep Learning Architecture for EEG Data | Khushiyant et.al. | 2509.07202 | null |
| 2025-09-04 | K-Syn: K-space Data Synthesis in Ultra Low-data Regimes | Guan Yu et.al. | 2509.06997 | null |
| 2025-09-08 | SynthDrive: Scalable Real2Sim2Real Sensor Simulation Pipeline for High-Fidelity Asset Generation and Driving Data Synthesis | Zhengqing Chen et.al. | 2509.06798 | null |
| 2025-09-15 | A Statistical 3D Stomach Shape Model for Anatomical Analysis | Erez Posner et.al. | 2509.06464 | null |
| 2025-09-08 | MeanFlow-Accelerated Multimodal Video-to-Audio Synthesis via One-Step Generation | Xiaoran Yang et.al. | 2509.06389 | null |
| 2025-09-08 | Text4Seg++: Advancing Image Segmentation via Generative Language Modeling | Mengcheng Lan et.al. | 2509.06321 | null |
| 2025-09-07 | If generative AI is the answer, what is the question? | Ambuj Tewari et.al. | 2509.06120 | null |
| 2025-09-07 | DreamAudio: Customized Text-to-Audio Generation with Diffusion Models | Yi Yuan et.al. | 2509.06027 | null |
| 2025-09-06 | GUIDe: Generative and Uncertainty-Informed Inverse Design for On-Demand Nonlinear Functional Responses | Haoxuan Dylan Mu et.al. | 2509.05641 | null |
| 2025-09-04 | SasAgent: Multi-Agent AI System for Small-Angle Scattering Data Analysis | Lijie Ding et.al. | 2509.05363 | null |
| 2025-09-02 | Ensembling Membership Inference Attacks Against Tabular Generative Models | Joshua Ward et.al. | 2509.05350 | null |
| 2025-09-04 | Improved 3D Scene Stylization via Text-Guided Generative Image Editing with Region-Based Control | Haruo Fujiwara et.al. | 2509.05285 | null |
| 2025-09-05 | Recomposer: Event-roll-guided generative audio editing | Daniel P. W. Ellis et.al. | 2509.05256 | null |
| 2025-09-08 | Probabilistic operator learning: generative modeling and uncertainty quantification for foundation models of differential equations | Benjamin J. Zhang et.al. | 2509.05186 | null |
| 2025-09-05 | Painting the market: generative diffusion models for financial limit order book simulation and forecasting | Alfred Backhouse et.al. | 2509.05107 | null |
| 2025-09-05 | QCA-MolGAN: Quantum Circuit Associative Molecular GAN with Multi-Agent Reinforcement Learning | Aaron Mark Thomas et.al. | 2509.05051 | null |
| 2025-09-05 | Efficient Video-to-Audio Generation via Multiple Foundation Models Mapper | Gehui Chen et.al. | 2509.04957 | null |
| 2025-09-05 | SynGen-Vision: Synthetic Data Generation for training industrial vision models | Alpana Dubey et.al. | 2509.04894 | null |
| 2025-09-04 | Transition Models: Rethinking the Generative Learning Objective | Zidong Wang et.al. | 2509.04394 | null |
| 2025-09-04 | AUDETER: A Large-scale Dataset for Deepfake Audio Detection in Open Worlds | Qizhou Wang et.al. | 2509.04345 | null |
| 2025-09-04 | Synthetic Survival Data Generation for Heart Failure Prognosis Using Deep Generative Models | Chanon Puttanawarut et.al. | 2509.04245 | null |
| 2025-09-04 | Synthesizing Sheet Music Problems for Evaluation and Reinforcement Learning | Zhilin Wang et.al. | 2509.04059 | null |
| 2025-09-04 | An invertible generative model for forward and inverse problems | Tristan van Leeuwen et.al. | 2509.03910 | null |
| 2025-09-04 | Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series | Zhengyi Guo et.al. | 2509.03898 | null |
| 2025-09-03 | LuxDiT: Lighting Estimation with Video Diffusion Transformer | Ruofan Liang et.al. | 2509.03680 | null |
| 2025-09-05 | CEHR-XGPT: A Scalable Multi-Task Foundation Model for Electronic Health Records | Chao Pang et.al. | 2509.03643 | null |
| 2025-09-03 | Multi-level SSL Feature Gating for Audio Deepfake Detection | Hoan My Tran et.al. | 2509.03409 | null |
| 2025-09-03 | Generative Auto-Bidding in Large-Scale Competitive Auctions via Diffusion Completer-Aligner | Yewen Li et.al. | 2509.03348 | null |
| 2025-09-03 | A Comprehensive Guide to Differential Privacy: From Theory to User Expectations | Napsu Karmitsa et.al. | 2509.03294 | null |
| 2025-09-03 | Improving Perceptual Audio Aesthetic Assessment via Triplet Loss and Self-Supervised Embeddings | Dyah A. M. G. Wisnu et.al. | 2509.03292 | null |
| 2025-09-03 | RTGMFF: Enhanced fMRI-based Brain Disorder Diagnosis via ROI-driven Text Generation and Multimodal Feature Fusion | Junhao Jia et.al. | 2509.03214 | null |
| 2025-09-03 | Eigendecompositions of temporal networks | Lucas Lacasa et.al. | 2509.03135 | null |
| 2025-09-03 | Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers | Xingyue Huang et.al. | 2509.03059 | null |
| 2025-09-03 | Scale-Adaptive Generative Flows for Multiscale Scientific Data | Yifan Chen et.al. | 2509.02971 | null |
| 2025-09-02 | Generative AI for Crystal Structures: A Review | Pierre-Paul De Breuck et.al. | 2509.02723 | null |
| 2025-09-02 | Top-H Decoding: Adapting the Creativity and Coherence with Bounded Entropy in Text Generation | Erfan Baghaei Potraghloo et.al. | 2509.02510 | null |
| 2025-09-02 | Exploring Variational Graph Autoencoders for Distribution Grid Data Generation | Syed Zain Abbas et.al. | 2509.02469 | null |
| 2025-09-02 | Exploring Diffusion Models for Generative Forecasting of Financial Charts | Taegyeong Lee et.al. | 2509.02308 | null |
| 2025-09-01 | Towards Improved Speech Recognition through Optimized Synthetic Data Generation | Yanis Perrin et.al. | 2508.21631 | null |
| 2025-08-11 | Large Language Model Data Generation for Enhanced Intent Recognition in German Speech | Theresa Pekarek Rosin et.al. | 2508.06277 | null |
| 2025-07-25 | Synthetic Data Generation for Phrase Break Prediction with Large Language Model | Hoyeon Lee et.al. | 2507.18044 | null |
| 2025-07-15 | DualDub: Video-to-Soundtrack Generation via Joint Speech and Background Audio Synthesis | Wenjie Tian et.al. | 2507.10109 | null |
| 2025-06-13 | Multimodal Cinematic Video Synthesis Using Text-to-Image and Audio Generation Models | Sridhar S et.al. | 2506.10005 | null |
| 2025-06-11 | A Review on Score-based Generative Models for Audio Applications | Ge Zhu et.al. | 2506.08457 | link |
| 2025-06-24 | Analysis and Evaluation of Synthetic Data Generation in Speech Dysfluency Detection | Jinming Zhang et.al. | 2505.22029 | null |
| 2025-07-01 | From Alignment to Advancement: Bootstrapping Audio-Language Alignment with Synthetic Data | Chun-Yi Kuan et.al. | 2505.20166 | null |
| 2025-05-15 | DPN-GAN: Inducing Periodic Activations in Generative Adversarial Networks for High-Fidelity Audio Synthesis | Zeeshan Ahmad et.al. | 2505.09091 | null |
| 2025-03-04 | Voice Cloning for Dysarthric Speech Synthesis: Addressing Data Scarcity in Speech-Language Pathology | Birger Moell et.al. | 2503.01266 | null |
| 2025-06-09 | DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model | Lei Zhao et.al. | 2502.18952 | null |
| 2025-05-23 | ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts | Ashi Garg et.al. | 2502.05674 | null |
| 2025-07-24 | Koel-TTS: Enhancing LLM based Speech Generation with Preference Alignment and Classifier Free Guidance | Shehzeen Hussain et.al. | 2502.05236 | null |
| 2025-01-29 | CosyAudio: Improving Audio Generation with Confidence Scores and Synthetic Captions | Xinfa Zhu et.al. | 2501.16761 | null |
| 2025-09-05 | Exposing Synthetic Speech: Model Attribution and Detection of AI-generated Speech via Audio Fingerprints | Matías Pizarro et.al. | 2411.14013 | null |
| 2024-12-20 | Exploring the Landscape for Generative Sequence Models for Specialized Data Synthesis | Mohammad Zbeeb et.al. | 2411.01929 | link |
| 2024-10-24 | Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation | Junwon Lee et.al. | 2410.17589 | null |
| 2025-03-25 | Where are we in audio deepfake detection? A systematic analysis over generative and detection models | Xiang Li et.al. | 2410.04324 | null |
| 2025-07-08 | A Framework for Synthetic Audio Conversations Generation using Large Language Models | Kaung Myat Kyaw et.al. | 2409.00946 | null |
| 2024-08-20 | Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition | Samuele Cornell et.al. | 2408.09215 | null |
| 2024-08-01 | On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition | Nick Rossenbach et.al. | 2407.21476 | null |
| 2024-06-27 | SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond | Marco Comunità et.al. | 2406.17672 | null |
| 2024-06-21 | Instruction Data Generation and Unsupervised Adaptation for Speech Language Models | Vahid Noroozi et.al. | 2406.12946 | null |
| 2024-06-13 | LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation | Wenhao Guan et.al. | 2406.08203 | null |
| 2024-07-10 | AudioLCM: Text-to-Audio Generation with Latent Consistency Models | Huadai Liu et.al. | 2406.00356 | null |
| 2024-06-04 | Creative Text-to-Audio Generation via Synthesizer Programming | Manuel Cherep et.al. | 2406.00294 | null |
| 2024-05-01 | Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis | Shivam Mehta et.al. | 2404.19622 | null |
| 2024-02-09 | Listening Between the Lines: Synthetic Speech Detection Disregarding Verbal Content | Davide Salvi et.al. | 2402.05567 | null |
| 2024-02-19 | Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis | Vinotha R et.al. | 2401.11771 | null |
| 2024-01-08 | Pheme: Efficient and Conversational Speech Generation | Paweł Budzianowski et.al. | 2401.02839 | null |
| 2023-11-21 | EDMSound: Spectrogram Based Diffusion Models for Efficient and High-Quality Audio Synthesis | Ge Zhu et.al. | 2311.08667 | null |
| 2024-03-27 | Generative Pre-training for Speech with Flow Matching | Alexander H. Liu et.al. | 2310.16338 | null |
| 2024-01-24 | Low-latency Speech Enhancement via Speech Token Generation | Huaying Xue et.al. | 2310.08981 | null |
| 2024-05-14 | AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining | Haohe Liu et.al. | 2308.05734 | null |
| 2023-07-04 | FFPDG: Fast, Fair and Private Data Generation | Weijie Xu et.al. | 2307.00161 | null |
| 2023-05-31 | Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation | Jiawei Huang et.al. | 2305.18474 | null |
| 2023-03-28 | Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis | Karren Yang et.al. | 2303.14885 | null |
| 2023-04-04 | A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI | Chenshuang Zhang et.al. | 2303.13336 | null |
| 2024-07-18 | Configurable EBEN: Extreme Bandwidth Extension Network to enhance body-conducted speech capture | Julien Hauret et.al. | 2303.10008 | null |
| 2023-01-31 | Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models | Rongjie Huang et.al. | 2301.12661 | null |
| 2023-05-26 | Evaluating and reducing the distance between synthetic and real speech distributions | Christoph Minixhofer et.al. | 2211.16049 | null |
| 2023-07-27 | AudioLM: a Language Modeling Approach to Audio Generation | Zalán Borsos et.al. | 2209.03143 | null |
| 2022-07-05 | Computer-assisted Pronunciation Training – Speech synthesis is almost all you need | Daniel Korzekwa et.al. | 2207.00774 | null |
| 2022-06-22 | Adversarial Audio Synthesis with Complex-valued Polynomial Networks | Yongtao Wu et.al. | 2206.06811 | null |
| 2024-06-06 | Parallel Synthesis for Autoregressive Speech Generation | Po-chun Hsu et.al. | 2204.11806 | null |
| 2022-03-30 | Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise | Tuomo Raitio et.al. | 2203.10637 | null |
| 2022-03-16 | Attributable-Watermarking of Speech Generative Models | Yongbaek Cho et.al. | 2202.08900 | null |
| 2021-06-15 | CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis | Simon Rouard et.al. | 2106.07431 | null |
| 2022-02-01 | ItôTTS and ItôWave: Linear Stochastic Differential Equation Is All You Need For Audio Generation | Shoule Wu et.al. | 2105.07583 | null |
| 2021-08-02 | VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding | Javier Nistal et.al. | 2105.01531 | null |
| 2022-02-25 | Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis | Erica Cooper et.al. | 2104.12292 | null |
| 2021-04-01 | DiffWave: A Versatile Diffusion Model for Audio Synthesis | Zhifeng Kong et.al. | 2009.09761 | null |
| 2022-06-29 | DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial Networks | J. Nistal et.al. | 2008.12073 | null |
| 2019-06-05 | MelNet: A Generative Model for Audio in the Frequency Domain | Sean Vasquez et.al. | 1906.01083 | null |
| 2019-05-22 | Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems | Ohsung Kwon et.al. | 1905.08486 | null |
| 2019-03-15 | Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis | Bajibabu Bollepalli et.al. | 1903.05955 | null |
| 2019-02-20 | Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks | Hafiz Malik et.al. | 1902.06782 | null |
| 2018-10-31 | Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks | Lauri Juvela et.al. | 1810.12598 | null |
| 2017-09-26 | Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks | Yuki Saito et.al. | 1709.08041 | null |
Contributions are welcome! Please feel free to submit issues or pull requests.
If you find this repository useful, please consider giving it a star!