ASR-TTS-paper-daily

Contributors Forks Stargazers Issues GitHub Pages

🎯 ASR-TTS Paper Daily

Automatically curated collection of the latest research papers in Speech & Language Technology

📅 Updated on 2026.04.02

🌟 About This Repository

This repository provides a daily-updated collection of the latest research papers from arXiv in the following domains:

📖 Usage instructions: here 🌐 Web version: GitHub Pages

💡 This page is inspired by cv-arxiv-daily

🎤 ASR

📊 647 papers

📅 Publish Date 📝 Title 👥 Authors 📄 PDF 💻 Code
2026-04-01 VisG AV-HuBERT: Viseme-Guided AV-HuBERT Aristeidis Papadopoulos et.al. 2604.00982 link
2026-04-01 English to Central Kurdish Speech Translation: Corpus Creation, Evaluation, and Orthographic Standardization Mohammad Mohammadamini et.al. 2604.00613 null
2026-04-01 Speech LLMs are Contextual Reasoning Transcribers Keqi Deng et.al. 2604.00610 null
2026-04-01 Adapting Text LLMs to Speech via Multimodal Depth Up-Scaling Kazuki Yano et.al. 2604.00489 null
2026-03-31 FLEURS-Kobani: Extending the FLEURS Dataset for Northern Kurdish Daban Q. Jaff et.al. 2603.29892 null
2026-03-31 Can LLM Agents Identify Spoken Dialects like a Linguist? Tobias Bystrich et.al. 2603.29541 null
2026-03-31 LLM Probe: Evaluating LLMs for Low-Resource Languages Hailay Kidu Teklehaymanot et.al. 2603.29517 null
2026-03-31 Spoken Digit Recognition and Speaker Classification by Nonlinear Interfered Spin Wave-Based Physical Reservoir Computing Sota Hikasa et.al. 2603.29311 null
2026-03-31 Advancing LLM-based phoneme-to-grapheme for multilingual speech recognition Lukuang Dong et.al. 2603.29217 null
2026-03-30 EBuddy: a workflow orchestrator for industrial human-machine collaboration Michele Banfi et.al. 2603.28579 null
2026-03-30 Users and Wizards in Conversations: How WoZ Interface Choices Define Human-Robot Interactions Ekaterina Torubarova et.al. 2603.28338 null
2026-03-30 Voice-Controlled Scratch for Children with (Motor) Disabilities Elias Goller et.al. 2603.28246 null
2026-03-30 On the Role of Encoder Depth: Pruning Whisper and LoRA Fine-Tuning in SLAM-ASR Ganesh Pavan Kartikeya Bharadwaj Kolluri et.al. 2603.27981 null
2026-03-29 Investigation on the Robustness of Acoustic Foundation Models on Post Exercise Speech Xiangyuan Xue et.al. 2603.27508 null
2026-03-28 Two-Stage Acoustic Adaptation with Gated Cross-Attention Adapters for LLM-Based Multi-Talker Speech Recognition Hao Shi et.al. 2603.27205 null
2026-03-27 JAL-Turn: Joint Acoustic-Linguistic Modeling for Real-Time and Robust Turn-Taking Detection in Full-Duplex Spoken Dialogue Systems Guangzhao Yang et.al. 2603.26515 null
2026-03-27 Automatic Speech Recognition for Documenting Endangered Languages: Case Study of Ikema Miyakoan Chihiro Taguchi et.al. 2603.26248 null
2026-03-27 Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR Shashi Kumar et.al. 2603.26246 null
2026-03-30 Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models Kyudan Jung et.al. 2603.25750 null
2026-03-26 Back to Basics: Revisiting ASR in the Age of Voice Agents Geeyang Tay et.al. 2603.25727 null
2026-03-26 CLAR: CIF-Localized Alignment for Retrieval-Augmented Speech LLM-Based Contextual ASR Shangkun Huang et.al. 2603.25460 null
2026-03-26 Goodness-of-pronunciation without phoneme time alignment Jeremy H. M. Wong et.al. 2603.25150 null
2026-03-25 A Sociolinguistic Analysis of Automatic Speech Recognition Bias in Newcastle English Dana Serditova et.al. 2603.24549 null
2026-03-25 When AI Meets Early Childhood Education: Large Language Models as Assessment Teammates in Chinese Preschools Xingming Li et.al. 2603.24389 null
2026-03-25 Bridging Biological Hearing and Neuromorphic Computing: End-to-End Time-Domain Audio Signal Processing with Reservoir Computing Rinku Sebastian et.al. 2603.24283 null
2026-03-25 From Oracle to Noisy Context: Mitigating Contextual Exposure Bias in Speech-LLMs Xiaoyong Guo et.al. 2603.24034 null
2026-03-24 Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages Badr M. Abdullah et.al. 2603.23654 null
2026-03-24 Evaluating a Multi-Agent Voice-Enabled Smart Speaker for Care Homes: A Safety-Focused Framework Zeinab Dehghani et.al. 2603.23625 null
2026-03-05 Berta: an open-source, modular tool for AI-enabled clinical documentation Samridhi Vaid et.al. 2603.23513 null
2026-03-24 MSR-HuBERT: Self-supervised Pre-training for Adaptation to Multiple Sampling Rates Zikang Huang et.al. 2603.23048 null
2026-03-24 Who Spoke What When? Evaluating Spoken Language Models for Conversational ASR with Semantic and Overlap-Aware Metrics Naohiro Tawara et.al. 2603.22709 null
2026-03-23 Precision-Varying Prediction (PVP): Robustifying ASR systems against adversarial attacks Matías Pizarro et.al. 2603.22590 null
2026-03-23 SLURP-TN : Resource for Tunisian Dialect Spoken Language Understanding Haroun Elleuch et.al. 2603.21940 null
2026-03-23 Ara-Best-RQ: Multi Dialectal Arabic SSL Haroun Elleuch et.al. 2603.21900 null
2026-03-23 Cascade-Free Mandarin Visual Speech Recognition via Semantic-Guided Cross-Representation Alignment Lei Yang et.al. 2603.21808 null
2026-03-30 RESPOND: Responsive Engagement Strategy for Predictive Orchestration and Dialogue Meng-Chen Lee et.al. 2603.21682 null
2026-03-20 Demonstration of Adapt4Me: An Uncertainty-Aware Authoring Environment for Personalizing Automatic Speech Recognition to Non-normative Speech Niclas Pokel et.al. 2603.20112 null
2026-03-20 LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families Jianan Chen et.al. 2603.20042 null
2026-03-18 Impact of automatic speech recognition quality on Alzheimer’s disease detection from spontaneous speech: a reproducible benchmark study with lexical modeling and statistical validation Himadri Samanta et.al. 2603.18239 null
2026-03-27 LoGSAM: Parameter-Efficient Cross-Modal Grounding for MRI Segmentation Mohammad Robaitul Islam Bhuiyan et.al. 2603.17576 null
2026-03-19 Zipper-LoRA: Dynamic Parameter Decoupling for Speech-LLM based Multilingual Speech Recognition Yuxiang Mei et.al. 2603.17558 null
2026-03-17 Over-the-air White-box Attack on the Wav2Vec Speech Recognition Neural Network Protopopov Alexey et.al. 2603.16972 null
2026-03-18 Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Embeddings Bridging Massively Multilingual Text and Speech Omnilingual SONAR Team et.al. 2603.16606 null
2026-03-17 RECOVER: Robust Entity Correction via agentic Orchestration of hypothesis Variants for Evidence-based Recovery Abhishek Kumar et.al. 2603.16411 null
2026-03-17 Fanar 2.0: Arabic Generative AI Stack FANAR TEAM et.al. 2603.16397 null
2026-03-18 Attention-guided Evidence Grounding for Spoken Question Answering Ke Yang et.al. 2603.16292 null
2026-03-17 Is Semi-Automatic Transcription Useful in Corpus Creation? Preliminary Considerations on the KIParla Corpus Martina Simonotti et.al. 2603.16258 null
2026-03-17 Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR Quy-Anh Dang et.al. 2603.16184 null
2026-03-16 Lost in Transcription: Subtitle Errors in Automatic Speech Recognition Reduce Speaker and Content Evaluations Kowe Kadoma et.al. 2603.15807 null
2026-03-16 Two-Stage Adaptation for Non-Normative Speech Recognition: Revisiting Speaker-Independent Initialization for Personalization Shan Jiang et.al. 2603.15261 null
2026-03-16 LLMs and Speech: Integration vs. Combination Robin Schmitt et.al. 2603.15045 null
2026-03-16 SoulX-Duplug: Plug-and-Play Streaming State Prediction Module for Realtime Full-Duplex Speech Conversation Ruiqi Yan et.al. 2603.14877 null
2026-03-16 Vietnamese Automatic Speech Recognition: A Revisit Thi Vu et.al. 2603.14779 null
2026-03-04 BrainWhisperer: Leveraging Large-Scale ASR Models for Neural Speech Decoding Tommaso Boccato et.al. 2603.13321 null
2026-03-12 TASTE-Streaming: Towards Streamable Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling Liang-Hsuan Tseng et.al. 2603.12350 null
2026-03-12 Dr. SHAP-AV: Decoding Relative Modality Contributions via Shapley Attribution in Audio-Visual Speech Recognition Umberto Cappellazzo et.al. 2603.12046 null
2026-03-11 Continued Pretraining for Low-Resource Swahili ASR: Achieving State-of-the-Art Performance with Minimal Labeled Data Hillary Mutisya et.al. 2603.11378 null
2026-03-11 Duration Aware Scheduling for ASR Serving Under Workload Drift Darshan Makwana et.al. 2603.11273 null
2026-03-11 Self-Speculative Decoding for LLM-based ASR with CTC Encoder Drafts George Saon et.al. 2603.11243 null
2026-03-11 Huntington Disease Automatic Speech Recognition with Biomarker Supervision Charles L. Wang et.al. 2603.11168 null
2026-03-11 Uni-ASR: Unified LLM-Based Architecture for Non-Streaming and Streaming Automatic Speech Recognition Yinfeng Xia et.al. 2603.11123 null
2026-03-11 AlphaFlowTSE: One-Step Generative Target Speaker Extraction via Conditional AlphaFlow Duojia Li et.al. 2603.10701 null
2026-03-11 Distilling LLM Semantic Priors into Encoder-Only Multi-Talker ASR with Talker-Count Routing Hao Shi et.al. 2603.10587 null
2026-03-11 G-STAR: End-to-End Global Speaker-Tracking Attributed Recognition Jing Peng et.al. 2603.10468 null
2026-03-11 FireRedASR2S: A State-of-the-Art Industrial-Grade All-in-One Automatic Speech Recognition System Kaituo Xu et.al. 2603.10420 null
2026-03-10 SCENEBench: An Audio Understanding Benchmark Grounded in Assistive and Industrial Use Cases Laya Iyer et.al. 2603.09853 null
2026-03-10 Finetuning a Text-to-Audio Model for Room Impulse Response Generation Kirak Kim et.al. 2603.09708 null
2026-03-12 Logics-Parsing-Omni Technical Report Xin An et.al. 2603.09677 null
2026-03-10 Speech-Omni-Lite: Portable Speech Interfaces for Vision-Language Models Dehua Tao et.al. 2603.09627 null
2026-03-10 SPAR-K: Scheduled Periodic Alternating Early Exit for Spoken Language Models Hsiao-Ying Huang et.al. 2603.09215 null
2026-03-10 Trade-offs Between Capacity and Robustness in Neural Audio Codecs for Adversarially Robust Speech Recognition Jordan Prescott et.al. 2603.09034 null
2026-03-09 NLE: Non-autoregressive LLM-based ASR by Transcript Editing Avihu Dekel et.al. 2603.08397 null
2026-03-09 Bootstrapping Audiovisual Speech Recognition in Zero-AV-Resource Scenarios with Synthetic Visual Data Pol Buitrago et.al. 2603.08249 null
2026-03-09 PathBench: Speech Intelligibility Benchmark for Automatic Pathological Speech Assessment Bence Mark Halpern et.al. 2603.08097 null
2026-03-09 Listening with the Eyes: Benchmarking Egocentric Co-Speech Grounding across Space and Time Weijie Zhou et.al. 2603.07966 null
2026-03-08 Nwāchā Munā: A Devanagari Speech Corpus and Proximal Transfer Benchmark for Nepal Bhasha ASR Rishikesh Kumar Sharma et.al. 2603.07554 null
2026-03-07 Seeing the Context: Rich Visual Context-Aware Speech Recognition via Multimodal Reasoning Wenjie Tian et.al. 2603.07263 null
2026-03-06 Speak in Context: Multilingual ASR with Speech Context Alignment via Contrastive Learning Yuchen Zhang et.al. 2603.06505 null
2026-03-06 Doctor or Patient? Synergizing Diarization and ASR for Code-Switched Hinglish Medical Conditions Extraction Séverin Baroudi et.al. 2603.06373 null
2026-03-06 Continual Adaptation for Pacific Indigenous Speech Recognition Yang Xiao et.al. 2603.06310 null
2026-03-06 Whisper-CD: Accurate Long-Form Speech Recognition using Multi-Negative Contrastive Decoding Hoseong Ahn et.al. 2603.06193 null
2026-03-12 Which Data Matter? Embedding-Based Data Selection for Speech Recognition Zakaria Aldeneh et.al. 2603.05819 null
2026-03-06 Activation Steering for Accent Adaptation in Speech Foundation Models Jinuo Sun et.al. 2603.05813 null
2026-03-05 Exploring the potential and limitations of Model Merging for Multi-Domain Adaptation in ASR Carlos Carvalho et.al. 2603.05354 null
2026-03-05 PersianPunc: A Large-Scale Dataset and BERT-Based Approach for Persian Punctuation Restoration Mohammad Javad Ranjbar Kalahroodi et.al. 2603.05314 null
2026-03-05 Beyond Word Error Rate: Auditing the Diversity Tax in Speech Recognition through Dataset Cartography Ting-Hui Cheng et.al. 2603.05267 null
2026-03-05 Boosting ASR Robustness via Test-Time Reinforcement Learning with Audio-Text Semantic Rewards Linghan Fang et.al. 2603.05231 null
2026-03-05 Measuring the Redundancy of Decoder Layers in SpeechLLMs Adel Moumen et.al. 2603.05121 null
2026-03-05 TW-Sound580K: A Regional Audio-Text Dataset with Verification-Guided Curation for Localized Audio-Language Modeling Hao-Hui Xie et.al. 2603.05094 null
2026-03-05 Federated Heterogeneous Language Model Optimization for Hybrid Automatic Speech Recognition Mengze Hong et.al. 2603.04945 null
2026-03-05 Spectral dynamics reservoir computing for high-speed hardware-efficient neuromorphic processing Jiaxuan Chen et.al. 2603.04901 null
2026-02-16 Generating Realistic, Protocol-Compliant Maritime Radio Dialogues using Self-Instruct and Low-Rank Adaptation Gürsel Akdeniz et.al. 2603.04423 null
2026-03-04 FlowW2N: Whispered-to-Normal Speech Conversion via Flow-Matching Fabian Ritter-Gutierrez et.al. 2603.04296 null
2026-03-04 Robust LLM-based Audio-Visual Speech Recognition with Sparse Modality Alignment and Visual Unit-Guided Refinement Fei Su et.al. 2603.03811 null
2026-03-05 The PARLO Dementia Corpus: A German Multi-Center Resource for Alzheimer’s Disease Franziska Braun et.al. 2603.03471 null
2026-03-07 ACES: Accent Subspaces for Coupling, Explanations, and Stress-Testing in Automatic Speech Recognition Swapnil Parekh et.al. 2603.03359 null
2026-03-03 Speech recognition assisted by large language models to command software orally – Application to an augmented and virtual reality web app for immersive molecular graphics Fabio Cortes Rodriguez et.al. 2603.02901 null
2026-03-04 SilentWear: an Ultra-Low Power Wearable System for EMG-based Silent Speech Recognition Giusy Spacone et.al. 2603.02847 null
2026-03-02 GLoRIA: Gated Low-Rank Interpretable Adaptation for Dialectal ASR Pouya Mehralian et.al. 2603.02464 null
2026-03-02 Sequence-Level Unsupervised Training in Speech Recognition: A Theoretical Study Zijian Yang et.al. 2603.02285 null
2026-03-15 Whisper-RIR-Mega: A Paired Clean-Reverberant Speech Benchmark for ASR Robustness to Room Acoustics Mandip Goswami et.al. 2603.02252 null
2026-02-25 Quality of Automatic Speech Recognition – Polish Language case study – from Wav2Vec to Scribe ElevenLabs Marcin Pietroń et.al. 2603.02246 null
2026-03-02 VietSuperSpeech: A Large-Scale Vietnamese Conversational Speech Dataset for ASR Fine-Tuning in Chatbot, Customer Support, and Call Center Applications Loan Do et.al. 2603.01894 null
2026-03-02 The USTC-NERCSLIP Systems for the CHiME-9 MCoRec Challenge Ya Jiang et.al. 2603.01415 null
2026-03-07 Using Songs to Improve Kazakh Automatic Speech Recognition Rustem Yeshpanov et.al. 2603.00961 null
2026-03-01 Towards Orthographically-Informed Evaluation of Speech Recognition Systems for Indian Languages Kaushal Santosh Bhogale et.al. 2603.00941 null
2026-02-28 Polynomial Mixing for Efficient Self-supervised Speech Encoders Eva Feillet et.al. 2603.00683 null
2026-02-28 Whisper-MLA: Reducing GPU Memory Consumption of ASR Models based on MHA2MLA Conversion Sen Zhang et.al. 2603.00563 null
2026-02-27 Chunk-wise Attention Transducers for Fast and Accurate Streaming Speech-to-Text Hainan Xu et.al. 2602.24245 null
2026-02-27 Dialect and Gender Bias in YouTube’s Spanish Captioning System Iris Dania Jimenez et.al. 2602.24002 null
2026-02-26 Challenges in Automatic Speech Recognition for Adults with Cognitive Impairment Michelle Cohn et.al. 2602.23436 null
2026-02-16 Hello-Chat: Towards Realistic Social Audio Interactions Yueran Hou et.al. 2602.23387 null
2026-02-26 Align-Consistency: Improving Non-autoregressive and Semi-supervised ASR with Consistency Regularization Wanting Huang et.al. 2602.23171 null
2026-02-26 Efficient Dialect-Aware Modeling and Conditioning for Low-Resource Taiwanese Hakka Speech Processing An-Ci Peng et.al. 2602.22522 null
2026-02-25 TG-ASR: Translation-Guided Learning with Parallel Gated Cross Attention for Low-Resource Automatic Speech Recognition Cheng-Yeh Yang et.al. 2602.22039 null
2026-03-02 Mitigating Structural Noise in Low-Resource S2TT: An Optimized Cascaded Nepali-English Pipeline with Punctuation Restoration Tangsang Chongbang et.al. 2602.21647 null
2026-02-23 An Approach to Combining Video and Speech with Large Language Models in Human-Robot Interaction Guanting Shen et.al. 2602.20219 null
2026-02-23 Cross-lingual Matryoshka Representation Learning across Speech and Text Yaya Sy et.al. 2602.19991 null
2026-02-22 Pay Attention to CTC: Fast and Robust Pseudo-Labelling for Unified Speech Recognition Alexandros Haliassos et.al. 2602.19316 null
2026-02-21 Whisper: Courtside Edition Enhancing ASR Performance Through LLM-Driven Context Generation Yonathan Ron et.al. 2602.18966 null
2026-02-24 MDM-ASR: Bridging Accuracy and Efficiency in ASR with Diffusion-Based Non-Autoregressive Decoding Hao Yen et.al. 2602.18952 null
2026-02-21 ReHear: Iterative Pseudo-Label Refinement for Semi-Supervised Speech Recognition via Audio Large Language Models Zefang Liu et.al. 2602.18721 null
2026-02-18 Fine-Pruning: A Biologically Inspired Algorithm for Personalization of Machine Learning Models Joseph Bingham et.al. 2602.18507 null
2026-03-05 The Cascade Equivalence Hypothesis: When Do Speech LLMs Behave Like ASR $\rightarrow$ LLM Pipelines? Jayadev Billa et.al. 2602.17598 null
2026-02-17 Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits Gilad Nurko et.al. 2602.15405 null
2026-02-16 CLAP-Based Automatic Word Naming Recognition in Post-Stroke Aphasia Yacouba Kaloga et.al. 2602.14584 null
2026-02-15 From Scarcity to Scale: A Release-Level Analysis of the Pashto Common Voice Dataset Jandad Jahani et.al. 2602.14062 null
2026-02-15 Eureka-Audio: Triggering Audio Intelligence in Compact Language Models Dan Zhang et.al. 2602.13954 null
2026-02-14 voice2mode: Phonation Mode Classification in Singing using Self-Supervised Speech Models Aju Ani Justus et.al. 2602.13928 null
2026-02-03 Multimodal Consistency-Guided Reference-Free Data Selection for ASR Accent Adaptation Ligong Lei et.al. 2602.13263 null
2026-02-13 Can we trust AI to detect healthy multilingual English speakers among the cognitively impaired cohort in the UK? An investigation using real-world conversational speech Madhurananda Pahar et.al. 2602.13047 null
2026-02-13 ViMedCSS: A Vietnamese Medical Code-Switching Speech Dataset & Benchmark Tung X. Nguyen et.al. 2602.12911 null
2026-02-13 Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting Jing Xu et.al. 2602.12746 null
2026-02-16 Towards explainable reference-free speech intelligibility evaluation of people with pathological speech Bence Mark Halpern et.al. 2602.12723 null
2026-02-13 Decoder-only Conformer with Modality-aware Sparse Mixtures of Experts for ASR Jaeyoung Lee et.al. 2602.12546 null
2026-02-12 Moonshine v2: Ergodic Streaming Encoder ASR for Latency-Critical Speech Applications Manjunath Kudlur et.al. 2602.12241 null
2026-02-12 On the Sensitivity of Firing Rate-Based Federated Spiking Neural Networks to Differential Privacy Luiz Pereira et.al. 2602.12009 null
2026-02-28 TC-BiMamba: Trans-Chunk bidirectionally within BiMamba for unified streaming and non-streaming ASR Qingshun She et.al. 2602.11546 null
2026-02-21 Voxtral Realtime Alexander H. Liu et.al. 2602.11298 null
2026-02-10 When Less Is More? Diagnosing ASR Predictions in Sardinian via Layer-Wise Decoding Domenico De Cristofaro et.al. 2602.10350 null
2026-02-10 ViSpeechFormer: A Phonemic Approach for Vietnamese Automatic Speech Recognition Khoa Anh Nguyen et.al. 2602.10003 null
2026-02-10 Where Are We At with Automatic Speech Recognition for the Bambara Language? Seydou Diallo et.al. 2602.09785 null
2026-02-04 Beyond the Utterance: An Empirical Study of Very Long Context Speech Recognition Robert Flynn et.al. 2602.09044 null
2026-02-04 Windowed SummaryMixing: An Efficient Fine-Tuning of Self-Supervised Learning Models for Low-resource Speech Recognition Aditya Srinivas Menon et.al. 2602.09043 null
2026-02-09 Cross-Modal Bottleneck Fusion For Noise Robust Audio-Visual Speech Recognition Seaone Ok et.al. 2602.08293 null
2026-02-08 D-ORCA: Dialogue-Centric Optimization for Robust Audio-Visual Captioning Changli Tang et.al. 2602.07960 null
2026-02-06 Equipping LLM with Directional Multi-Talker Speech Understanding Capabilities Ju Lin et.al. 2602.07211 null
2026-02-05 From Hallucination to Articulation: Language Model-Driven Losses for Ultra Low-Bitrate Neural Speech Coding Jayeon Yi et.al. 2602.06213 null
2026-02-05 Enabling Automatic Disordered Speech Recognition: An Impaired Speech Dataset in the Akan Language Isaac Wiafe et.al. 2602.05406 null
2026-02-11 Evaluating Kubernetes Performance for GenAI Inference: From Automatic Speech Recognition to LLM Summarization Sai Sindhur Malleni et.al. 2602.04900 null
2026-02-04 Speaker-Aware Simulation Improves Conversational Speech Recognition Máté Gedeon et.al. 2602.04776 null
2026-02-04 Linguistically Informed Evaluation of Multilingual ASR for African Languages Fei-Yueh Chen et.al. 2602.04716 null
2026-02-04 Frontend Token Enhancement for Token-Based Speech Recognition Takanori Ashihara et.al. 2602.04217 null
2026-02-03 Mići Princ – A Little Boy Teaching Speech Technologies the Chakavian Dialect Nikola Ljubešić et.al. 2602.03245 null
2026-02-02 Mixture-of-Experts with Intermediate CTC Supervision for Accented Speech Recognition Wonjun Lee et.al. 2602.01967 null
2026-02-02 BBPE16: UTF-16-based byte-level byte-pair encoding for improved multilingual speech recognition Hyunsik Kim et.al. 2602.01717 null
2026-02-01 Adapting Where It Matters: Depth-Aware Adaptation for Efficient Multilingual Speech Recognition in Low-Resource Languages Yang Xiao et.al. 2602.01008 null
2026-02-01 MedSpeak: A Knowledge Graph-Aided ASR Error Correction Framework for Spoken Medical QA Yutong Song et.al. 2602.00981 null
2026-01-30 CALM: Joint Contextual Acoustic-Linguistic Modeling for Personalization of Multi-Speaker ASR Muhammad Shakeel et.al. 2601.22792 null
2026-01-30 Streaming Speech Recognition with Decoder-Only Large Language Models and Latency Optimization Genshun Wan et.al. 2601.22779 null
2026-01-29 Towards Robust Dysarthric Speech Recognition: LLM-Agent Post-ASR Correction Beyond WER Xiuwen Zheng et.al. 2601.21347 null
2026-01-30 Qwen3-ASR Technical Report Xian Shi et.al. 2601.21337 null
2026-01-28 asr_eval: Algorithms and tools for multi-reference and streaming speech recognition evaluation Oleg Sedukhin et.al. 2601.20992 null
2026-01-30 Text-only adaptation in LLM-based ASR through text denoising Sergio Burdisso et.al. 2601.20900 null
2026-01-28 Reducing Prompt Sensitivity in LLM-based Speech Recognition Through Learnable Projection Sergio Burdisso et.al. 2601.20898 null
2026-01-28 A Study of Data Selection Strategies for Pre-training Self-Supervised Speech Models Ryan Whetten et.al. 2601.20896 null
2026-01-28 SW-ASR: A Context-Aware Hybrid ASR Pipeline for Robust Single Word Speech Recognition Manali Sharma et.al. 2601.20890 null
2026-01-27 MA-LipNet: Multi-Dimensional Attention Networks for Robust Lipreading Matteo Rossi et.al. 2601.20881 null
2026-02-04 SpeechMapper: Speech-to-text Embedding Projector for LLMs Biswesh Mohapatra et.al. 2601.20417 null
2026-01-28 Mind the Shift: Using Delta SSL Embeddings to Enhance Child ASR Zilai Wang et.al. 2601.20142 null
2026-01-27 Do we really need Self-Attention for Streaming Automatic Speech Recognition? Youness Dkhissi et.al. 2601.19960 null
2026-01-23 Benchmarking von ASR-Modellen im deutschen medizinischen Kontext: Eine Leistungsanalyse anhand von Anamnesegesprächen Thomas Schuster et.al. 2601.19945 null
2026-01-08 FastWhisper: Adaptive Self-knowledge Distillation for Real-time Automatic Speech Recognition Junseok Lee et.al. 2601.19919 null
2026-01-27 Rethinking Discrete Speech Representation Tokens for Accent Generation Jinzuomu Zhong et.al. 2601.19786 null
2026-01-27 Phonological Tokenizer: Prosody-Aware Phonetic Token via Multi-Objective Fine-Tuning with Differentiable K-Means Kentaro Onda et.al. 2601.19781 null
2026-01-27 Advanced Modeling of Interlanguage Speech Intelligibility Benefit with L1-L2 Multi-Task Learning Using Differentiable K-Means for Accent-Robust Discrete Token-Based ASR Kentaro Onda et.al. 2601.19767 null
2026-01-27 SLM-SS: Speech Language Model for Generative Speech Separation Tianhua Li et.al. 2601.19533 null
2026-01-27 Dynamic Multi-Expert Projectors with Stabilized Routing for Multilingual Speech Recognition Isha Pandey et.al. 2601.19451 null
2026-02-02 Language Family Matters: Evaluating LLM-Based ASR Across Linguistic Boundaries Yuchen Zhang et.al. 2601.18899 null
2026-01-29 Unheard in the Digital Age: Rethinking AI Bias and Speech Diversity Onyedikachi Hope Amaechi-Okorie et.al. 2601.18641 null
2026-01-26 Pisets: A Robust Speech Recognition System for Lectures and Interviews Ivan Bondarenko et.al. 2601.18415 null
2026-01-26 Noise-Robust AV-ASR Using Visual Features Both in the Whisper Encoder and Decoder Zhengyang Li et.al. 2601.18396 null
2026-01-26 OCR-Enhanced Multimodal ASR Can Read While Listening Junli Chen et.al. 2601.18393 null
2026-01-26 Efficient Rehearsal for Continual Learning in ASR via Singular Value Tuning Steven Vander Eeckt et.al. 2601.18266 null
2026-01-30 LLM-ForcedAligner: A Non-Autoregressive and Accurate LLM-Based Forced Aligner for Multilingual and Long-Form Speech Bingshen Mu et.al. 2601.18220 null
2026-01-25 SpatialEmb: Extract and Encode Spatial Information for 1-Stage Multi-channel Multi-speaker ASR on Arbitrary Microphone Arrays Yiwen Shao et.al. 2601.18037 null
2026-01-25 dLLM-ASR: A Faster Diffusion LLM-based Framework for Speech Recognition Wenjie Tian et.al. 2601.17902 null
2026-01-25 BanglaRobustNet: A Hybrid Denoising-Attention Architecture for Robust Bangla Speech Recognition Md Sazzadul Islam Ridoy et.al. 2601.17679 null
2026-01-24 Window Size Versus Accuracy Experiments in Voice Activity Detectors Max McKinnon et.al. 2601.17270 null
2026-01-22 Sink or SWIM: Tackling Real-Time ASR at Scale Federico Bruzzone et.al. 2601.17097 null
2026-01-16 AI-based System for Transforming text and sound to Educational Videos M. E. ElAlami et.al. 2601.17022 null
2026-01-20 SoundBreak: A Systematic Study of Audio-Only Adversarial Attacks on Trimodal Models Aafiya Hussain et.al. 2601.16231 null
2026-01-22 Quantum Dimension Reduction of Hidden Markov Models Rishi Sundar et.al. 2601.16126 null
2026-01-27 Distillation-based Layer Dropping (DLD): Effective End-to-end Framework for Dynamic Speech Networks Abdul Hannan et.al. 2601.16117 null
2026-01-20 Lost in Transcription: How Speech-to-Text Errors Derail Code Understanding Jayant Havare et.al. 2601.15339 null
2026-01-22 Deaf and Hard of Hearing Access to Intelligent Personal Assistants: Comparison of Voice-Based Options with an LLM-Powered Touch Interface Paige S. DeVries et.al. 2601.15209 null
2026-01-21 Inverse-Hessian Regularization for Continual Learning in ASR Steven Vander Eeckt et.al. 2601.14751 null
2026-01-20 HoverAI: An Embodied Aerial Agent for Natural Human-Drone Interaction Yuhua Jin et.al. 2601.13801 null
2026-01-20 LongSpeech: A Scalable Benchmark for Transcription, Translation and Understanding in Long Speech Fei Yang et.al. 2601.13539 null
2026-01-28 Arab Voices: Mapping Standard and Dialectal Arabic Speech Technology Peter Sullivan et.al. 2601.13319 null
2026-01-19 Typhoon ASR Real-time: FastConformer-Transducer for Thai Automatic Speech Recognition Warit Sirichotedumrong et.al. 2601.13044 null
2026-01-18 SSVD-O: Parameter-Efficient Fine-Tuning with Structured SVD for Speech Recognition Pu Wang et.al. 2601.12600 null
2026-01-18 Harmonizing the Arabic Audio Space with Data Scheduling Hunzalah Hassan Bhatti et.al. 2601.12494 null
2026-01-18 CTC-DID: CTC-Based Arabic dialect identification for streaming applications Muhammad Umar Farooq et.al. 2601.12199 null
2025-12-23 Multi-Level Embedding Conformer Framework for Bengali Automatic Speech Recognition Md. Nazmus Sakib et.al. 2601.09710 null
2026-01-14 Linear Complexity Self-Supervised Learning for Music Understanding with Random Quantizer Petros Vavaroutsos et.al. 2601.09603 null
2026-01-14 Speech-Hands: A Self-Reflection Voice Agentic Approach to Speech Recognition and Audio Reasoning with Omni Perception Zhen Wan et.al. 2601.09413 null
2026-01-14 SLAM-LLM: A Modular, Open-Source Multimodal Large Language Model Framework and Best Practice for Speech, Language, Audio and Music Processing Ziyang Ma et.al. 2601.09385 null
2026-01-17 MCGA: A Multi-task Classical Chinese Literary Genre Audio Corpus Yexing Du et.al. 2601.09270 null
2026-01-15 DSA-Tokenizer: Disentangled Semantic-Acoustic Tokenization via Flow Matching-based Hierarchical Fusion Hanlin Zhang et.al. 2601.09239 null
2026-01-14 SITA: Learning Speaker-Invariant and Tone-Aware Speech Representations for Low-Resource Tonal Languages Tianyi Xu et.al. 2601.09050 null
2026-01-13 Robust CAPTCHA Using Audio Illusions in the Era of Large Language Models: from Evaluation to Advances Ziqi Ding et.al. 2601.08516 null
2026-01-12 HiVid-Narrator: Hierarchical Video Narrative Generation with Scene-Primed ASR-anchored Compression Haoxuan Li et.al. 2601.07366 null
2026-01-12 Towards Comprehensive Semantic Speech Embeddings for Chinese Dialects Kalvin Chang et.al. 2601.07274 null
2026-01-11 Task Arithmetic with Support Languages for Low-Resource ASR Emma Rafkin et.al. 2601.07038 null
2026-01-11 Categorize Early, Integrate Late: Divergent Processing Strategies in Automatic Speech Recognition Nathan Roll et.al. 2601.06972 null
2026-01-11 TagSpeech: End-to-End Multi-Speaker ASR and Diarization with Fine-Grained Temporal Grounding Mingyue Huo et.al. 2601.06896 null
2026-01-11 Variational decomposition autoencoding improves disentanglement of latent representations Ioannis Ziogas et.al. 2601.06844 null
2026-01-10 QMAVIS: Long Video-Audio Understanding using Fusion of Large Multimodal Models Zixing Lin et.al. 2601.06573 null
2026-01-10 Representing Sounds as Neural Amplitude Fields: A Benchmark of Coordinate-MLPs and A Fourier Kolmogorov-Arnold Framework Linfei Li et.al. 2601.06406 null
2026-01-09 An Intelligent AI glasses System with Multi-Agent Architecture for Real-Time Voice Processing and Task Execution Sheng-Kai Chen et.al. 2601.06235 null
2026-01-13 GenAITEd Ghana: A First-of-Its-Kind Context-Aware and Curriculum-Aligned Conversational AI Agent for Teacher Education Matthew Nyaaba et.al. 2601.06093 null
2025-12-31 AzeroS: Extending LLM to Speech with Self-Generated Instruction-Free Tuning Yiwen Shao et.al. 2601.06086 null
2026-01-09 Multimodal In-context Learning for ASR of Low-resource Languages Zhaolin Li et.al. 2601.05707 null
2026-01-08 WESR: Scaling and Evaluating Word-level Event-Speech Recognition Chenchen Yang et.al. 2601.04508 null
2026-01-07 Dialect Matters: Cross-Lingual ASR Transfer for Low-Resource Indic Language Varieties Akriti Dhasmana et.al. 2601.04373 null
2026-01-08 TellWhisper: Tell Whisper Who Speaks When Yifan Hu et.al. 2601.03712 null
2026-01-06 Linear Script Representations in Speech Foundation Models Enable Zero-Shot Transliteration Ryan Soh-Eun Shim et.al. 2601.02906 null
2026-01-06 Multi-channel multi-speaker transformer for speech recognition Guo Yifan et.al. 2601.02688 null
2026-01-05 Dynamic Quantization Error Propagation in Encoder-Decoder ASR Quantization Xinyu Wang et.al. 2601.02455 null
2026-01-14 MORE: Multi-Objective Adversarial Attacks on Speech Recognition Xiaoxue Gao et.al. 2601.01852 null
2026-01-15 Bridging the gap: A comparative exploration of Speech-LLM and end-to-end architecture for multilingual conversational ASR Yuxiang Mei et.al. 2601.01461 null
2026-01-03 IO-RAE: Information-Obfuscation Reversible Adversarial Example for Audio Privacy Protection Jiajie Zhu et.al. 2601.01239 null
2025-12-31 Index-ASR Technical Report Zheshu Song et.al. 2601.00890 null
2026-01-02 Three factor delay learning rules for spiking neural networks Luke Vassallo et.al. 2601.00668 null
2026-01-02 A Language-Agnostic Hierarchical LoRA-MoE Architecture for CTC-based Multilingual ASR Yuang Zheng et.al. 2601.00557 null
2026-01-01 ROBIN: Incremental Oblique Interleaved ECC for Reliability Improvement in STT-MRAM Caches Elham Cheshmikhani et.al. 2601.00456 null
2026-01-01 Enhancing Reliability of STT-MRAM Caches by Eliminating Read Disturbance Accumulation Elham Cheshmikhani et.al. 2601.00450 null
2026-01-01 Unseen Risks of Clinical Speech-to-Text Systems: Transparency, Privacy, and Reliability Challenges in AI-Driven Documentation Nelly Elsayed et.al. 2601.00382 null
2026-01-01 IKFST: IOO and KOO Algorithms for Accelerated and Precise WFST-based End-to-End Automatic Speech Recognition Zhuoran Zhuang et.al. 2601.00160 null
2025-12-31 SLM-TTA: A Framework for Test-Time Adaptation of Generative Spoken Language Models Yuan-Kuei Wu et.al. 2512.24739 null
2025-12-29 PROFASR-BENCH: A Benchmark for Context-Conditioned ASR in High-Stakes Professional Speech Deepak Babu Piskala et.al. 2512.23686 null
2025-12-17 Marco-ASR: A Principled and Metric-Driven Framework for Fine-Tuning Large-Scale ASR Models for Domain Adaptation Xuanfan Ni et.al. 2512.22165 null
2025-12-14 EEG-to-Voice Decoding of Spoken and Imagined speech Using Non-Invasive EEG Hanbeot Park et.al. 2512.22146 null
2025-12-26 Contextual Biasing for LLM-Based ASR with Hotword Retrieval and Reinforcement Learning YuXiang Kong et.al. 2512.21828 null
2025-12-25 Broadband tunable microwave photonic radar for simultaneous detection of human respiration, heartbeat, and speech with deep learning-based speech recognition Lei Gao et.al. 2512.21566 null
2025-12-23 Corpus of Cross-lingual Dialogues with Minutes and Detection of Misunderstandings Marko Čechovič et.al. 2512.20204 null
2025-12-29 VALLR-Pin: Uncertainty-Factorized Visual Speech Recognition for Mandarin with Pinyin Guidance Chang Sun et.al. 2512.20032 null
2025-12-22 Kunnafonidilaw ka Cadeau: an ASR dataset of present-day Bambara Yacouba Diarra et.al. 2512.19400 null
2025-12-22 From Speech to Subtitles: Evaluating ASR Models in Subtitling Italian Television Programs Alessandro Lucca et.al. 2512.19161 null
2025-12-22 Enhancing Fully Formatted End-to-End Speech Recognition with Knowledge Distillation via Multi-Codebook Vector Quantization Jian You et.al. 2512.18967 null
2025-12-20 Phoneme-based speech recognition driven by large language models and sampling marginalization Te Ma et.al. 2512.18371 null
2025-12-20 TICL+: A Case Study On Speech In-Context Learning for Children’s Speech Recognition Haolong Zheng et.al. 2512.18263 null
2025-11-27 Supplementary Resources and Analysis for Automatic Speech Recognition Systems Trained on the Loquacious Dataset Nick Rossenbach et.al. 2512.17915 null
2025-12-19 Peeking Into The Future For Contextual Biasing Ramaneswaran Selvakumar et.al. 2512.17657 null
2025-12-19 Zero-Shot Recognition of Dysarthric Speech Using Commercial Automatic Speech Recognition and Multimodal Large Language Models Ali Alsayegh et.al. 2512.17474 null
2025-12-19 Incorporating Error Level Noise Embedding for Improving LLM-Assisted Robustness in Persian Speech Recognition Zahra Rahmani et.al. 2512.17247 null
2026-01-14 Navigating the Reality Gap: Privacy-Preserving On-Device Continual Adaptation of ASR for Clinical Telephony Darshil Chauhan et.al. 2512.16401 null
2025-12-16 ComMark: Covert and Robust Black-Box Model Watermarking with Compressed Samples Yunfei Yang et.al. 2512.15641 null
2025-12-16 Scalable Frameworks for Real-World Audio-Visual Speech Recognition Sungnyun Kim et.al. 2512.14083 null
2025-12-18 Adaptive Edge-Cloud Inference for Speech-to-Action Systems Using ASR and Large Language Models Mohammad Jalili Torkamani et.al. 2512.12769 null
2025-12-13 System X: A Mobile Voice-Based AI System for EMR Generation and Clinical Decision Support in Low-Resource Maternal Healthcare Maryam Mustafa et.al. 2512.12240 null
2025-12-12 All-in-One ASR: Unifying Encoder-Decoder Models of CTC, Attention, and Transducer in Dual-Mode ASR Takafumi Moriya et.al. 2512.11543 null
2025-12-12 The Affective Bridge: Unifying Feature Representations for Speech Deepfake Detection Yupei Li et.al. 2512.11241 null
2025-11-30 Benchmarking Automatic Speech Recognition Models for African Languages Alvin Nahabwe et.al. 2512.10968 null
2025-11-30 ASR Under the Stethoscope: Evaluating Biases in Clinical Speech Recognition across Indian Languages Subham Kumar et.al. 2512.10967 null
2025-12-11 TRIDENT: A Redundant Architecture for Caribbean-Accented Emergency Speech Triage Elroy Galbraith et.al. 2512.10741 null
2025-12-10 Robust Speech Activity Detection in the Presence of Singing Voice Philipp Grundhuber et.al. 2512.09713 null
2025-12-02 Enhancing Automatic Speech Recognition Through Integrated Noise Detection Architecture Karamvir Singh et.al. 2512.08973 null
2025-12-08 A Simple Method to Enhance Pre-trained Language Models with Speech Tokens for Classification Nicolas Calbucura et.al. 2512.07571 null
2025-12-08 Efficient ASR for Low-Resource Languages: Leveraging Cross-Lingual Unlabeled Data Srihari Bandarupalli et.al. 2512.07277 null
2025-12-05 Morphologically-Informed Tokenizers for Languages with Non-Concatenative Morphology: A case study of Yoloxóchtil Mixtec ASR Chris Crawford et.al. 2512.06169 null
2025-12-01 KidSpeak: A General Multi-purpose LLM for Kids’ Speech Recognition and Screening Rohan Sharma et.al. 2512.05994 null
2025-12-02 Comparing Unsupervised and Supervised Semantic Speech Tokens: A Case Study of Child ASR Mohan Shi et.al. 2512.03301 null
2025-12-02 Bangla Hate Speech Classification with Fine-tuned Transformer Models Yalda Keivan Jafari et.al. 2512.02845 null
2025-12-02 Spoken Conversational Agents with Large Language Models Chao-Han Huck Yang et.al. 2512.02593 null
2025-12-01 See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models Le Thien Phuc Nguyen et.al. 2512.02231 null
2025-12-01 Swivuriso: The South African Next Voices Multilingual Speech Dataset Vukosi Marivatee et.al. 2512.02201 null
2025-11-18 On the Difficulty of Token-Level Modeling of Dysfluency and Fluency Shaping Artifacts Kashaf Gulzar et.al. 2512.02027 null
2025-12-01 MAC-SLU: Multi-Intent Automotive Cabin Spoken Language Understanding Benchmark Yuezhang Peng et.al. 2512.01603 null
2025-12-01 ZO-ASR: Zeroth-Order Fine-Tuning of Speech Foundation Models without Back-Propagation Yuezhang Peng et.al. 2512.01267 null
2025-12-11 A Low-Complexity Speech Codec Using Parametric Dithering for ASR Ellison Murray et.al. 2512.00511 null
2025-11-28 OmniFusion: Simultaneous Multilingual Multimodal Translations via Modular Fusion Sai Koneru et.al. 2512.00234 null
2025-11-28 Scaling HuBERT for African Languages: From Base to Large and XL Antoine Caubrière et.al. 2511.23370 null
2025-11-28 Group-Aware Partial Model Merging for Children’s Automatic Speech Recognition Thomas Rolland et.al. 2511.23098 null
2025-11-27 Modeling Romanized Hindi and Bengali: Dataset Creation and Multilingual LLM Integration Kanchon Gharami et.al. 2511.22769 null
2025-11-27 3RSeT: Read Disturbance Rate Reduction in STT-MRAM Caches by Selective Tag Comparison Elham Cheshmikhani et.al. 2511.22551 null
2025-11-27 Do You See What I Say? Generalizable Deepfake Detection based on Visual Speech Recognition Maheswar Bora et.al. 2511.22443 null
2025-11-16 On the Cross-lingual Transferability of Pre-trained wav2vec2-based Models Jonatas Grosman et.al. 2511.21704 null
2025-11-26 ASR Error Correction in Low-Resource Burmese with Alignment-Enhanced Transformers using Phonetic Features Ye Bhone Lin et.al. 2511.21088 null
2025-11-26 RosettaSpeech: Zero-Shot Speech-to-Speech Translation from Monolingual Data Zhisheng Zheng et.al. 2511.20974 null
2025-11-26 Towards Audio Token Compression in Large Audio Language Models Saurabhchand Bhati et.al. 2511.20973 null
2025-11-25 Bridging the Language Gap: Synthetic Voice Diversity via Latent Mixup for Equitable Speech Recognition Wesley Bian et.al. 2511.20534 null
2025-11-25 Mispronunciation Detection and Diagnosis Without Model Training: A Retrieval-Based Approach Huu Tuong Tu et.al. 2511.20107 null
2025-11-24 Neural Architecture Search for Quantum Autoencoders Hibah Agha et.al. 2511.19246 null
2025-11-24 AuViRe: Audio-visual Speech Representation Reconstruction for Deepfake Temporal Localization Christos Koutlis et.al. 2511.18993 null
2025-11-24 Context-Aware Whisper for Arabic ASR Under Linguistic Varieties Bashar Talafha et.al. 2511.18774 null
2025-11-21 Point of Order: Action-Aware LLM Persona Modeling for Realistic Civic Simulation Scott Merrill et.al. 2511.17813 null
2025-11-21 Enhancing Quranic Learning: A Multimodal Deep Learning Approach for Arabic Phoneme Recognition Ayhan Kucukmanisa et.al. 2511.17477 null
2025-11-21 WER is Unaware: Assessing How ASR Errors Distort Clinical Understanding in Patient Facing Dialogue Zachary Ellis et.al. 2511.16544 null
2025-12-03 NLP Datasets for Idiom and Figurative Language Tasks Blake Matheny et.al. 2511.16345 null
2025-11-19 Scriboora: Rethinking Human Pose Forecasting Daniel Bermuth et.al. 2511.15565 null
2025-11-19 Building Robust and Scalable Multilingual ASR for Indian Languages Arjun Gangwar et.al. 2511.15418 null
2025-11-18 Ground Truth Generation for Multilingual Historical NLP using LLMs Clovis Gladstone et.al. 2511.14688 null
2025-11-18 TTA: Transcribe, Translate and Alignment for Cross-lingual Speech Representation Wei Liu et.al. 2511.14410 null
2025-11-18 AfriSpeech-MultiBench: A Verticalized Multidomain Multicountry Benchmark Suite for African Accented English ASR Gabrial Zencha Ashungafac et.al. 2511.14255 null
2025-11-18 Listen Like a Teacher: Mitigating Whisper Hallucinations using Adaptive Layer Attention and Knowledge Distillation Kumud Tripathi et.al. 2511.14219 null
2025-11-17 Human-centric Maintenance Process Through Integration of AI, Speech, and AR Parul Khanna et.al. 2511.13918 null
2025-11-17 Spatial Blind Spot: Auditory Motion Perception Deficits in Audio LLMs Zhe Sun et.al. 2511.13273 null
2025-11-17 Distinguishing Repetition Disfluency from Morphological Reduplication in Bangla ASR Transcripts: A Novel Corpus and Benchmarking Analysis Zaara Zabeen Arpa et.al. 2511.13159 null
2025-11-15 How Far Do SSL Speech Models Listen for Tone? Temporal Focus of Tone Representation under Low-resource Transfer Minu Kim et.al. 2511.12285 null
2025-11-15 Fusionista2.0: Efficiency Retrieval System for Large-Scale Datasets Huy M. Le et.al. 2511.12255 null
2025-11-12 Tighter Truncated Rectangular Prism Approximation for RNN Robustness Verification Xingqi Lin et.al. 2511.11699 null
2025-11-14 Speech-Aware Long Context Pruning and Integration for Contextualized Automatic Speech Recognition Yiming Rong et.al. 2511.11139 null
2025-11-13 TEDxTN: A Three-way Speech Translation Corpus for Code-Switched Tunisian Arabic - English Fethi Bougares et.al. 2511.10780 null
2025-11-09 Towards Fine-Grained Code-Switch Speech Translation with Semantic Space Alignment Yan Gao et.al. 2511.10670 null
2025-11-13 ELYADATA & LIA at NADI 2025: ASR and ADI Subtasks Haroun Elleuch et.al. 2511.10090 null
2025-11-12 Omnilingual ASR: Open-Source Multilingual Speech Recognition for 1600+ Languages Omnilingual ASR team et.al. 2511.09690 null
2025-11-12 End-to-end Contrastive Language-Speech Pretraining Model For Long-form Spoken Question Answering Jiliang Hu et.al. 2511.09282 null
2025-11-12 Context-Aware Dynamic Chunking for Streaming Tibetan Speech Recognition Chao Wang et.al. 2511.09085 null
2025-11-12 Towards Effective and Efficient Non-autoregressive decoders for Conformer and LLM-based ASR using Block-based Attention Mask Tianzi Wang et.al. 2511.09084 null
2025-11-11 Unifying Model and Layer Fusion for Speech Foundation Models Yi-Jen Shih et.al. 2511.08389 null
2025-11-11 Quantizing Whisper-small: How design choices affect ASR performance Arthur Söhler et.al. 2511.08093 null
2025-11-11 Pruning as Regularization: Sensitivity-Aware One-Shot Pruning in ASR Julian Irigoyen et.al. 2511.08092 null
2025-11-13 SpikCommander: A High-performance Spiking Transformer with Multi-view Learning for Efficient Speech Command Recognition Jiaqi Wang et.al. 2511.07883 null
2025-11-11 Surgical Agent Orchestration Platform for Voice-directed Patient Data Interaction Hyeryun Park et.al. 2511.07392 null
2025-11-10 Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models Umberto Cappellazzo et.al. 2511.07253 null
2025-11-24 Privacy on the Fly: A Predictive Adversarial Transformation Network for Mobile Sensor Data Tianle Song et.al. 2511.07242 null
2025-11-10 Improving Remote Patient Monitoring Systems Using a Fog-based IoT Platform with Speech Recognition Marc Jayson Baucas et.al. 2511.07189 null
2025-11-10 CLiFT-ASR: A Cross-Lingual Fine-Tuning Framework for Low-Resource Taiwanese Hokkien Speech Recognition Hung-Yang Sung et.al. 2511.06860 null
2025-11-10 MedVoiceBias: A Controlled Study of Audio LLM Behavior in Clinical Decision-Making Zhi Rui Tam et.al. 2511.06592 null
2025-11-09 We Can Hear You with mmWave Radar! An End-to-End Eavesdropping System Dachao Han et.al. 2511.06205 null
2025-11-06 CantoASR: Prosody-Aware ASR-LALM Collaboration for Low-Resource Cantonese Dazhong Chen et.al. 2511.04139 null
2025-11-06 WST: Weakly Supervised Transducer for Automatic Speech Recognition Dongji Gao et.al. 2511.04035 null
2025-11-06 Accelerating scientific discovery with the common task framework J. Nathan Kutz et.al. 2511.04001 null
2025-11-05 Open Source State-Of-the-Art Solution for Romanian Speech Recognition Gabriel Pirlogeanu et.al. 2511.03361 null
2025-11-05 TASU: Text-Only Alignment for Speech Understanding Jing Peng et.al. 2511.03310 null
2025-11-11 How to Evaluate Speech Translation with Source-Aware Neural MT Metrics Mauro Cettolo et.al. 2511.03295 null
2025-11-04 Energy-Efficient Hardware Acceleration of Whisper ASR on a CGLA Takuto Ando et.al. 2511.02269 null
2025-10-30 Overview of the MEDIQA-OE 2025 Shared Task on Medical Order Extraction from Doctor-Patient Consultations Jean-Philippe Corbeil et.al. 2510.26974 null
2025-10-29 Multi-Representation Attention Framework for Underwater Bioacoustic Denoising and Recognition Amine Razig et.al. 2510.26838 null
2025-10-28 See the Speaker: Crafting High-Resolution Talking Faces from Speech with Prior Guidance and Region Refinement Jinting Wang et.al. 2510.26819 null
2025-10-30 HMM for short independent sequences: Multiple sequence Baum-Welch application Margarita Cabrera-Bean et.al. 2510.26532 null
2025-10-29 Learning Disentangled Speech- and Expression-Driven Blendshapes for 3D Talking Face Animation Yuxiang Mao et.al. 2510.25234 null
2025-10-29 Explainable Disentanglement on Discrete Speech Representations for Noise-Robust ASR Shreyas Gopal et.al. 2510.25150 null
2025-10-28 POWSM: A Phonetic Open Whisper-Style Speech Foundation Model Chin-Jou Li et.al. 2510.24992 null
2025-10-28 Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation Inclusion AI et.al. 2510.24821 null
2025-10-28 BEST-RQ-Based Self-Supervised Learning for Whisper Domain Adaptation Raphaël Bagat et.al. 2510.24570 null
2025-10-30 Audio Signal Processing Using Time Domain Mel-Frequency Wavelet Coefficient Rinku Sebastian et.al. 2510.24519 null
2025-10-28 V-SAT: Video Subtitle Annotation Tool Arpita Kundu et.al. 2510.24180 null
2025-10-28 RegSpeech12: A Regional Corpus of Bengali Spontaneous Speech Across Dialects Md. Rezuwan Hassan et.al. 2510.24096 null
2025-10-27 A Neural Model for Contextual Biasing Score Learning and Filtering Wanting Huang et.al. 2510.23849 null
2025-11-01 RoboOmni: Proactive Robot Manipulation in Omni-modal Context Siyin Wang et.al. 2510.23763 null
2025-10-27 Arabic Little STT: Arabic Children Speech Recognition Dataset Mouhand Alkadri et.al. 2510.23319 null
2025-10-27 A Cocktail-Party Benchmark: Multi-Modal dataset and Comparative Evaluation Results Thai-Binh Nguyen et.al. 2510.23276 null
2025-10-29 Are ASR foundation models generalized enough to capture features of regional dialects for low-resource languages? Tawsif Tashwar Dipto et.al. 2510.23252 null
2025-10-27 Adapting Speech Foundation Models with Large Language Models for Unified Speech Recognition Jing-Xuan Zhang et.al. 2510.22961 null
2025-10-26 EchoMind: An Interrelated Multi-level Benchmark for Evaluating Empathetic Speech Language Models Li Zhou et.al. 2510.22758 null
2025-10-26 LRW-Persian: Lip-reading in the Wild Dataset for Persian Language Zahra Taghizadeh et.al. 2510.22716 null
2025-11-02 Mitigating Attention Sinks and Massive Activations in Audio-Visual Speech Recognition with LLMs Anand et.al. 2510.22603 null
2025-10-26 A Sociophonetic Analysis of Racial Bias in Commercial ASR Systems Using the Pacific Northwest English Corpus Michael Scott et.al. 2510.22495 null
2025-10-26 The Limits of Data Scaling: Sub-token Utilization and Acoustic Saturation in Multilingual ASR Siyu Liang et.al. 2510.22492 null
2025-10-26 The Tonogenesis Continuum in Tibetan: A Computational Investigation Siyu Liang et.al. 2510.22485 null
2025-10-25 Bridging the Perceptual-Statistical Gap in Dysarthria Assessment: Why Machine Learning Still Falls Short Krishna Gurugubelli et.al. 2510.22237 null
2025-10-25 M-CIF: Multi-Scale Alignment For CIF-Based Non-Autoregressive ASR Ruixiang Mao et.al. 2510.22172 null
2025-10-23 LSF-Animation: Label-Free Speech-Driven Facial Animation via Implicit Feature Representation Xin Lu et.al. 2510.21864 null
2025-10-24 SindBERT, the Sailor: Charting the Seas of Turkish NLP Raphael Scheible-Schmitt et.al. 2510.21364 null
2025-10-27 ReFESS-QI: Reference-Free Evaluation For Speech Separation With Joint Quality And Intelligibility Scoring Ari Frummer et.al. 2510.21014 null
2025-10-21 Can large audio language models understand child stuttering speech? speech summarization, and source separation Chibuzor Okocha et.al. 2510.20850 null
2025-10-23 Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding Xin Zhang et.al. 2510.20504 null
2025-10-22 Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech Recognition Yuu Jinnai et.al. 2510.19471 null
2025-10-23 FLASH Viterbi: Fast and Adaptive Viterbi Decoding for Modern Data Systems Ziheng Deng et.al. 2510.19301 null
2025-10-22 Tibetan Language and AI: A Comprehensive Survey of Resources, Methods and Challenges Cheng Huang et.al. 2510.19144 null
2025-10-28 RIR-Mega: a large-scale simulated room impulse response dataset for machine learning and room acoustics modeling Mandip Goswami et.al. 2510.18917 null
2025-10-23 MLMA: Towards Multilingual ASR With Mamba-based Architectures Mohamed Nabih Ali et.al. 2510.18684 null
2025-10-21 Towards Fair ASR For Second Language Speakers Using Fairness Prompted Finetuning Monorama Swain et.al. 2510.18374 null
2025-10-19 Zero- and One-Shot Data Augmentation for Sentence-Level Dysarthric Speech Recognition in Constrained Scenarios Shiyao Wang et.al. 2510.16700 null
2025-10-18 Hallucination Benchmark for Speech Foundation Models Alkis Koudounas et.al. 2510.16567 null
2025-10-18 Probing the Hidden Talent of ASR Foundation Models for L2 English Oral Assessment Fu-An Chao et.al. 2510.16387 null
2025-10-17 SpeechLLMs for Large-scale Contextualized Zero-shot Slot Filling Kadri Hacioglu et.al. 2510.15851 null
2025-10-17 SpikeVox: Towards Energy-Efficient Speech Therapy Framework with Spike-driven Generative Language Models Rachmad Vidya Wicaksana Putra et.al. 2510.15566 null
2025-10-15 Do Slides Help? Multi-modal Context for Automatic Transcription of Conference Talks Supriti Sinhamahapatra et.al. 2510.13979 null
2025-10-15 Personal Attribute Leakage in Federated Speech Models Hamdan Al-Ali et.al. 2510.13357 null
2025-10-15 Two Heads Are Better Than One: Audio-Visual Speech Error Correction with Dual Hypotheses Sungnyun Kim et.al. 2510.13281 null
2025-10-15 STT-GS: Sample-Then-Transmit Edge Gaussian Splatting with Joint Client Selection and Power Control Zhen Li et.al. 2510.13186 null
2025-10-14 A Critical Review of the Need for Knowledge-Centric Evaluation of Quranic Recitation Mohammed Hilal Al-Kharusi et.al. 2510.12858 null
2025-10-14 Adaptive vector steering: A training-free, layer-wise intervention for hallucination mitigation in large audio and multimodal models Tsung-En Lin et.al. 2510.12851 null
2025-10-11 Automatic Speech Recognition in the Modern Era: Architectures, Training, and Evaluation Md. Nayeem et.al. 2510.12827 null
2025-10-14 Cost Analysis of Human-corrected Transcription for Predominately Oral Languages Yacouba Diarra et.al. 2510.12781 null
2025-10-14 Structured Sparsity and Weight-adaptive Pruning for Memory and Compute efficient Whisper models Prasenjit K Mudi et.al. 2510.12666 null
2025-10-12 Dual Data Scaling for Robust Two-Stage User-Defined Keyword Spotting Zhiqi Ai et.al. 2510.10740 null
2025-10-12 Proficiency-Aware Adaptation and Data Augmentation for Robust L2 ASR Ling Sun et.al. 2510.10738 null
2025-10-12 End-to-end Speech Recognition with similar length speech and text Peng Fan et.al. 2510.10453 null
2025-10-12 Knowledge-Decoupled Functionally Invariant Path with Synthetic Personal Data for Personalized ASR Yue Gu et.al. 2510.10401 null
2025-10-11 End-to-end Automatic Speech Recognition and Speech Translation: Integration of Speech Foundational Models and LLMs Nam Luu et.al. 2510.10329 null
2025-10-11 SyncLipMAE: Contrastive Masked Pretraining for Audio-Visual Talking-Face Representation Zeyu Ling et.al. 2510.10069 null
2025-10-10 Accent-Invariant Automatic Speech Recognition via Saliency-Driven Spectrogram Masking Mohammad Hossein Sameti et.al. 2510.09528 null
2025-10-10 WildElder: A Chinese Elderly Speech Dataset from the Wild with Fine-Grained Manual Annotations Hui Wang et.al. 2510.09344 null
2025-10-10 SynthVC: Leveraging Synthetic Data for End-to-End Low Latency Streaming Voice Conversion Zhao Guo et.al. 2510.09245 null
2025-10-10 Effects of automotive microphone frequency response characteristics and noise conditions on speech and ASR quality – an experimental evaluation Michele Buccoli et.al. 2510.09236 null
2025-10-10 FLToP CTC: Frame-Level Token Pruning via Relative Threshold for Efficient and Memory-Saving Decoding on Diverse Platforms Atul Shree et.al. 2510.09085 null
2025-10-08 Look before Transcription: End-to-End SlideASR with Visually-Anchored Policy Optimization Rui Hu et.al. 2510.08618 null
2025-10-01 Articulation-Informed ASR: Integrating Articulatory Features into ASR via Auxiliary Speech Inversion and Cross-Attention Fusion Ahmed Adel Attia et.al. 2510.08585 null
2025-10-09 Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition Yi-Cheng Lin et.al. 2510.08047 null
2025-10-09 Bloodroot: When Watermarking Turns Poisonous For Stealthy Backdoor Kuan-Yu Chen et.al. 2510.07909 null
2025-10-08 LASER: An LLM-based ASR Scoring and Evaluation Rubric Amruta Parulekar et.al. 2510.07437 null
2025-10-08 How much speech data is necessary for ASR in African languages? An evaluation of data scaling in Kinyarwanda and Kikuyu Benjamin Akera et.al. 2510.07221 null
2025-10-09 Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation Vaibhav Srivastav et.al. 2510.06961 null
2025-10-07 Linguistically Informed Tokenization Improves ASR for Underresourced Languages Massimo Daul et.al. 2510.06461 null
2025-10-07 BanglaTalk: Towards Real-Time Speech Assistance for Bengali Regional Dialects Jakir Hasan et.al. 2510.06188 null
2025-10-06 How I Built ASR for Endangered Languages with a Spoken Dictionary Christopher Bartley et.al. 2510.04832 null
2025-10-06 Evaluating Self-Supervised Speech Models via Text-Based LLMS Takashi Maekaku et.al. 2510.04463 null
2025-10-05 Probing Whisper for Dysarthric Speech in Detection and Assessment Zhengjun Yue et.al. 2510.04219 null
2025-10-05 Drax: Speech Recognition with Discrete Flow Matching Aviv Navon et.al. 2510.04162 null
2025-10-05 MoME: Mixture of Matryoshka Experts for Audio-Visual Speech Recognition Umberto Cappellazzo et.al. 2510.04136 null
2025-10-04 Adapting Diarization-Conditioned Whisper for End-to-End Multi-Talker Speech Recognition Martin Kocour et.al. 2510.03723 null
2025-10-04 Towards Unsupervised Speech Recognition at the Syllable-Level Liming Wang et.al. 2510.03639 null
2025-10-04 Scaling Multi-Talker ASR with Speaker-Agnostic Activity Streams Xiluo He et.al. 2510.03630 null
2025-10-03 Listening or Reading? Evaluating Speech Awareness in Chain-of-Thought Speech-to-Text Translation Jacobo Romero-Díaz et.al. 2510.03115 null
2025-10-03 Revisiting Direct Speech-to-Text Translation with Speech LLMs: Better Scaling than CoT Prompting? Oriol Pareras et.al. 2510.03093 null
2025-10-03 Sequence-Preserving Dual-FoV Defense for Traffic Sign and Light Recognition in Autonomous Vehicles Abhishek Joshi et.al. 2510.02642 null
2025-10-02 A Physical Unclonable Function Based on Variations of Write Times in STT-MRAM due to Manufacturing Defects Jacob Huber et.al. 2510.02574 null
2025-10-16 Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models Tolúlopé Ògúnrèmí et.al. 2510.02569 null
2025-10-02 EvolveCaptions: Empowering DHH Users Through Real-Time Collaborative Captioning Liang-Yuan Wu et.al. 2510.02181 null
2025-09-30 An Analysis of the New EU AI Act and A Proposed Standardization Framework for Machine Learning Fairness Mike Teodorescu et.al. 2510.01281 null
2025-10-01 Automatic Speech Recognition (ASR) for African Low-Resource Languages: A Systematic Literature Review Sukairaj Hafiz Imam et.al. 2510.01145 null
2025-10-01 Spiralformer: Low Latency Encoder for Streaming Speech Recognition with Circular Layer Skipping and Early Exiting Emiru Tsunoo et.al. 2510.00982 null
2025-10-01 EuroSpeech: A Multilingual Speech Corpus Samuel Pfisterer et.al. 2510.00514 null
2025-09-26 Temporal-Aware Iterative Speech Model for Dementia Detection Chukwuemeka Ugwu et.al. 2510.00030 null
2025-09-30 IR-UWB Radar-Based Contactless Silent Speech Recognition with Attention-Enhanced Temporal Convolutional Networks Sunghwa Lee et.al. 2509.26409 null
2025-09-30 ASR Under Noise: Exploring Robustness for Sundanese and Javanese Salsabila Zahirah Pranida et.al. 2509.25878 null
2025-09-29 Beyond WER: Probing Whisper’s Sub-token Decoder Across Diverse Language Resource Levels Siyu Liang et.al. 2509.25516 null
2025-09-29 Confidence-Guided Error Correction for Disordered Speech Recognition Abner Hernandez et.al. 2509.25048 null
2025-10-05 HiKE: Hierarchical Evaluation Framework for Korean-English Code-Switching Speech Recognition Gio Paik et.al. 2509.24613 null
2025-09-29 A Text-To-Text Alignment Algorithm for Better Evaluation of Modern Speech Recognition Systems Lasse Borgholt et.al. 2509.24478 null
2025-09-28 AISHELL6-whisper: A Chinese Mandarin Audio-visual Whisper Speech Dataset with Speech Recognition Baselines Cancan Li et.al. 2509.23833 null
2025-09-28 Automatic Speech Recognition for Greek Medical Dictation Vardis Georgilas et.al. 2509.23550 null
2025-09-26 Index-MSR: A high-efficiency multimodal fusion framework for speech recognition Jinming Chen et.al. 2509.22744 null
2025-10-10 From Coarse to Fine: Recursive Audio-Visual Semantic Enhancement for Speech Separation Ke Xue et.al. 2509.22425 null
2025-09-26 Decoding Deception: Understanding Automatic Speech Recognition Vulnerabilities in Evasion and Poisoning Attacks Aravindhan G et.al. 2509.22060 null
2025-09-26 A Parallel Ultra-Low Power Silent Speech Interface based on a Wearable, Fully-dry EMG Neckband Fiona Meier et.al. 2509.21964 null
2025-09-25 Visual Authority and the Rhetoric of Health Misinformation: A Multimodal Analysis of Social Media Videos Mohammad Reza Zarei et.al. 2509.20724 null
2025-09-23 Variational Low-Rank Adaptation for Personalized Impaired Speech Recognition Niclas Pokel et.al. 2509.20397 null
2025-09-23 Data-Efficient ASR Personalization for Non-Normative Speech Using an Uncertainty-Based Phoneme Difficulty Score for Guided Sampling Niclas Pokel et.al. 2509.20396 null
2025-09-24 DRES: Benchmarking LLMs for Disfluency Removal Maria Teleki et.al. 2509.20321 null
2025-09-25 From Text to Talk: Audio-Language Model Needs Non-Autoregressive Joint Training Tianqiao Liu et.al. 2509.20072 null
2025-09-24 Discrete Diffusion for Generative Modeling of Text-Aligned Speech Tokens Pin-Jui Ku et.al. 2509.20060 null
2025-09-24 Weakly Supervised Phonological Features for Pathological Speech Analysis Jenthe Thienpondt et.al. 2509.19879 null
2025-09-26 MMedFD: A Real-world Healthcare Benchmark for Multi-turn Full-Duplex Automatic Speech Recognition Hongzhao Chen et.al. 2509.19817 null
2025-09-23 Retrieval Augmented Generation based context discovery for ASR Dimitrios Siskos et.al. 2509.19567 null
2025-09-23 WolBanking77: Wolof Banking Speech Intent Classification Dataset Abdou Karim Kandji et.al. 2509.19271 null
2025-09-23 SloPalSpeech: A 2,8000-Hour Slovak Speech Corpus from Parliamentary Data Erik Božík et.al. 2509.19270 null
2025-09-23 LOTUSDIS: A Thai far-field meeting corpus for robust conversational ASR Pattara Tipaksorn et.al. 2509.18722 null
2025-09-22 Speech Vecalign: an Embedding-based Method for Aligning Parallel Speech Documents Chutong Meng et.al. 2509.18360 null
2025-09-20 Conversational Orientation Reasoning: Egocentric-to-Allocentric Navigation with Multimodal Chain-of-Thought Yu Ti Huang et.al. 2509.18200 null
2025-09-24 MNV-17: A High-Quality Performative Mandarin Dataset for Nonverbal Vocalization Recognition in Speech Jialong Mai et.al. 2509.18196 null
2025-09-22 Transformer-Encoder Trees for Efficient Multilingual Machine Translation and Speech Translation Yiwen Guan et.al. 2509.17930 null
2025-09-22 Qwen3-Omni Technical Report Jin Xu et.al. 2509.17765 null
2025-09-22 Leveraging Audio-Visual Data to Reduce the Multilingual Gap in Self-Supervised Speech Models María Andrea Cruz Blandón et.al. 2509.17523 null
2025-09-20 Idiosyncratic Versus Normative Modeling of Atypical Speech Recognition: Dysarthric Case Studies Vishnu Raja et.al. 2509.16718 null
2025-09-20 Audio-Conditioned Diffusion LLMs for ASR and Deliberation Processing Mengqi Wang et.al. 2509.16622 null
2025-09-19 Whisper-UT: A Unified Translation Framework for Speech and Text Cihan Xiao et.al. 2509.16375 null
2025-09-26 GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition Tianyue Wang et.al. 2509.16031 null
2025-09-19 Session-Level Spoken Language Assessment with a Multimodal Foundation Model via Multi-Target Learning Hong-Yun Lin et.al. 2509.16025 null
2025-09-22 Interpreting the Role of Visemes in Audio-Visual Speech Recognition Aristeidis Papadopoulos et.al. 2509.16023 null
2025-09-19 VOX-KRIKRI: Unifying Speech and Language through Continuous Fusion Dimitrios Damianos et.al. 2509.15667 null
2025-09-19 Layer-wise Minimal Pair Probing Reveals Contextual Grammatical-Conceptual Hierarchy in Speech Representations Linyang He et.al. 2509.15655 null
2025-09-19 Thinking in cocktail party: Chain-of-Thought and reinforcement learning for target speaker automatic speech recognition Yiru Zhang et.al. 2509.15612 null
2025-09-19 Chunk Based Speech Pre-training with High Resolution Finite Scalar Quantization Yun Tang et.al. 2509.15579 null
2025-09-19 State-of-the-Art Dysarthric Speech Recognition with MetaICL for on-the-fly Personalization Dhruuv Agarwal et.al. 2509.15516 null
2025-09-18 BiRQ: Bi-Level Self-Labeling Random Quantization for Self-Supervised Speech Recognition Liuyuan Jiang et.al. 2509.15430 null
2025-09-25 Speech Language Models for Under-Represented Languages: Insights from Wolof Yaya Sy et.al. 2509.15362 null
2025-09-20 Listening, Imagining & Refining: A Heuristic Optimized ASR Correction Framework with LLMs Yutong Liu et.al. 2509.15095 null
2025-09-19 From Hype to Insight: Rethinking Large Language Model Integration in Visual Speech Recognition Rishabh Jain et.al. 2509.14880 null
2025-09-18 Towards Building Speech Large Language Models for Multitask Understanding in Low-Resource Languages Mingchen Shao et.al. 2509.14804 null
2025-09-18 UMA-Split: unimodal aggregation for both English and Mandarin non-autoregressive speech recognition Ying Fang et.al. 2509.14653 null
2025-09-17 Multi-Channel Differential ASR for Robust Wearer Speech Recognition on Smart Glasses Yufeng Yang et.al. 2509.14430 null
2025-09-13 Context-Enhanced Granular Edit Representation for Efficient and Accurate ASR Post-editing Luan Vejsiu et.al. 2509.14263 null
2025-09-25 Canary-1B-v2 & Parakeet-TDT-0.6B-v3: Efficient and High-Performance Models for Multilingual ASR and AST Monica Sekoyan et.al. 2509.14128 null
2025-09-17 Language Conditioning Improves Accuracy of Aircraft Goal Prediction in Untowered Airspace Sundhar Vinodh Sangeetha et.al. 2509.14063 null
2025-09-17 Conducting Mission-Critical Voice Experiments with Automated Speech Recognition and Crowdsourcing Jan Janak et.al. 2509.13724 null
2025-09-16 Invisible Ears at Your Fingertips: Acoustic Eavesdropping via Mouse Sensors Mohamad Fakih et.al. 2509.13581 null
2025-09-16 TICL: Text-Embedding KNN For Speech In-Context Learning Unlocks Speech Recognition Abilities of Large Multimodal Models Haolong Zheng et.al. 2509.13395 null
2025-09-22 GLAD: Global-Local Aware Dynamic Mixture-of-Experts for Multi-Talker ASR Yujie Guo et.al. 2509.13093 null
2025-09-16 PAC: Pronunciation-Aware Contextualized Large Language Model-based Automatic Speech Recognition Li Fu et.al. 2509.12647 null
2025-09-17 FunAudio-ASR Technical Report Keyu An et.al. 2509.12508 null
2025-09-15 In-domain SSL pre-training and streaming ASR Jarod Duret et.al. 2509.12101 null
2025-09-12 Improving Audio Event Recognition with Consistency Regularization Shanmuka Sadhu et.al. 2509.10391 null
2025-09-12 Data-independent Beamforming for End-to-end Multichannel Multi-speaker ASR Can Cui et.al. 2509.10234 null
2025-09-12 Prominence-aware automatic speech recognition for conversational speech Julian Linke et.al. 2509.10116 null
2025-09-12 Unified Learnable 2D Convolutional Feature Extraction for ASR Peter Vieting et.al. 2509.10031 null
2025-09-11 Combining Textual and Spectral Features for Robust Classification of Pilot Communications Abdullah All Tanvir et.al. 2509.09752 null
2025-09-11 Improving Synthetic Data Training for Contextual Biasing Models with a Keyword-Aware Cost Function Chin Yuen Kwok et.al. 2509.09197 null
2025-09-11 Efficient Trie-based Biasing using K-step Prediction for Rare Word Recognition Chin Yuen Kwok et.al. 2509.09196 null
2025-09-09 A Bottom-up Framework with Language-universal Speech Attribute Modeling for Syllable-based ASR Hao Yen et.al. 2509.08173 null
2025-09-09 EnvX: Agentize Everything with Agentic AI Linyao Chen et.al. 2509.08088 null
2025-09-08 Identifying and Calibrating Overconfidence in Noisy Speech Recognition Mingyue Huo et.al. 2509.07195 null
2025-09-08 The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties William Chen et.al. 2509.07139 null
2025-09-20 TSPC: A Two-Stage Phoneme-Centric Architecture for code-switching Vietnamese-English Speech Recognition Minh N. H. Nguyen et.al. 2509.05983 null
2025-09-07 Enhancing the Robustness of Contextual ASR to Varying Biasing Information Volumes Through Purified Semantic Correlation Joint Modeling Yue Gu et.al. 2509.05908 null
2025-09-06 New Insights into Optimal Alignment of Acoustic and Linguistic Representations for Knowledge Transfer in ASR Xugang Lu et.al. 2509.05609 null
2025-09-05 Graph Connectionist Temporal Classification for Phoneme Recognition Henry Grafé et.al. 2509.05399 null
2025-09-05 Layer-wise Analysis for Quality of Multilingual Synthesized Speech Erica Cooper et.al. 2509.04830 null
2025-09-02 From Silent Signals to Natural Language: A Dual-Stage Transformer-LLM Approach Nithyashree Sivasubramaniam et.al. 2509.04507 null
2025-09-01 Refining Transcripts With TV Subtitles by Prompt-Based Weakly Supervised Training of ASR Xinnian Zhao et.al. 2509.04491 null
2025-09-01 Serialized Output Prompting for Large Language Model-based Multi-Talker Speech Recognition Hao Shi et.al. 2509.04488 null
2025-08-29 SpeechLLM: Unified Speech and Language Model for Enhanced Multi-Task Understanding in Low Resource Settings Jaekwon Yoo et.al. 2509.04473 null
2025-09-04 Contextualized Token Discrimination for Speech Search Query Correction Junyu Lu et.al. 2509.04393 null
2025-09-04 Denoising GER: A Noise-Robust Generative Error Correction with LLM for Speech Recognition Yanyan Liu et.al. 2509.04392 null
2025-09-04 PARCO: Phoneme-Augmented Robust Contextual ASR via Contrastive Entity Disambiguation Jiajun He et.al. 2509.04357 null
2025-09-04 Enhancing Self-Supervised Speaker Verification Using Similarity-Connected Graphs and GCN Zhaorui Sun et.al. 2509.04147 null
2025-08-27 An Effective Strategy for Modeling Score Ordinality and Non-uniform Intervals in Automated Speaking Assessment Tien-Hong Lo et.al. 2509.03372 null
2025-09-05 Exploring persuasive interactions with generative social robots: An experimental framework Stephan Vonschallen et.al. 2509.03231 null
2025-09-03 Beyond Words: Interjection Classification for Improved Human-Computer Interaction Yaniv Goren et.al. 2509.03181 null
2025-09-03 A Study on Zero-Shot Non-Intrusive Speech Intelligibility for Hearing Aids Using Large Language Models Ryandhimas E. Zezario et.al. 2509.03021 null
2025-09-04 Speech Intelligibility Assessment with Uncertainty-Aware Whisper Embeddings and sLSTM Ryandhimas E. Zezario et.al. 2509.03013 null
2025-09-02 SSVD: Structured SVD for Parameter-Efficient Fine-Tuning and Benchmarking under Domain Shift in ASR Pu Wang et.al. 2509.02830 null
2025-09-02 Flavors of Moonshine: Tiny Specialized ASR Models for Edge Devices Evan King et.al. 2509.02523 null
2025-09-04 AudioCodecBench: A Comprehensive Benchmark for Audio Codec Evaluation Lu Wang et.al. 2509.02349 null
2025-09-03 NADI 2025: The First Multidialectal Arabic Speech Processing Shared Task Bashar Talafha et.al. 2509.02038 null
2025-09-02 Group Relative Policy Optimization for Speech Recognition Prashanth Gurunath Shivakumar et.al. 2509.01939 null
2025-09-02 Multilingual Speech Recognition Using Discrete Tokens with a Two-step Training Strategy Zehan Li et.al. 2509.01900 null
2025-09-01 Mic Drop or Data Flop? Evaluating the Fitness for Purpose of AI Voice Interviewers for Data Collection within Quantitative & Qualitative Research Contexts Shreyas Tirumala et.al. 2509.01814 null
2025-09-01 Characterization of Speech Similarity Between Australian Aboriginal and High-Resource Languages: A Case Study on Dharawal Ting Dang et.al. 2509.01419 null
2025-09-01 CabinSep: IR-Augmented Mask-Based MVDR for Real-Time In-Car Speech Separation with Distributed Heterogeneous Arrays Runduo Han et.al. 2509.01399 null
2025-09-01 Analysing the Language of Neural Audio Codecs Joonyong Park et.al. 2509.01390 null
2025-09-01 Noisy Disentanglement with Tri-stage Training for Noise-Robust Speech Recognition Shuangyuan Chen et.al. 2509.01087 null
2025-08-31 A Unified Denoising and Adaptation Framework for Self-Supervised Bengali Dialectal ASR Swadhin Biswas et.al. 2509.00988 null
2025-08-30 Entropy-based Coarse and Compressed Semantic Speech Representation Learning Jialong Zuo et.al. 2509.00503 null
2025-08-27 Automatic Pronunciation Error Detection and Correction of the Holy Quran’s Learners Using Deep Learning Abdullah Abdelfattah et.al. 2509.00094 null
2025-08-29 NSPDI-SNN: An efficient lightweight SNN based on nonlinear synaptic pruning and dendritic integration Wuque Cai et.al. 2508.21566 null
2025-09-02 AHELM: A Holistic Evaluation of Audio-Language Models Tony Lee et.al. 2508.21376 null
2025-08-28 Can Layer-wise SSL Features Improve Zero-Shot ASR Performance for Children’s Speech? Abhijit Sinha et.al. 2508.21225 null
2025-08-28 Benchmarking Large Pretrained Multilingual Models on Québec French Speech Recognition Coralie Serrand et.al. 2508.21193 null
2025-08-28 OLMoASR: Open Models and Data for Training Robust Speech Recognition Models Huong Ngo et.al. 2508.20869 null
2025-08-28 Generative Annotation for ASR Named Entity Correction Yuanchang Luo et.al. 2508.20700 null
2025-08-28 Towards Inclusive Communication: A Unified LLM-Based Framework for Sign Language, Lip Movements, and Audio Understanding Jeong Hun Yeo et.al. 2508.20476 null
2025-09-08 Heterogeneous Self-Supervised Acoustic Pre-Training with Local Constraints Xiaodong Cui et.al. 2508.19990 null
2025-08-27 TokenVerse++: Towards Flexible Multitask Learning with Dynamic Task Activation Shashi Kumar et.al. 2508.19856 null
2025-08-27 CAMÕES: A Comprehensive Automatic Speech Recognition Benchmark for European Portuguese Carlos Carvalho et.al. 2508.19721 null
2025-08-27 Hybrid Decoding: Rapid Pass and Selective Detailed Correction for Sequence Models Yunkyu Lim et.al. 2508.19671 null
2025-08-27 Towards stable AI systems for Evaluating Arabic Pronunciations Hadi Zaatiti et.al. 2508.19587 null
2025-08-22 Whisper based Cross-Lingual Phoneme Recognition between Vietnamese and English Nguyen Huu Nhat Minh et.al. 2508.19270 null
2025-08-26 MOSA: Mixtures of Simple Adapters Outperform Monolithic Approaches in LLM-based Multilingual ASR Junjie Li et.al. 2508.18998 null
2025-08-26 TaiBai: A fully programmable brain-inspired processor with topology-aware efficiency Qianpeng Li et.al. 2508.18961 null
2025-08-26 DESAMO: A Device for Elder-Friendly Smart Homes Powered by Embedded LLM with Audio Modality Youngwon Choi et.al. 2508.18918 null
2025-08-26 Improving Noise Robust Audio-Visual Speech Recognition via Router-Gated Cross-Modal Feature Fusion DongHoon Lim et.al. 2508.18734 null
2025-08-26 Cross-Learning Fine-Tuning Strategy for Dysarthric Speech Recognition Via CDSD database Qing Xiao et.al. 2508.18732 null
2025-08-26 Attention2Probability: Attention-Driven Terminology Probability Estimation for Robust Speech-to-Text System Yanfan Du et.al. 2508.18701 null
2025-08-22 H-PRM: A Pluggable Hotword Pre-Retrieval Module for Various Speech Recognition Systems Huangyu Dai et.al. 2508.18295 null
2025-08-20 Toward Responsible ASR for African American English Speakers: A Scoping Review of Bias and Equity in Speech Technology Jay L. Cunningham et.al. 2508.18288 null
2025-08-25 Evaluating the Representation of Vowels in Wav2Vec Feature Extractor: A Layer-Wise Analysis Using MFCCs Domenico De Cristofaro et.al. 2508.17914 null
2025-08-25 Designing Practical Models for Isolated Word Visual Speech Recognition Iason Ioannis Panagos et.al. 2508.17894 null
2025-08-25 Talking to Robots: A Practical Examination of Speech Foundation Models for HRI Applications Theresa Pekarek Rosin et.al. 2508.17753 null
2025-08-24 AI-Powered Legal Intelligence System Architecture: A Comprehensive Framework for Automated Legal Consultation and Analysis Sean Kalaycioglu et.al. 2508.17499 null
2025-08-22 Benchmarking Training Paradigms, Dataset Composition, and Model Scaling for Child ASR in ESPnet Anyu Ying et.al. 2508.16576 null
2025-08-21 Beyond Transcription: Mechanistic Interpretability in ASR Neta Glazer et.al. 2508.15882 null
2025-08-20 MGSC: A Multi-granularity Consistency Framework for Robust End-to-end Asr Xuwen Yang et.al. 2508.15853 null
2025-08-21 UniCoM: A Universal Code-Switching Speech Generator Sangmin Lee et.al. 2508.15244 null
2025-08-20 A Study of the Scale Invariant Signal to Distortion Ratio in Speech Separation with Noisy References Simon Dahl Jepsen et.al. 2508.14623 null
2025-08-18 Whispering Context: Distilling Syntax and Semantics for Long Speech Transcripts Duygu Altinok et.al. 2508.13376 null
2025-08-18 Overcoming Latency Bottlenecks in On-Device Speech Translation: A Cascaded Approach with Alignment-Based Streaming MT Zeeshan Ahmed et.al. 2508.13358 null
2025-08-18 Evaluating ASR robustness to spontaneous speech errors: A study of WhisperX using a Speech Error Database John Alderete et.al. 2508.13060 null
2025-08-18 Arabic ASR on the SADA Large-Scale Arabic Speech Corpus with Transformer-Based Models Branislav Gerazov et.al. 2508.12968 null
2025-08-17 CarelessWhisper: Turning Whisper into a Causal Streaming Model Tomer Krichli et.al. 2508.12301 null
2025-08-17 HuBERT-VIC: Improving Noise-Robust Automatic Speech Recognition of Speech Foundation Model via Variance-Invariance-Covariance Regularization Hyebin Ahn et.al. 2508.12292 null
2025-08-17 What do Speech Foundation Models Learn? Analysis and Applications Ankita Pasad et.al. 2508.12255 null
2025-11-06 Omni-Router: Sharing Routing Decisions in Sparse Mixture-of-Experts for Speech Recognition Zijin Gu et.al. 2507.05724 null
2025-08-12 Transfer Learning from Visual Speech Recognition to Mouthing Recognition in German Sign Language Dinh Nam Pham et.al. 2505.13784 null
2025-05-19 Survey of End-to-End Multi-Speaker Automatic Speech Recognition for Monaural Audio Xinlu He et.al. 2505.10975 null
2025-02-26 Exploring Gender Disparities in Automatic Speech Recognition Technology Hend ElGhazaly et.al. 2502.18434 null
2025-02-11 Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Jing-Xuan Zhang et.al. 2502.05766 null
2025-02-04 Sagalee: an Open Source Automatic Speech Recognition Dataset for Oromo Language Turi Abu et.al. 2502.00421 null
2025-02-03 Language Bias in Self-Supervised Learning For Automatic Speech Recognition Edward Storey et.al. 2501.19321 null
2025-01-20 Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR Karl El Hajal et.al. 2501.10256 null
2024-09-25 Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs Yang Yuhang et.al. 2409.16005 null
2024-08-26 Focused Discriminative Training For Streaming CTC-Trained Automatic Speech Recognition Models Adnan Haider et.al. 2408.13008 null
2024-09-26 Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations Kunal Dhawan et.al. 2407.03495 null
2025-01-10 Towards Unsupervised Speech Recognition Without Pronunciation Models Junrui Ni et.al. 2406.08380 null
2024-09-12 Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation Muhammad Shakeel et.al. 2405.13514 null
2024-04-26 Developing Acoustic Models for Automatic Speech Recognition in Swedish Giampiero Salvi et.al. 2404.16547 null
2025-04-29 SpeechColab Leaderboard: An Open-Source Platform for Automatic Speech Recognition Evaluation Jiayu Du et.al. 2403.08196 null
2024-03-14 Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of Speech Sound Disorders in Korean children Taekyung Ahn et.al. 2403.08187 null
2025-11-04 Aligning Speech to Languages to Enhance Code-switching Speech Recognition Hexin Liu et.al. 2403.05887 null
2024-02-22 ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge He Wang et.al. 2401.03473 null
2024-02-12 Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification Anirudh S. Sundar et.al. 2312.14378 null
2023-11-20 Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition Qijie Shao et.al. 2311.07062 null
2024-01-29 Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting Chao-Han Huck Yang et.al. 2309.15649 null
2024-02-23 Training dynamic models using early exits for automatic speech recognition on resource-constrained devices George August Wright et.al. 2309.09546 null
2023-08-15 Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition Han Zhu et.al. 2308.06547 null
2023-08-09 Federated Representation Learning for Automatic Speech Recognition Guruprasad V Ramesh et.al. 2308.02013 null
2023-07-06 Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture Haoran Miao et.al. 2307.02351 null
2023-07-06 Boosting Norwegian Automatic Speech Recognition Javier de la Rosa et.al. 2307.01672 null
2023-04-18 A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech Recognition Ruchao Fan et.al. 2304.07611 null
2023-03-07 End-to-End Speech Recognition: A Survey Rohit Prabhavalkar et.al. 2303.03329 null
2023-03-07 A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One Lingwei Meng et.al. 2302.09908 null
2023-02-03 Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition Minglun Han et.al. 2302.01194 null
2022-09-23 Assessing ASR Model Quality on Disordered Speech using BERTScore Jimmy Tobin et.al. 2209.10591 null
2023-08-25 Automatic Speech Recognition for Speech Assessment of Persian Preschool Children Amirhossein Abaskohi et.al. 2203.12886 null
2022-03-18 Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition Mengzhe Geng et.al. 2202.10290 null
2022-02-03 Visualizing Automatic Speech Recognition – Means for a Better Understanding? Karla Markert et.al. 2202.00673 null
2022-01-31 Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition Piotr Żelasko et.al. 2201.11207 null
2022-05-10 A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition Qiu-Shi Zhu et.al. 2201.08930 null
2024-11-07 Robustifying automatic speech recognition by extracting slowly varying features Matías Pizarro et.al. 2112.07400 null
2022-05-02 Privacy attacks for automatic speech recognition acoustic models in a federated learning framework Natalia Tomashenko et.al. 2111.03777 null
2022-05-03 Speech Pattern based Black-box Model Watermarking for Automatic Speech Recognition Haozhe Chen et.al. 2110.09814 null
2021-11-05 Towards efficient end-to-end speech recognition with biologically-inspired neural networks Thomas Bohnstingl et.al. 2110.02743 null
2025-02-06 Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch Jakob Poncelet et.al. 2109.14357 null
2021-10-19 Adversarial Example Devastation and Detection on Speech Recognition System by Adding Random Noise Mingyu Dong et.al. 2108.13562 null
2021-07-06 Arabic Code-Switching Speech Recognition using Monolingual Data Ahmed Ali et.al. 2107.01573 null
2021-07-05 Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition Qiujia Li et.al. 2107.00764 null
2022-03-22 Unsupervised Automatic Speech Recognition: A Review Hanan Aldarmaki et.al. 2106.04897 link
2021-05-06 Accent Recognition with Hybrid Phonetic Features Zhan Zhang et.al. 2105.01920 null
2021-10-05 Non-autoregressive Mandarin-English Code-switching Speech Recognition Shun-Po Chuang et.al. 2104.02258 null
2021-02-23 Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model Junwei Liao et.al. 2102.11114 null
2021-11-30 Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition Aswin Shanmugam Subramanian et.al. 2102.07955 null
2021-02-16 Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition Priyabrata Karmakar et.al. 2102.07259 null
2021-02-10 Sparsification via Compressed Sensing for Automatic Speech Recognition Kai Zhen et.al. 2102.04932 null
2021-02-01 BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge Martin Kocour et.al. 2101.12729 null
2021-09-14 Multi-task Language Modeling for Improving Speech Recognition of Rare Words Chao-Han Huck Yang et.al. 2011.11715 null
2020-11-09 Multilingual Bottleneck Features for Improving ASR Performance of Code-Switched Speech in Under-Resourced Languages Trideba Padhi et.al. 2011.03118 null
2020-09-22 Far-Field Automatic Speech Recognition Reinhold Haeb-Umbach et.al. 2009.09395 null
2020-10-06 CTC-Segmentation of Large Corpora for German End-to-end Speech Recognition Ludwig Kürzinger et.al. 2007.09127 null
2020-06-04 The NTNU System at the Interspeech 2020 Non-Native Children’s Speech ASR Challenge Tien-Hong Lo et.al. 2005.08433 null
2020-03-02 A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition Erik McDermott et.al. 2002.11268 null
2021-10-11 Submodular Rank Aggregation on Score-based Permutations for Distributed Automatic Speech Recognition Jun Qi et.al. 2001.10529 null
2023-05-23 Leveraging End-to-End Speech Recognition with Neural Architecture Search Ahmed Baruwa et.al. 1912.05946 null
2019-11-21 On using 2D sequence-to-sequence models for speech recognition Parnia Bahar et.al. 1911.08888 null
2019-11-13 Recurrent Neural Network Transducer for Audio-Visual Speech Recognition Takaki Makino et.al. 1911.04890 null
2019-10-15 VAIS ASR: Building a conversational speech recognition system using language model combination Quang Minh Nguyen et.al. 1910.05603 null
2020-03-17 Advancing Speech Recognition With No Speech Or With Noisy Speech Gautam Krishna et.al. 1906.08871 null
2019-05-22 Articulatory and bottleneck features for speaker-independent ASR of dysarthric speech Emre Yılmaz et.al. 1905.06533 null
2019-04-26 Phonetically-Oriented Word Error Alignment for Speech Recognition Error Analysis in Speech Translation Nicholas Ruiz et.al. 1904.11024 null
2023-05-15 End-to-end Audiovisual Speech Activity Detection with Bimodal Recurrent Neural Models Fei Tao et.al. 1809.04553 null
2018-09-13 Isolated and Ensemble Audio Preprocessing Methods for Detecting Adversarial Examples against Automatic Speech Recognition Krishan Rajaratnam et.al. 1809.04397 null
2018-05-29 Automatic context window composition for distant speech recognition Mirco Ravanelli et.al. 1805.10498 null
2018-04-27 End-to-End Multimodal Speech Recognition Shruti Palaskar et.al. 1804.09713 link
2018-03-08 Extracting Domain Invariant Features by Unsupervised Learning for Robust Automatic Speech Recognition Wei-Ning Hsu et.al. 1803.02551 null
2019-05-01 Unsupervised Adaptation with Domain Separation Networks for Robust Speech Recognition Zhong Meng et.al. 1711.08010 null
2018-02-23 BridgeNets: Student-Teacher Transfer Learning Based on Recursive Neural Networks and its Application to Distant Speech Recognition Jaeyoung Kim et.al. 1710.10224 null
2018-04-26 Resolution limits on visual speech recognition Helen L. Bear et.al. 1710.01073 null
2017-09-01 Leveraging Deep Neural Network Activation Entropy to cope with Unseen Data in Speech Recognition Vikramjit Mitra et.al. 1708.09516 null
2018-12-06 Single-Channel Multi-talker Speech Recognition with Permutation Invariant Training Yanmin Qian et.al. 1707.06527 null
2017-04-27 Towards Estimating the Upper Bound of Visual-Speech Recognition: The Visual Lip-Reading Feasibility Database Adriana Fernandez-Lopez et.al. 1704.08028 null
2016-12-07 Invariant Representations for Noisy Speech Recognition Dmitriy Serdyuk et.al. 1612.01928 null
2016-11-10 Automatic recognition of child speech for robotic applications in noisy environments Samuel Fernando et.al. 1611.02695 null
2014-02-12 Modified SPLICE and its Extension to Non-Stereo Data for Noise Robust Speech Recognition D. S. Pavan Kumar et.al. 1307.4048 null

🗣️ TTS

📊 517 papers

📅 Publish Date 📝 Title 👥 Authors 📄 PDF 💻 Code
2026-04-01 OmniVoice: Towards Omnilingual Zero-Shot Text-to-Speech with Diffusion Language Models Han Zhu et.al. 2604.00688 null
2026-03-31 MambaVoiceCloning: Efficient and Expressive Text-to-Speech via State-Space Modeling and Diffusion Control Sahil Kumar et.al. 2604.00292 null
2026-03-24 Fast elementwise operations on tensor trains with alternating cross interpolation Marc K. Ritter et.al. 2604.00037 null
2026-03-31 LongCat-AudioDiT: High-Fidelity Diffusion Text-to-Speech in the Waveform Latent Space Detai Xin et.al. 2603.29339 null
2026-03-31 From Natural Alignment to Conditional Controllability in Multimodal Dialogue Zeyu Jin et.al. 2603.29162 null
2026-03-30 ParaSpeechCLAP: A Dual-Encoder Speech-Text Model for Rich Stylistic Language-Audio Pretraining Anuj Diwan et.al. 2603.28737 null
2026-03-29 VoxAnchor: Grounding Speech Authenticity in Throat Vibration via mmWave Radar Mingda Han et.al. 2603.27562 null
2026-03-27 LLaDA-TTS: Unifying Speech Synthesis and Zero-Shot Editing via Masked Diffusion Modeling Xiaoyu Fan et.al. 2603.26364 null
2026-03-26 Voxtral TTS Alexander H. Liu et.al. 2603.25551 null
2026-03-25 YingMusic-Singer: Controllable Singing Voice Synthesis with Flexible Lyric Manipulation and Annotation-free Melody Guidance Chunbo Hao et.al. 2603.24589 null
2026-03-25 Iterate to Differentiate: Enhancing Discriminability and Reliability in Zero-Shot TTS Evaluation Shengfan Shen et.al. 2603.24430 null
2026-04-01 How Open is Open TTS? A Practical Evaluation of Open Source TTS Tools Teodora Răgman et.al. 2603.24116 null
2026-03-23 SelfTTS: cross-speaker style transfer through explicit embedding disentanglement and self-refinement using self-augmentation Lucas H. Ueda et.al. 2603.22252 null
2026-03-23 Tuning Real-World Image Restoration at Inference: A Test-Time Scaling Paradigm for Flow Matching Models Purui Bai et.al. 2603.22027 null
2026-03-22 Assessing the Ability of Neural TTS Systems to Model Consonant-Induced F0 Perturbation Tianle Yang et.al. 2603.21078 null
2026-03-21 The Binding Effect: Analyzing How Multi-Dimensional Cues Form Gender Bias in Instruction TTS Kuan-Yu Chen et.al. 2603.20743 null
2026-03-21 SNAP: Speaker Nulling for Artifact Projection in Speech Deepfake Detection Kyudan Jung et.al. 2603.20686 null
2026-03-24 Tensor Train Representation of High-Dimensional Unsteady Flamelet Manifolds Sinan Demir et.al. 2603.20240 null
2026-03-20 Audio Avatar Fingerprinting: An Approach for Authorized Use of Voice Cloning in the Era of Synthetic Audio Candice R. Gerstner et.al. 2603.20165 null
2026-03-20 Gesture2Speech: How Far Can Hand Movements Shape Expressive Speech? Lokesh Kumar et.al. 2603.19831 null
2026-03-20 Borderless Long Speech Synthesis Xingchen Song et.al. 2603.19798 null
2026-03-20 MOSS-TTS Technical Report Yitian Gong et.al. 2603.18090 null
2026-03-03 EEG-Based Brain-LLM Interface for Human Preference Aligned Generation Junzi Zhang et.al. 2603.16897 null
2026-03-17 From the Inside Out: Progressive Distribution Refinement for Confidence Calibration Xizhong Yang et.al. 2603.16500 null
2026-03-17 On the Emotion Understanding of Synthesized Speech Yuan Ge et.al. 2603.16483 null
2026-03-17 CAST-TTS: A Simple Cross-Attention Framework for Unified Timbre Control in TTS Zihao Zheng et.al. 2603.16280 null
2026-03-16 Meta-TTRL: A Metacognitive Framework for Self-Improving Test-Time Reinforcement Learning in Unified Multimodal Models Lit Sin Tan et.al. 2603.15724 null
2026-03-18 NV-Bench: Benchmark of Nonverbal Vocalization Synthesis for Expressive Text-to-Speech Generation Qinke Ni et.al. 2603.15352 null
2026-03-16 PhonemeDF: A Synthetic Speech Dataset for Audio Deepfake Detection and Naturalness Evaluation Vamshi Nallaguntla et.al. 2603.15037 null
2026-03-16 WhispSynth: Scaling Multilingual Whisper Corpus through Real Data Curation and A Novel Pitch-free Generative Framework Tianyi Tan et.al. 2603.14853 null
2026-03-16 Investigating the Impact of Speech Enhancement on Audio Deepfake Detection in Noisy Environments Anacin et.al. 2603.14767 null
2026-03-15 Affectron: Emotional Speech Synthesis with Affective and Contextually Aligned Nonverbal Vocalizations Deok-Hyeon Cho et.al. 2603.14432 null
2026-03-15 CodecMOS-Accent: A MOS Benchmark of Resynthesized and TTS Speech from Neural Codecs Across English Accents Wen-Chin Huang et.al. 2603.14328 null
2026-03-27 DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and Synchronization Ngoc-Son Nguyen et.al. 2603.14267 null
2026-03-14 Beyond Two-stage Diffusion TTS: Joint Structure and Content Refinement via Jump Diffusion Jiabao Ai et.al. 2603.14032 null
2026-03-13 VoXtream2: Full-stream TTS with dynamic speaking rate control Nikita Torgashov et.al. 2603.13518 null
2026-03-12 MamTra: A Hybrid Mamba-Transformer Backbone for Speech Synthesis Tan Dat Nguyen et.al. 2603.12342 null
2026-03-12 Linking Perception, Confidence and Accuracy in MLLMs Yuetian Du et.al. 2603.12149 null
2026-03-12 Causal Prosody Mediation for Text-to-Speech:Counterfactual Training of Duration, Pitch, and Energy in FastSpeech2 Suvendu Sekhar Mohanty et.al. 2603.11683 null
2026-03-12 RAF: Relativistic Adversarial Feedback For Universal Speech Synthesis Yongjoon Lee et.al. 2603.11678 null
2026-03-11 When Fine-Tuning Fails and when it Generalises: Role of Data Diversity and Mixed Training in LLM-based TTS Anupam Purwar et.al. 2603.10904 null
2026-03-12 Probabilistic Verification of Voice Anti-Spoofing Models Evgeny Kushnir et.al. 2603.10713 null
2026-03-25 MM-tau-p $^2$ : Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings Anupam Purwar et.al. 2603.09643 null
2026-03-10 GeoSolver: Scaling Test-Time Reasoning in Remote Sensing with Fine-Grained Process Supervision Lang Sun et.al. 2603.09551 null
2026-03-12 Multi-tasking through quantum annealing Jargalsaikhan Artag et.al. 2603.09468 null
2026-03-09 MAPLE: Elevating Medical Reasoning from Statistical Consensus to Process-Led Alignment Kailong Fan et.al. 2603.08987 null
2026-03-11 Fish Audio S2 Technical Report Shijia Liao et.al. 2603.08823 null
2026-03-09 SWE-Fuse: Empowering Software Agents via Issue-free Trajectory Learning and Entropy-aware RLVR Training Xin-Cheng Wen et.al. 2603.07927 null
2026-03-08 Targeted Speaker Poisoning Framework in Zero-Shot Text-to-Speech Thanapat Trachu et.al. 2603.07551 null
2026-03-08 Learning-free L2-Accented Speech Generation using Phonological Rules Thanathai Lertpetchpun et.al. 2603.07550 null
2026-03-08 Accent Vector: Controllable Accent Manipulation for Multilingual TTS Without Accented Data Thanathai Lertpetchpun et.al. 2603.07534 null
2026-03-08 Bolbosh: Script-Aware Flow Matching for Kashmiri Text-to-Speech Tajamul Ashraf et.al. 2603.07513 null
2026-02-21 Advances in GRPO for Generation Models: A Survey Zexiang Liu et.al. 2603.06623 null
2026-03-06 Prosodic Boundary-Aware Streaming Generation for LLM-Based TTS with Streaming Text Input Changsong Liu et.al. 2603.06444 null
2026-03-06 Is it Me? Toward Self-Extension to AI Avatars in Virtual Reality Jieying Zhang et.al. 2603.06030 null
2026-03-06 Activation Steering for Accent-Neutralized Zero-Shot Text-To-Speech Mu Yang et.al. 2603.05977 null
2026-03-06 How Well Do Current Speech Deepfake Detection Methods Generalize to the Real World? Daixian Li et.al. 2603.05852 null
2026-03-06 StreamWise: Serving Multi-Modal Generation in Real-Time at Scale Haoran Qiu et.al. 2603.05800 null
2026-03-05 Hierarchical Decoding for Discrete Speech Synthesis with Multi-Resolution Spoof Detection Junchuan Zhao et.al. 2603.05373 null
2026-03-04 ZeSTA: Zero-Shot TTS Augmentation with Domain-Conditioned Training for Data-Efficient Personalized Speech Synthesis Youngwon Choi et.al. 2603.04219 null
2026-03-04 VietNormalizer: An Open-Source, Dependency-Free Python Library for Vietnamese Text Normalization in TTS and NLP Applications Hung Vu Nguyen et.al. 2603.04145 null
2026-03-03 DLIOS: An LLM-Augmented Real-Time Multi-Modal Interactive Enhancement Overlay System for Douyin Live Streaming Shuide Wen et.al. 2603.03060 null
2026-03-02 When Spoof Detectors Travel: Evaluation Across 66 Languages in the Low-Resource Language Spoofing Corpus Kirill Borodin et.al. 2603.02364 null
2026-03-01 MM-DeepResearch: A Simple and Effective Multimodal Agentic Search Baseline Huanjin Yao et.al. 2603.01050 null
2026-03-01 S-VoCAL: A Dataset and Evaluation Framework for Inferring Speaking Voice Character Attributes in Literature Abigail Berthe-Pardo et.al. 2603.00958 null
2026-02-27 Disentangled Mode-Specific Representations for Tensor Time Series via Contrastive Learning Kohei Obata et.al. 2602.23663 null
2026-02-26 TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment Trung Dang et.al. 2602.23068 null
2026-02-24 MIDI-Informed Singing Accompaniment Generation in a Compositional Song Pipeline Fang-Duo Tsai et.al. 2602.22029 null
2026-02-23 Can You Tell It’s AI? Human Perception of Synthetic Voices in Vishing Scenarios Zoha Hayat Bhatti et.al. 2602.20061 null
2026-02-23 CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment Hanwen Liu et.al. 2602.19574 null
2026-02-22 CosyAccent: Duration-Controllable Accent Normalization Using Source-Synthesis Training Data Qibing Bai et.al. 2602.19166 null
2026-02-20 Recursive Sketched Interpolation: Efficient Hadamard Products of Tensor Trains Zhaonan Meng et.al. 2602.17974 null
2026-02-19 Financial time series augmentation using transformer based GAN architecture Andrzej Podobiński et.al. 2602.17865 null
2026-02-18 How to Label Resynthesized Audio: The Dual Role of Neural Audio Codecs in Audio Deepfake Detection Yixuan Xiao et.al. 2602.16343 null
2026-03-03 UniTAF: A Modular Framework for Joint Text-to-Speech and Audio-to-Face Modeling Qiangong Zhou et.al. 2602.15651 null
2026-02-16 Disentangling Pitch and Creak for Speaker Identity Preservation in Speech Synthesis Frederik Rautenberg et.al. 2602.14686 null
2026-02-16 Probing Human Articulatory Constraints in End-to-End TTS with Reverse and Mismatched Speech-Text Directions Parth Khadse et.al. 2602.14664 null
2026-02-15 LogitsCoder: Towards Efficient Chain-of-Thought Path Search via Logits Preference Decoding for Code Generation Jizheng Chen et.al. 2602.14054 null
2026-02-27 Learning Vocal-Tract Area and Radiation with a Physics-Informed Webster Model Minhui Lu et.al. 2602.13834 null
2026-02-14 ELEAT-SAGA: Early & Late Integration with Evading Alternating Training for Spoof-Robust Speaker Verification Amro Asali et.al. 2602.13761 null
2026-02-12 UniT: Unified Multimodal Chain-of-Thought Test-time Scaling Leon Liangyu Chen et.al. 2602.12279 null
2026-02-12 SLD-L2S: Hierarchical Subspace Latent Diffusion for High-Fidelity Lip to Speech Synthesis Yifan Liang et.al. 2602.11477 null
2026-01-19 Synthesizing the Virtual Advocate: A Multi-Persona Speech Generation Framework for Diverse Linguistic Jurisdictions in Indic Languages Aniket Deroy et.al. 2602.11172 null
2026-02-11 Calliope: A TTS-based Narrated E-book Creator Ensuring Exact Synchronization, Privacy, and Layout Fidelity Hugo L. Hammer et.al. 2602.10735 null
2026-02-10 Emotion-Coherent Speech Data Augmentation and Self-Supervised Contrastive Style Training for Enhancing Kids’s Story Speech Synthesis Raymond Chung et.al. 2602.10164 null
2026-02-10 Covo-Audio Technical Report Wenfu Wang et.al. 2602.09823 null
2026-02-10 TVTSyn: Content-Synchronous Time-Varying Timbre for Streaming Voice Conversion and Anonymization Waris Quamer et.al. 2602.09389 null
2026-02-03 DSFlow: Dual Supervision and Step-Aware Architecture for One-Step Flow Matching Speech Synthesis Bin Lin et.al. 2602.09041 null
2026-02-09 Tutti: Expressive Multi-Singer Synthesis via Structure-Level Timbre Control and Vocal Texture Modeling Jiatao Chen et.al. 2602.08233 null
2026-02-08 MARTI-MARS $^2$ : Scaling Multi-Agent Self-Search via Reinforcement Learning for Code Generation Shijie Wang et.al. 2602.07848 null
2026-02-08 SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis Jiale Qian et.al. 2602.07803 null
2026-02-05 Private and interpretable clinical prediction with quantum-inspired tensor train models José Ramón Pareja Monturiol et.al. 2602.06110 null
2026-01-14 PersonaPlex: Voice and Role Control for Full Duplex Conversational Speech Models Rajarshi Roy et.al. 2602.06053 null
2026-02-05 Zero-Shot TTS With Enhanced Audio Prompts: Bsc Submission For The 2026 Wildspoof Challenge TTS Track Jose Giraldo et.al. 2602.05770 null
2026-02-05 EGSS: Entropy-guided Stepwise Scaling for Reliable Software Engineering Chenhui Mao et.al. 2602.05242 null
2026-02-05 ARCHI-TTS: A flow-matching-based Text-to-Speech Model with Self-supervised Semantic Aligner and Accelerated Inference Chunyat Wu et.al. 2602.05207 null
2026-02-04 HoliAntiSpoof: Audio LLM for Holistic Speech Anti-Spoofing Xuenan Xu et.al. 2602.04535 null
2026-02-04 SCALE: Self-uncertainty Conditioned Adaptive Looking and Execution for Vision-Language-Action Models Hyeonbeom Choi et.al. 2602.04208 null
2026-02-04 PFluxTTS: Hybrid Flow-Matching TTS with Robust Cross-Lingual Voice Cloning and Inference-Time Model Fusion Vikentii Pankov et.al. 2602.04160 null
2026-02-01 Decoding Ambiguous Emotions with Test-Time Scaling in Audio-Language Models Hong Jia et.al. 2602.03873 null
2026-02-03 CoCoEmo: Composable and Controllable Human-Like Emotional TTS via Activation Steering Siyi Wang et.al. 2602.03420 null
2026-02-03 SWE-World: Building Software Engineering Agents in Docker-Free Environments Shuang Sun et.al. 2602.03419 null
2026-02-24 SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training Huatong Song et.al. 2602.03411 null
2026-02-01 VividVoice: A Unified Framework for Scene-Aware Visually-Driven Speech Synthesis Chengyuan Ma et.al. 2602.02591 null
2026-02-02 LipSody: Lip-to-Speech Synthesis with Enhanced Prosody Consistency Jaejun Lee et.al. 2602.01908 null
2026-02-02 Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models Jinbin Bai et.al. 2602.01842 null
2026-02-03 ARTIS: Agentic Risk-Aware Test-Time Scaling via Iterative Simulation Xingshan Zeng et.al. 2602.01709 null
2026-02-01 Chronos: Learning Temporal Dynamics of Reasoning Chains for Test-Time Scaling Kai Zhang et.al. 2602.01208 null
2026-02-01 HierCon: Hierarchical Contrastive Attention for Audio Deepfake Detection Zhili Nicholas Liang et.al. 2602.01032 null
2026-02-09 APR: Penalizing Structural Redundancy in Large Reasoning Models via Anchor-based Process Rewards Kaiyan Chang et.al. 2602.00760 null
2026-01-30 Multi-Speaker Conversational Audio Deepfake: Taxonomy, Dataset and Pilot Study Alabi Ahmed et.al. 2602.00295 null
2026-01-30 Now You Hear Me: Audio Narrative Attacks Against Large Audio-Language Models Ye Yu et.al. 2601.23255 null
2026-01-30 Hearing is Believing? Evaluating and Analyzing Audio Language Model Sycophancy with SYAUDIO Junchi Yao et.al. 2601.23149 null
2026-01-30 DiffuSpeech: Silent Thought, Spoken Answer via Unified Speech-Text Diffusion Yuxuan Lou et.al. 2601.22889 null
2026-01-30 EmoShift: Lightweight Activation Steering for Enhanced Emotion-Aware Speech Synthesis Li Zhou et.al. 2601.22873 null
2026-01-30 Evaluating and Rewarding LALMs for Expressive Role-Play TTS via Mean Continuation Log-Probability Yong Ren et.al. 2601.22661 null
2026-01-29 Speech Quality-Based Localization of Low-Quality Speech and Text-to-Speech Synthesis Artefacts Michael Kuhlmann et.al. 2601.21886 null
2026-01-28 Audio Deepfake Detection in the Age of Advanced Text-to-Speech models Robin Singh et.al. 2601.20510 null
2026-01-28 Erasing Your Voice Before It’s Heard: Training-free Speaker Unlearning for Zero-shot Text-to-Speech Myungjin Lee et.al. 2601.20481 null
2026-01-29 Unit-Based Agent for Semi-Cascaded Full-Duplex Dialogue Systems Haoyuan Yu et.al. 2601.20230 null
2026-01-27 T-Mimi: A Transformer-based Mimi Decoder for Real-Time On-Phone TTS Haibin Wu et.al. 2601.20094 null
2026-01-26 Neural Multi-Speaker Voice Cloning for Nepali in Low-Resource Settings Aayush M. Shrestha et.al. 2601.18694 null
2026-01-26 UrgentMOS: Unified Multi-Metric and Preference Learning for Robust Speech Quality Assessment Wei Wang et.al. 2601.18438 null
2026-01-26 GAIA: A Data Flywheel System for Training GUI Test-Time Scaling Critic Models Shaokang Wang et.al. 2601.18197 null
2026-01-23 SonoEdit: Null-Space Constrained Knowledge Editing for Pronunciation Correction in LLM-Based TTS Ayush Pratap Singh et.al. 2601.17086 null
2026-01-22 Timbre-Aware LLM-based Direct Speech-to-Speech Translation Extendable to Multiple Language Pairs Lalaram Arya et.al. 2601.16023 null
2026-01-22 Qwen3-TTS Technical Report Hangrui Hu et.al. 2601.15621 null
2026-01-22 DeepASMR: LLM-Based Zero-Shot ASMR Speech Generation for Anyone of Any Voice Leying Zhang et.al. 2601.15596 null
2026-01-20 Prosody-Guided Harmonic Attention for Phase-Coherent Neural Vocoding in the Complex Spectrum Mohammed Salah Al-Radhi et.al. 2601.14472 null
2026-01-28 Quantifying Speaker Embedding Phonological Rule Interactions in Accented Speech Synthesis Thanathai Lertpetchpun et.al. 2601.14417 null
2026-01-20 Synthetic Singers: A Review of Deep-Learning-based Singing Voice Synthesis Approaches Changhao Pan et.al. 2601.13910 null
2026-01-19 Lombard Speech Synthesis for Any Voice with Controllable Style Embeddings Seymanur Akti et.al. 2601.12966 null
2026-01-18 A Unified Neural Codec Language Model for Selective Editable Text to Speech Generation Hanchen Pei et.al. 2601.12480 null
2026-01-18 ParaMETA: Towards Learning Disentangled Paralinguistic Speaking Styles Representations from Speech Haowei Lou et.al. 2601.12289 null
2026-01-18 Confidence-based Filtering for Speech Dataset Curation with Generative Speech Enhancement Using Discrete Tokens Kazuki Yamauchi et.al. 2601.12254 null
2026-01-17 Examining possible doubly topped baryon configurations M. Shekari Tousi et.al. 2601.11985 null
2026-01-16 FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning Tanyu Chen et.al. 2601.11141 null
2026-01-16 Redefining Machine Simultaneous Interpretation: From Incremental Translation to Human-Like Strategies Qianen Zhang et.al. 2601.11002 null
2026-01-20 VoiceSculptor: Your Voice, Designed By You Jingbin Hu et.al. 2601.10629 null
2026-01-15 Memo-SQL: Structured Decomposition and Experience-Driven Self-Correction for Training-Free NL2SQL Zerui Yang et.al. 2601.10011 null
2026-01-13 Decoding Order Matters in Autoregressive Speech Synthesis Minghui Zhao et.al. 2601.08450 null
2026-01-12 LJ-Spoof: A Generatively Varied Corpus for Audio Anti-Spoofing and Synthesis Source Tracing Surya Subramani et.al. 2601.07958 null
2026-01-11 Bridging Attribution and Open-Set Detection using Graph-Augmented Instance Learning in Synthetic Speech Mohd Mujtaba Akhtar et.al. 2601.07064 null
2026-01-10 Lightweight Resolution-Aware Audio Deepfake Detection via Cross-Scale Attention and Consistency Learning K. A. Shahriar et.al. 2601.06560 null
2026-01-10 3D CoCa v2: Contrastive Learners with Test-Time Search for Generalizable Spatial Intelligence Hao Tang et.al. 2601.06496 null
2026-01-09 SPAM: Style Prompt Adherence Metric for Prompt-based TTS Chanhee Cho et.al. 2601.05554 null
2026-01-08 Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Yuxiang Ji et.al. 2601.05432 null
2026-01-08 CosyEdit: Unlocking End-to-End Speech Editing Capability from Zero-Shot Text-to-Speech Models Junyang Chen et.al. 2601.05329 null
2026-01-08 FlexiVoice: Enabling Flexible Style Control in Zero-Shot TTS with Natural Language Instructions Dekun Chen et.al. 2601.04656 null
2026-01-04 LEMAS: Large A 150K-Hour Large-scale Extensible Multilingual Audio Suite with Generative Speech Models Zhiyuan Zhao et.al. 2601.04233 null
2026-01-07 Agentic Rubrics as Contextual Verifiers for SWE Agents Mohit Raghavendra et.al. 2601.04171 null
2026-01-09 IndexTTS 2.5 Technical Report Yunpei Li et.al. 2601.03888 null
2026-01-07 ReStyle-TTS: Relative and Continuous Style Control for Zero-Shot Speech Synthesis Haitao Li et.al. 2601.03632 null
2026-01-06 Tigrinya Number Verbalization: Rules, Algorithm, and Implementation Fitsum Gaim et.al. 2601.03403 null
2026-01-06 Segment-Aware Conditioning for Training-Free Intra-Utterance Emotion and Duration Control in Text-to-Speech Qifan Liang et.al. 2601.03170 null
2026-01-24 XLSR-MamBo: Scaling the Hybrid Mamba-Attention Backbone for Audio Deepfake Detection Kwok-Ho Ng et.al. 2601.02944 null
2026-01-06 Vulnerabilities of Audio-Based Biometric Authentication Systems Against Deepfake Speech Synthesis Mengze Hong et.al. 2601.02914 null
2026-01-06 Vclip: Face-based Speaker Generation by Face-voice Association Learning Yao Shi et.al. 2601.02753 null
2026-01-05 Towards Prosodically Informed Mizo TTS without Explicit Tone Markings Abhijit Mohanta et.al. 2601.02073 null
2026-01-05 A Training-Free Large Reasoning Model-based Knowledge Tracing Framework for Unified Prediction and Prescription Unggi Lee et.al. 2601.01708 null
2026-01-08 MM-Sonate: Multimodal Controllable Audio-Video Generation with Zero-Shot Voice Cloning Chunyu Qiang et.al. 2601.01568 null
2026-01-04 OV-InstructTTS: Towards Open-Vocabulary Instruct Text-to-Speech Yong Ren et.al. 2601.01459 null
2026-01-07 SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Chaofan Tao et.al. 2601.01426 null
2026-01-01 DepFlow: Disentangled Speech Generation to Mitigate Semantic Bias in Depression Detection Yuxin Li et.al. 2601.00303 null
2026-01-01 Latent Flow Matching for Expressive Singing Voice Synthesis Minhyeok Yun et.al. 2601.00217 null
2025-12-30 A closer look at the young stellar group around Sh 2-295 João Victor Corrêa-Rodrigues et.al. 2512.24388 null
2025-12-29 MiMo-Audio: Audio Language Models are Few-Shot Learners Xiaomi LLM-Core Team et.al. 2512.23808 link
2025-12-29 AI4Reading: Chinese Audiobook Interpretation System Based on Multi-Agent Collaboration Minjiang Huang et.al. 2512.23300 link
2025-12-31 Task-oriented Learnable Diffusion Timesteps for Universal Few-shot Learning of Dense Tasks Changgyoon Oh et.al. 2512.23210 null
2025-12-27 Scaling Unverifiable Rewards: A Case Study on Visual Insights Shuyu Gan et.al. 2512.22650 null
2025-12-27 ManchuTTS: Towards High-Quality Manchu Speech Synthesis via Flow Matching and Hierarchical Text Representation Suhua Wang et.al. 2512.22491 null
2025-12-26 SWE-RM: Execution-free Feedback For Software Engineering Agents KaShun Shum et.al. 2512.21919 null
2025-12-25 Zero-Shot to Zero-Lies: Detecting Bengali Deepfake Audio through Transfer Learning Most. Sharmin Sultana Samu et.al. 2512.21702 null
2025-12-22 Picosecond laser test unit for photosensor characterization at ambient and low temperatures Matthias Raphael Stock et.al. 2512.19667 null
2025-12-22 dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models Yi Xin et.al. 2512.19433 null
2025-12-22 JoyVoice: Long-Context Conditioning for Anthropomorphic Multi-Speaker Conversational Synthesis Fan Yu et.al. 2512.19090 null
2025-12-21 Smark: A Watermark for Text-to-Speech Diffusion Models via Discrete Wavelet Transform Yichuan Zhang et.al. 2512.18791 null
2025-12-21 Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis Pengchao Feng et.al. 2512.18699 null
2025-12-20 The MEVIR 2 Framework: A Virtue-Informed Moral-Epistemic Model of Human Trust Decisions Daniel Schwabe et.al. 2512.18539 null
2025-12-19 Training Text-to-Speech Model with Purely Synthetic Data: Feasibility, Sensitivity, and Generalization Capability Tingxiao Zhou et.al. 2512.17356 null
2025-12-19 Robust TTS Training via Self-Purifying Flow Matching for the WildSpoof 2026 TTS Track June Young Yi et.al. 2512.17293 null
2025-12-19 Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience Jiangjie Chen et.al. 2512.17260 null
2025-12-17 Rotatable IRS-Assisted 6DMA Communications: A Two-timescale Design Chao Zhou et.al. 2512.15092 null
2025-12-16 Robust Training of Singing Voice Synthesis Using Prior and Posterior Uncertainty Yiwen Zhao et.al. 2512.14653 null
2025-12-16 GLM-TTS Technical Report Jiayan Cui et.al. 2512.14291 null
2026-01-04 DisCo-Speech: Controllable Zero-Shot Speech Generation with A Disentangled Speech Codec Tao Li et.al. 2512.13251 null
2025-12-13 F5-TTS-RO: Extending F5-TTS to Romanian TTS via Lightweight Input Adaptation Radu-Gabriel Chivereanu et.al. 2512.12297 null
2025-12-11 Limits and Gains of Test-Time Scaling in Vision-Language Reasoning Mohammadjavad Ahmadpour et.al. 2512.11109 null
2025-12-11 CompanionCast: A Multi-Agent Conversational AI Framework with Spatial Audio for Social Co-Viewing Experiences Yiyang Wang et.al. 2512.10918 null
2025-12-10 DMP-TTS: Disentangled multi-modal Prompting for Controllable Text-to-Speech with Chained Guidance Kang Yin et.al. 2512.09504 null
2025-12-09 LG Uplus System with Multi-Speaker IDs and Discriminator-based Sub-Judges for the WildSpoof Challenge Jinyoung Park et.al. 2512.09000 null
2025-12-08 Beyond Unified Models: A Service-Oriented Approach to Low Latency, Context Aware Phonemization for Real Time TTS Mahta Fetrat et.al. 2512.08006 null
2025-12-09 Performance Benchmarking of Tensor Trains for accelerated Quantum-Inspired Homogenization on TPU, GPU and CPU architectures Sascha H. Hauck et.al. 2512.07811 null
2025-12-05 Simulating Life Paths with Digital Twins: AI-Generated Future Selves Influence Decision-Making and Expand Human Choice Rachel Poonsiriwong et.al. 2512.05397 null
2025-11-23 SyncVoice: Towards Video Dubbing with Vision-Augmented Pretrained TTS Model Kaidi Wang et.al. 2512.05126 null
2025-12-04 YingMusic-Singer: Zero-shot Singing Voice Synthesis and Editing with Annotation-free Melody Guidance Junjie Zheng et.al. 2512.04779 null
2025-12-04 M3-TTS: Multi-modal DiT Alignment & Mel-latent for Zero-shot High-fidelity Speech Synthesis Xiaopeng Wang et.al. 2512.04720 null
2025-12-04 RRPO: Robust Reward Policy Optimization for LLM-based Emotional TTS Cong Wang et.al. 2512.04552 null
2025-12-03 Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation Hang Xu et.al. 2512.03996 null
2025-12-02 Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach Siyuan Yang et.al. 2512.02834 null
2025-12-02 Generative Multi-modal Feedback for Singing Voice Synthesis Evaluation Xueyan Li et.al. 2512.02523 null
2025-12-01 The Art of Scaling Test-Time Compute for Large Language Models Aradhye Agarwal et.al. 2512.02008 null
2025-11-30 Arabic TTS with FastPitch: Reproducible Baselines, Adversarial Training, and Oversmoothing Analysis Lars Nippert et.al. 2512.00937 null
2025-11-29 FR-TTS: Test-Time Scaling for NTP-based Image Generation with Effective Filling-based Reward Signal Hang Xu et.al. 2512.00438 null
2025-11-27 GLA-Grad++: An Improved Griffin-Lim Guided Diffusion Model for Speech Synthesis Teysir Baoueb et.al. 2511.22293 null
2025-11-27 VSpeechLM: A Visual Speech Language Model for Visual Text-to-Speech Task Yuyue Wang et.al. 2511.22229 null
2025-11-21 Asking LLMs to Verify First is Almost Free Lunch Shiguang Wu et.al. 2511.21734 null
2025-11-26 TSGM: Regular and Irregular Time-series Generation using Score-based Generative Models Haksoo Lim et.al. 2511.21335 null
2025-11-26 Multi-Reward GRPO for Stable and Prosodic Single-Codebook TTS LLMs at Scale Yicheng Zhong et.al. 2511.21270 null
2025-11-26 MUSE: Manipulating Unified Framework for Synthesizing Emotions in Images via Test-Time Optimization Yingjie Xia et.al. 2511.21051 null
2025-11-26 CartoonSing: Unifying Human and Nonhuman Timbres in Singing Generation Jionghao Han et.al. 2511.21045 null
2025-10-30 Transforming Higher Education with AI-Powered Video Lectures Dengsheng Zhang et.al. 2511.20660 null
2025-11-25 Continual Audio Deepfake Detection via Universal Adversarial Perturbation Wangjie Li et.al. 2511.19974 null
2025-11-26 Scale Where It Matters: Training-Free Localized Scaling for Diffusion Models Qin Ren et.al. 2511.19917 null
2025-11-23 InstructAudio: Unified speech and music generation with natural language instruction Chunyu Qiang et.al. 2511.18487 null
2025-11-22 A superpersuasive autonomous policy debating system Allen Roush et.al. 2511.17854 null
2025-11-21 AI in Music and Sound: Pedagogical Reflections, Post-Structuralist Approaches and Creative Outcomes in Seminar Practice Guilherme Coelho et.al. 2511.17425 null
2025-11-20 Codec2Vec: Self-Supervised Speech Representation Learning Using Neural Speech Codecs Wei-Cheng Tseng et.al. 2511.16639 null
2025-11-20 SceneGuard: Training-Time Voice Protection with Scene-Consistent Audible Background Noise Rui Sang et.al. 2511.16114 null
2025-11-24 PresentCoach: Dual-Agent Presentation Coaching through Exemplars and Interactive Feedback Sirui Chen et.al. 2511.15253 null
2025-11-18 Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech Nam-Gyu Kim et.al. 2511.14824 null
2025-11-06 The Impact of Prosodic Segmentation on Speech Synthesis of Spontaneous Speech Julio Cesar Galdino et.al. 2511.14779 null
2025-11-16 Hi-Reco: High-Fidelity Real-Time Conversational Digital Humans Hongbin Huang et.al. 2511.12662 null
2025-11-15 VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing Zhisheng Zheng et.al. 2511.12347 null
2025-11-14 CLARITY: Contextual Linguistic Adaptation and Accent Retrieval for Dual-Bias Mitigation in Text-to-Speech Generation Crystal Min Hui Poon et.al. 2511.11104 null
2025-11-14 Synthetic Voices, Real Threats: Evaluating Large Text-to-Speech Models in Generating Harmful Audio Guangke Chen et.al. 2511.10913 null
2025-11-13 Curved Worlds, Clear Boundaries: Generalizing Speech Deepfake Detection using Hyperbolic and Spherical Geometry Spaces Farhan Sheth et.al. 2511.10793 null
2025-11-13 VocalNet-M2: Advancing Low-Latency Spoken Language Modeling via Integrated Multi-Codebook Tokenization and Multi-Token Prediction Yuhao Wang et.al. 2511.10232 null
2025-11-13 Time-Layer Adaptive Alignment for Speaker Similarity in Flow-Matching Based Zero-Shot TTS Haoyu Li et.al. 2511.09995 null
2025-11-30 SpeechJudge: Towards Human-Level Judgment for Speech Naturalness Xueyao Zhang et.al. 2511.07931 link
2025-11-24 SynTTS-Commands: A Public Dataset for On-Device KWS via TTS-Synthesized Multilingual Speech Lu Gan et.al. 2511.07821 null
2025-11-10 Generating Novel and Realistic Speakers for Voice Conversion Meiying Melissa Chen et.al. 2511.07135 null
2025-10-26 Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation Canxiang Yan et.al. 2511.05516 null
2025-11-07 Shared Latent Representation for Joint Text-to-Audio-Visual Synthesis Dogucan Yaman et.al. 2511.05432 null
2025-11-07 Synthesizing speech with selected perceptual voice qualities - A case study with creaky voice Frederik Rautenberg et.al. 2511.05143 null
2025-11-19 Step-Audio-EditX Technical Report Chao Yan et.al. 2511.03601 null
2025-11-05 Seeing What You Say: Expressive Image Generation from Speech Jiyoung Lee et.al. 2511.03423 null
2025-11-05 PolyNorm: Few-Shot LLM-Based Text Normalization for Text-to-Speech Michel Wong et.al. 2511.03080 null
2025-11-04 Augmenting Open-Vocabulary Dysarthric Speech Assessment with Human Perceptual Supervision Kaimeng Jia et.al. 2511.02270 null
2025-11-03 Toward Objective and Interpretable Prosody Evaluation in Text-to-Speech: A Linguistically Motivated Approach Cedric Chan et.al. 2511.02104 null
2025-10-29 Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph Fali Wang et.al. 2511.00086 null
2025-10-31 Reconstructing Unseen Sentences from Speech-related Biosignals for Open-vocabulary Neural Communication Deok-Seon Kim et.al. 2510.27247 null
2025-10-30 Two-Timescale Optimization Framework for IAB-Enabled Heterogeneous UAV Networks Jikang Deng et.al. 2510.26578 null
2025-10-30 SP-MCQA: Evaluating Intelligibility of TTS Beyond the Word Level Hitomi Jin Ling Tee et.al. 2510.26190 null
2025-10-30 Reasoning Path Divergence: A New Metric and Curation Strategy to Unlock LLM Diverse Thinking Feng Ju et.al. 2510.26122 null
2025-10-30 Evaluating the Role of Verifiers in Test-Time Scaling for Legal Reasoning Tasks Davide Romano et.al. 2510.25623 null
2025-10-27 SFMS-ALR: Script-First Multilingual Speech Synthesis with Adaptive Locale Resolution Dharma Teja Donepudi et.al. 2510.25178 null
2025-10-28 Can Aha Moments Be Fake? Identifying True and Decorative Thinking Steps in Chain-of-Thought Jiachen Zhao et.al. 2510.24941 null
2025-10-28 Bayesian Speech synthesizers Can Learn from Multiple Teachers Ziyang Zhang et.al. 2510.24372 null
2025-10-28 SoulX-Podcast: Towards Realistic Long-form Podcasts with Dialectal and Paralinguistic Diversity Hanke Xie et.al. 2510.23541 null
2025-10-28 BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents Litu Ou et.al. 2510.23458 null
2025-10-26 UltraVoice: Scaling Fine-Grained Style-Controlled Speech Conversations for Spoken Dialogue Models Wenming Tu et.al. 2510.22588 null
2025-10-25 T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models Jindong Yang et.al. 2510.22366 null
2025-10-23 GuitarFlow: Realistic Electric Guitar Synthesis From Tablatures via Flow Matching and Style Transfer Jackson Loth et.al. 2510.21872 null
2025-10-24 StylePitcher: Generating Style-Following and Expressive Pitch Curves for Versatile Singing Tasks Jingyue Huang et.al. 2510.21685 null
2025-10-24 SHAP Meets Tensor Networks: Provably Tractable Explanations with Parallelism Reda Marzouk et.al. 2510.21599 null
2025-10-23 Vox-Evaluator: Enhancing Stability and Fidelity for Zero-shot TTS with A Multi-Level Evaluator Hualei Wang et.al. 2510.20210 null
2025-10-22 EchoFake: A Replay-Aware Dataset for Practical Speech Deepfake Detection Tong Zhang et.al. 2510.19414 null
2025-10-21 ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive Text-to-Speech Generation Haowei Lou et.al. 2510.18308 null
2025-10-19 U-Codec: Ultra Low Frame-rate Neural Speech Codec for Fast High-fidelity Speech Generation Xusheng Yang et.al. 2510.16718 null
2025-10-18 TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model Bin Yu et.al. 2510.16449 null
2025-10-22 VoiceMorph: How AI Voice Morphing Reveals the Boundaries of Auditory Self-Recognition Kye Shimizu et.al. 2510.16192 null
2025-10-15 Optimal Aggregation of LLM and PRM Signals for Efficient Test-Time Scaling Peng Kuang et.al. 2510.13918 null
2025-10-15 Generative Universal Verifier as Multimodal Meta-Reasoner Xinchen Zhang et.al. 2510.13804 null
2025-10-15 Closing the Gap Between Text and Speech Understanding in LLMs Santiago Cuervo et.al. 2510.13632 null
2025-10-15 Mismatch Aware Guidance for Robust Emotion Control in Auto-Regressive TTS Models Yizhou Peng et.al. 2510.13293 null
2025-10-15 StressTransfer: Stress-Aware Speech-to-Speech Translation with Emphasis Preservation Xi Chen et.al. 2510.13194 null
2025-10-23 Continuous-Token Diffusion for Speaker-Referenced TTS in Multimodal LLMs Xinlu He et.al. 2510.12995 null
2025-10-15 DiSTAR: Diffusion over a Scalable Token Autoregressive Representation for Speech Generation Yakun Song et.al. 2510.12210 null
2025-10-13 BridgeCode: A Dual Speech Representation Paradigm for Autoregressive Zero-Shot Text-to-Speech Synthesis Jingyuan Xing et.al. 2510.11646 null
2025-10-13 Perturbation Self-Supervised Representations for Cross-Lingual Emotion TTS: Stage-Wise Modeling of Emotion and Speaker Cheng Gong et.al. 2510.11124 null
2025-10-14 ParsVoice: A Large-Scale Multi-Speaker Persian Speech Corpus for Text-to-Speech Synthesis Mohammad Javad Ranjbar Kalahroodi et.al. 2510.10774 null
2025-10-17 MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations Wenxiang Guo et.al. 2510.10396 null
2025-10-11 Unifying Tree Search Algorithm and Reward Design for LLM Reasoning: A Survey Jiaqi Wei et.al. 2510.09988 null
2025-10-10 O_O-VC: Synthetic Data-Driven One-to-One Alignment for Any-to-Any Voice Conversion Huu Tuong Tu et.al. 2510.09061 null
2025-10-10 DiTSinger: Scaling Singing Voice Synthesis with Diffusion Transformer and Implicit Alignment Zongcai Du et.al. 2510.09016 null
2025-10-04 Less Diverse, Less Safe: The Indirect But Pervasive Risk of Test-Time Scaling in Large Language Models Shahriar Kabir Nahin et.al. 2510.08592 null
2025-10-09 DialoSpeech: Dual-Speaker Dialogue Generation with LLM and Flow Matching Hanke Xie et.al. 2510.08373 null
2025-10-09 IntMeanFlow: Few-step Speech Generation with Integral Velocity Distillation Wei Wang et.al. 2510.07979 null
2025-11-05 VoiceAgentBench: Are Voice Assistants ready for agentic tasks? Dhruv Jain et.al. 2510.07978 null
2025-10-09 Parallel Test-Time Scaling for Latent Reasoning Models Runyang You et.al. 2510.07745 null
2025-10-08 AsyncSpade: Efficient Test-Time Scaling with Asynchronous Sparse Decoding Shuqing Luo et.al. 2510.07486 null
2025-10-08 Making Machines Sound Sarcastic: LLM-Enhanced and Retrieval-Guided Sarcastic Speech Synthesis Zhu Li et.al. 2510.07096 null
2025-10-08 Towards Responsible Evaluation for Text-to-Speech Yifan Yang et.al. 2510.06927 null
2025-10-08 XLSR-Kanformer: A KAN-Intergrated model for Synthetic Speech Detection Phuong Tuan Dat et.al. 2510.06706 null
2025-10-07 Test-Time Scaling of Reasoning Models for Machine Translation Zihao Li et.al. 2510.06471 null
2025-10-07 TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning Jiaru Zou et.al. 2510.06217 null
2025-10-07 Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification Weihao Zeng et.al. 2510.06135 null
2025-10-07 ECTSpeech: Enhancing Efficient Speech Synthesis via Easy Consistency Tuning Tao Zhu et.al. 2510.05984 null
2025-10-07 Data-efficient Targeted Token-level Preference Optimization for LLM-based Text-to-Speech Rikuto Kotoge et.al. 2510.05799 null
2025-10-07 EMORL-TTS: Reinforcement Learning for Fine-Grained Emotion Control in LLM-based TTS Haoxun Li et.al. 2510.05758 null
2025-10-07 Sparse deepfake detection promotes better disentanglement Antoine Teissier et.al. 2510.05696 null
2025-10-09 Paper2Video: Automatic Video Generation from Scientific Papers Zeyu Zhu et.al. 2510.05096 null
2025-10-28 Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models Yolo Yunlong Tang et.al. 2510.05034 null
2025-10-06 Speak, Edit, Repeat: High-Fidelity Voice Editing and Zero-Shot TTS with Cross-Attentive Mamba Baher Mohammad et.al. 2510.04738 null
2025-10-07 Synthetic Audio Forensics Evaluation (SAFE) Challenge Kirill Trapeznikov et.al. 2510.03387 null
2025-10-03 Evaluation of preprocessing pipelines in the creation of in-the-wild TTS datasets Matías Di Bernardo et.al. 2510.03111 null
2025-10-03 Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-Speech Hieu-Nghia Huynh-Nguyen et.al. 2510.02848 null
2025-10-02 On the Role of Temperature Sampling in Test-Time Scaling Yuheng Wu et.al. 2510.02611 null
2025-10-02 Emotional Text-To-Speech Based on Mutual-Information-Guided Emotion-Timbre Disentanglement Jianing Yang et.al. 2510.01722 null
2025-09-30 BatonVoice: An Operationalist Framework for Enhancing Controllable Speech Synthesis with Linguistic Intelligence from LLMs Yue Wang et.al. 2509.26514 null
2025-09-30 Go with Your Gut: Scaling Confidence for Autoregressive Image Generation Harold Haodong Chen et.al. 2509.26376 null
2025-09-30 HiStyle: Hierarchical Style Embedding Predictor for Text-Prompt-Guided Controllable Speech Synthesis Ziyu Zhang et.al. 2509.25842 null
2025-09-29 Emotion-Aligned Generation in Diffusion Text to Speech Models via Preference-Guided Optimization Jiacheng Shi et.al. 2509.25416 null
2025-09-29 Incentive-Aligned Multi-Source LLM Summaries Yanchen Jiang et.al. 2509.25184 null
2025-09-29 MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech Chengyao Wang et.al. 2509.25131 null
2025-09-29 LatentEvolve: Self-Evolving Test-Time Scaling in Latent Space Guibin Zhang et.al. 2509.24771 null
2025-09-29 VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning Yixuan Zhou et.al. 2509.24650 null
2025-09-29 Word-Level Emotional Expression Control in Zero-Shot Text-to-Speech Synthesis Tianrui Wang et.al. 2509.24629 null
2025-09-29 ContextPRM: Leveraging Contextual Coherence for multi-domain Test-Time Scaling Haotian Zhang et.al. 2509.24460 null
2025-09-29 UniFlow-Audio: Unified Flow Matching for Audio Generation from Omni-Modalities Xuenan Xu et.al. 2509.24391 null
2025-09-28 Generalizable Speech Deepfake Detection via Information Bottleneck Enhanced Adversarial Alignment Pu Huang et.al. 2509.23618 null
2025-10-07 Training Vision-Language Process Reward Models for Test-Time Scaling in Multimodal Reasoning: Key Insights and Lessons Learned Brandon Ong et.al. 2509.23250 null
2025-09-27 BFA: Real-time Multilingual Text-to-speech Forced Alignment Abdul Rehman et.al. 2509.23147 null
2025-09-25 DiaMoE-TTS: A Unified IPA-Based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptation Ziqi Chen et.al. 2509.22727 null
2025-09-24 PerformSinger: Multimodal Singing Voice Synthesis Leveraging Synchronized Lip Cues from Singing Performance Videos Ke Gu et.al. 2509.22718 null
2025-09-26 Dynamic Experts Search: Enhancing Reasoning in Mixture-of-Experts LLMs at Test Time Yixuan Han et.al. 2509.22572 null
2025-09-26 Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis Zhikang Niu et.al. 2509.22167 null
2025-09-26 Speaker Anonymisation for Speech-based Suicide Risk Detection Ziyun Cui et.al. 2509.22148 null
2025-09-26 Think Right, Not More: Test-Time Scaling for Numerical Claim Verification Primakov Chungkham et.al. 2509.22101 null
2025-09-26 Redefining Machine Simultaneous Interpretation: From Incremental Translation to Human-Like Strategies Qianen Zhang et.al. 2509.21801 null
2025-09-26 SPADE: Structured Pruning and Adaptive Distillation for Efficient LLM-TTS Tan Dat Nguyen et.al. 2509.20802 null
2025-09-24 Reconstruction-Based Adaptive Scheduling Using AI Inferences in Safety-Critical Systems Samer Alshaer et.al. 2509.20513 null
2025-09-24 Objective Evaluation of Prosody and Intelligibility in Speech Synthesis via Conditional Prediction of Discrete Tokens Ismail Rasim Ulgen et.al. 2509.20485 null
2025-09-20 Beyond Global Emotion: Fine-Grained Emotional Speech Synthesis with Dynamic Word-Level Modulation Sirui Wang et.al. 2509.20378 null
2025-09-25 Measuring Prosody Diversity in Zero-Shot TTS: A New Metric, Benchmark, and Exploration Yifan Yang et.al. 2509.19928 null
2025-09-24 CoMelSinger: Discrete Token-Based Zero-Shot Singing Synthesis With Structured Melody Control and Guidance Junchuan Zhao et.al. 2509.19883 null
2025-09-24 Eliminating stability hallucinations in llm-based tts models via attention guidance ShiMing Wang et.al. 2509.19852 null
2025-09-24 Efficient Speech Watermarking for Speech Synthesis via Progressive Knowledge Distillation Yang Cui et.al. 2509.19812 null
2025-09-24 PART: Progressive Alignment Representation Training for Multilingual Speech-To-Text with LLMs Pei Zhang et.al. 2509.19745 null
2025-09-24 Selective Classifier-free Guidance for Zero-shot Text-to-speech John Zheng et.al. 2509.19668 null
2025-09-23 Are We Scaling the Right Thing? A System Perspective on Test-Time Scaling Youpeng Zhao et.al. 2509.19645 null
2025-09-23 Finding My Voice: Generative Reconstruction of Disordered Speech for Automated Clinical Evaluation Karen Rosero et.al. 2509.19231 null
2025-09-23 Investigating Test-Time Scaling with Reranking for Machine Translation Shaomu Tan et.al. 2509.19020 null
2025-09-23 No Verifiable Reward for Prosody: Toward Preference-Guided Prosody Learning in TTS Seungyoun Shin et.al. 2509.18531 null
2025-09-22 Discrete-time diffusion-like models for speech synthesis Xiaozhou Tan et.al. 2509.18470 null
2025-09-22 TMD-TTS: A Unified Tibetan Multi-Dialect Text-to-Speech Synthesis for Ü-Tsang, Amdo and Kham Speech Dataset Generation Yutong Liu et.al. 2509.18060 null
2025-09-22 Variation in Verification: Understanding Verification Dynamics in Large Language Models Yefan Zhou et.al. 2509.17995 null
2025-09-22 Nord-Parl-TTS: Finnish and Swedish TTS Dataset from Parliament Speech Zirui Li et.al. 2509.17988 null
2025-09-23 Mitigating Strategy-Selection Bias in Reasoning for More Effective Test-Time Scaling Zongqian Wu et.al. 2509.17905 null
2025-09-22 Audiobook-CC: Controllable Long-context Speech Generation for Multicast Audiobook Min Liu et.al. 2509.17516 null
2025-09-21 Bridging the gap between training and inference in LM-based TTS models Ruonan Zhang et.al. 2509.17021 null
2025-09-21 MBCodec:Thorough disentangle for high-fidelity audio compression Ruonan Zhang et.al. 2509.17006 null
2025-09-19 Fed-PISA: Federated Voice Cloning via Personalized Identity-Style Adaptation Qi Wang et.al. 2509.16010 null
2025-09-19 VoXtream: Full-Stream Text-to-Speech with Extremely Low Latency Nikita Torgashov et.al. 2509.15969 null
2025-09-19 Deep Dubbing: End-to-End Auto-Audiobook System with Text-to-Timbre and Context-Aware Instruct-TTS Ziqi Dai et.al. 2509.15845 null
2025-09-19 Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech Xinlei Niu et.al. 2509.15492 null
2025-09-18 Real-Time Streaming Mel Vocoding with Generative Flow Matching Simon Welker et.al. 2509.15085 null
2025-09-19 DAIEN-TTS: Disentangled Audio Infilling for Environment-Aware Text-to-Speech Synthesis Ye-Xin Lu et.al. 2509.14684 null
2025-09-23 Cross-Lingual F5-TTS: Towards Language-Agnostic Voice Cloning and Speech Synthesis Qingyu Liu et.al. 2509.14579 null
2025-10-01 SpeechWeave: Diverse Multilingual Synthetic Text & Audio Data Generation Pipeline for Training Text to Speech Models Karan Dua et.al. 2509.14270 null
2025-09-17 Slim-SC: Thought Pruning for Efficient Scaling with Self-Consistency Colin Hong et.al. 2509.13990 null
2025-09-22 Do You Hear What I Mean? Quantifying the Instruction-Perception Gap in Instruction-Guided Expressive Text-To-Speech Systems Yi-Cheng Lin et.al. 2509.13989 null
2025-09-16 MSR-Codec: A Low-Bitrate Multi-Stream Residual Codec for High-Fidelity Speech Generation with Information Disentanglement Jingyu Li et.al. 2509.13068 null
2025-09-21 LTA-thinker: Latent Thought-Augmented Training Framework for Large Language Models on Complex Reasoning Jiaqi Wang et.al. 2509.12875 null
2025-09-16 Towards personalized, precise and survey-free environment recognition: AI-enhanced sensor fusion without pre-deployment Ruichen Wang et.al. 2509.12870 null
2025-09-16 A Lightweight Pipeline for Noisy Speech Voice Cloning and Accurate Lip Sync Synthesis Javeria Amir et.al. 2509.12831 null
2025-09-21 Building Coding Agents via Entropy-Enhanced Multi-Turn Preference Optimization Jiahao Yu et.al. 2509.12434 null
2025-09-15 Preservation of Language Understanding Capabilities in Speech-aware Large Language Models Marek Kubis et.al. 2509.12171 null
2025-09-29 FuseCodec: Semantic-Contextual Fusion and Supervision for Neural Codecs Md Mubtasim Ahasan et.al. 2509.11425 null
2025-09-14 Length-Aware Rotary Position Embedding for Text-Speech Alignment Hyeongju Kim et.al. 2509.11084 null
2025-09-12 Towards Data Drift Monitoring for Speech Deepfake Detection in the context of MLOps Xin Wang et.al. 2509.10086 null
2025-09-11 DiTReducio: A Training-Free Acceleration for DiT-Based TTS via Progressive Calibration Yanru Huo et.al. 2509.09748 null
2025-09-12 DiFlow-TTS: Discrete Flow Matching with Factorized Speech Tokens for Low-Latency Zero-Shot Text-To-Speech Ngoc-Son Nguyen et.al. 2509.09631 null
2025-09-11 HISPASpoof: A New Dataset For Spanish Speech Forensics Maria Risques et.al. 2509.09155 null
2025-09-10 Accelerating Diffusion Transformer-Based Text-to-Speech with Transformer Layer Caching Siratish Sakpiboonchit et.al. 2509.08696 null
2025-09-14 Progressive Facial Granularity Aggregation with Bilateral Attribute-based Enhancement for Face-to-Speech Synthesis Yejin Jeon et.al. 2509.07376 null
2025-09-09 When Fine-Tuning is Not Enough: Lessons from HSAD on Hybrid and Adversarial Audio Spoof Detection Bin Hu et.al. 2509.07323 null
2025-09-08 Controllable Singing Voice Synthesis using Phoneme-Level Energy Sequence Yerin Ryu et.al. 2509.07038 null
2025-09-07 Multimodal Fine-grained Context Interaction Graph Modeling for Conversational Speech Synthesis Zhenqi Jia et.al. 2509.06074 null
2025-09-06 LatinX: Aligning a Multilingual TTS Model with Direct Preference Optimization Luis Felipe Chary et.al. 2509.05863 null
2025-09-08 Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework Jie Chen et.al. 2509.05007 null
2025-09-04 Say More with Less: Variable-Frame-Rate Speech Tokenization via Adaptive Clustering and Implicit Duration Coding Rui-Chen Zheng et.al. 2509.04685 null
2025-09-04 DarkStream: real-time speech anonymization with low latency Waris Quamer et.al. 2509.04667 null
2025-09-04 AUDETER: A Large-scale Dataset for Deepfake Audio Detection in Open Worlds Qizhou Wang et.al. 2509.04345 null
2025-09-04 Open-Source Full-Duplex Conversational Datasets for Natural and Interactive Speech Synthesis Zhitong Zhou et.al. 2509.04093 null
2025-09-04 LibriQuote: A Speech Dataset of Fictional Character Utterances for Expressive Zero-Shot Speech Synthesis Gaspard Michel et.al. 2509.04072 null
2025-09-16 SwinSRGAN: Swin Transformer-based Generative Adversarial Network for High-Fidelity Speech Super-Resolution Jiajun Yuan et.al. 2509.03913 null
2025-09-03 Multi-level SSL Feature Gating for Audio Deepfake Detection Hoan My Tran et.al. 2509.03409 null
2025-09-03 Improving Perceptual Audio Aesthetic Assessment via Triplet Loss and Self-Supervised Embeddings Dyah A. M. G. Wisnu et.al. 2509.03292 null
2025-09-03 AIVA: An AI-based Virtual Companion for Emotion-aware Interaction Chenxi Li et.al. 2509.03212 null
2025-09-02 Scale, Don’t Fine-tune: Guiding Multimodal LLMs for Efficient Visual Place Recognition at Test-Time Jintao Cheng et.al. 2509.02129 null
2025-09-04 FireRedTTS-2: Towards Long Conversational Speech Generation for Podcast and Chatbot Kun Xie et.al. 2509.02020 null
2025-09-03 MixedG2P-T5: G2P-free Speech Synthesis for Mixed-script texts using Speech Self-Supervised Learning and Language Model Joonyong Park et.al. 2509.01391 null
2025-08-31 MPO: Multidimensional Preference Optimization for Language Model-based Text-to-Speech Kangxiang Xia et.al. 2509.00685 null
2025-08-31 Speaker-Conditioned Phrase Break Prediction for Text-to-Speech with Phoneme-Level Pre-trained Language Model Dong Yang et.al. 2509.00675 null
2025-08-29 Democratizing Agentic AI with Fast Test-Time Scaling on the Edge Hao Mark Chen et.al. 2509.00195 null
2025-08-27 Learning to Refine: Self-Refinement of Parallel Reasoning in LLMs Qibin Wang et.al. 2509.00084 null
2025-08-28 Multilingual Dataset Integration Strategies for Robust Audio Deepfake Detection: A SAFE Challenge System Hashim Ali et.al. 2508.20983 null
2025-08-26 Predicting the optimal noise strength for solving optimization problems with analog Ising machines Leen Mys et.al. 2508.19107 null
2025-08-26 CLEAR: Continuous Latent Autoregressive Modeling for High-quality and Low-latency Speech Synthesis Chun Yat Wu et.al. 2508.19098 null
2025-08-25 SwiftF0: Fast and Accurate Monophonic Pitch Detection Lars Nieradzik et.al. 2508.18440 null
2025-08-25 Unseen Speaker and Language Adaptation for Lightweight Text-To-Speech with Adapters Alessio Falai et.al. 2508.18006 null
2025-08-27 Vocoder-Projected Feature Discriminator Takuhiro Kaneko et.al. 2508.17874 null
2025-08-25 ClearMask: Noise-Free and Naturalness-Preserving Protection Against Voice Deepfake Attacks Yuanda Wang et.al. 2508.17660 null
2025-08-24 Improving French Synthetic Speech Quality via SSML Prosody Control Nassima Ould Ouali et.al. 2508.17494 null
2025-08-23 WildSpoof Challenge Evaluation Plan Yihan Wu et.al. 2508.16858 null
2025-09-09 Trust but Verify! A Survey on Verification Design for Test-time Scaling V Venktesh et.al. 2508.16665 null
2025-09-05 Mitigating Hallucinations in LM-Based TTS Models via Distribution Alignment Using GFlowNets Chenlin Liu et.al. 2508.15442 null
2025-08-25 Linear Preference Optimization: Decoupled Gradient Control via Absolute Regularization Rui Wang et.al. 2508.14947 null
2025-08-20 Long-Context Speech Synthesis with Context-Aware Memory Zhipeng Li et.al. 2508.14713 null
2025-08-20 Improving Resource-Efficient Speech Enhancement via Neural Differentiable DSP Vocoder Refinement Heitor R. Guimarães et.al. 2508.14709 null
2025-08-22 Your Reward Function for RL is Your Best PRM for Search: Unifying RL and Search-Based TTS Can Jin et.al. 2508.14313 null
2025-08-19 Who Gets the Mic? Investigating Gender Bias in the Speaker Assignment of a Speech-LLM Dariia Puhach et.al. 2508.13603 null
2025-08-18 Integrating Feedback Loss from Bi-modal Sarcasm Detector for Sarcastic Speech Synthesis Zhu Li et.al. 2508.13028 null
2025-08-18 Cooperative Sensing-Assisted Predictive Beam Tracking for MIMO-OFDM Networked ISAC Systems Xiaoyu Yang et.al. 2508.12723 null
2025-08-18 Real-Time Sign Language Gestures to Speech Transcription using Deep Learning Brandone Fonya et.al. 2508.12713 null
2025-08-19 FNH-TTS: A Fast, Natural, and Human-Like Speech Synthesis System with advanced prosodic modeling based on Mixture of Experts Qingliang Meng et.al. 2508.12001 null
2025-08-15 MoE-TTS: Enhancing Out-of-Domain Text Understanding for Description-based TTS via Mixture-of-Experts Heyang Xue et.al. 2508.11326 null
2025-10-07 EmoSSLSphere: Multilingual Emotional Speech Synthesis with Spherical Vectors and Discrete Speech Tokens Joonyong Park et.al. 2508.11273 null
2025-08-14 Facilitating Personalized TTS for Dysarthric Speakers Using Knowledge Anchoring and Curriculum Learning Yejin Jeon et.al. 2508.10412 null
2025-08-14 Towards Frame-level Quality Predictions of Synthetic Speech Michael Kuhlmann et.al. 2508.10374 null
2025-08-15 Training-Free Multimodal Large Language Model Orchestration Tianyu Xie et.al. 2508.10016 null
2025-09-16 UtterTune: LoRA-Based Target-Language Pronunciation Edit and Control in Multilingual Text-to-Speech Shuhei Kato et.al. 2508.09767 null
2025-08-12 ProMode: A Speech Prosody Model Conditioned on Acoustic and Textual Inputs Eray Eren et.al. 2508.09389 null
2025-08-12 Fake-Mamba: Real-Time Speech Deepfake Detection Using Bidirectional Mamba as Self-Attention’s Alternative Xi Xuan et.al. 2508.09294 null
2025-08-12 HumanOLAT: A Large-Scale Dataset for Full-Body Human Relighting and Novel-View Synthesis Timo Teufel et.al. 2508.09137 null
2025-08-12 QAMRO: Quality-aware Adaptive Margin Ranking Optimization for Human-aligned Assessment of Audio Generation Systems Chien-Chun Wang et.al. 2508.08957 null
2025-08-10 Scalable Controllable Accented TTS Henry Li Xinyuan et.al. 2508.07426 null
2025-08-10 KLASSify to Verify: Audio-Visual Deepfake Detection Using SSL-based Audio and Handcrafted Visual Features Ivan Kukanov et.al. 2508.07337 null
2025-08-12 XEmoRAG: Cross-Lingual Emotion Transfer with Controllable Intensity Using Retrieval-Augmented Generation Tianlun Zuo et.al. 2508.07302 null
2025-08-09 Maestro-EVC: Controllable Emotional Voice Conversion Guided by References and Explicit Prosody Jinsung Yoon et.al. 2508.06890 null
2025-08-09 Text to Speech System for Meitei Mayek Script Gangular Singh Irengbam et.al. 2508.06870 null
2025-08-08 Llasa+: Free Lunch for Accelerated and Streaming Llama-Based Speech Synthesis Wenjie Tian et.al. 2508.06262 null
2025-08-08 NEP: Autoregressive Image Editing via Next Editing Token Prediction Huimin Wu et.al. 2508.06044 null
2025-08-07 A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understanding Runchuan Ye et.al. 2508.05385 null
2025-08-15 Fairness in Dysarthric Speech Synthesis: Understanding Intrinsic Bias in Dysarthric Speech Cloning using F5-TTS M Anuprabha et.al. 2508.05102 null
2025-08-07 UniTalker: Conversational Speech-Visual Synthesis Yifan Hu et.al. 2508.04585 null
2025-08-06 The State Of TTS: A Case Study with Human Fooling Rates Praveen Srinivasa Varadhan et.al. 2508.04179 null
2025-08-29 Parallel GPT: Harmonizing the Independence and Interdependence of Acoustic and Semantic Information for Zero-Shot Text-to-Speech Jingyuan Xing et.al. 2508.04141 null
2025-07-04 Analyzing and Improving Speaker Similarity Assessment for Speech Synthesis Marc-André Carbonneau et.al. 2507.02176 null
2025-07-08 Pronunciation Editing for Finnish Speech using Phonetic Posteriorgrams Zirui Li et.al. 2507.02115 null
2025-07-03 Multi-interaction TTS toward professional recording reproduction Hiroki Kanagawa et.al. 2507.00808 null
2025-05-27 Revival with Voice: Multi-modal Controllable Text-to-Speech Synthesis Minsu Kim et.al. 2505.18972 null
2025-05-13 Lightweight End-to-end Text-to-speech Synthesis for low resource on-device applications Biel Tura Vecino et.al. 2505.07701 null
2025-01-16 Speech Synthesis along Perceptual Voice Quality Dimensions Frederik Rautenberg et.al. 2501.08791 null
2025-06-03 Low-Resource Text-to-Speech Synthesis Using Noise-Augmented Training of ForwardTacotron Kishor Kayyar Lakshminarayana et.al. 2501.05976 null
2024-12-31 Stable-TTS: Stable Speaker-Adaptive Text-to-Speech Synthesis via Prosody Prompting Wooseok Han et.al. 2412.20155 null
2024-11-12 Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis Shijia Liao et.al. 2411.01156 null
2024-11-01 Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis Théodor Lemerle et.al. 2410.23320 null
2024-10-29 Mitigating Unauthorized Speech Synthesis for Voice Protection Zhisheng Zhang et.al. 2410.20742 null
2025-01-13 MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion Sho Inoue et.al. 2409.09352 null
2024-09-10 AS-Speech: Adaptive Style For Speech Synthesis Zhipeng Li et.al. 2409.05730 null
2024-07-02 FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech Synthesis Yinlin Guo et.al. 2407.00753 null
2024-06-13 Text-aware and Context-aware Expressive Audiobook Speech Synthesis Dake Guo et.al. 2406.05672 null
2024-10-25 FlashSpeech: Efficient Zero-Shot Speech Synthesis Zhen Ye et.al. 2404.14700 null
2024-04-03 Humane Speech Synthesis through Zero-Shot Emotion and Disfluency Generation Rohan Chaudhury et.al. 2404.01339 link
2024-04-02 CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models Xiang Li et.al. 2404.00569 null
2024-03-21 Building speech corpus with diverse voice characteristics for its prompt-based representation Aya Watanabe et.al. 2403.13353 null
2024-03-19 EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech Ziqi Liang et.al. 2403.08164 null
2024-02-05 Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations Panos Kakoulidis et.al. 2402.01520 null
2024-02-19 Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis Vinotha R et.al. 2401.11771 null
2024-08-28 ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations Cheng Gong et.al. 2312.14398 null
2024-02-01 MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis Wenhao Guan et.al. 2312.10687 null
2023-11-28 HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis Sang-Hoon Lee et.al. 2311.12454 null
2023-12-19 High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models Chunyu Qiang et.al. 2309.15512 null
2024-10-28 Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition Ziyang Ma et.al. 2309.10294 null
2023-08-01 MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis Shun Lei et.al. 2307.16012 null
2023-07-17 Controllable Emphasis with zero data for text-to-speech Arnaud Joly et.al. 2307.07062 null
2023-07-12 On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis Siyang Wang et.al. 2307.05132 null
2024-01-26 Disentanglement in a GAN for Unconditional Speech Synthesis Matthew Baas et.al. 2307.01673 null
2023-06-29 UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data Heeseung Kim et.al. 2306.16083 null
2023-06-22 Visual-Aware Text-to-Speech Mohan Zhou et.al. 2306.12020 null
2023-06-21 CML-TTS A Multilingual Dataset for Speech Synthesis in Low-Resource Languages Frederico S. Oliveira et.al. 2306.10097 null
2023-06-02 EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis Haobin Tang et.al. 2306.00648 null
2023-05-23 MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting Neil Shah et.al. 2305.11926 null
2023-10-31 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model Zhen Ye et.al. 2305.06908 null
2023-12-19 Zero-shot text-to-speech synthesis conditioned using self-supervised speech representation model Kenichi Fujita et.al. 2304.11976 null
2023-05-31 NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers Kai Shen et.al. 2304.09116 null
2023-12-19 ParrotTTS: Text-to-Speech synthesis by exploiting self-supervised representations Neil Shah et.al. 2303.01261 null
2023-02-20 Lip-to-Speech Synthesis in the Wild with Multi-task Learning Minsu Kim et.al. 2302.08841 null
2022-12-07 UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis Yi Lei et.al. 2212.01546 null
2022-11-30 Controllable speech synthesis by learning discrete phoneme-level prosodic representations Nikolaos Ellinas et.al. 2211.16307 null
2023-03-15 Grad-StyleSpeech: Any-speaker Adaptive Text-to-Speech Synthesis with Diffusion Models Minki Kang et.al. 2211.09383 null
2024-10-01 Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder Jan Melechovsky et.al. 2211.03316 null
2022-10-03 Detection of Prosodic Boundaries in Speech Using Wav2Vec 2.0 Marie Kunešová et.al. 2209.15032 null
2022-05-25 TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS Xulong Zhang et.al. 2205.11824 null
2024-06-06 Parallel Synthesis for Autoregressive Speech Generation Po-chun Hsu et.al. 2204.11806 null
2023-02-07 The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance Lin Zhang et.al. 2204.05177 null
2022-03-30 Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise Tuomo Raitio et.al. 2203.10637 null
2022-01-27 J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis Shinnosuke Takamichi et.al. 2201.10896 null
2021-11-18 Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control Konstantinos Markopoulos et.al. 2111.09146 null
2022-08-01 Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech Sung-Feng Huang et.al. 2111.04040 null
2021-07-13 Extending Text-to-Speech Synthesis with Articulatory Movement Prediction using Ultrasound Tongue Imaging Tamás Gábor Csapó et.al. 2107.05550 null
2021-07-08 VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis Hui Lu et.al. 2107.03298 null
2021-07-07 Location, Location: Enhancing the Evaluation of Text-to-Speech Synthesis Using the Rapid Prosody Transcription Paradigm Elijah Gutierrez et.al. 2107.02527 null
2021-07-06 Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input Tamás Gábor Csapó et.al. 2107.02003 null
2021-07-26 A Survey on Neural Speech Synthesis Xu Tan et.al. 2106.15561 null
2021-06-29 Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech Raahil Shah et.al. 2106.12896 null
2021-06-22 Non-native English lexicon creation for bilingual speech synthesis Arun Baby et.al. 2106.10870 null
2021-06-22 Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters Mohammed Salah Al-Radhi et.al. 2106.10481 null
2021-05-11 MASS: Multi-task Anthropomorphic Speech Synthesis Framework Jinyin Chen et.al. 2105.04124 null
2021-07-01 How do Voices from Past Speech Synthesis Challenges Compare Today? Erica Cooper et.al. 2105.02373 null
2022-02-25 Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis Erica Cooper et.al. 2104.12292 null
2021-04-06 Diff-TTS: A Denoising Diffusion Model for Text-to-Speech Myeonghun Jeong et.al. 2104.01409 null
2021-06-15 Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability Rui Liu et.al. 2104.01408 null
2021-03-09 AudioVisual Speech Synthesis: A brief literature review Efthymios Georgiou et.al. 2103.03927 null
2021-03-29 GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis Rui Liu et.al. 2010.12423 null
2020-10-19 Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion Shengkui Zhao et.al. 2010.08136 null
2021-01-07 Transfer Learning from Speech Synthesis to Voice Conversion with Non-Parallel Training Data Mingyang Zhang et.al. 2009.14399 null
2020-10-26 Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search Jaehyeon Kim et.al. 2005.11129 null
2020-05-22 Cross-lingual Multispeaker Text-to-Speech under Limited-Data Scenario Zexin Cai et.al. 2005.10441 null
2020-02-18 Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech Vatsal Aggarwal et.al. 1911.12760 null
2019-09-26 Sequence to Sequence Neural Speech Synthesis with Prosody Modification Capabilities Slava Shechtman et.al. 1909.10302 null
2019-09-10 Evaluating Long-form Text-to-Speech: Comparing the Ratings of Sentences and Paragraphs Rob Clark et.al. 1909.03965 null
2019-08-28 Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis Xin Wang et.al. 1908.10256 null
2020-11-04 Using generative modelling to produce varied intonation for speech synthesis Zack Hodari et.al. 1906.04233 null
2019-09-24 Problem-Agnostic Speech Embeddings for Multi-Speaker Text-to-Speech with SampleRNN David Álvarez et.al. 1906.00733 null
2019-05-22 Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems Ohsung Kwon et.al. 1905.08486 null
2019-02-12 Generative Moment Matching Network-based Random Modulation Post-filter for DNN-based Singing Voice Synthesis and Neural Double-tracking Hiroki Tamaru et.al. 1902.03389 null
2018-08-21 Multimodal speech synthesis architecture for unsupervised speaker adaptation Hieu-Thi Luong et.al. 1808.06288 null
2019-01-04 Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis Ye Jia et.al. 1806.04558 null
2018-02-23 Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning Wei Ping et.al. 1710.07654 null
2017-09-26 Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks Yuki Saito et.al. 1709.08041 null
2017-09-25 Techniques and Challenges in Speech Synthesis David Ferris et.al. 1709.07552 null
2016-08-19 DNN-based Speech Synthesis for Indian Languages from ASCII text Srikanth Ronanki et.al. 1608.05374 null
2016-06-30 Penambahan emosi menggunakan metode manipulasi prosodi untuk sistem text to speech bahasa Indonesia Salita Ulitia Prini et.al. 1606.09222 null

🌐 Machine Translation

📊 712 papers

📅 Publish Date 📝 Title 👥 Authors 📄 PDF 💻 Code
2026-04-01 Translating With Feeling: Centering Translator Perspectives within Translation Technologies Daniel Chechelnitsky et.al. 2604.00758 null
2026-04-01 AfrIFact: Cultural Information Retrieval, Evidence Extraction and Fact Checking for African Languages Israel Abebe Azime et.al. 2604.00706 null
2026-03-11 Multi-lingual Multi-institutional Electronic Health Record based Predictive Model Kyunghoon Hur et.al. 2604.00027 null
2026-03-10 ASCAT: An Arabic Scientific Corpus and Benchmark for Advanced Translation Evaluation Serry Sibaee et.al. 2604.00015 null
2026-03-31 Rewrite the News: Tracing Editorial Reuse Across News Agencies Soveatin Kuntur et.al. 2603.29937 null
2026-03-31 Bringing Up a Bilingual BabyLM: Investigating Multilingual Language Acquisition Using Small-Scale Models Linda Zeng et.al. 2603.29552 null
2026-03-31 L-ReLF: A Framework for Lexical Dataset Creation Anass Sedrati et.al. 2603.29346 null
2026-03-31 Open Machine Translation for Esperanto Ona de Gibert et.al. 2603.29345 null
2026-03-31 Advancing LLM-based phoneme-to-grapheme for multilingual speech recognition Lukuang Dong et.al. 2603.29217 null
2026-03-30 On the limited utility of parallel data for learning shared multilingual representations Julius Leino et.al. 2603.29026 null
2026-03-30 Graphilosophy: Graph-Based Digital Humanities Computing with The Four Books Minh-Thu Do et.al. 2603.28755 null
2026-03-30 Top-down string-to-dependency Neural Machine Translation Shuhei Kondo et.al. 2603.27938 null
2026-03-29 Budget-Xfer: Budget-Constrained Source Language Selection for Cross-Lingual Transfer to African Languages Tewodros Kederalah Idris et.al. 2603.27651 null
2026-03-28 EuraGovExam: A Multilingual Multimodal Benchmark from Real-World Civil Service Exams JaeSeong Kim et.al. 2603.27223 null
2026-03-23 Do Multilingual VLMs Reason Equally? A Cross-Lingual Visual Reasoning Audit for Indian Languages Swastik R et.al. 2603.26742 null
2026-03-27 Toward Culturally Grounded Natural Language Processing Sina Bagheri Nezhad et.al. 2603.26013 null
2026-03-26 Translation Asymmetry in LLMs as a Data Augmentation Factor: A Case Study for 6 Romansh Language Varieties Jannis Vamvas et.al. 2603.25489 null
2026-03-26 Translation or Recitation? Calibrating Evaluation Scores for Machine Translation of Extremely Low-Resource Languages Danlu Chen et.al. 2603.25222 null
2026-03-26 Cross-Preference Learning for Sentence-Level and Context-Aware Machine Translation Ying Li et.al. 2603.25183 null
2026-03-26 Bilingual Text-to-Motion Generation: A New Benchmark and Baselines Wanjiang Weng et.al. 2603.25178 null
2026-03-26 Toward domain-specific machine translation and quality estimation systems Javad Pourmostafa Roshan Sharami et.al. 2603.24955 null
2026-03-29 POLY-SIM: Polyglot Speaker Identification with Missing Modality Grand Challenge 2026 Evaluation Plan Marta Moscati et.al. 2603.24569 null
2026-03-25 Samasāmayik: A Parallel Dataset for Hindi-Sanskrit Machine Translation N J Karthika et.al. 2603.24307 null
2026-03-25 MMTIT-Bench: A Multilingual and Multi-Scenario Benchmark with Cognition-Perception-Reasoning Guided Text-Image Machine Translation Gengluo Li et.al. 2603.23896 null
2026-03-07 Konkani LLM: Multi-Script Instruction Tuning and Evaluation for a Low-Resource Indian Language Reuben Chagas Fernandes et.al. 2603.23529 null
2026-03-24 From Synthetic to Native: Benchmarking Multilingual Intent Classification in Logistics Customer Service Haoyu He et.al. 2603.23172 null
2026-03-23 Rashid: A Cipher-Based Framework for Exploring In-Context Language Learning Niyati Bafna et.al. 2603.22497 null
2026-03-24 Adapting Self-Supervised Speech Representations for Cross-lingual Dysarthria Detection in Parkinson’s Disease Abner Hernandez et.al. 2603.22225 null
2026-03-23 Enhancing Document-Level Machine Translation via Filtered Synthetic Corpora and Two-Stage LLM Adaptation Ireh Kim et.al. 2603.22186 null
2026-03-23 DATASHI: A Parallel English-Tashlhiyt Corpus for Orthography Normalization and Low-Resource Language Processing Nasser-Eddine Monir et.al. 2603.21571 null
2026-03-22 Graph Fusion Across Languages using Large Language Models Kaung Myat Kyaw et.al. 2603.21248 null
2026-03-22 Left Behind: Cross-Lingual Transfer as a Bridge for Low-Resource Languages in Large Language Models Abdul-Salem Beibitkhan et.al. 2603.21036 null
2026-03-20 Span-Level Machine Translation Meta-Evaluation Stefano Perrella et.al. 2603.19921 null
2026-03-20 Neither Here Nor There: Cross-Lingual Representation Dynamics of Code-Mixed Text in Multilingual Encoders Debajyoti Mazumder et.al. 2603.19771 null
2026-03-19 Vocabulary shapes cross-lingual variation of word-order learnability in language models Jonas Mayer Martins et.al. 2603.19427 null
2026-02-26 HATL: Hierarchical Adaptive-Transfer Learning Framework for Sign Language Machine Translation Nada Shahin et.al. 2603.19260 null
2026-03-19 Why Better Cross-Lingual Alignment Fails for Better Cross-Lingual Transfer: Case of Encoders Yana Veitsman et.al. 2603.18863 null
2026-03-19 Cross-Lingual LLM-Judge Transfer via Evaluation Decomposition Ivaxi Sheth et.al. 2603.18557 null
2026-03-18 ConGA: Guidelines for Contextual Gender Annotation. A Framework for Annotating Gender in Machine Translation Argentina Anna Rescigno et.al. 2603.17962 null
2026-03-18 Gender Disambiguation in Machine Translation: Diagnostic Evaluation in Decoder-Only Architectures Chiara Manna et.al. 2603.17952 null
2026-03-18 ShapleyLaw: A Game-Theoretic Approach to Multilingual Scaling Laws Xuyang Cao et.al. 2603.17945 null
2026-03-18 Pretrained Multilingual Transformers Reveal Quantitative Distance Between Human Languages Yue Zhao et.al. 2603.17912 null
2026-03-19 Zipper-LoRA: Dynamic Parameter Decoupling for Speech-LLM based Multilingual Speech Recognition Yuxiang Mei et.al. 2603.17558 null
2026-03-31 Language on Demand, Knowledge at Core: Composing LLMs with Encoder-Decoder Translation Models for Extensible Multilinguality Mengyu Bu et.al. 2603.17512 null
2026-03-18 From Words to Worlds: Benchmarking Cross-Cultural Cultural Understanding in Machine Translation Bangju Han et.al. 2603.17303 null
2026-03-17 Knowledge Localization in Mixture-of-Experts LLMs Using Cross-Lingual Inconsistency Lucas Bandarkar et.al. 2603.17102 null
2026-03-17 Ensemble Self-Training for Unsupervised Machine Translation Ido Aharon et.al. 2603.17087 null
2026-03-17 Can Linguistically Related Languages Guide LLM Translation in Low-Resource Settings? Aishwarya Ramasethu et.al. 2603.16660 null
2026-03-18 Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Embeddings Bridging Massively Multilingual Text and Speech Omnilingual SONAR Team et.al. 2603.16606 null
2026-03-17 Who Benchmarks the Benchmarks? A Case Study of LLM Evaluation in Icelandic Finnur Ágúst Ingimundarson et.al. 2603.16406 null
2026-03-18 Omnilingual MT: Machine Translation for 1,600 Languages Omnilingual MT Team et.al. 2603.16309 null
2026-03-16 Robust Language Identification for Romansh Varieties Charlotte Model et.al. 2603.15969 null
2026-03-16 Machine Translation in the Wild: User Reaction to Xiaohongshu’s Built-In Translation Feature Sui He et.al. 2603.15922 null
2026-03-16 Bidirectional Chinese and English Passive Sentences Dataset for Machine Translation Xinyue Ma et.al. 2603.15227 null
2026-03-16 Pretraining and Benchmarking Modern Encoders for Latvian Arturs Znotins et.al. 2603.15005 null
2026-03-29 ExPosST: Explicit Positioning with Adaptive Masking for LLM-Based Simultaneous Machine Translation Yuzhe Shang et.al. 2603.14903 null
2026-03-16 Developing an English-Efik Corpus and Machine Translation System for Digitization Inclusion Offiong Bassey Edet et.al. 2603.14873 null
2026-03-16 Towards Privacy-Preserving Machine Translation at the Inference Stage: A New Task and Benchmark Wei Shao et.al. 2603.14756 null
2026-03-15 Multilingual TinyStories: A Synthetic Combinatorial Corpus of Indic Children’s Stories for Training Small Language Models Deepon Halder et.al. 2603.14563 null
2026-03-14 NepTam: A Nepali-Tamang Parallel Corpus and Baseline Machine Translation Experiments Rupak Raj Ghimire et.al. 2603.14053 null
2026-03-30 GhanaNLP Parallel Corpora: Comprehensive Multilingual Resources for Low-Resource Ghanaian Languages Lawrence Adu Gyamfi et.al. 2603.13793 null
2026-03-13 Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation Yifeng Liu et.al. 2603.13045 null
2026-03-16 Is Human Annotation Necessary? Iterative MBR Distillation for Error Span Detection in Machine Translation Boxuan Lyu et.al. 2603.12983 null
2026-03-13 HMS-BERT: Hybrid Multi-Task Self-Training for Multilingual and Multi-Label Cyberbullying Detection Zixin Feng et.al. 2603.12920 null
2026-03-13 Learning from Child-Directed Speech in Two-Language Scenarios: A French-English Case Study Liel Binyamin et.al. 2603.12906 null
2026-03-12 Translationese as a Rational Response to Translation Task Difficulty Maria Kunilovskaya et.al. 2603.12050 null
2026-03-12 Just Use XML: Revisiting Joint Translation and Label Projection Thennal D K et.al. 2603.12021 null
2026-03-12 Semi-Synthetic Parallel Data for Translation Quality Estimation: A Case Study of Dataset Building for an Under-Resourced Language Pair Assaf Siani et.al. 2603.11743 null
2026-03-12 Streaming Translation and Transcription Through Speech-to-Text Causal Alignment Roman Koshkin et.al. 2603.11578 null
2026-03-11 Evaluating Explainable AI Attribution Methods in Neural Machine Translation via Attention-Guided Knowledge Distillation Aria Nourbakhsh et.al. 2603.11342 null
2026-03-11 Large Language Models as Annotators for Machine Translation Quality Estimation Sidi Wang et.al. 2603.10775 null
2026-04-01 IMTBench: A Multi-Scenario Cross-Modal Collaborative Evaluation Benchmark for In-Image Machine Translation Jiahao Lyu et.al. 2603.10495 null
2026-03-11 Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck Hongbin Zhang et.al. 2603.10351 null
2026-02-15 Automated evaluation of LLMs for effective machine translation of Mandarin Chinese to English Yue Zhang et.al. 2603.09998 null
2026-03-10 Do What I Say: A Spoken Prompt Dataset for Instruction-Following Maike Züfle et.al. 2603.09881 null
2026-03-13 EPIC-EuroParl-UdS: Information-Theoretic Perspectives on Translation and Interpreting Maria Kunilovskaya et.al. 2603.09785 null
2026-03-11 AutoViVQA: A Large-Scale Automatically Constructed Dataset for Vietnamese Visual Question Answering Nguyen Anh Tuong et.al. 2603.09689 null
2026-03-10 LLM as a Meta-Judge: Synthetic Data for NLP Evaluation Metric Validation Lukáš Eigler et.al. 2603.09403 null
2026-03-10 ICDAR 2025 Competition on End-to-End Document Image Machine Translation Towards Complex Layouts Yaping Zhang et.al. 2603.09392 null
2026-03-10 Geometry-Aware Metric Learning for Cross-Lingual Few-Shot Sign Language Recognition on Static Hand Keypoints Chayanin Chamachot et.al. 2603.09213 null
2026-03-14 MultiGraSCCo: A Multilingual Anonymization Benchmark with Annotations of Personal Identifiers Ibrahim Baroud et.al. 2603.08879 null
2026-03-09 Using Multimodal and Language-Agnostic Sentence Embeddings for Abstractive Summarization Chaimae Chellaf et.al. 2603.08282 null
2026-03-09 Quantifying Cross-Lingual Transfer in Paralinguistic Speech Tasks Pol Buitrago et.al. 2603.08231 null
2026-03-09 Is continuous CoT better suited for multi-lingual reasoning? Ali Hamza Bashir et.al. 2603.08177 null
2026-03-09 Gender Bias in MT for a Genderless Language: New Benchmarks for Basque Amaia Murillo et.al. 2603.08153 null
2026-03-30 Nwāchā Munā: A Devanagari Speech Corpus and Proximal Transfer Benchmark for Nepal Bhasha ASR Rishikesh Kumar Sharma et.al. 2603.07554 null
2026-03-07 Domain-Specific Quality Estimation for Machine Translation in Low-Resource Scenarios Namrata Patil Gurav et.al. 2603.07372 null
2026-03-07 How Much Noise Can BERT Handle? Insights from Multilingual Sentence Difficulty Detection Nouran Khallaf et.al. 2603.07346 null
2026-03-10 Governance Architecture for Autonomous Agent Systems: Threats, Framework, and Engineering Practice Yuxu Ge et.al. 2603.07191 null
2026-03-06 LIT-RAGBench: Benchmarking Generator Capabilities of Large Language Models in Retrieval-Augmented Generation Koki Itai et.al. 2603.06198 null
2026-03-05 NeuronMoE: Neuron-Guided Mixture-of-Experts for Efficient Multilingual LLM Extension Rongzhi Li et.al. 2603.05046 null
2026-03-04 Hindsight Quality Prediction Experiments in Multi-Candidate Human-Post-Edited Machine Translation Malik Marmonier et.al. 2603.04083 null
2026-02-08 The Logovista English-Japanese Machine Translation System Barton D. Wright et.al. 2603.03311 null
2026-02-27 Universal Conceptual Structure in Neural Translation: Probing NLLB-200’s Multilingual Geometry Kyle Elliott Mathewson et.al. 2603.02258 null
2026-02-28 BLUFF: Benchmarking the Detection of False and Synthetic Content across 58 Low-Resource Languages Jason Lucas et.al. 2603.00634 null
2026-02-23 Distance Learning and Multilingual Education: A Case Study of Challenges and Pedagogical Perspectives in the Greek Border Region Ariadni Mandala et.al. 2603.00128 null
2026-02-27 Terminology Rarity Predicts Catastrophic Failure in LLM Translation of Low-Resource Ancient Languages: Evidence from Ancient Greek James L. Zainaldin et.al. 2602.24119 null
2026-03-04 Extending Czech Aspect-Based Sentiment Analysis with Opinion Terms: Dataset and LLM Benchmarks Jakub Šmíd et.al. 2602.22730 null
2026-02-26 Layer-Targeted Multilingual Knowledge Erasure in Large Language Models Taoran Li et.al. 2602.22562 null
2026-02-26 Multilingual Safety Alignment Via Sparse Weight Editing Jiaming Liang et.al. 2602.22554 null
2026-02-27 Bridging Latent Reasoning and Target-Language Generation via Retrieval-Transition Heads Shaswat Patel et.al. 2602.22453 null
2026-02-25 IndicIFEval: A Benchmark for Verifiable Instruction-Following Evaluation in 14 Indic Languages Thanmay Jayakumar et.al. 2602.22125 null
2026-02-25 TG-ASR: Translation-Guided Learning with Parallel Gated Cross Attention for Low-Resource Automatic Speech Recognition Cheng-Yeh Yang et.al. 2602.22039 null
2026-02-25 Global-Local Dual Perception for MLLMs in High-Resolution Text-Rich Image Translation Junxin Lu et.al. 2602.21956 null
2026-03-02 Mitigating Structural Noise in Low-Resource S2TT: An Optimized Cascaded Nepali-English Pipeline with Punctuation Restoration Tangsang Chongbang et.al. 2602.21647 null
2026-02-25 Enhancing Multilingual Embeddings via Multi-Way Parallel Text Alignment Barah Fazili et.al. 2602.21543 null
2026-02-24 Small Language Models for Privacy-Preserving Clinical Information Extraction in Low-Resource Languages Mohammadreza Ghaffarzadeh-Esfahani et.al. 2602.21374 null
2026-02-24 **Naver Labs Europe @ WSDM CUP Multilingual Retrieval** Thibault Formal et.al. 2602.20986
2026-02-23 Cross-lingual Matryoshka Representation Learning across Speech and Text Yaya Sy et.al. 2602.19991 null
2026-02-23 DEEP: Docker-based Execution and Evaluation Platform Sergio Gómez González et.al. 2602.19583 null
2026-03-16 TurkicNLP: An NLP Toolkit for Turkic Languages Sherzod Hakimov et.al. 2602.19174 null
2026-02-21 Why Agent Caching Fails and How to Fix It: Structured Intent Canonicalization with Few-Shot Learning Abhinaba Basu et.al. 2602.18922 null
2026-02-25 BURMESE-SAN: Burmese NLP Benchmark for Evaluating Large Language Models Thura Aung et.al. 2602.18788 null
2026-02-20 Tower of Babel in Cross-Cultural Communication: A Case Study of #Give Me a Chinese Name# Dialogues During the “TikTok Refugees’’ Event Jielin Feng et.al. 2602.18549 null
2026-02-05 Synthetic Media in Multilingual MOOCs: Deepfake Tutors, Pedagogical Effects, and Ethical-Policy Challenges Alexandros Gazis et.al. 2602.18457 null
2026-02-20 Learning Long-Range Dependencies with Temporal Predictive Coding Tom Potter et.al. 2602.18131 null
2026-02-19 What Language is This? Ask Your Tokenizer Clara Meister et.al. 2602.17655 null
2026-02-19 Evaluating Extremely Low-Resource Machine Translation: A Comparative Study of ChrF++ and BLEU Metrics Sanjeev Kumar et.al. 2602.17425 null
2026-02-19 WebFAQ 2.0: A Multilingual QA Dataset with Mined Hard Negatives for Dense Retrieval Michael Dinzinger et.al. 2602.17327 null
2026-02-19 Representation Collapse in Machine Translation Through the Lens of Angular Dispersion Evgeniia Tokarchuk et.al. 2602.17287 null
2026-02-19 Towards Cross-lingual Values Assessment: A Consensus-Pluralism Perspective Yukun Chen et.al. 2602.17283 null
2026-02-19 Evaluating Cross-Lingual Classification Approaches Enabling Topic Discovery for Multilingual Social Media Data Deepak Uniyal et.al. 2602.17051 null
2026-02-18 When Semantic Overlap Is Not Enough: Cross-Lingual Euphemism Transfer Between Turkish and English Hasan Can Biyik et.al. 2602.16957 null
2026-02-18 Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment Yuyan Bu et.al. 2602.16660 null
2026-02-18 Training Models on Dialects of Translationese Shows How Lexical Diversity and Source-Target Syntactic Similarity Shape Learning Jenny Kunz et.al. 2602.16469 null
2026-02-17 A Curious Class of Adpositional Multiword Expressions in Korean Junghyun Min et.al. 2602.16023 null
2026-01-22 KD4MT: A Survey of Knowledge Distillation for Machine Translation Ona de Gibert et.al. 2602.15845 null
2026-02-17 Operationalising the Superficial Alignment Hypothesis via Task Complexity Tomás Vergara-Browne et.al. 2602.15829 null
2026-02-17 LuxMT Technical Report Nils Rehlinger et.al. 2602.15506 null
2026-02-17 Bridging Day and Night: Target-Class Hallucination Suppression in Unpaired Image Translation Shuwei Li et.al. 2602.15383 null
2026-02-18 Indic-TunedLens: Interpreting Multilingual Models in Indian Languages Mihir Panchal et.al. 2602.15038 null
2026-02-16 Unlocking Reasoning Capability on Machine Translation in Large Language Models Sara Rajaee et.al. 2602.14763 null
2026-02-16 Crowdsourcing Piedmontese to Test LLMs on Non-Standard Orthography Gianluca Vico et.al. 2602.14675 null
2026-02-22 BETA-Labeling for Multilingual Dataset Construction in Low-Resource IR Md. Najib Hasan et.al. 2602.14488 null
2026-02-15 GRRM: Group Relative Reward Modeling for Machine Translation Sen Yang et.al. 2602.14028 null
2026-02-13 LLM-Powered Automatic Translation and Urgency in Crisis Scenarios Belu Ticona et.al. 2602.13452 null
2026-02-13 $\mathcal{X}$ -KD: General Experiential Knowledge Distillation for Large Language Models Yuang Cai et.al. 2602.12674 null
2026-02-25 Scaling Model and Data for Multilingual Machine Translation with Open Large Language Models Yuzhe Shang et.al. 2602.11961 null
2026-02-12 Cross-Modal Robustness Transfer (CMRT): Training Robust Speech Translation Models Using Adversarial Text Abderrahmane Issam et.al. 2602.11933 null
2026-02-11 Towards Reliable Machine Translation: Scaling LLMs for Critical Error Detection and Safety Muskaan Chopra et.al. 2602.11444 null
2026-02-09 SinFoS: A Parallel Dataset for Translating Sinhala Figures of Speech Johan Sofalas et.al. 2602.09866 null
2026-02-10 From FusHa to Folk: Exploring Cross-Lingual Transfer in Arabic Language Models Abdulmuizz Khalak et.al. 2602.09826 null
2026-02-10 Life Cycle-Aware Evaluation of Knowledge Distillation for Machine Translation: Environmental Impact and Translation Quality Trade-offs Joseph Attieh et.al. 2602.09691 null
2026-02-10 LEMUR: A Corpus for Robust Fine-Tuning of Multilingual Law Embedding Models for Retrieval Narges Baba Ahmadi et.al. 2602.09570 null
2026-02-10 AfriNLLB: Efficient Translation Models for African Languages Yasmin Moslem et.al. 2602.09373 null
2026-02-10 Unsupervised Cross-Lingual Part-of-Speech Tagging with Monolingual Corpora Only Jianyu Zheng et.al. 2602.09366 null
2026-02-10 Positive-Unlabelled Active Learning to Curate a Dataset for Orca Resident Interpretation Bret Nestor et.al. 2602.09295 null
2026-02-09 Generalizing Sports Feedback Generation by Watching Competitions and Reading Books: A Rock Climbing Case Study Arushi Rai et.al. 2602.08996 null
2026-02-09 Challenges in Translating Technical Lectures: Insights from the NPTEL Basudha Raje et.al. 2602.08698 null
2026-02-09 Do Multilingual LLMs have specialized language heads? Muhammad Naufil et.al. 2602.08625 null
2026-02-09 Beyond Scalar Scores: Reinforcement Learning for Error-Aware Quality Estimation of Machine Translation Archchana Sindhujan et.al. 2602.08600 null
2026-02-08 Lost in Translation? A Comparative Study on the Cross-Lingual Transfer of Composite Harms Vaibhav Shukla et.al. 2602.07963 null
2026-01-31 Vectra: A New Metric, Dataset, and Model for Visual Quality Assessment in E-Commerce In-Image Machine Translation Qingyu Wu et.al. 2602.07014 null
2026-02-06 MTQE.en-he: Machine Translation Quality Estimation for English-Hebrew Andy Rosenbaum et.al. 2602.06546 null
2026-02-05 Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training Junxiao Liu et.al. 2602.05940 null
2026-02-05 Polyglots or Multitudes? Multilingual LLM Answers to Value-laden Multiple-Choice Questions Léo Labat et.al. 2602.05932 null
2026-02-05 Consensus-Aligned Neuron Efficient Fine-Tuning Large Language Models for Multi-Domain Machine Translation Shuting Jiang et.al. 2602.05694 null
2026-02-05 BhashaSetu: Cross-Lingual Knowledge Transfer from High-Resource to Extreme Low-Resource Languages Subhadip Maji et.al. 2602.05599 null
2026-02-05 Cross-Lingual Empirical Evaluation of Large Language Models for Arabic Medical Tasks Chaimae Abouzahir et.al. 2602.05374 null
2026-02-04 Multilingual Extraction and Recognition of Implicit Discourse Relations in Speech and Text Ahmed Ruby et.al. 2602.05107 null
2026-02-04 Data Kernel Perspective Space Performance Guarantees for Synthetic Data from Transformer Models Michael Browder et.al. 2602.05106 null
2026-02-04 Beyond Many-Shot Translation: Scaling In-Context Demonstrations For Low-Resource Machine Translation Luis Frentzen Salim et.al. 2602.04764 null
2026-02-04 “Be My Cheese?”: Cultural Nuance Benchmarking for Machine Translation in Multilingual LLMs Madison Van Doren et.al. 2602.04729 null
2026-02-04 Disentangling meaning from language in LLM-based machine translation Théo Lasnier et.al. 2602.04613 null
2026-02-04 No One-Size-Fits-All: Building Systems For Translation to Bashkir, Kazakh, Kyrgyz, Tatar and Chuvash Using Synthetic And Original Data Dmitry Karpov et.al. 2602.04442 null
2026-02-14 Tokenization and Morphological Fidelity in Uralic NLP: A Cross-Lingual Evaluation Nuo Xu et.al. 2602.04241 null
2026-02-03 BIRDTurk: Adaptation of the BIRD Text-to-SQL Dataset to Turkish Burak Aktaş et.al. 2602.03633 null
2026-02-03 Assessing the Impact of Typological Features on Multilingual Machine Translation in the Age of Large Language Models Vitalii Hirak et.al. 2602.03551 null
2026-02-03 PEGRL: Improving Machine Translation by Post-Editing Guided Reinforcement Learning Yunzhi Shen et.al. 2602.03352 null
2026-02-03 Consensus Group Relative Policy Optimization for Text Generation Yuki Ichihara et.al. 2602.03102 null
2026-02-02 Controlled disagreement improves generalization in decentralized training Zesen Wang et.al. 2602.02899 null
2026-02-02 Large Language Models for Mental Health: A Multilingual Evaluation Nishat Raihan et.al. 2602.02440 null
2026-02-02 Cross-Lingual Stability of LLM Judges Under Controlled Generation: Evidence from Finno-Ugric Languages Isaac Chung et.al. 2602.02287 null
2026-02-02 BBPE16: UTF-16-based byte-level byte-pair encoding for improved multilingual speech recognition Hyunsik Kim et.al. 2602.01717 null
2026-02-02 SEA-Guard: Culturally Grounded Multilingual Safeguard for Southeast Asia Panuthep Tasawong et.al. 2602.01618 null
2026-02-01 Who Transfers Safety? Identifying and Targeting Cross-Lingual Shared Safety Neurons Xianhui Zhang et.al. 2602.01283 null
2026-02-01 From Utterance to Vividity: Training Expressive Subtitle Translation LLM via Adaptive Local Preference Optimization Chaoqun Cui et.al. 2602.01068 null
2026-01-19 Extending Beacon to Hindi: Cultural Adaptation Drives Cross-Lingual Sycophancy Sarthak Sattigeri et.al. 2602.00046 null
2026-02-11 Bias Beyond Borders: Political Ideology Evaluation and Steering in Multilingual LLMs Afrozah Nadeem et.al. 2601.23001 null
2026-01-30 Benchmarking Machine Translation on Chinese Social Media Texts Kaiyan Zhao et.al. 2601.22931 null
2026-01-30 When Meanings Meet: Investigating the Emergence and Quality of Shared Concept Spaces during Multilingual Language Model Training Felicia Körner et.al. 2601.22851 null
2026-01-30 RASST: Fast Cross-modal Retrieval-Augmented Simultaneous Speech Translation Jiaxuan Luo et.al. 2601.22777 null
2026-01-29 TidyVoice 2026 Challenge Evaluation Plan Aref Farhadipour et.al. 2601.21960 null
2026-02-06 DimStance: Multilingual Datasets for Dimensional Stance Analysis Jonas Becker et.al. 2601.21483 null
2026-01-28 UrduBench: An Urdu Reasoning Benchmark using Contextually Ensembled Translations with Human-in-the-Loop Muhammad Ali Shafique et.al. 2601.21000 null
2026-01-28 When Flores Bloomz Wrong: Cross-Direction Contamination in Machine Translation Evaluation David Tan et.al. 2601.20858 null
2026-01-28 MiLorE-SSL: Scaling Multilingual Capabilities in Self-Supervised Models without Forgetting Jing Xu et.al. 2601.20300 null
2026-01-27 FFE-Hallu:Hallucinations in Fixed Figurative Expressions:Benchmark of Idioms and Proverbs in the Persian Language Faezeh Hosseini et.al. 2601.20105 null
2026-01-27 LinguaMap: Which Layers of LLMs Speak Your Language and How to Tune Them? J. Ben Tamo et.al. 2601.20009 null
2026-01-27 Reflective Translation: Improving Low-Resource Machine Translation via Structured Self-Reflection Nicholas Cheng et.al. 2601.19871 null
2026-01-27 Dynamic Multi-Expert Projectors with Stabilized Routing for Multilingual Speech Recognition Isha Pandey et.al. 2601.19451 null
2026-02-14 Do LLMs Truly Benefit from Longer Context in Automatic Post-Editing? Ahrii Kim et.al. 2601.19410 null
2026-01-27 Leveraging Sentence-oriented Augmentation and Transformer-Based Architecture for Vietnamese-Bahnaric Translation Tan Sang Nguyen et.al. 2601.19124 null
2026-01-26 XProvence: Zero-Cost Multilingual Context Pruning for Retrieval-Augmented Generation Youssef Mohamed et.al. 2601.18886 null
2026-01-26 Mitigating the OWASP Top 10 For Large Language Models Applications using Intelligent Agents Mohammad Fasha et.al. 2601.18105 null
2026-01-25 PEAR: Pairwise Evaluation for Automatic Relative Scoring in Machine Translation Lorenzo Proietti et.al. 2601.18006 null
2026-01-25 DIETA: A Decoder-only transformer-based model for Italian-English machine TrAnslation Pranav Kasela et.al. 2601.17823 null
2026-01-25 Cross-Lingual Probing and Community-Grounded Analysis of Gender Bias in Low-Resource Bengali Md Asgor Hossain Reaj et.al. 2601.17764 null
2026-01-25 Align to the Pivot: Dual Alignment with Self-Feedback for Multilingual Math Reasoning Chunxu Zhao et.al. 2601.17671 link
2026-01-24 CLM-Bench: Benchmarking and Analyzing Cross-lingual Misalignment of LLMs in Knowledge Editing Yucheng Hu et.al. 2601.17397 null
2026-01-23 Do LLM hallucination detectors suffer from low-resource effect? Debtanu Datta et.al. 2601.16766 null
2026-01-23 Typologically Informed Parameter Aggregation Stef Accou et.al. 2601.16629 null
2026-01-23 Cross-Lingual Activation Steering for Multilingual Language Models Rhitabrat Pokharel et.al. 2601.16390 null
2026-01-21 Large-Scale Multidimensional Knowledge Profiling of Scientific Literature Zhucun Xue et.al. 2601.15170 null
2026-01-21 Obscuring Data Contamination Through Translation: Evidence from Arabic Corpora Chaymaa Abbas et.al. 2601.14994 null
2026-01-20 PRiSM: Benchmarking Phone Realization in Speech Models Shikhar Bharadwaj et.al. 2601.14046 null
2026-01-20 On Temperature-Constrained Non-Deterministic Machine Translation: Potential and Evaluation Weichuan Wang et.al. 2601.13729 null
2026-01-19 Alexandria: A Multi-Domain Dialectal Arabic Machine Translation Dataset for Culturally Inclusive and Linguistically Diverse LLMs Abdellah El Mekki et.al. 2601.13099 null
2026-01-19 A Shared Geometry of Difficulty in Multilingual Language Models Stefano Civelli et.al. 2601.12731 null
2026-01-19 UbuntuGuard: A Culturally-Grounded Policy Benchmark for Equitable AI Safety in African Languages Tassallah Abdullahi et.al. 2601.12696 null
2026-01-18 Benchmarking Concept-Spilling Across Languages in LLMs Ilia Badanin et.al. 2601.12549 null
2026-02-04 Improving Low-Resource Machine Translation via Round-Trip Reinforcement Learning Ahmed Attia et.al. 2601.12535 null
2026-02-02 The Language You Ask In: Language-Conditioned Ideological Divergence in LLM Analysis of Contested Political Documents Oleg Smirnov et.al. 2601.12164 null
2026-01-17 GloCTM: Cross-Lingual Topic Modeling via a Global Context Space Nguyen Tien Phat et.al. 2601.11872 null
2026-01-16 Translation as a Scalable Proxy for Multilingual Evaluation Sheriff Issaka et.al. 2601.11778 null
2026-01-14 Semantic Differentiation for Tackling Challenges in Watermarking Low-Entropy Constrained Generation Outputs Nghia T. Le et.al. 2601.11629 null
2025-12-25 Compass-Embedding v4: Robust Contrastive Learning for Multilingual E-commerce Embeddings Pakorn Ueareeworakul et.al. 2601.11565 null
2026-01-16 MultiCaption: Detecting disinformation using multilingual visual claims Rafael Martins Frade et.al. 2601.11220 null
2026-01-15 BYOL: Bring Your Own Language Into LLMs Syed Waqas Zamir et.al. 2601.10804 null
2026-01-15 INDIC DIALECT: A Multi Task Benchmark to Evaluate and Translate in Indian Language Dialects Tarun Sharma et.al. 2601.10388 null
2026-01-15 Untangling Input Language from Reasoning Language: A Diagnostic Framework for Cross-Lingual Moral Alignment in LLMs Nan Li et.al. 2601.10257 null
2026-01-15 One Instruction Does Not Fit All: How Well Do Embeddings Align Personas and Instructions in Low-Resource Indian Languages? Arya Shah et.al. 2601.10205 null
2026-01-28 HOMURA: Taming the Sand-Glass for Time-Constrained LLM Translation via Reinforcement Learning Ziang Cui et.al. 2601.10187 null
2026-01-20 Multilingual-To-Multimodal (M2M): Unlocking New Languages with Monolingual Text Piyush Singh Pasi et.al. 2601.10096 null
2026-01-15 Context Volume Drives Performance: Tackling Domain Shift in Extremely Low-Resource Translation via RAG David Samuel Setiawan et.al. 2601.09982 null
2025-12-29 Benchmarking Cross-Lingual Semantic Alignment in Multilingual Embeddings Wen G. Gong et.al. 2601.09732 null
2026-01-16 Assessing and Improving Punctuation Robustness in English-Marathi Machine Translation Kaustubh Shivshankar Shejole et.al. 2601.09725 null
2025-12-24 Opportunities and Challenges of Natural Language Processing for Low-Resource Senegalese Languages in Social Science Research Derguene Mbaye et.al. 2601.09716 null
2026-01-14 Creating a Hybrid Rule and Neural Network Based Semantic Tagger using Silver Standard Data: the PyMUSAS framework for Multilingual Semantic Annotation Andrew Moore et.al. 2601.09648 null
2026-01-24 Layer-Parallel Training for Transformers Shuai Jiang et.al. 2601.09026 null
2026-01-19 TranslateGemma Technical Report Mara Finkelstein et.al. 2601.09012 null
2026-01-13 A Parallel Cross-Lingual Benchmark for Multimodal Idiomaticity Understanding Dilara Torunoğlu-Selamet et.al. 2601.08645 null
2026-01-13 Get away with less: Need of source side data curation to build parallel corpus for low resource Machine Translation Saumitra Yadav et.al. 2601.08629 null
2026-01-13 CLaS-Bench: A Cross-Lingual Alignment and Steering Benchmark Daniil Gurgurov et.al. 2601.08331 null
2026-01-12 Order in the Evaluation Court: A Critical Analysis of NLG Evaluation Trends Jing Yang et.al. 2601.07648 null
2026-01-12 Beyond Literal Mapping: Benchmarking and Improving Non-Literal Translation Evaluation Yanzhi Tian et.al. 2601.07338 null
2026-01-12 Mitrasamgraha: A Comprehensive Classical Sanskrit Machine Translation Dataset Sebastian Nehrdich et.al. 2601.07314 null
2026-01-11 When Abundance Conceals Weakness: Knowledge Conflict in Multilingual Models Jiaqi Zhao et.al. 2601.07041 null
2026-01-11 BiasLab: A Multilingual, Dual-Framing Framework for Robust Measurement of Output-Level Bias in Large Language Models William Guey et.al. 2601.06861 null
2026-01-10 Evaluating Cross-Lingual Unlearning in Multilingual Language Models Tyler Lizzo et.al. 2601.06675 null
2026-01-10 MITRA: A Large-Scale Parallel Corpus and Multilingual Pretrained Language Model for Machine Translation and Semantic Retrieval for Pāli, Sanskrit, Buddhist Chinese, and Tibetan Sebastian Nehrdich et.al. 2601.06400 null
2026-01-10 AfriqueLLM: How Data Mixing and Model Architecture Impact Continued Pre-training for African Languages Hao Yu et.al. 2601.06395 null
2026-01-09 Evaluating Robustness of Large Language Models in Enterprise Applications: Benchmarks for Perturbation Consistency Across Formats and Languages Tara Bogavelli et.al. 2601.06341 null
2026-01-09 A Rising Tide Lifts All Boats: MTQE Rewards for Idioms Improve General Translation Quality Ishika Agarwal et.al. 2601.06307 null
2026-01-09 AdaFuse: Adaptive Ensemble Decoding with Test-Time Scaling for LLMs Chengming Cui et.al. 2601.06022 null
2026-01-09 CLewR: Curriculum Learning with Restarts for Machine Translation Preference Learning Alexandra Dragomir et.al. 2601.05858 null
2026-01-09 One Script Instead of Hundreds? On Pretraining Romanized Encoder Language Models Benedikt Ebing et.al. 2601.05776 null
2026-01-14 Afri-MCQA: Multimodal Cultural Question Answering for African Languages Atnafu Lambebo Tonja et.al. 2601.05699 null
2026-01-09 Multilingual Amnesia: On the Transferability of Unlearning in Multilingual LLMs Alireza Dehghanpour Farashah et.al. 2601.05641 null
2026-01-09 Text Detoxification in isiXhosa and Yorùbá: A Cross-Lingual Machine Learning Approach for Low-Resource African Languages Abayomi O. Agbeyangi et.al. 2601.05624 null
2026-01-09 Enabling Stroke-Level Structural Analysis of Hieroglyphic Scripts without Language-Specific Priors Fuwen Luo et.al. 2601.05508 null
2026-01-08 BanglaLorica: Design and Evaluation of a Robust Watermarking Algorithm for Large Language Models in Bangla Text Generation Amit Bin Tariqul et.al. 2601.04534 null
2026-01-07 The Overlooked Role of Graded Relevance Thresholds in Multilingual Dense Retrieval Tomer Wullach et.al. 2601.04395 null
2026-01-07 Dialect Matters: Cross-Lingual ASR Transfer for Low-Resource Indic Language Varieties Akriti Dhasmana et.al. 2601.04373 null
2026-01-07 Analyzing and Improving Cross-lingual Knowledge Transfer for Machine Translation David Stap et.al. 2601.04036 null
2026-01-12 NeoAMT: Neologism-Aware Agentic Machine Translation with Reinforcement Learning Zhongtao Miao et.al. 2601.03790 null
2026-01-07 Bootstrapping Code Translation with Weighted Multilanguage Exploration Yuhan Wu et.al. 2601.03512 null
2026-01-06 Eye-Q: A Multilingual Benchmark for Visual Word Puzzle Solving and Image-to-Phrase Reasoning Ali Najar et.al. 2601.03400 null
2026-01-06 Can Embedding Similarity Predict Cross-Lingual Transfer? A Systematic Study on African Languages Tewodros Kederalah Idris et.al. 2601.03168 null
2026-01-10 Improving Indigenous Language Machine Translation with Synthetic Data and Language-Specific Preprocessing Aashish Dhawan et.al. 2601.03135 null
2026-01-06 Enhancing Multilingual RAG Systems with Debiased Language Preference-Guided Query Fusion Jeonghyun Park et.al. 2601.02956 null
2026-01-10 Pearmut: Human Evaluation of Translation Made Trivial Vilém Zouhar et.al. 2601.02933 null
2026-01-05 Cost-Efficient Cross-Lingual Retrieval-Augmented Generation for Low-Resource Languages: A Case Study in Bengali Agricultural Advisory Md. Asif Hossain et.al. 2601.02065 null
2026-01-20 Semantic Alignment of Multilingual Knowledge Graphs via Contextualized Vector Projections Abhishek Kumar et.al. 2601.00814 null
2026-01-23 The Role of Mixed-Language Documents for Multilingual Large Language Model Pretraining Jiandong Shao et.al. 2601.00364 null
2026-01-01 Parallel Universes, Parallel Languages: A Comprehensive Study on LLM-based Multilingual Counterfactual Example Generation Qianli Wang et.al. 2601.00263 null
2025-12-31 Triangulation as an Acceptance Rule for Multilingual Mechanistic Interpretability Yanan Long et.al. 2512.24842 null
2025-12-30 HY-MT1.5 Technical Report Mao Zheng et.al. 2512.24092 null
2025-12-29 A Stepwise-Enhanced Reasoning Framework for Large Language Models Based on External Subgraph Generation Xin Zhang et.al. 2512.23356 null
2026-01-01 AlignAR: Generative Sentence Alignment for Arabic-English Parallel Corpora of Legal and Literary Texts Baorong Huang et.al. 2512.21842 null
2025-12-25 Ara-HOPE: Human-Centric Post-Editing Evaluation for Dialectal Arabic to Modern Standard Arabic Translation Abdullah Alabdullah et.al. 2512.21787 null
2025-12-29 Gamayun’s Path to Multilingual Mastery: Cost-Efficient Training of a 1.5B-Parameter LLM Alexander Podolskiy et.al. 2512.21580 null
2025-12-23 SweRank+: Multilingual, Multi-Turn Code Ranking for Software Issue Localization Revanth Gangi Reddy et.al. 2512.20482 null
2025-12-23 Corpus of Cross-lingual Dialogues with Minutes and Detection of Misunderstandings Marko Čechovič et.al. 2512.20204 null
2025-12-23 Well Begun is Half Done: Location-Aware and Trace-Guided Iterative Automated Vulnerability Repair Zhenlei Ye et.al. 2512.20203 null
2025-12-22 MauBERT: Universal Phonetic Inductive Biases for Few-Shot Acoustic Units Discovery Angelo Ortiz Tandazo et.al. 2512.19612 null
2025-12-21 Remedy-R: Generative Reasoning for Machine Translation Evaluation without Error Annotations Shaomu Tan et.al. 2512.18906 null
2025-12-21 From Scratch to Fine-Tuned: A Comparative Study of Transformer Training Strategies for Legal Machine Translation Amit Barman et.al. 2512.18593 null
2025-12-19 Seeing Justice Clearly: Handwritten Legal Document Translation with OCR and Vision-Language Models Shubham Kumar Nigam et.al. 2512.18004 null
2025-12-17 Cross-Language Bias Examination in Large Language Models Yuxuan Liang et.al. 2512.16029 null
2025-12-17 An Empirical Study on Chinese Character Decomposition in Multiword Expression-Aware Neural Machine Translation Lifeng Han et.al. 2512.15556 null
2025-12-17 Yes-MT’s Submission to the Low-Resource Indic Language Translation Shared Task in WMT 2024 Yash Bhaskar et.al. 2512.15226 null
2025-12-16 Low-Resource, High-Impact: Building Corpora for Inclusive Language Technologies Ekaterina Artemova et.al. 2512.14576 link
2025-12-16 A Comparative Analysis of Retrieval-Augmented Generation Techniques for Bengali Standard-to-Dialect Machine Translation Using LLMs K. M. Jubair Sami et.al. 2512.14179 null
2025-12-16 Multilingual and Continuous Backchannel Prediction: A Cross-lingual Study Koji Inoue et.al. 2512.14085 null
2025-12-15 PrahokBART: A Pre-trained Sequence-to-Sequence Model for Khmer Natural Language Generation Hour Kaing et.al. 2512.13552 null
2025-12-15 Advancing Bangla Machine Translation Through Informal Datasets Ayon Roy et.al. 2512.13487 null
2025-12-15 Scaling Laws for Code: Every Programming Language Matters Jian Yang et.al. 2512.13472 null
2025-12-12 Improving Translation Quality by Selecting Better Data for LLM Fine-Tuning: A Comparative Analysis Felipe Ribeiro Fujita de Mello et.al. 2512.11388 null
2025-12-11 MultiScript30k: Leveraging Multilingual Embeddings to Extend Cross Script Parallel Data Christopher Driggers-Ellis et.al. 2512.11074 null
2025-12-10 Efficient Continual Learning in Neural Machine Translation: A Low-Rank Adaptation Approach Salvador Carrión et.al. 2512.09910 null
2025-12-10 Mitigating Social Bias in English and Urdu Language Models Using PRM-Guided Candidate Selection and Sequential Refinement Muneeb Ur Raheem Khan et.al. 2512.09854 null
2025-12-09 What Triggers my Model? Contrastive Explanations Inform Gender Choices by Translation Models Janiça Hackenbuchner et.al. 2512.08440 null
2025-12-30 Minimum Bayes Risk Decoding for Error Span Detection in Reference-Free Automatic Machine Translation Evaluation Boxuan Lyu et.al. 2512.07540 null
2025-12-08 SwissGov-RSD: A Human-annotated, Cross-lingual Benchmark for Token-level Recognition of Semantic Differences Between Related Documents Michelle Wastl et.al. 2512.07538 null
2025-12-08 Persian-Phi: Efficient Cross-Lingual Adaptation of Compact LLMs via Curriculum Learning Amir Mohammad Akhlaghi et.al. 2512.07454 null
2025-12-08 Efficient ASR for Low-Resource Languages: Leveraging Cross-Lingual Unlabeled Data Srihari Bandarupalli et.al. 2512.07277 null
2025-12-08 MASim: Multilingual Agent-Based Simulation for Social Science Xuan Zhang et.al. 2512.07195 null
2025-12-05 Grounded Multilingual Medical Reasoning for Question Answering with Large Language Models Pietro Ferrazzi et.al. 2512.05658 null
2025-12-04 Structured Document Translation via Format Reinforcement Learning Haiyue Song et.al. 2512.05100 null
2025-12-04 AdiBhashaa: A Community-Curated Benchmark for Machine Translation into Indian Tribal Languages Pooja Singh et.al. 2512.04765 null
2025-12-03 Adapting Large Language Models to Low-Resource Tibetan: A Two-Stage Continual and Supervised Fine-Tuning Study Lifeng Chen et.al. 2512.03976 null
2025-12-03 Zero-Shot Video Translation and Editing with Frame Spatial-Temporal Correspondence Shuai Yang et.al. 2512.03905 null
2025-12-03 HieroGlyphTranslator: Automatic Recognition and Translation of Egyptian Hieroglyphs to English Ahmed Nasser et.al. 2512.03817 null
2025-12-03 M3DR: Towards Universal Multilingual Multimodal Document Retrieval Adithya S Kolavi et.al. 2512.03514 null
2025-12-03 From Hypothesis to Premises: LLM-based Backward Logical Reasoning with Selective Symbolic Translation Qingchuan Li et.al. 2512.03360 null
2025-11-29 Beyond Code Pairs: Dialogue-Based Data Generation for LLM Code Translation Le Chen et.al. 2512.03086 null
2025-12-02 Fine-Tuned Large Language Models for Logical Translation: Reducing Hallucinations with Lang2Logic Muyu Pan et.al. 2512.02987 null
2025-12-02 Cross-Lingual Prompt Steerability: Towards Accurate and Robust LLM Behavior across Languages Lechen Zhang et.al. 2512.02841 null
2025-12-02 BOOM: Beyond Only One Modality KIT’s Multimodal Multilingual Lecture Companion Sai Koneru et.al. 2512.02817 null
2025-12-02 TriLex: A Framework for Multilingual Sentiment Analysis in Low-Resource South African Languages Mike Nkongolo et.al. 2512.02799 null
2025-12-02 Towards Language-Independent Face-Voice Association with Multimodal Foundation Models Aref Farhadipour et.al. 2512.02759 null
2025-12-03 Invariance under Structure Translation as the Origin of Host Immune Capacity Conservation from Noether’s Theorem Yexing Chen et.al. 2512.02730 null
2025-12-02 CREST: Universal Safety Guardrails Through Cluster-Guided Cross-Lingual Transfer Lavish Bansal et.al. 2512.02711 null
2025-12-02 Feedback Loops and Code Perturbations in LLM-based Software Engineering: A Case Study on a C-to-Rust Translation System Martin Weiss et.al. 2512.02567 null
2025-12-01 Cross-Lingual Interleaving for Speech Language Models Adel Moumen et.al. 2512.01865 null
2025-12-01 BHRAM-IL: A Benchmark for Hallucination Recognition and Assessment in Multiple Indian Languages Hrishikesh Terdalkar et.al. 2512.01852 null
2025-12-01 MCAT: Scaling Many-to-Many Speech-to-Text Translation with MLLMs to 70 Languages Yexing Du et.al. 2512.01512 null
2025-12-01 LAURA: Enhancing Code Review Generation with Context-Enriched Retrieval-Augmented LLM Yuxin Zhang et.al. 2512.01356 null
2025-12-15 Conveying Imagistic Thinking in Traditional Chinese Medicine Translation: A Prompt Engineering and LLM-Based Evaluation Framework Jiatong Han et.al. 2512.01198 null
2025-12-02 Multilingual Training-Free Remote Sensing Image Captioning Carlos Rebelo et.al. 2512.00887 null
2025-11-30 SHRAG: AFrameworkfor Combining Human-Inspired Search with RAG Hyunseok Ryu et.al. 2512.00772 null
2025-11-30 MPR-GUI: Benchmarking and Enhancing Multilingual Perception and Reasoning in GUI Agents Ruihan Chen et.al. 2512.00756 null
2025-11-29 Partial Cross-Compilation and Mixed Execution for Accelerating Dynamic Binary Translation Yuhao Gu et.al. 2512.00487 null
2025-11-29 IndicParam: Benchmark to evaluate LLMs on low-resource Indic Languages Ayush Maheshwari et.al. 2512.00333 null
2025-11-29 Lost without translation – Can transformer (language models) understand mood states? Prakrithi Shivaprakash et.al. 2512.00274 null
2025-11-28 OmniFusion: Simultaneous Multilingual Multimodal Translations via Modular Fusion Sai Koneru et.al. 2512.00234 null
2025-11-28 Asm2SrcEval: Evaluating Large Language Models for Assembly-to-Source Code Translation Parisa Hamedi et.al. 2512.00134 null
2025-11-28 Unlocking Multilingual Reasoning Capability of LLMs and LVLMs through Representation Engineering Qiming Li et.al. 2511.23231 null
2025-12-09 Conveying Imagistic Thinking in Traditional Chinese Medicine Translation: A Prompt Engineering and LLM-Based Evaluation Framework Jiatong Han et.al. 2511.23059 null
2025-11-26 Advancing Automated In-Isolation Validation in Repository-Level Code Translation Kaiyao Ke et.al. 2511.21878 null
2025-11-24 LLMs for Low-Resource Dialect Translation Using Context-Aware Prompting: A Case Study on Sylheti Tabia Tanzin Prama et.al. 2511.21761 null
2025-11-16 On the Cross-lingual Transferability of Pre-trained wav2vec2-based Models Jonatas Grosman et.al. 2511.21704 null
2025-11-26 Rigidity of Solitons to the Mean Curvature Flow in $\mathbb{H}^3$ as Translation Surfaces Tarcios Andrey Ferreira et.al. 2511.21545 null
2025-11-26 Voice, Bias, and Coreference: An Interpretability Study of Gender in Speech Translation Lina Conti et.al. 2511.21517 null
2025-11-26 RosettaSpeech: Zero-Shot Speech-to-Speech Translation from Monolingual Data Zhisheng Zheng et.al. 2511.20974 null
2025-11-25 Winning with Less for Low Resource Languages: Advantage of Cross-Lingual English_Persian Argument Mining Model over LLM Augmentation Ali Jahan et.al. 2511.20872 null
2025-11-25 TReFT: Taming Rectified Flow Models For One-Step Image Translation Shengqian Li et.al. 2511.20307 null
2025-11-24 Generative Query Expansion with Multilingual LLMs for Cross-Lingual Information Retrieval Olivia Macmillan-Scott et.al. 2511.19325 null
2025-11-24 What Drives Cross-lingual Ranking? Retrieval Approaches with Multilingual Language Models Roksana Goworek et.al. 2511.19324 null
2025-11-24 Large Language Models for the Summarization of Czech Documents: From History to the Present Václav Tran et.al. 2511.18848 null
2025-11-23 DocPTBench: Benchmarking End-to-End Photographed Document Parsing and Translation Yongkun Du et.al. 2511.18434 null
2025-11-23 SmolKalam: Ensemble Quality-Filtered Translation at Scale for High Quality Arabic Post-Training Data Sultan Alrashed et.al. 2511.18411 null
2025-11-21 Estonian WinoGrande Dataset: Comparative Analysis of LLM Performance on Human and Machine Translation Marii Ojastu et.al. 2511.17290 null
2025-11-21 Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation Chuancheng Shi et.al. 2511.17282 null
2025-11-21 Lost in Translation and Noise: A Deep Dive into the Failure Modes of VLMs on Real-World Tables Anshul Singh et.al. 2511.17238 null
2025-11-21 LangMark: A Multilingual Dataset for Automatic Post-Editing Diego Velazquez et.al. 2511.17153 null
2025-11-19 HinTel-AlignBench: A Framework and Benchmark for Hindi-Telugu with English-Aligned Samples Rishikant Chigrupaatii et.al. 2511.15183 null
2025-11-21 LiveCLKTBench: Towards Reliable Evaluation of Cross-Lingual Knowledge Transfer in Multilingual LLMs Pei-Fu Guo et.al. 2511.14774 null
2025-11-18 NeuCLIRBench: A Modern Evaluation Collection for Monolingual, Cross-Language, and Multilingual Information Retrieval Dawn Lawrie et.al. 2511.14758 null
2025-11-18 TTA: Transcribe, Translate and Alignment for Cross-lingual Speech Representation Wei Liu et.al. 2511.14410 null
2025-11-17 Can QE-informed (Re)Translation lead to Error Correction? Govardhan Padmanabhan et.al. 2511.13884 null
2025-11-18 Crossing Borders: A Multimodal Challenge for Indian Poetry Translation and Image Generation Sofia Jamil et.al. 2511.13689 null
2025-11-23 Non-Linear Scoring Model for Translation Quality Evaluation Serge Gladkoff et.al. 2511.13467 null
2025-11-15 Exploring Parameter-Efficient Fine-Tuning and Backtranslation for the WMT 25 General Translation Task Felipe Fujita et.al. 2511.12109 null
2025-11-14 Do LLMs Really Struggle at NL-FOL Translation? Revealing their Strengths via a Novel Benchmarking Strategy Andrea Brunello et.al. 2511.11816 null
2025-11-14 Beyond Exascale: Dataflow Domain Translation on a Cerebras Cluster Tomas Oppelstrup et.al. 2511.11542 null
2025-11-14 Translation-Symmetric Market: Enabling Incentive Compatibility For DER Aggregation Ruike Lyu et.al. 2511.11453 null
2025-11-14 Comprehension of Multilingual Expressions Referring to Target Objects in Visual Inputs Francisco Nogueira et.al. 2511.11427 null
2025-11-14 OT-ALD: Aligning Latent Distributions with Optimal Transport for Accelerated Image-to-Image Translation Zhanpeng Wang et.al. 2511.11162 null
2025-12-17 DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains Xiying Zhao et.al. 2511.10984 null
2025-11-13 Towards Attribution of Generators and Emotional Manipulation in Cross-Lingual Synthetic Speech using Geometric Learning Girish et.al. 2511.10790 null
2025-11-13 TEDxTN: A Three-way Speech Translation Corpus for Code-Switched Tunisian Arabic - English Fethi Bougares et.al. 2511.10780 null
2025-11-13 Faithful Summarization of Consumer Health Queries: A Cross-Lingual Framework with LLMs Ajwad Abrar et.al. 2511.10768 null
2025-11-09 Towards Fine-Grained Code-Switch Speech Translation with Semantic Space Alignment Yan Gao et.al. 2511.10670 null
2025-11-05 Evaluating Modern Large Language Models on Low-Resource and Morphologically Rich Languages:A Cross-Lingual Benchmark Across Cantonese, Japanese, and Turkish Chengxuan Xia et.al. 2511.10664 null
2025-11-13 LangGPS: Language Separability Guided Data Pre-Selection for Joint Multilingual Instruction Tuning Yangfan Ye et.al. 2511.10229 null
2025-11-13 Fractional neural attention for efficient multiscale sequence processing Cheng Kevin Qu et.al. 2511.10208 null
2025-11-13 Scalable data-driven modeling of microstructure evolution by learning local dependency and spatiotemporal translation invariance rules in phase field simulation Zishuo Lan et.al. 2511.10171 null
2025-11-13 Language Drift in Multilingual Retrieval-Augmented Generation: Characterization and Decoding-Time Mitigation Bo Li et.al. 2511.09984 null
2025-11-14 HI-TransPA: Hearing Impairments Translation Personal Assistant Zhiming Ma et.al. 2511.09915 null
2025-11-12 Predicate-Argument Structure Divergences in Chinese and English Parallel Sentences and their Impact on Language Transfer Rocco Tripodi et.al. 2511.09796 null
2025-11-12 How Small Can You Go? Compact Language Models for On-Device Critical Error Detection in Machine Translation Muskaan Chopra et.al. 2511.09748 null
2025-11-12 NSL-MT: Linguistically Informed Negative Samples for Efficient Machine Translation in Low-Resource Languages Mamadou K. Keita et.al. 2511.09537 null
2025-11-12 Spatial Audio Rendering for Real-Time Speech Translation in Virtual Meetings Margarita Geleta et.al. 2511.09525 null
2025-11-12 POTSA: A Cross-Lingual Speech Alignment Framework for Low Resource Speech-to-Text Translation Xuanchen Li et.al. 2511.09232 null
2025-11-07 The LLM Pro Finance Suite: Multilingual Large Language Models for Financial Applications Gaëtan Caillaut et.al. 2511.08621 null
2025-11-11 Large Sign Language Models: Toward 3D American Sign Language Translation Sen Zhang et.al. 2511.08535 null
2025-11-11 Introducing A Bangla Sentence - Gloss Pair Dataset for Bangla Sign Language Translation and Research Neelavro Saha et.al. 2511.08507 null
2025-12-03 Focusing on Language: Revealing and Exploiting Language Attention Heads in Multilingual Large Language Models Xin Liu et.al. 2511.07498 null
2025-11-07 It Takes Two: A Dual Stage Approach for Terminology-Aware Translation Akshat Singh Jaswal et.al. 2511.07461 null
2025-11-10 Discourse Graph Guided Document Translation with Large Language Models Viet-Thanh Pham et.al. 2511.07230 null
2025-11-10 Llama-Embed-Nemotron-8B: A Universal Text Embedding Model for Multilingual and Cross-Lingual Tasks Yauhen Babakhin et.al. 2511.07025 null
2025-11-10 A Picture is Worth a Thousand (Correct) Captions: A Vision-Guided Judge-Corrector System for Multimodal Machine Translation Siddharth Betala et.al. 2511.07010 null
2025-11-10 Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs Yingfeng Luo et.al. 2511.07003 null
2025-11-10 CLiFT-ASR: A Cross-Lingual Fine-Tuning Framework for Low-Resource Taiwanese Hokkien Speech Recognition Hung-Yang Sung et.al. 2511.06860 null
2025-11-10 Steering LLMs toward Korean Local Speech: Iterative Refinement Framework for Faithful Dialect Translation Keunhyeung Park et.al. 2511.06680 null
2025-11-09 Ibom NLP: A Step Toward Inclusive Natural Language Processing for Nigeria’s Minority Languages Oluwadara Kalejaiye et.al. 2511.06531 null
2025-11-09 Rethinking what Matters: Effective and Robust Multilingual Realignment for Low-Resource Languages Quang Phuoc Nguyen et.al. 2511.06497 null
2025-11-07 A multimodal multiplex of the mental lexicon for multilingual individuals Maria Huynh et.al. 2511.05361 null
2025-11-07 Translation via Annotation: A Computational Study of Translating Classical Chinese into Japanese Zilong Li et.al. 2511.05239 null
2025-11-07 Mind the Gap… or Not? How Translation Errors and Evaluation Details Skew Multilingual Results Jan-Thorsten Peter et.al. 2511.05162 null
2025-11-07 Reasoning-Guided Claim Normalization for Noisy Multilingual Social Media Posts Manan Sharma et.al. 2511.05078 null
2025-11-12 MERaLiON-SER: Robust Speech Emotion Recognition Model for English and SEA Languages Hardik B. Sailor et.al. 2511.04914 null
2025-11-06 IndicVisionBench: Benchmarking Cultural and Multilingual Understanding in VLMs Ali Faraz et.al. 2511.04727 null
2025-11-01 Cross-Lingual SynthDocs: A Large-Scale Synthetic Corpus for Any to Arabic OCR and Document Understanding Haneen Al-Homoud et.al. 2511.04699 null
2025-11-06 Dynamic Jointly Batch Selection for Data Efficient Machine Translation Fine-Tuning Mohammad Amin Ghanizadeh et.al. 2511.04406 null
2025-11-06 Direct Semantic Communication Between Large Language Models via Vector Translation Fu-Chun Yang et.al. 2511.03945 null
2025-11-05 Evaluating Machine Translation Datasets for Low-Web Data Languages: A Gendered Lens Hellina Hailu Nigatu et.al. 2511.03880 null
2025-11-05 BanglaSTEM: A Parallel Corpus for Technical Domain Bangla-English Translation Kazi Reyazul Hasan et.al. 2511.03498 null
2025-11-18 Segmentation Beyond Defaults: Asymmetrical Byte Pair Encoding for Optimal Machine Translation Performance Saumitra Yadav et.al. 2511.03383 null
2025-11-11 How to Evaluate Speech Translation with Source-Aware Neural MT Metrics Mauro Cettolo et.al. 2511.03295 null
2025-11-05 Beyond Ranked Lists: The SARAL Framework for Cross-Lingual Document Set Retrieval Shantanu Agarwal et.al. 2511.03228 null
2025-11-04 Automatic Machine Translation Detection Using a Surrogate Multilingual Translation Model Cristian García-Romero et.al. 2511.02958 null
2025-11-04 PragExTra: A Multilingual Corpus of Pragmatic Explicitation in Translation Doreen Osmelak et.al. 2511.02721 null
2025-11-04 The Analysis of Lexical Errors in Machine Translation from English into Romanian Angela Stamatie et.al. 2511.02587 null
2025-11-05 HPLT 3.0: Very Large-Scale Multilingual Resources for LLM and MT. Mono- and Bi-lingual Data, Multilingual Evaluation, and Pre-Trained Models Stephan Oepen et.al. 2511.01066 null
2025-11-04 Do Methods to Jailbreak and Defend LLMs Generalize Across Languages? Berk Atil et.al. 2511.00689 null
2025-11-01 Leveraging the Cross-Domain & Cross-Linguistic Corpus for Low Resource NMT: A Case Study On Bhili-Hindi-English Parallel Corpus Pooja Singh et.al. 2511.00486 null
2025-10-31 POSESTITCH-SLT: Linguistically Inspired Pose-Stitching for End-to-End Sign Language Translation Abhinav Joshi et.al. 2511.00270 null
2025-10-31 Multilingual BERT language model for medical tasks: Evaluation on domain-specific adaptation and cross-linguality Yinghao Luo et.al. 2510.27552 null
2025-10-31 TransAlign: Machine Translation Encoders are Strong Word Aligners, Too Benedikt Ebing et.al. 2510.27337 null
2025-10-31 Languages are Modalities: Cross-Lingual Alignment via Encoder Injection Rajan Agarwal et.al. 2510.27254 null
2025-10-31 Simple Additions, Substantial Gains: Expanding Scripts, Languages, and Lineage Coverage in URIEL+ Mason Shipton et.al. 2510.27183 null
2025-10-30 Distilling Multilingual Vision-Language Models: When Smaller Models Stay Multilingual Sukrit Sriratanawilai et.al. 2510.26271 null
2025-10-29 Rethinking Cross-lingual Alignment: Balancing Transfer and Cultural Erasure in Multilingual LLMs HyoJung Han et.al. 2510.26024 null
2025-10-29 Semantic Label Drift in Cross-Cultural Translation Mohsinul Kabir et.al. 2510.25967 null
2025-11-04 Hybrid Quantum-Classical Recurrent Neural Networks Wenduan Xu et.al. 2510.25557 null
2025-10-29 Testing Cross-Lingual Text Comprehension In LLMs Using Next Sentence Prediction Ritesh Sunil Chavan et.al. 2510.25187 null
2025-10-29 Pretraining Strategies using Monolingual and Parallel Data for Low-Resource Machine Translation Idriss Nguepi Nguefack et.al. 2510.25116 null
2025-10-27 Cross-Lingual Summarization as a Black-Box Watermark Removal Attack Gokul Ganesan et.al. 2510.24789 null
2025-10-28 MQM Re-Annotation: A Technique for Collaborative Evaluation of Machine Translation Parker Riley et.al. 2510.24664 null
2025-10-28 Zero-Shot Cross-Lingual Transfer using Prefix-Based Adaptation Snegha A et.al. 2510.24619 null
2025-10-28 Ko-MuSR: A Multistep Soft Reasoning Benchmark for LLMs Capable of Understanding Korean Chanwoo Park et.al. 2510.24150 null
2025-10-28 Challenging Multilingual LLMs: A New Taxonomy and Benchmark for Unraveling Hallucination in Translation Xinwei Wu et.al. 2510.24073 null
2025-10-27 AfriMTEB and AfriE5: Benchmarking and Adapting Text Embedding Models for African Languages Kosei Uemura et.al. 2510.23896 null
2025-10-27 A U-Net and Transformer Pipeline for Multilingual Image Translation Siddharth Sahay et.al. 2510.23554 null
2025-10-27 Quality-Aware Translation Tagging in Multilingual RAG system Hoyeon Moon et.al. 2510.23070 null
2025-10-27 Cross-Lingual Sponsored Search via Dual-Encoder and Graph Neural Networks for Context-Aware Query Translation in Advertising Platforms Ziyang Gao et.al. 2510.22957 null
2025-10-26 Iterative Layer Pruning for Efficient Translation Inference Yasmin Moslem et.al. 2510.22763 null
2025-11-05 TraceTrans: Translation and Spatial Tracing for Surgical Prediction Xiyu Luo et.al. 2510.22379 null
2025-10-24 Penalizing Length: Uncovering Systematic Bias in Quality Estimation Metrics Yilin Zhang et.al. 2510.22028 null
2025-10-24 Estonian Native Large Language Model Benchmark Helena Grete Lillepalu et.al. 2510.21193 null
2025-10-24 Bridging Language Gaps with Adaptive RAG: Improving Indonesian Language Question Answering William Christian et.al. 2510.21068 null
2025-10-23 Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost Runzhe Zhan et.al. 2510.20780 null
2025-10-23 Structure-Conditional Minimum Bayes Risk Decoding Bryan Eikema et.al. 2510.20700 null
2025-10-23 Assessing the Political Fairness of Multilingual LLMs: A Case Study based on a 21-way Multiparallel EuroParl Dataset Paul Lerner et.al. 2510.20508 null
2025-10-22 Conditions for Catastrophic Forgetting in Multilingual Translation Danni Liu et.al. 2510.19546 null
2025-10-22 Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech Recognition Yuu Jinnai et.al. 2510.19471 null
2025-10-22 Spatio-temporal Sign Language Representation and Translation Yasser Hamidullah et.al. 2510.19413 null
2025-10-22 SONAR-SLT: Multilingual Sign Language Translation via Language-Agnostic Sentence Embedding Supervision Yasser Hamidullah et.al. 2510.19398 null
2025-10-22 Tibetan Language and AI: A Comprehensive Survey of Resources, Methods and Challenges Cheng Huang et.al. 2510.19144 null
2025-10-20 Transformer-Based Low-Resource Language Translation: A Study on Standard Bengali to Sylheti Mangsura Kabir Oni et.al. 2510.18898 null
2025-10-21 SemiAdapt and SemiLoRA: Efficient Domain Adaptation for Transformer-based Low-Resource Language Translation with a Case Study on Irish Josh McGiff et.al. 2510.18725 null
2025-10-20 Lingua Custodi’s participation at the WMT 2025 Terminology shared task Jingshu Liu et.al. 2510.17504 null
2025-10-20 Evaluating Large Language Models on Urdu Idiom Translation Muhammad Farmal Khan et.al. 2510.17460 null
2025-10-19 Zero-Shot Performance Prediction for Probabilistic Scaling Laws Viktoria Schram et.al. 2510.16743 null
2025-10-17 On Non-interactive Evaluation of Animal Communication Translators Orr Paradise et.al. 2510.15768 null
2025-10-16 Predicting Task Performance with Context-aware Scaling Laws Kyle Montgomery et.al. 2510.14919 null
2025-10-16 Semantic Prosody in Machine Translation: the English-Chinese Case of Passive Structures Xinyue Ma et.al. 2510.14662 null
2025-10-16 LiRA: Linguistic Robust Anchoring for Cross-lingual Large Language Models Haolin Li et.al. 2510.14466 null
2025-10-16 From Binary to Bilingual: How the National Weather Service is Using Artificial Intelligence to Develop a Comprehensive Translation Program Joseph E. Trujillo-Falcon et.al. 2510.14369 null
2025-10-15 Beyond Single-Reward: Multi-Pair, Multi-Perspective Preference Optimization for Machine Translation Hao Wang et.al. 2510.13434 null
2025-10-15 A fully automated and scalable Parallel Data Augmentation for Low Resource Languages using Image and Text Analytics Prawaal Sharma et.al. 2510.13211 null
2025-10-15 ACADATA: Parallel Dataset of Academic Data for Machine Translation Iñaki Lacunza et.al. 2510.12621 null
2025-10-14 Uncertainty Quantification for Hallucination Detection in Large Language Models: Foundations, Methodology, and Future Directions Sungmin Kang et.al. 2510.12040 null
2025-10-13 LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens Armel Zebaze et.al. 2510.11919 null
2025-10-12 Bhasha-Rupantarika: Algorithm-Hardware Co-design approach for Multilingual Neural Machine Translation Mukul Lokhande et.al. 2510.10676 null
2025-10-11 End-to-end Automatic Speech Recognition and Speech Translation: Integration of Speech Foundational Models and LLMs Nam Luu et.al. 2510.10329 null
2025-10-11 Toward Machine Translation Literacy: How Lay Users Perceive and Rely on Imperfect Translations Yimin Xiao et.al. 2510.09994 null
2025-10-10 Evaluating Robustness of Large Language Models Against Multilingual Typographical Errors Yihong Liu et.al. 2510.09536 null
2025-10-13 DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation Enze Zhang et.al. 2510.09116 null
2025-10-10 Quality Estimation Reranking for Document-Level Translation Krzysztof Mrozinski et.al. 2510.08870 null
2025-10-31 Ready to Translate, Not to Represent? Bias and Performance Gaps in Multilingual LLMs Across Language Families and Domains Md. Faiyaz Abdullah Sayeedi et.al. 2510.07877 null
2025-10-08 LuxInstruct: A Cross-Lingual Instruction Tuning Dataset For Luxembourgish Fred Philippy et.al. 2510.07074 null
2025-10-08 Revisiting Metric Reliability for Fine-grained Evaluation of Machine Translation and Summarization in Indian Languages Amir Hossein Yari et.al. 2510.07061 null
2025-10-08 GAMBIT+: A Challenge Set for Evaluating Gender Bias in Machine Translation Quality Estimation Metrics Giorgos Filandrianos et.al. 2510.06841 null
2025-10-08 Learning to Rewrite Prompts for Bootstrapping LLMs on Downstream Tasks Qinhao Zhou et.al. 2510.06695 null
2025-10-11 TRepLiNa: Layer-wise CKA+REPINA Alignment Improves Low-Resource Machine Translation in Aya-23 8B Toshiki Nakai et.al. 2510.06249 null
2025-10-01 SynCED-EnDe 2025: A Synthetic and Curated English - German Dataset for Critical Error Detection in Machine Translation Muskaan Chopra et.al. 2510.05144 null
2025-09-27 Trainable Reference-Based Evaluation Metric for Identifying Quality of English-Gujarati Machine Translation System Nisheeth Joshi et.al. 2510.05113 null
2025-10-05 Enhancing OCR for Sino-Vietnamese Language Processing via Fine-tuned PaddleOCRv5 Minh Hoang Nguyen et.al. 2510.04003 null
2025-10-04 Rezwan: Leveraging Large Language Models for Comprehensive Hadith Text Processing: A 1.2M Corpus Development Majid Asgari-Bidhendi et.al. 2510.03781 null
2025-10-04 TreePrompt: Leveraging Hierarchical Few-Shot Example Selection for Improved English-Persian and English-German Translation Ramtin Kakavand et.al. 2510.03748 null
2025-09-30 Scaling Laws Revisited: Modeling the Role of Data Quality in Language Model Pretraining Anirudh Subramanyam et.al. 2510.03313 null
2025-09-24 GemDetox at TextDetox CLEF 2025: Enhancing a Massively Multilingual Model for Text Detoxification on Low-resource Languages Trung Duc Anh Dang et.al. 2510.01250 null
2025-10-01 Exposing the Cracks: Vulnerabilities of Retrieval-Augmented LLM-based Machine Translation Yanming Sun et.al. 2510.00829 null
2025-10-02 Tenyidie Syllabification corpus creation and deep learning applications Teisovi Angami et.al. 2510.00629 null
2025-09-30 Searching for Difficult-to-Translate Test Examples at Scale Wenda Xu et.al. 2509.26619 null
2025-10-02 Generating Difficult-to-Translate Texts Vilém Zouhar et.al. 2509.26592 null
2025-09-29 Don’t Sweat the Small Stuff: Segment-Level Meta-Evaluation Based on Pairwise Difference Correlation Colten DiIanni et.al. 2509.25546 null
2025-09-29 Aligning Multilingual Reasoning with Verifiable Semantics from a High-Resource Expert Model Fahim Faisal et.al. 2509.25543 null
2025-09-29 ThermalGen: Style-Disentangled Flow-Based Generative Models for RGB-to-Thermal Image Translation Jiuhong Xiao et.al. 2509.24878 null
2025-10-02 The Hidden Costs of Translation Accuracy: Distillation, Quantization, and Environmental Impact Dhaathri Vijay et.al. 2509.23990 null
2025-09-27 Liaozhai through the Looking-Glass: On Paratextual Explicitation of Culture-Bound Terms in Machine Translation Sherrie Shen et.al. 2509.23395 null
2025-09-26 From tests to effect sizes: Quantifying uncertainty and statistical variability in multilingual and multitask NLP evaluation benchmarks Jonne Sälevä et.al. 2509.22612 null
2025-09-26 JGU Mainz’s Submission to the WMT25 Shared Task on LLMs with Limited Resources for Slavic Languages: MT and QA Hossain Shaikh Saadi et.al. 2509.22490 null
2025-09-26 MO-GRPO: Mitigating Reward Hacking of Group Relative Policy Optimization on Multi-Objective Problems Yuki Ichihara et.al. 2509.22047 null
2025-09-25 “Be My Cheese?”: Assessing Cultural Nuance in Multilingual LLM Translations Madison Van Doren et.al. 2509.21577 null
2025-09-24 SiniticMTError: A Machine Translation Dataset with Error Annotations for Sinitic Languages Hannah Liu et.al. 2509.20557 null
2025-09-24 Feeding Two Birds or Favoring One? Adequacy-Fluency Tradeoffs in Evaluation and Meta-Evaluation of Machine Translation Behzad Shayegh et.al. 2509.20287 null
2025-09-24 Low-Resource English-Tigrinya MT: Leveraging Multilingual Models, Custom Tokenizers, and Clean Evaluation Benchmarks Hailay Kidu Teklehaymanot et.al. 2509.20209 null
2025-09-24 CorIL: Towards Enriching Indian Language to Indian Language Parallel Corpora and Machine Translation Systems Soham Bhattacharjee et.al. 2509.19941 null
2025-09-24 EnAnchored-X2X: English-Anchored Optimization for Many-to-Many Translation Sen Yang et.al. 2509.19770 null
2025-09-23 Evaluating Language Translation Models by Playing Telephone Syeda Jannatus Saba et.al. 2509.19611 null
2025-09-22 Transformer-Encoder Trees for Efficient Multilingual Machine Translation and Speech Translation Yiwen Guan et.al. 2509.17930 null
2025-09-22 Specification-Aware Machine Translation and Evaluation for Purpose Alignment Yoko Kayano et.al. 2509.17559 null
2025-09-22 Enhancing Cross-Lingual Transfer through Reversible Transliteration: A Huffman-Based Approach for Low-Resource Languages Wenhao Zhuang et.al. 2509.17493 null
2025-10-10 Filling in the Clinical Gaps in Benchmark: Case for HealthBench for the Japanese medical system Shohei Hisada et.al. 2509.17444 null
2025-09-22 Scaling, Simplification, and Adaptation: Lessons from Pretraining on Machine-Translated Text Dan John Velasco et.al. 2509.17317 null
2025-09-22 JPResUnet: A Joint Probability Density Function Translation Model in Partially Premixed Flames Hanying Yang et.al. 2509.17297 null
2025-09-21 Extending Automatic Machine Translation Evaluation to Book-Length Documents Kuang-Da Wang et.al. 2509.17249 null
2025-09-21 CUTE: A Multilingual Dataset for Enhancing Cross-Lingual Knowledge Transfer in Low-Resource Languages Wenhao Zhuang et.al. 2509.16914 null
2025-09-20 Angular Dispersion Accelerates $k$ -Nearest Neighbors Machine Translation Evgeniia Tokarchuk et.al. 2509.16729 null
2025-09-19 Whisper-UT: A Unified Translation Framework for Speech and Text Cihan Xiao et.al. 2509.16375 null
2025-09-19 UPRPRC: Unified Pipeline for Reproducing Parallel Resources – Corpus from the United Nations Qiuyang Lu et.al. 2509.15789 null
2025-10-23 Multilingual LLM Prompting Strategies for Medical English-Vietnamese Machine Translation Nhu Vo et.al. 2509.15640 null
2025-09-18 RulER: Automated Rule-Based Semantic Error Localization and Repair for Code Translation Shuo Jin et.al. 2509.14829 null
2025-09-18 Evaluating Large Language Models for Cross-Lingual Retrieval Longfei Zuo et.al. 2509.14749 null
2025-09-17 Translate, then Detect: Leveraging Machine Translation for Cross-Lingual Toxicity Classification Samuel J. Bell et.al. 2509.14493 null
2025-09-17 You Are What You Train: Effects of Data Composition on Training Context-aware Machine Translation Models Paweł Mąka et.al. 2509.14031 null
2025-09-17 Audio-Based Crowd-Sourced Evaluation of Machine Translation Quality Sami Ul Haq et.al. 2509.14023 null
2025-09-17 Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale Hasan Abed Al Kader Hammoud et.al. 2509.14008 null
2025-09-17 Long-context Reference-based MT Quality Estimation Sami Ul Haq et.al. 2509.13980 null
2025-09-20 Data Augmentation for Maltese NLP using Transliterated and Machine Translated Arabic Data Kurt Micallef et.al. 2509.12853 null
2025-10-06 Human + AI for Accelerating Ad Localization Evaluation Harshit Rajgarhia et.al. 2509.12543 null
2025-09-15 A comparison of pipelines for the translation of a low resource language based on transformers Chiara Bonfanti et.al. 2509.12514 null
2025-09-14 PATIMT-Bench: A Multi-Scenario Benchmark for Position-Aware Text Image Machine Translation in Large Vision-Language Models Wanru Zhuang et.al. 2509.12278 null
2025-09-15 XplaiNLP at CheckThat! 2025: Multilingual Subjectivity Detection with Finetuned Transformers and Prompt-Based Inference with Large Language Models Ariana Sahitaj et.al. 2509.12130 null
2025-09-04 Optimal Multi-Task Learning at Regularization Horizon for Speech Translation Task JungHo Jung et.al. 2509.09701 null
2025-09-11 Mitigating Language Barriers in Education: Developing Multilingual Digital Learning Materials with Machine Translation Lucie Poláková et.al. 2509.09473 null
2025-09-09 Small Open Models Achieve Near Parity with Large Models in Low Resource Literary Translation at a Fraction of the Cost Mihai Nadas et.al. 2509.07829 null
2025-10-18 From Scarcity to Efficiency: Investigating the Effects of Data Augmentation on African Machine Translation Mardiyyah Oduwole et.al. 2509.07471 null
2025-09-09 Hunyuan-MT Technical Report Mao Zheng et.al. 2509.05209 null
2025-09-05 PRIM: Towards Practical In-Image Multilingual Machine Translation Yanzhi Tian et.al. 2509.05146 null
2025-09-28 Artificially Fluent: Swahili AI Performance Benchmarks Between English-Trained and Natively-Trained Datasets Sophie Jaffer et.al. 2509.04516 null
2025-09-04 Exploring NLP Benchmarks in an Extremely Low-Resource Setting Ulin Nuha et.al. 2509.03962 null
2025-09-04 Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation Jiaxin Guo et.al. 2509.03809 null
2025-09-24 Expanding the WMT24++ Benchmark with Rumantsch Grischun, Sursilvan, Sutsilvan, Surmiran, Puter, and Vallader Jannis Vamvas et.al. 2509.03148 null
2025-09-02 The Forgotten Code: Validating a Century-Old Translation System with AI Jean-Marie Le Ray et.al. 2509.02506 null
2025-09-18 CSRM-LLM: Embracing Multilingual LLMs for Cold-Start Relevance Matching in Emerging E-commerce Markets Yujing Wang et.al. 2509.01566 null
2025-08-28 The Uneven Impact of Post-Training Quantization in Machine Translation Benjamin Marie et.al. 2508.20893 null
2025-08-28 Languages Still Left Behind: Toward a Better Multilingual Machine Translation Benchmark Chihiro Taguchi et.al. 2508.20511 null
2025-09-06 FlowMalTrans: Unsupervised Binary Code Translation for Malware Detection Using Flow-Adapter Architecture Minghao Hu et.al. 2508.20212 null
2025-08-26 Improving Low-Resource Translation with Dictionary-Guided Fine-Tuning and RL: A Spanish-to-Wayuunaiki Study Manuel Mosquera et.al. 2508.19481 null
2025-09-03 The Ramon Llull’s Thinking Machine for Automated Ideation Xinran Zhao et.al. 2508.19200 null
2025-10-10 LaTeXTrans: Structured LaTeX Translation with Multi-Agent Coordination Ziming Zhu et.al. 2508.18791 null
2025-08-26 A New NMT Model for Translating Clinical Texts from English to Spanish Rumeng Li et.al. 2508.18607 null
2025-08-25 COMET-poly: Machine Translation Metric Grounded in Other Candidates Maike Züfle et.al. 2508.18549 null
2025-08-24 Evaluating the Impact of Verbal Multiword Expressions on Machine Translation Linfeng Liu et.al. 2508.17458 null
2025-08-22 Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish Yakup Abrek Er et.al. 2508.16431 null
2025-08-22 The Mediomatix Corpus: Parallel Data for Romansh Idioms via Comparable Schoolbooks Zachary Hopton et.al. 2508.16371 null
2025-10-06 OpenWHO: A Document-Level Parallel Corpus for Health Translation in Low-Resource Languages Raphaël Merx et.al. 2508.16048 null
2025-08-21 Confidence-Modulated Speculative Decoding for Large Language Models Jaydip Sen et.al. 2508.15371 null
2025-08-20 Improving LLMs for Machine Translation Using Synthetic Preference Data Dario Vajda et.al. 2508.14951 null
2025-08-24 Preliminary Ranking of WMT25 General Machine Translation Systems Tom Kocmi et.al. 2508.14909 null
2025-08-20 Filling the Gap for Uzbek: Creating Translation Resources for Southern Uzbek Mukhammadsaid Mamasaidov et.al. 2508.14586 null
2025-08-20 In2x at WMT25 Translation Task Lei Pang et.al. 2508.14472 null
2025-08-18 Overcoming Latency Bottlenecks in On-Device Speech Translation: A Cascaded Approach with Alignment-Based Streaming MT Zeeshan Ahmed et.al. 2508.13358 null
2025-09-29 DocHPLT: A Massively Multilingual Document-Level Translation Dataset Dayyán O’Brien et.al. 2508.13079 null
2025-08-18 From SALAMANDRA to SALAMANDRATA: BSC Submission for WMT25 General Machine Translation Shared Task Javier Garcia Gilabert et.al. 2508.12774 null
2025-08-25 SEA-BED: Southeast Asia Embedding Benchmark Wuttikorn Ponwitayarat et.al. 2508.12243 null
2025-08-14 Neural Machine Translation for Coptic-French: Strategies for Low-Resource Ancient Languages Nasma Chaoui et.al. 2508.10683 null
2025-08-14 Evaluating LLMs on Chinese Idiom Translation Cai Yang et.al. 2508.10421 null
2025-08-28 Estimating Machine Translation Difficulty Lorenzo Proietti et.al. 2508.10175 null
2025-08-12 TopXGen: Topic-Diverse Parallel Data Generation for Low-Resource Machine Translation Armel Zebaze et.al. 2508.08680 null
2025-08-12 UWB at WASSA-2024 Shared Task 2: Cross-lingual Emotion Detection Jakub Šmíd et.al. 2508.08650 null
2025-08-11 Toward Machine Interpreting: Lessons from Human Interpreting Studies Matthias Sperber et.al. 2508.07964 null
2025-08-10 ALOPE: Adaptive Layer Optimization for Translation Quality Estimation using Large Language Models Archchana Sindhujan et.al. 2508.07484 null
2025-08-08 Testing the Limits of Machine Translation from One Book Jonathan Shaw et.al. 2508.06665 null
2025-08-08 Train It and Forget It: Merge Lists are Unnecessary for BPE Inference in Language Models Tomohiro Sawada et.al. 2508.06621 null
2025-08-07 PEACH: A sentence-aligned Parallel English-Arabic Corpus for Healthcare Rania Al-Sabbagh et.al. 2508.05722 null
2025-08-07 MELLA: Bridging Linguistic Capability and Cultural Groundedness for Low-Resource Language MLLMs Yufei Gao et.al. 2508.05502 null
2025-08-07 Optimal Corpus Aware Training for Neural Machine Translation Yi-Hsiu Liao et.al. 2508.05364 null
2025-08-11 REINA: Regularized Entropy Information-Based Loss for Efficient Simultaneous Speech Translation Nameer Hirschkind et.al. 2508.04946 null
2025-08-05 Marito: Structuring and Building Open Multilingual Terminologies for South African NLP Vukosi Marivate et.al. 2508.03529 null
2025-08-05 Investigation on deep learning-based galaxy image translation models Hengxin Ruan et.al. 2508.03291 null
2025-08-05 Cross-lingual Opinions and Emotions Mining in Comparable Documents Motaz Saad et.al. 2508.03112 null
2025-08-04 A Survey on Data Security in Large Language Models Kang Chen et.al. 2508.02312 null
2025-08-04 A French Version of the OLDI Seed Corpus Malik Marmonier et.al. 2508.02290 null
2025-08-04 SHAMI-MT: A Syrian Arabic Dialect to Modern Standard Arabic Bidirectional Machine Translation System Serry Sibaee et.al. 2508.02268 null
2025-08-25 CultureGuard: Towards Culturally-Aware Dataset and Guard Model for Multilingual Safety Applications Raviraj Joshi et.al. 2508.01710 null
2025-08-02 ArzEn-MultiGenre: An aligned parallel dataset of Egyptian Arabic song lyrics, novels, and subtitles, with English translations Rania Al-Sabbagh et.al. 2508.01411 null
2025-09-16 Sample-Aware Test-Time Adaptation for Medical Image-to-Image Translation Irene Iele et.al. 2508.00766 null
2025-07-31 Arabic Hate Speech Identification and Masking in Social Media using Deep Learning Models and Pre-trained Models Fine-tuning Salam Thabet Doghmash et.al. 2507.23661 null
2025-07-31 Beyond the Cloud: Assessing the Benefits and Drawbacks of Local LLM Deployment for Translators Peter Sandrini et.al. 2507.23399 null
2025-07-29 RL from Teacher-Model Refinement: Gradual Imitation Learning for Machine Translation Dongyub Jude Lee et.al. 2507.22219 null
2025-07-31 Multi-Hypothesis Distillation of Multilingual Neural Translation Models for Low-Resource Languages Aarón Galiano-Jiménez et.al. 2507.21568 null
2025-07-07 iLSU-T: an Open Dataset for Uruguayan Sign Language Translation Ariel E. Stassi et.al. 2507.21104 null
2025-07-28 Multilingual Self-Taught Faithfulness Evaluators Carlo Alfano et.al. 2507.20752 null
2025-09-02 Advancing Dialectal Arabic to Modern Standard Arabic Machine Translation Abdullah Alabdullah et.al. 2507.20301 null
2025-07-29 Mind the Language Gap in Digital Humanities: LLM-Aided Translation of SKOS Thesauri Felix Kraus et.al. 2507.19537 null
2025-07-25 LLaVA-NeuMT: Selective Layer-Neuron Modulation for Efficient Multilingual Multimodal Translation Jingxuan Wei et.al. 2507.18940 null
2025-07-24 GIIFT: Graph-guided Inductive Image-free Multimodal Machine Translation Jiafeng Xiong et.al. 2507.18562 null
2025-07-24 Uncertainty Quantification for Evaluating Machine Translation Bias Ieva Raminta Staliūnaitė et.al. 2507.18338 null
2025-07-25 Natural Language Processing for Tigrinya: Current State and Future Directions Fitsum Gaim et.al. 2507.17974 null
2025-07-23 Dual-branch Prompting for Multimodal Machine Translation Jie Wang et.al. 2507.17588 null
2025-07-22 Introducing Quality Estimation to Machine Translation Post-editing Workflow: An Empirical Study on Its Usefulness Siqi Liu et.al. 2507.16515 null
2025-07-22 GG-BBQ: German Gender Bias Benchmark for Question Answering Shalaka Satheesh et.al. 2507.16410 null
2025-07-21 Evaluating Text Style Transfer: A Nine-Language Benchmark for Text Detoxification Vitaly Protasov et.al. 2507.15557 null
2025-07-20 A Case Against Implicit Standards: Homophone Normalization in Machine Translation for Languages that use the Ge’ez Script Hellina Hailu Nigatu et.al. 2507.15142 null
2025-08-21 Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters Shanbo Cheng et.al. 2507.13618 null
2025-07-16 Mitigating Stylistic Biases of Machine Translation Systems via Monolingual Corpora Only Xuanqi Gao et.al. 2507.13395 null
2025-07-16 The first open machine translation system for the Chechen language Abu-Viskhan A. Umishov et.al. 2507.12672 null
2025-09-19 Translationese-index: Using Likelihood Ratios for Graded and Generalizable Measurement of Translationese Yikang Liu et.al. 2507.12260 null
2025-07-16 Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language Models Bo Zeng et.al. 2507.11882 null
2025-07-31 ILID: Native Script Language Identification for Indian Languages Yash Ingle et.al. 2507.11832 null
2025-08-30 How Important is `Perfect’ English for Machine Translation Prompts? Patrícia Schmidtová et.al. 2507.09509 null
2025-07-11 Improving MLLM’s Document Image Machine Translation via Synchronously Self-reviewing Its OCR Proficiency Yupu Liang et.al. 2507.08309 null
2025-07-10 Conditional Unigram Tokenization with Parallel Data Gianluca Vico et.al. 2507.07824 null
2025-07-10 Single-to-mix Modality Alignment with Multimodal Large Language Model for Document Image Machine Translation Yupu Liang et.al. 2507.07572 null
2025-07-09 Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation Kazi Mahathir Rahman et.al. 2507.06530 null
2025-07-09 Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings Russell Taylor et.al. 2507.06506 null
2025-07-07 A Tale of Two Scripts: Transliteration and Post-Correction for Judeo-Arabic Juan Moreno Gonzalez et.al. 2507.04746 null
2025-07-09 Losing our Tail – Again: On (Un)Natural Selection And Multilingual Large Language Models Eva Vanmassenhove et.al. 2507.03933 null
2025-07-17 Learning to Translate Ambiguous Terminology by Preference Optimization on Post-Edits Nathaniel Berger et.al. 2507.03580 null
2025-07-04 GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation Himanshu Dutta et.al. 2507.03311 null
2025-07-01 TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation Xi Xuan et.al. 2507.00875 null
2025-07-01 Neural translation for Stokes inversion and synthesis A. Asensio Ramos et.al. 2507.00594 null
2025-06-30 Natural language processing for African languages David Ifeoluwa Adelani et.al. 2507.00297 link
2025-06-30 Bridging the Gap with Retrieval-Augmented Generation: Making Prosthetic Device User Manuals Available in Marginalised Languages Ikechukwu Ogbonna et.al. 2506.23958 null
2025-07-07 CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation Yi Liu et.al. 2506.23347 null
2025-05-12 Do Not Change Me: On Transferring Entities Without Modification in Neural Machine Translation – a Multilingual Perspective Dawid Wisniewski et.al. 2505.06010 null
2024-12-30 Advancing Explainability in Neural Machine Translation: Analytical Metrics for Attention and Alignment Consistency Anurag Mishra et.al. 2412.18669 null
2025-07-31 Instruction-tuned Large Language Models for Machine Translation in the Medical Domain Miguel Rios et.al. 2408.16440 null
2024-08-07 Conditioning LLMs with Emotion in Neural Machine Translation Charles Brazier et.al. 2408.03150 null
2024-06-11 CantonMT: Cantonese to English NMT Platform with Fine-Tuned Models Using Synthetic Back-Translation Data Kung Yin Hong et.al. 2403.11346 null
2023-11-21 Context-aware Neural Machine Translation for English-Japanese Business Scene Dialogues Sumire Honda et.al. 2311.11976 null
2023-11-01 Is Robustness Transferable across Languages in Multilingual Neural Machine Translation? Leiyu Pan et.al. 2310.20162 null
2023-08-28 Ngambay-French Neural Machine Translation (sba-Fr) Sakayo Toadoum Sari et.al. 2308.13497 null
2023-07-18 Enhancing Supervised Learning with Contrastive Markings in Neural Machine Translation Training Nathaniel Berger et.al. 2307.08416 null
2023-05-29 Gender Lost In Translation: How Bridging The Gap Between Languages Affects Gender Bias in Zero-Shot Multilingual Translation Lena Cabrera et.al. 2305.16935 null
2023-05-15 Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency Regularization Pengzhi Gao et.al. 2305.07310 null
2022-09-07 Informative Language Representation Learning for Massively Multilingual Neural Machine Translation Renren Jin et.al. 2209.01530 null
2022-08-16 Fast Vocabulary Projection Method via Clustering for Multilingual Machine Translation on GPU Hossam Amer et.al. 2208.06874 null
2022-08-25 UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation Jian Yang et.al. 2207.04900 null
2022-04-12 MMTAfrica: Multilingual Machine Translation for African Languages Chris C. Emezue et.al. 2204.04306 null
2022-03-09 ViNMT: Neural Machine Translation Toolkit Nguyen Hoang Quan et.al. 2112.15272 null
2021-12-23 English2Gbe: A multilingual machine translation model for {Fon/Ewe}Gbe Gilles Hacheme et.al. 2112.11482 null
2022-04-14 Towards Making the Most of Multilingual Pretraining for Zero-Shot Neural Machine Translation Guanhua Chen et.al. 2110.08547 null
2021-09-10 Generalised Unsupervised Domain Adaptation of Neural Machine Translation with Cross-Lingual Data Selection Thuy-Trang Vu et.al. 2109.04292 null
2021-11-08 Zero-shot Cross-lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders Guanhua Chen et.al. 2104.08757 null
2020-11-04 Cross-lingual Word Embeddings beyond Zero-shot Machine Translation Shifei Chen et.al. 2011.01682 null
2020-10-21 Complete Multilingual Neural Machine Translation Markus Freitag et.al. 2010.10239 null
2020-10-20 Diving Deep into Context-Aware Neural Machine Translation Jingjing Huo et.al. 2010.09482 null
2022-03-15 Rethinking Document-level Neural Machine Translation Zewei Sun et.al. 2010.08961 null
2020-10-07 Multi-task Learning for Multilingual Neural Machine Translation Yiren Wang et.al. 2010.02523 null
2020-10-16 Very Deep Transformers for Neural Machine Translation Xiaodong Liu et.al. 2008.07772 null
2020-08-10 A Multilingual Neural Machine Translation Model for Biomedical Data Alexandre Bérard et.al. 2008.02878 null
2020-05-27 The Unreasonable Volatility of Neural Machine Translation Models Marzieh Fadaee et.al. 2005.12398 null
2020-05-12 Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation Aditya Siddhant et.al. 2005.04816 null
2020-04-27 Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation Biao Zhang et.al. 2004.11867 null
2020-12-10 Transfer learning and subword sampling for asymmetric-resource one-to-many neural translation Stig-Arne Grönroos et.al. 2004.04002 null
2021-04-02 Cross-lingual Supervision Improves Unsupervised Neural Machine Translation Mingxuan Wang et.al. 2004.03137 null
2020-02-21 Compositional Neural Machine Translation by Removing the Lexicon from Syntax Tristan Thrush et.al. 2002.08899 null
2020-01-08 A Comprehensive Survey of Multilingual Neural Machine Translation Raj Dabre et.al. 2001.01115 null
2019-12-30 A Study of Multilingual Neural Machine Translation Xu Tan et.al. 1912.11625 null
2019-12-09 Pairwise Neural Machine Translation Evaluation Francisco Guzman et.al. 1912.03135 null
2019-12-09 Machine Translation Evaluation Meets Community Question Answering Francisco Guzmán et.al. 1912.02998 null
2020-10-01 Neural Machine Translation: A Review and Survey Felix Stahlberg et.al. 1912.02047 null
2019-12-04 Cross-lingual Pre-training Based Transfer for Zero-shot Neural Machine Translation Baijun Ji et.al. 1912.01214 null
2019-10-31 Adapting Multilingual Neural Machine Translation to Unseen Languages Surafel M. Lakew et.al. 1910.13998 null
2019-10-29 Multitask Learning For Different Subword Segmentations In Neural Machine Translation Tejas Srinivasan et.al. 1910.12368 null
2019-10-22 On the Importance of Word Boundaries in Character-level Neural Machine Translation Duygu Ataman et.al. 1910.06753 null
2019-10-02 Interrogating the Explanatory Power of Attention in Neural Machine Translation Pooya Moradi et.al. 1910.00139 null
2019-09-25 Data Ordering Patterns for Neural Machine Translation: An Empirical Study Siddhant Garg et.al. 1909.10642 null
2019-09-17 Multilingual Neural Machine Translation for Zero-Resource Languages Surafel M. Lakew et.al. 1909.07342 link
2019-11-13 Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation Aditya Siddhant et.al. 1909.00437 null
2019-10-09 Transductive Data-Selection Algorithms for Fine-Tuning Neural Machine Translation Alberto Poncelas et.al. 1908.09532 null
2019-08-27 Multilingual Neural Machine Translation with Language Clustering Xu Tan et.al. 1908.09324 null
2019-07-16 Simple Automatic Post-editing for Arabic-Japanese Machine Translation Ella Noll et.al. 1907.06210 null
2019-07-12 Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges Naveen Arivazhagan et.al. 1907.05019 null
2019-07-10 An Intrinsic Nearest Neighbor Analysis of Neural Machine Translation Architectures Hamidreza Ghader et.al. 1907.03885 null
2019-07-09 Exploiting Out-of-Domain Parallel Data through Multilingual Transfer Learning for Low-Resource Neural Machine Translation Aizhan Imankulova et.al. 1907.03060 null
2019-07-08 Interactive-Predictive Neural Machine Translation through Reinforcement and Imitation Tsz Kin Lam et.al. 1907.02326 null
2019-07-03 Improving Robustness in Real-World Neural Machine Translation Engines Rohit Gupta et.al. 1907.01279 null
2019-06-19 Generalizing Back-Translation in Neural Machine Translation Miguel Graça et.al. 1906.07286 null
2019-06-10 Word-based Domain Adaptation for Neural Machine Translation Shen Yan et.al. 1906.03129 null
2019-06-06 Effective Cross-lingual Transfer of Neural Machine Translation Models without Shared Vocabularies Yunsu Kim et.al. 1905.05475 null
2019-03-28 Using Monolingual Data in Neural Machine Translation: a Systematic Study Franck Burlot et.al. 1903.11437 null
2019-07-03 Massively Multilingual Neural Machine Translation Roee Aharoni et.al. 1903.00089 null
2018-12-04 The RGNLP Machine Translation Systems for WAT 2018 Atul Kr. Ojha et.al. 1812.00798 null
2018-11-06 Improving Zero-Shot Translation of Low-Resource Languages Surafel M. Lakew et.al. 1811.01389 null
2018-11-06 Transfer Learning in Multilingual Neural Machine Translation with Dynamic Vocabulary Surafel M. Lakew et.al. 1811.01137 null
2018-11-06 Neural Machine Translation into Language Varieties Surafel M. Lakew et.al. 1811.01064 null
2018-09-14 Zero-Shot Cross-lingual Classification Using Multilingual Neural Machine Translation Akiko Eriguchi et.al. 1809.04686 null
2018-09-11 Towards one-shot learning for rare-word translation with external experts Ngoc-Quan Pham et.al. 1809.03182 null
2020-07-09 Trivial Transfer Learning for Low-Resource Neural Machine Translation Tom Kocmi et.al. 1809.00357 null
2018-09-14 Parameter Sharing Methods for Multilingual Self-Attentional Translation Models Devendra Singh Sachan et.al. 1809.00252 null
2018-09-05 Denoising Neural Machine Translation Training with Trusted Data and Online Data Selection Wei Wang et.al. 1809.00068 null
2018-06-22 A Comparison of Transformer and Recurrent Neural Networks on Multilingual Neural Machine Translation Surafel M. Lakew et.al. 1806.06957 null
2018-06-14 Generative Neural Machine Translation Harshil Shah et.al. 1806.05138 null
2018-06-11 Multilingual Neural Machine Translation with Task-Specific Attention Graeme Blackwood et.al. 1806.03280 null
2018-06-11 Multi-Source Neural Machine Translation with Missing Data Yuta Nishimura et.al. 1806.02525 null
2020-09-14 On the Impact of Various Types of Noise on Neural Machine Translation Huda Khayrallah et.al. 1805.12282 null
2018-05-31 Bi-Directional Neural Machine Translation with Synthetic Parallel Data Xing Niu et.al. 1805.11213 null
2018-05-14 Bootstrapping Multilingual Intent Models via Machine Translation for Dialog Automation Nicholas Ruiz et.al. 1805.04453 null
2018-05-14 Deep Neural Machine Translation with Weakly-Recurrent Units Mattia Antonino Di Gangi et.al. 1805.04185 null
2018-05-08 Multi-Domain Neural Machine Translation Sander Tars et.al. 1805.02282 null
2021-09-15 Exploring Hyper-Parameter Optimization for Neural Machine Translation on GPU Architectures Robert Lim et.al. 1805.02094 null
2018-10-17 A neural interlingua for multilingual machine translation Yichao Lu et.al. 1804.08198 null
2021-05-20 Massively Parallel Cross-Lingual Learning in Low-Resource Target Language Translation Zhong Zhou et.al. 1804.07878 null
2018-02-13 Quantitative Fine-Grained Human Evaluation of Machine Translation Systems: a Case Study on English to Croatian Filip Klubička et.al. 1802.01451 null
2018-09-19 A User-Study on Online Adaptation of Neural Machine Translation to Human Post-Edits Sariya Karimova et.al. 1712.04853 null
2017-10-06 Machine Translation Evaluation with Neural Networks Francisco Guzmán et.al. 1710.02095 null
2017-08-22 Neural Machine Translation with Extended Context Jörg Tiedemann et.al. 1708.05943 null
2017-08-22 The Helsinki Neural Machine Translation System Robert Östling et.al. 1708.05942 null
2017-08-04 Exploiting Linguistic Resources for Neural Machine Translation Using Multi-task Learning Jan Niehues et.al. 1708.00993 null
2017-08-01 Linguistically Motivated Vocabulary Reduction for Neural Machine Translation from Turkish to English Duygu Ataman et.al. 1707.09879 null
2017-06-30 Stronger Baselines for Trustable Results in Neural Machine Translation Michael Denkowski et.al. 1706.09733 null
2017-06-20 An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation Makoto Morishita et.al. 1706.05765 null
2017-06-14 Six Challenges for Neural Machine Translation Philipp Koehn et.al. 1706.03872 null
2018-12-19 Beam Search Strategies for Neural Machine Translation Markus Freitag et.al. 1702.01806 null
2017-07-19 Predicting Target Language CCG Supertags Improves Neural Machine Translation Maria Nadejde et.al. 1702.01147 null
2017-08-23 Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation Melvin Johnson et.al. 1611.04558 null
2016-10-21 Lexicons and Minimum Risk Training for Neural Machine Translation: NAIST-CMU at WAT2016 Graham Neubig et.al. 1610.06542 null
2016-01-07 Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism Orhan Firat et.al. 1601.01073 null
2016-06-06 Improving Neural Machine Translation Models with Monolingual Data Rico Sennrich et.al. 1511.06709 null
2015-09-30 Neural-based machine translation for medical text domain. Based on European Medicines Agency leaflet texts Krzysztof Wołk et.al. 1509.08644 null
2014-09-26 Semantically-Informed Syntactic Machine Translation: A Tree-Grafting Approach Kathryn Baker et.al. 1409.7085 null
2014-10-08 On the Properties of Neural Machine Translation: Encoder-Decoder Approaches Kyunghyun Cho et.al. 1409.1259 null
2014-10-08 Overcoming the Curse of Sentence Length for Neural Machine Translation using Automatic Segmentation Jean Pouget-Abadie et.al. 1409.1257 null

⚡ Small Language Models

📊 3333 papers

📅 Publish Date 📝 Title 👥 Authors 📄 PDF 💻 Code
2026-04-01 Universal YOCO for Efficient Depth Scaling Yutao Sun et.al. 2604.01220 null
2026-04-01 LLM REgression with a Latent Iterative State Head Yiheng Su et.al. 2604.01206 null
2026-04-01 AdaLoRA-QAT: Adaptive Low-Rank and Quantization-Aware Segmentation Prantik Deb et.al. 2604.01167 null
2026-04-01 Lightweight Prompt-Guided CLIP Adaptation for Monocular Depth Estimation Reyhaneh Ahani Manghotay et.al. 2604.01118 null
2026-04-01 A Hierarchical Importance-Guided Multi-objective Evolutionary Framework for Deep Neural Network Pruning Zak Khan et.al. 2604.01076 null
2026-04-01 ONE-SHOT: Compositional Human-Environment Video Synthesis via Spatial-Decoupled Motion Injection and Hybrid Context Integration Fengyuan Yang et.al. 2604.01043 null
2026-04-01 Integer-State Dynamics of Quantized Spiking Neural Networks for Efficient Hardware Acceleration Lei Zhang et.al. 2604.01042 null
2026-04-01 Fast and Accurate Probing of In-Training LLMs’ Downstream Performances Zhichen Liu et.al. 2604.01025 null
2026-04-01 Parameter-Efficient Fine-Tuning of Machine-Learning Interatomic Potentials for Phonon and Thermal Properties Jonas Grandel et.al. 2604.01017 null
2026-04-01 Toral Chern-Simons TQFT via Geometric Quantization in Real Polarization Daniel Galviz et.al. 2604.01016 null
2026-04-01 PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding Nan Wang et.al. 2604.00886 null
2026-04-01 LinguDistill: Recovering Linguistic Ability in Vision- Language Models via Selective Cross-Modal Distillation Patrick Amadeus Irawan et.al. 2604.00829 null
2026-04-01 Video Patch Pruning: Efficient Video Instance Segmentation via Early Token Reduction Patrick Glandorf et.al. 2604.00827 null
2026-04-01 Compact Keyframe-Optimized Multi-Agent Gaussian Splatting SLAM Monica M. Q. Li et.al. 2604.00804 null
2026-04-01 From Baselines to Preferences: A Comparative Study of LoRA/QLoRA and Preference Optimization for Mental Health Text Classification Mihael Arcan et.al. 2604.00773 null
2026-04-01 IWP: Token Pruning as Implicit Weight Pruning in Large Vision Language Models Dong-Jae Lee et.al. 2604.00757 null
2026-04-01 Andreev-enhanced conductance quantization and gate-tunable induced superconducting gap in germanium Elyjah Kiyooka et.al. 2604.00755 null
2026-04-01 Spectral Compact Training: Pre-Training Large Language Models via Permanent Truncated SVD and Stiefel QR Retraction Björn Roman Kohlberger et.al. 2604.00733 null
2026-04-01 A Survey of On-Policy Distillation for Large Language Models Mingyang Song et.al. 2604.00626 null
2026-04-01 A Physical Imitation Learning Pipeline for Energy-Efficient Quadruped Locomotion Assisted by Parallel Elastic Joint Huyue Ma et.al. 2604.00611 null
2026-04-01 TALENT: Target-aware Efficient Tuning for Referring Image Segmentation Shuo Jin et.al. 2604.00609 null
2026-04-01 More Human, More Efficient: Aligning Annotations with Quantized SLMs Jiayu Wang et.al. 2604.00586 null
2026-04-01 Learning from Many and Adapting to the Unknown in Open-set Test Streams Xiao Zhang et.al. 2604.00533 null
2026-04-01 Formal Deformation quantization as a Fréchet algebra Qin Li et.al. 2604.00532 null
2026-04-01 MF-QAT: Multi-Format Quantization-Aware Training for Elastic Inference Zifei Xu et.al. 2604.00529 null
2026-04-01 Adaptive Parallel Monte Carlo Tree Search for Efficient Test-time Compute Scaling Hongbeen Kim et.al. 2604.00510 null
2026-04-01 VADMamba++: Efficient Video Anomaly Detection via Hybrid Modeling in Grayscale Space Jihao Lyu et.al. 2604.00360 null
2026-03-31 UCell: rethinking generalizability and scaling of bio-medical vision models Nicholas Kuang et.al. 2604.00243 null
2026-03-31 The Kormendy Relation in the First Billion Years: Evidence from $JWST$ Anshuman Borgohain et.al. 2604.00104 null
2026-03-31 Meteorology-Driven GPT4AP: A Multi-Task Forecasting LLM for Atmospheric Air Pollution in Data-Scarce Settings Prasanjit Dey et.al. 2603.29974 null
2026-03-31 Curvature-Guided LoRA: Steering in the pretrained NTK subspace Frédéric Zheng et.al. 2603.29824 null
2026-03-31 Compiling Code LLMs into Lightweight Executables Jieke Shi et.al. 2603.29813 null
2026-03-31 Big2Small: A Unifying Neural Network Framework for Model Compression Jing-Xiao Liao et.al. 2603.29768 null
2026-03-31 One-for-All: A Lightweight Stabilized and Parameter-Efficient Pre-trained LLM for Time Series Forecasting Prasanjit Dey et.al. 2603.29756 null
2026-03-31 Client-Verifiable and Efficient Federated Unlearning in Low-Altitude Wireless Networks Yuhua Xu et.al. 2603.29688 null
2026-03-31 Quantization with Unified Adaptive Distillation to enable multi-LoRA based one-for-all Generative Vision Models on edge Sowmya Vajrala et.al. 2603.29535 null
2026-03-31 Distilling Human-Aligned Privacy Sensitivity Assessment from Large Language Models Gabriel Loiseau et.al. 2603.29497 null
2026-03-31 SeGPruner: Semantic-Geometric Visual Token Pruner for 3D Question Answering Wenli Li et.al. 2603.29437 null
2026-03-31 AP-DRL: A Synergistic Algorithm-Hardware Framework for Automatic Task Partitioning of Deep Reinforcement Learning on Versal ACAP Enlai Li et.al. 2603.29369 null
2026-03-31 Long-Document QA with Chain-of-Structured-Thought and Fine-Tuned SLMs Zhuowen Liang et.al. 2603.29232 null
2026-03-31 Dual-Imbalance Continual Learning for Real-World Food Recognition Xiaoyan Zhang et.al. 2603.29133 null
2026-03-31 A Multi-Sensor Fusion Parking Barrier System with Lightweight Vision on Edge Yuwen Zhu et.al. 2603.29126 null
2026-03-30 PolarQuant: Optimal Gaussian Weight Quantization via Hadamard Rotation for LLM Compression Caio Vicentino et.al. 2603.29078 null
2026-03-30 A Unified Algebraic Framework for Subspace Pruning in Koopman Operator Approximation via Principal Vectors Dhruv Shah et.al. 2603.29001 null
2026-03-30 Zero-shot Cross-domain Knowledge Distillation: A Case study on YouTube Music Srivaths Ranganathan et.al. 2603.28994 null
2026-03-30 Linear Regression from 1-bit Quantized Data Daniel Hill et.al. 2603.28989 null
2026-03-30 Privacy Guard & Token Parsimony by Prompt and Context Handling and LLM Routing Alessio Langiu et.al. 2603.28972 null
2026-03-30 OneComp: One-Line Revolution for Generative AI Model Compression Yuma Ichikawa et.al. 2603.28845 null
2026-03-30 DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing Kailai Feng et.al. 2603.28713 null
2026-03-30 Trust-Aware Routing for Distributed Generative AI Inference at the Edge Chanh Nguyen et.al. 2603.28622 null
2026-03-30 Fine-Tuning Large Language Models for Cooperative Tactical Deconfliction of Small Unmanned Aerial Systems Iman Sharifi et.al. 2603.28561 null
2026-03-30 Compressing Transformer Language Models via Matrix Product Operator Decomposition: A Case Study on PicoGPT Younes Javanmard et.al. 2603.28534 link
2026-03-30 HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention Yufei Xu et.al. 2603.28458 null
2026-03-31 LG-HCC: Local Geometry-Aware Hierarchical Context Compression for 3D Gaussian Splatting Xuan Deng et.al. 2603.28431 null
2026-03-30 IsoQuant: Hardware-Aligned SO(4) Isoclinic Rotations for LLM KV Cache Compression Zhongping Ji et.al. 2603.28430 null
2026-03-31 Resource-efficient quantum approximate optimization algorithm via Bayesian optimization and maximum-probability evaluation Siran Zhang et.al. 2603.28413 null
2026-03-30 EdgeDiT: Hardware-Aware Diffusion Transformers for Efficient On-Device Image Generation Sravanth Kodavanti et.al. 2603.28405 null
2026-03-30 DinoDental: Benchmarking DINOv3 as a Unified Vision Encoder for Dental Image Analysis Kun Tang et.al. 2603.28297 null
2026-03-30 Cost-Matching Model Predictive Control for Efficient Reinforcement Learning in Humanoid Locomotion Wenqi Cai et.al. 2603.28243 null
2026-03-30 TwinMixing: A Shuffle-Aware Feature Interaction Model for Multi-Task Segmentation Minh-Khoi Do et.al. 2603.28233 null
2026-03-30 Spinning Particles around Einstein-Geometric Proca AdS Compact Objects Gulzoda Rakhimova et.al. 2603.28181 null
2026-03-30 CoT2-Meta: Budgeted Metacognitive Control for Test-Time Reasoning Siyuan Ma et.al. 2603.28135 null
2026-03-30 Q-DIVER: Integrated Quantum Transfer Learning and Differentiable Quantum Architecture Search with EEG Data Junghoon Justin Park et.al. 2603.28122 null
2026-03-30 DELTA: A DAG-aware Efficient OCS Logical Topology Optimization Framework for AIDCs Niangen Ye et.al. 2603.28096 null
2026-03-30 Octree-based Learned Point Cloud Geometry Compression: A Lossy Perspective Kaiyu Zheng et.al. 2603.28095 null
2026-03-30 Reducing Oracle Feedback with Vision-Language Embeddings for Preference-Based RL Udita Ghosh et.al. 2603.28053 null
2026-03-30 Adapting SAM to Nuclei Instance Segmentation and Classification via Cooperative Fine-Grained Refinement Jingze Su et.al. 2603.28027 null
2026-03-30 ExFusion: Efficient Transformer Training via Multi-Experts Fusion Jiacheng Ruan et.al. 2603.27965 null
2026-03-30 ITQ3_S: High-Fidelity 3-bit LLM Inference via Interleaved Ternary Quantization with Rotation-Domain Smoothing Edward J. Yoon et.al. 2603.27914 null
2026-03-29 Rényi Entropy: A New Token Pruning Metric for Vision Transformers Wei-Yuan Su et.al. 2603.27900 null
2026-03-29 Energy Efficient Orchestration in Multiple-Access Vehicular Aerial-Terrestrial 6G Networks Mohammad Farhoudi et.al. 2603.27870 null
2026-03-29 A Resource-Aligned Hybrid Quantum-Classical Framework for Multimodal Face Anti-Spoofing Wanqi Sun et.al. 2603.27852 null
2026-03-29 KVSculpt: KV Cache Compression as Distillation Bo Jiang et.al. 2603.27819 null
2026-03-29 Synergizing Discriminative Exemplars and Self-Refined Experience for MLLM-based In-Context Learning in Medical Diagnosis Wenkai Zhao et.al. 2603.27737 null
2026-03-29 Low-Rank Adaptation Reduces Catastrophic Forgetting in Sequential Transformer Encoder Fine-Tuning: Controlled Empirical Evidence and Frozen-Backbone Representation Probes Ashish Pandey et.al. 2603.27707 null
2026-03-29 Customized Visual Storytelling with Unified Multimodal LLMs Wei-Hua Li et.al. 2603.27690 null
2026-03-29 CrossHGL: A Text-Free Foundation Model for Cross-Domain Heterogeneous Graph Learning Xuanze Chen et.al. 2603.27685 null
2026-03-29 Prototype-Aligned Federated Soft-Prompts for Continual Web Personalization Canran Xiao et.al. 2603.27678 null
2026-03-29 Amped: Adaptive Multi-stage Non-edge Pruning for Edge Detection Yuhan Gao et.al. 2603.27661 null
2026-03-29 V-CAST: Video Curvature-Aware Spatio-Temporal Pruning for Efficient Video Large Language Models Xinying Lin et.al. 2603.27650 null
2026-03-29 OPRO: Orthogonal Panel-Relative Operators for Panel-Aware In-Context Image Generation Sanghyeon Lee et.al. 2603.27637 null
2026-03-29 KV Cache Quantization for Self-Forcing Video Generation: A 33-Method Empirical Study Suraj Ranganath et.al. 2603.27469 null
2026-03-29 TurboAngle: Near-Lossless KV Cache Compression via Uniform Angle Quantization Dipkumar Patel et.al. 2603.27467 null
2026-03-29 RSR-core: A High-Performance Engine for Low-Bit Matrix-Vector Multiplication Mohsen Dehghankar et.al. 2603.27462 link
2026-03-28 Decompose, Mix, Adapt: A Unified Framework for Parameter-Efficient Neural Network Recombination and Compression Nazia Tasnim et.al. 2603.27383 null
2026-03-28 TokenDance: Token-to-Token Music-to-Dance Generation with Bidirectional Mamba Ziyue Yang et.al. 2603.27314 null
2026-03-28 HiFlow: Tokenization-Free Scale-Wise Autoregressive Policy Learning via Flow Matching Daichi Yashima et.al. 2603.27281 null
2026-03-28 From Foundation ECG Models to NISQ Learners: Distilling ECGFounder into a VQC Student Giovanni dos Santos Franco et.al. 2603.27269 null
2026-03-27 PQuantML: A Tool for End-to-End Hardware-aware Model Compression Roope Niemi et.al. 2603.26595 null
2026-03-27 When Perplexity Lies: Generation-Focused Distillation of Hybrid Sequence Models Juan Gabriel Kostelec et.al. 2603.26556 null
2026-03-27 Learnable Quantum Efficiency Filters for Urban Hyperspectral Segmentation Imad Ali Shah et.al. 2603.26528 null
2026-03-27 SPECTRA: An Efficient Spectral-Informed Neural Network for Sensor-Based Activity Recognition Deepika Gurung et.al. 2603.26482 null
2026-03-27 Domain decomposition of large neural network surrogate models Timm Gödde et.al. 2603.26396 null
2026-03-27 From Pen to Pixel: Translating Hand-Drawn Plots into Graphical APIs via a Novel Benchmark and Efficient Adapter Zhenghao Xu et.al. 2603.26356 null
2026-03-27 From Pixels to Privacy: Temporally Consistent Video Anonymization via Token Pruning for Privacy Preserving Action Recognition Nazia Aslam et.al. 2603.26336 null
2026-03-27 Mitigating the Reasoning Tax in Vision-Language Fine-Tuning with Input-Adaptive Depth Aggregation Yiming Ren et.al. 2603.26330 null
2026-03-27 Query-Specific Pruning of RML Mappings (Extended Version) Sitt Min Oo et.al. 2603.26269 null
2026-03-27 ARTA: Adaptive Mixed-Resolution Token Allocation for Efficient Dense Feature Extraction David Hagerman et.al. 2603.26258 null
2026-03-27 Real-Time Branch-to-Tool Distance Estimation for Autonomous UAV Pruning: Benchmarking Five DEFOM-Stereo Variants from Simulation to Jetson Deployment Yida Lin et.al. 2603.26250 null
2026-03-27 Knowledge Distillation for Efficient Transformer-Based Reinforcement Learning in Hardware-Constrained Energy Management Systems Pascal Henrich et.al. 2603.26249 null
2026-03-27 EPDQ: Efficient and Privacy-Preserving Exact Distance Query on Encrypted Graphs Xuemei Fu et.al. 2603.26219 null
2026-03-27 4DRaL: Bridging 4D Radar with LiDAR for Place Recognition using Knowledge Distillation Ningyuan Huang et.al. 2603.26206 null
2026-03-27 Efficient Few-Shot Learning for Edge AI via Knowledge Distillation on MobileViT Shuhei Tsuyuki et.al. 2603.26145 null
2026-03-27 PruneFuse: Efficient Data Selection via Weight Pruning and Network Fusion Humaira Kousar et.al. 2603.26138 null
2026-03-27 InstaVSR: Taming Diffusion for Efficient and Temporally Consistent Video Super-Resolution Jintong Hu et.al. 2603.26134 null
2026-03-27 TurboESM: Ultra-Efficient 3-Bit KV Cache Quantization for Protein Language Models with Orthogonal Rotation and QJL Correction Yue Hu et.al. 2603.26110 null
2026-03-27 Learnable Instance Attention Filtering for Adaptive Detector Distillation Chen Liu et.al. 2603.26088 null
2026-03-27 Rethinking Token Pruning for Historical Screenshots in GUI Visual Agents: Semantic, Spatial, and Temporal Perspectives Daiqiang Li et.al. 2603.26041 null
2026-03-27 Learning to Trim: End-to-End Causal Graph Pruning with Dynamic Anatomical Feature Banks for Medical VQA Zibo Xu et.al. 2603.26028 null
2026-03-27 VeRA+: Vector-Based Lightweight Digital Compensation for Drift-Resilient RRAM In-Memory Computing Weirong Dong et.al. 2603.26016 null
2026-03-27 FairLLaVA: Fairness-Aware Parameter-Efficient Fine-Tuning for Large Vision-Language Assistants Mahesh Bhosale et.al. 2603.26008 null
2026-03-26 Density-aware Soft Context Compression with Semi-Dynamic Compression Ratio Yijiong Yu et.al. 2603.25926 null
2026-03-26 GazeQwen: Lightweight Gaze-Conditioned LLM Modulation for Streaming Video Understanding Trong Thang Pham et.al. 2603.25841 null
2026-03-26 ETA-VLA: Efficient Token Adaptation via Temporal Fusion and Intra-LLM Sparsification for Vision-Language-Action Models Yiru Wang et.al. 2603.25766 null
2026-03-26 Transverse force tomography inside a proton from Basis Light-front Quantization Ziqi Zhang et.al. 2603.25548 null
2026-03-26 Investigating the Fundamental Limit: A Feasibility Study of Hybrid-Neural Archival Marcus Armstrong et.al. 2603.25526 null
2026-03-27 CLIP-RD: Relational Distillation for Efficient CLIP Knowledge Distillation Jeannie Chung et.al. 2603.25383 null
2026-03-26 Optimizing Entanglement Distribution Protocols: Maximizing Classical Information in Quantum Networks Ethan Sanchez Hidalgo et.al. 2603.25360 null
2026-03-26 How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models Hector Borobia et.al. 2603.25325 null
2026-03-26 Towards Controllable Low-Light Image Enhancement: A Continuous Multi-illumination Dataset and Efficient State Space Framework Hongru Han et.al. 2603.25296 null
2026-03-26 Non-Minimally Coupled Scalar Field, Area Quantization and Black Hole Entropy Sahil Devdutt et.al. 2603.25292 null
2026-03-26 SliderQuant: Accurate Post-Training Quantization for LLMs Shigeng Wang et.al. 2603.25284 null
2026-03-26 Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model Jiahao Wu et.al. 2603.25184 null
2026-03-26 SIGMA: Structure-Invariant Generative Molecular Alignment for Chemical Language Models via Autoregressive Contrastive Learning Xinyu Wang et.al. 2603.25062 null
2026-03-26 Mechanistically Interpreting Compression in Vision-Language Models Veeraraju Elluru et.al. 2603.25035 null
2026-03-26 A Public Theory of Distillation Resistance via Constraint-Coupled Reasoning Architectures Peng Wei et.al. 2603.25022 null
2026-03-26 Topological Quantization of Complex Velocity in Stochastic Spacetimes Jorge Meza-Domíguez et.al. 2603.25016 null
2026-03-26 LiteGuard: Efficient Task-Agnostic Model Fingerprinting with Enhanced Generalization Guang Yang et.al. 2603.24982 null
2026-03-26 Toward domain-specific machine translation and quality estimation systems Javad Pourmostafa Roshan Sharami et.al. 2603.24955 null
2026-03-26 Surrogates, Spikes, and Sparsity: Performance Analysis and Characterization of SNN Hyperparameters on Hardware Ilkin Aliyev et.al. 2603.24891 null
2026-03-25 Prune as You Generate: Online Rollout Pruning for Faster and Better RLVR Haobo Xu et.al. 2603.24840 null
2026-03-25 Coefficient-Decoupled Matrix Product Operators as an Interface to Linear-Combination-of-Unitaries Circuits Younes Javanmard et.al. 2603.24822 null
2026-03-25 Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration Danil Tokhchukov et.al. 2603.24800 null
2026-03-25 Quantization of Beta Functions in Self-Dual Backgrounds and Emergent Non-Commutative EFT Mithat Ünsal et.al. 2603.24799 null
2026-03-25 Rafture: Erasure-coded Raft with Post-Dissemination Pruning Rithwik Kerur et.al. 2603.24761 null
2026-03-25 Bound states of anyons: a geometric quantization approach Qingchen Li et.al. 2603.24701 null
2026-03-25 ReDiPrune: Relevance-Diversity Pre-Projection Token Pruning for Efficient Multimodal LLMs An Yu et.al. 2603.24680 null
2026-03-25 Demystifying When Pruning Works via Representation Hierarchies Shwai He et.al. 2603.24652 null
2026-03-25 From friction scaling to an efficient method for estimating bubble wall velocity Tomasz Krajewski et.al. 2603.24583 null
2026-03-25 Latent-WAM: Latent World Action Modeling for End-to-End Autonomous Driving Linbo Wang et.al. 2603.24581 null
2026-03-25 TuneShift-KD: Knowledge Distillation and Transfer for Fine-tuned Models Yushi Guan et.al. 2603.24518 null
2026-03-25 JSSAnet: Theory-Guided Subchannel Partitioning and Joint Spatial Attention for Near-Field Channel Estimation Zhiming Zhu et.al. 2603.24505 null
2026-03-25 Marchuk: Efficient Global Weather Forecasting from Mid-Range to Sub-Seasonal Scales via Flow Matching Arsen Kuzhamuratov et.al. 2603.24428 null
2026-03-25 PP-OCRv5: A Specialized 5M-Parameter Model Rivaling Billion-Parameter Vision-Language Models on OCR Tasks Cheng Cui et.al. 2603.24373 null
2026-03-25 LATS: Large Language Model Assisted Teacher-Student Framework for Multi-Agent Reinforcement Learning in Traffic Signal Control Yifeng Zhang et.al. 2603.24361 null
2026-03-25 Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing Cheng Cui et.al. 2603.24326 null
2026-03-25 Powerful Teachers Matter: Text-Guided Multi-view Knowledge Distillation with Visual Prior Enhancement Xin Zhang et.al. 2603.24208 null
2026-03-25 Linear-Nonlinear Fusion Neural Operator for Partial Differential Equations Heng Wu et.al. 2603.24143 null
2026-03-25 MedAidDialog: A Multilingual Multi-Turn Medical Dialogue Dataset for Accessible Healthcare Shubham Kumar Nigam et.al. 2603.24132 null
2026-03-25 UW-VOS: A Large-Scale Dataset for Underwater Video Object Segmentation Hongshen Zhao et.al. 2603.24006 null
2026-03-25 Diet Your LLM: Dimension-wise Global Pruning of LLMs via Merging Task-specific Importance Score Jimyung Hong et.al. 2603.23985 null
2026-03-25 Towards Energy-aware Requirements Dependency Classification: Knowledge-Graph vs. Vector-Retrieval Augmented Inference with SLMs Shreyas Patil et.al. 2603.23954 null
2026-03-25 Attention-aware Inference Optimizations for Large Vision-Language Models with Memory-efficient Decoding Fatih Ilhan et.al. 2603.23914 null
2026-03-25 PowerFlow-DNN: Compiler-Directed Fine-Grained Power Orchestration for End-to-End Edge AI Inference Paul Chen et.al. 2603.23882 null
2026-03-25 Self-Evolving Multi-Agent Framework for Efficient Decision Making in Real-Time Strategy Scenarios Li Ma et.al. 2603.23875 null
2026-03-25 How Vulnerable Are Edge LLMs? Ao Ding et.al. 2603.23822 null
2026-03-24 An Adapter-free Fine-tuning Approach for Tuning 3D Foundation Models Sneha Paul et.al. 2603.23730 null
2026-03-24 Energy Efficient Software Hardware CoDesign for Machine Learning: From TinyML to Large Language Models Mohammad Saleh Vahdatpour et.al. 2603.23668 null
2026-03-24 QuickQudits: A Framework for Efficient Simulation of Noisy Qudit Clifford Circuits via an Extended Stabilizer Tableau Formalism Nina Brandl et.al. 2603.23641 null
2026-03-24 APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs Meriem Bouzouad et.al. 2603.23575 null
2026-03-24 Deformation quantization for systems with second-class constraints in deformed fermionic phase space Bing-Sheng Lin et.al. 2603.23411 null
2026-03-24 GeoSANE: Learning Geospatial Representations from Models, Not Data Joelle Hanna et.al. 2603.23408 null
2026-03-24 Harnessing Lightweight Transformer with Contextual Synergic Enhancement for Efficient 3D Medical Image Segmentation Xinyu Liu et.al. 2603.23390 null
2026-03-24 Pruning for efficient deterministic global optimization over trained ReLU neural networks Giacomo Lastrucci et.al. 2603.23299 null
2026-03-24 Block Coordinate Descent for Dynamic Portfolio Optimization on Finite-Precision Coherent Ising Machines Keming He et.al. 2603.23200 null
2026-03-24 LiZIP: An Auto-Regressive Compression Framework for LiDAR Point Clouds Aditya Shibu et.al. 2603.23162 null
2026-03-24 Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair Aditya Kakade et.al. 2603.23129 null
2026-03-24 High-Resolution Tensor-Network Fourier Methods for Exponentially Compressed Non-Gaussian Aggregate Distributions Juan José Rodríguez-Aldavero et.al. 2603.23106 null
2026-03-24 Good for the Planet, Bad for Me? Intended and Unintended Consequences of AI Energy Consumption Disclosure Michael Klesel et.al. 2603.23075 null
2026-03-24 VLA-IAP: Training-Free Visual Token Pruning via Interaction Alignment for Vision-Language-Action Models Jintao Cheng et.al. 2603.22991 null
2026-03-24 Markov-Enforced Discrete Diffusion Model for Digital Semantic Symbol Error Correction Yoon Huh et.al. 2603.22983 null
2026-03-24 PersonalQ: Select, Quantize, and Serve Personalized Diffusion Models for Efficient Inference Qirui Wang et.al. 2603.22943 null
2026-03-24 Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning Anshul Solanki et.al. 2603.22942 null
2026-03-24 ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling Shaobo Ju et.al. 2603.22911 null
2026-03-24 Balancing Safety and Efficiency in Aircraft Health Diagnosis: A Task Decomposition Framework with Heterogeneous Long-Micro Scale Cascading and Knowledge Distillation-based Interpretability Xinhang Chen et.al. 2603.22885 null
2026-03-24 TRINE: A Token-Aware, Runtime-Adaptive FPGA Inference Engine for Multimodal AI Hyunwoo Oh et.al. 2603.22867 null
2026-03-24 Aerial Agentic AI: Synergizing LLM and SLM for Low-Altitude Wireless Networks Li Dong et.al. 2603.22866 null
2026-03-25 Two-dimensional bound excitons in the real space and Landau quantization space: a comparative study Kunxiang Li et.al. 2603.22715 null
2026-03-23 Communication-Efficient Approximate Gradient Coding Sifat Munim et.al. 2603.22514 null
2026-03-23 A Theoretical Framework for Energy-Aware Gradient Pruning in Federated Learning Emmanouil M. Athanasakos et.al. 2603.22465 null
2026-03-23 A Brief Comparison of Training-Free Multi-Vector Sequence Compression Methods Rohan Jha et.al. 2603.22434 null
2026-03-23 An Exact Conjugation Identity for the Many-Body Wilson-Loop Beyond Quantization Kai Watanabe et.al. 2603.22217 null
2026-03-23 Mixture of Mini Experts: Overcoming the Linear Layer Bottleneck in Multiple Instance Learning Daniel Shao et.al. 2603.22198 null
2026-03-23 Dual-Space Knowledge Distillation with Key-Query Matching for Large Language Models with Vocabulary Mismatch Stella Eva Tsiapali et.al. 2603.22056 null
2026-03-23 SegMaFormer: A Hybrid State-Space and Transformer Model for Efficient Segmentation Duy D. Nguyen et.al. 2603.22002 null
2026-03-23 Parameter-Efficient Fine-Tuning for Medical Text Summarization: A Comparative Study of Lora, Prompt Tuning, and Full Fine-Tuning Ulugbek Shernazarov et.al. 2603.21970 null
2026-03-23 Suiren-1.0 Technical Report: A Family of Molecular Foundation Models Junyi An et.al. 2603.21942 null
2026-03-23 Camera-Agnostic Pruning of 3D Gaussian Splats via Descriptor-Based Beta Evidence Peter Fasogbon et.al. 2603.21933 null
2026-03-23 The Golden Subspace: Where Efficiency Meets Generalization in Continual Test-Time Adaptation Guannan Lai et.al. 2603.21928 null
2026-03-23 olLOSC: Unified and efficient density functional approximation to correct delocalization error in molecules and periodic materials Yichen Fan et.al. 2603.21906 null
2026-03-23 SmaAT-QMix-UNet: A Parameter-Efficient Vector-Quantized UNet for Precipitation Nowcasting Nikolas Stavrou et.al. 2603.21879 null
2026-03-23 Many-body mobility edges in one dimension revealed by efficient and interpretable feature-based learning with Kolmogorov-Arnold Networks Siqi Dai et.al. 2603.21807 null
2026-03-23 CurvZO: Adaptive Curvature-Guided Sparse Zeroth-Order Optimization for Efficient LLM Fine-Tuning Shuo Wang et.al. 2603.21725 null
2026-03-23 Rethinking Token Reduction for Large Vision-Language Models Yi Wang et.al. 2603.21701 null
2026-03-23 Distilling the knowledge with quantum neural networks Yuxuan Yan et.al. 2603.21586 null
2026-03-23 Rethinking SAR ATR: A Target-Aware Frequency-Spatial Enhancement Framework with Noise-Resilient Knowledge Guidance Yansong Lin et.al. 2603.21565 null
2026-03-23 Parameter-efficient Prompt Tuning and Hierarchical Textual Guidance for Few-shot Whole Slide Image Classification Jayanie Bogahawatte et.al. 2603.21504 null
2026-03-22 KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning Shuai Wang et.al. 2603.21440 null
2026-03-22 Uncertainty-Aware Knowledge Distillation for Multimodal Large Language Models Jingchen Sun et.al. 2603.21426 null
2026-03-22 Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs Mariela M. Nina et.al. 2603.21418 null
2026-03-22 Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models Jinghan Cao et.al. 2603.21389 null
2026-03-22 FluidWorld: Reaction-Diffusion Dynamics as a Predictive Substrate for World Models Fabien Polly et.al. 2603.21315 null
2026-03-22 DepthTCM: High Efficient Depth Compression via Physics-aware Transformer-CNN Mixed Architecture Young-Seo Chang et.al. 2603.21233 null
2026-03-22 QMoP: Query Guided Mixture-of-Projector for Efficient Visual Token Compression Zhongyang Li et.al. 2603.21232 null
2026-03-22 Emotion-Aware Quantization for Discrete Speech Representations: An Analysis of Emotion Preservation Haoguang Zhou et.al. 2603.21224 null
2026-03-22 Frequency Switching Mechanism for Parameter-E!cient Multi-Task Learning Shih-Wen Liu et.al. 2603.21111 null
2026-03-22 A lightweight Outlier Detection for Characterizing Radio- and Environment-Specific Link Quality Fluctuation in Low-Power Wireless Networks Zegeye Mekasha Kidane et.al. 2603.21107 null
2026-03-22 ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models Xu Li et.al. 2603.21105 null
2026-03-22 Learning Progressive Adaptation for Multi-Modal Tracking He Wang et.al. 2603.21100 null
2026-03-22 SkinCLIP-VL: Consistency-Aware Vision-Language Learning for Multimodal Skin Cancer Diagnosis Zhixiang Lu et.al. 2603.21010 null
2026-03-22 Structural Sensitivity in Compressed Transformers: Error Propagation, Lyapunov Stability, and Formally Verified Bounds Abhinaba Basu et.al. 2603.20991 null
2026-03-22 Joint Surrogate Learning of Objectives, Constraints, and Sensitivities for Efficient Multi-objective Optimization of Neural Dynamical Systems Frithjof Gressmann et.al. 2603.20984 null
2026-03-21 SozKZ: Training Efficient Small Language Models for Kazakh from Scratch Saken Tukenov et.al. 2603.20854 null
2026-03-21 HiCI: Hierarchical Construction-Integration for Long-Context Attention Xiangyu Zeng et.al. 2603.20843 null
2026-03-21 Lean Learning Beyond Clouds: Efficient Discrepancy-Conditioned Optical-SAR Fusion for Semantic Segmentation Chenxing Meng et.al. 2603.20811 null
2026-03-21 Less is More in Semantic Space: Intrinsic Decoupling via Clifford-M for Fundus Image Classification Yifeng Zheng et.al. 2603.20806 null
2026-03-21 VSD-MOT: End-to-End Multi-Object Tracking in Low-Quality Video Scenes Guided by Visual Semantic Distillation Jun Du et.al. 2603.20731 null
2026-03-21 Centrality-Based Pruning for Efficient Echo State Networks Sudip Laudari et.al. 2603.20684 null
2026-03-21 Enhancing Vision-Based Policies with Omni-View and Cross-Modality Knowledge Distillation for Mobile Robots Kai Li et.al. 2603.20679 null
2026-03-20 Understanding Behavior Cloning with Action Quantization Haoqun Cao et.al. 2603.20538 null
2026-03-20 AE-LLM: Adaptive Efficiency Optimization for Large Language Models Kaito Tanaka et.al. 2603.20492 null
2026-03-20 Developing an ESG-Oriented Large Language Model through ESG Practices Gabriel Assis et.al. 2603.20480 null
2026-03-20 Diffutron: A Masked Diffusion Language Model for Turkish Language Şuayp Talha Kocabay et.al. 2603.20466 null
2026-03-20 Accurate and efficient simulation-based inference for massive black-hole binaries with LISA Alice Spadaro et.al. 2603.20431 link
2026-03-20 TinyML Enhances CubeSat Mission Capabilities Luigi Capogrosso et.al. 2603.20174 null
2026-03-20 An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models Yuming Feng et.al. 2603.20100 null
2026-03-20 TAPAS: Efficient Two-Server Asymmetric Private Aggregation Beyond Prio(+) Harish Karthikeyan et.al. 2603.19949 null
2026-03-20 Timestep-Aware Block Masking for Efficient Diffusion Model Inference Haodong He et.al. 2603.19939 null
2026-03-20 SIMPLER: Efficient Foundation Model Adaptation via Similarity-Guided Layer Pruning for Earth Observation Víctor Barreiro et.al. 2603.19873 null
2026-03-20 Generalized Task-Driven Design of Soft Robots via Reduced-Order FEM-based Surrogate Modeling Yao Yao et.al. 2603.19794 null
2026-03-20 Growing Networks with Autonomous Pruning Charles De Lambilly et.al. 2603.19759 null
2026-03-20 FedPDPO: Federated Personalized Direct Preference Optimization for Large Language Model Alignment Kewen Zhu et.al. 2603.19741 null
2026-03-20 A two-step sequential approach for hyperparameter selection in finite context models José Contente et.al. 2603.19736 null
2026-03-20 Stepwise: Neuro-Symbolic Proof Search for Automated Systems Verification Baoding He et.al. 2603.19715 null
2026-03-20 RiboSphere: Learning Unified and Efficient Representations of RNA Structures Zhou Zhang et.al. 2603.19636 null
2026-03-20 BEAVER: A Training-Free Hierarchical Prompt Compression Method via Structure-Aware Page Selection Zhengpei Hu et.al. 2603.19635 null
2026-03-20 Dual-Domain Representation Alignment: Bridging 2D and 3D Vision via Geometry-Aware Architecture Search Haoyu Zhang et.al. 2603.19563 null
2026-03-20 Optimal Scalar Quantization for Matrix Multiplication: Closed-Form Density and Phase Transition Calvin Ang et.al. 2603.19559 null
2026-03-19 Vision Tiny Recursion Model (ViTRM): Parameter-Efficient Image Classification via Recursive State Refinement Ange-Clément Akazan et.al. 2603.19503 null
2026-03-19 VeloxNet: Efficient Spatial Gating for Lightweight Embedded Image Classification Md Meftahul Ferdaus et.al. 2603.19496 null
2026-03-19 F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World Ziyin Zhang et.al. 2603.19223 null
2026-03-19 Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Zhuolin Yang et.al. 2603.19220 null
2026-03-19 DyMoE: Dynamic Expert Orchestration with Mixed-Precision Quantization for Efficient MoE Inference on Edge Yuegui Huang et.al. 2603.19172 null
2026-03-19 Quasinormal Modes of Extremal Reissner-Nordstrom Black Holes via Seiberg-Witten Quantization Yi-Rong Wang et.al. 2603.19168 null
2026-03-19 A Pipelined Collaborative Speculative Decoding Framework for Efficient Edge-Cloud LLM Inference Yida Zhang et.al. 2603.19133 null
2026-03-19 LuMamba: Latent Unified Mamba for Electrode Topology-Invariant and Efficient EEG Modeling Danaé Broustail et.al. 2603.19100 null
2026-03-19 Towards Verifiable AI with Lightweight Cryptographic Proofs of Inference Pranay Anchuri et.al. 2603.19025 null
2026-03-19 End-to-End Simulation of Chemical Dynamics on a Quantum Computer Elliot C. Eklund et.al. 2603.19007 null
2026-03-19 Functional Subspace Watermarking for Large Language Models Zikang Ding et.al. 2603.18793 null
2026-03-19 6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models Rundong Su et.al. 2603.18742 null
2026-03-19 EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation Longfei Liu et.al. 2603.18739 null
2026-03-19 Multimodal Model for Computational Pathology:Representation Learning and Image Compression Peihang Wu et.al. 2603.18660 null
2026-03-19 AIMER: Calibration-Free Task-Agnostic MoE Pruning Zongfang Liu et.al. 2603.18492 null
2026-03-19 Prune-then-Quantize or Quantize-then-Prune? Understanding the Impact of Compression Order in Joint Model Compression Minjun Kim et.al. 2603.18426 null
2026-03-19 SynQ: Accurate Zero-shot Quantization by Synthesis-aware Fine-tuning Minjun Kim et.al. 2603.18423 null
2026-03-18 Energy-Aware Frame Rate Selection for Video Coding Geetha Ramasubbu et.al. 2603.18305 null
2026-03-18 LRConv-NeRV: Low Rank Convolution for Efficient Neural Video Compression Tamer Shanableh et.al. 2603.18261 null
2026-03-18 A Computationally Efficient Learning of Artificial Intelligence System Reliability Considering Error Propagation Fenglian Pan et.al. 2603.18201 null
2026-03-18 Q-Drift: Quantization-Aware Drift Correction for Diffusion Model Sampling Sooyoung Ryu et.al. 2603.18095 null
2026-03-18 Unified Spatio-Temporal Token Scoring for Efficient Video VLMs Jianrui Zhang et.al. 2603.18004 null
2026-03-18 Universal Skeleton Understanding via Differentiable Rendering and MLLMs Ziyi Wang et.al. 2603.18003 null
2026-03-18 AdaRadar: Rate Adaptive Spectral Compression for Radar-based Perception Jinho Park et.al. 2603.17979 null
2026-03-18 Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing Raghavv Goel et.al. 2603.17942 null
2026-03-18 Energy extraction from a rotating Buchdahl star via magnetic reconnection Ikhtiyor Eshtursunov et.al. 2603.17928 null
2026-03-18 RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Arpit Singh Gautam et.al. 2603.17891 null
2026-03-18 Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated Gradients Ziwei Xiang et.al. 2603.17809 null
2026-03-18 Exploring parameter-efficient fine-tuning (PEFT) of billion-parameter vision models with QLoRA and DoRA: insights into generalization for limited-data image classification under a 98:1 test-to-train regime Haiyu Yang et.al. 2603.17782 null
2026-03-18 Parameter-Efficient Modality-Balanced Symmetric Fusion for Multimodal Remote Sensing Semantic Segmentation Haocheng Li et.al. 2603.17705 null
2026-03-18 Halo: Domain-Aware Query Optimization for Long-Context Question Answering Pramod Chunduri et.al. 2603.17668 null
2026-03-18 ReLaGS: Relational Language Gaussian Splatting Yaxu Xie et.al. 2603.17605 null
2026-03-18 LoGSAM: Parameter-Efficient Cross-Modal Grounding for MRI Segmentation Mohammad Robaitul Islam Bhuiyan et.al. 2603.17576 null
2026-03-18 EI: Early Intervention for Multimodal Imaging based Disease Recognition Qijie Wei et.al. 2603.17514 null
2026-03-18 ZipServ: Fast and Memory-Efficient LLM Inference with Hardware-Aware Lossless Compression Ruibo Fan et.al. 2603.17435 null
2026-03-18 The Phasor Transformer: Resolving Attention Bottlenecks on the Unit Circle Dibakar Sigdel et.al. 2603.17433 null
2026-03-18 Motion-Adaptive Temporal Attention for Lightweight Video Generation with Stable Diffusion Rui Hong et.al. 2603.17398 null
2026-03-18 Beyond Outliers: A Data-Free Layer-wise Mixed-Precision Quantization Approach Driven by Numerical and Structural Dual-Sensitivity Hengyuan Zhang et.al. 2603.17354 null
2026-03-18 DANCE: Dynamic 3D CNN Pruning: Joint Frame, Channel, and Feature Adaptation for Energy Efficiency on the Edge Mohamed Mejri et.al. 2603.17275 null
2026-03-18 Efficient and flexible preparation of photonic NOON states in a superconducting system Dong-Sheng Li et.al. 2603.17253 null
2026-03-18 KANtize: Exploring Low-bit Quantization of Kolmogorov-Arnold Networks for Efficient Inference Sohaib Errabii et.al. 2603.17230 null
2026-03-17 OPERA: Online Data Pruning for Efficient Retrieval Model Adaptation Haoyang Fang et.al. 2603.17205 null
2026-03-17 On quantization and the classical variational principle for the metric mean dimension Maria Carvalho et.al. 2603.17091 null
2026-03-17 ACE-LoRA: Graph-Attentive Context Enhancement for Parameter-Efficient Adaptation of Medical Vision-Language Models M. Arda Aydın et.al. 2603.17079 null
2026-03-17 Early Quantization Shrinks Codebook: A Simple Fix for Diversity-Preserving Tokenization Wenhao Zhao et.al. 2603.17052 null
2026-03-17 Empirical Recipes for Efficient and Compact Vision-Language Models Jiabo Huang et.al. 2603.16987 null
2026-03-17 Integrating Inductive Biases in Transformers via Distillation for Financial Time Series Forecasting Yu-Chen Den et.al. 2603.16985 null
2026-03-17 Understanding Quantization of Optimizer States in LLM Pre-training: Dynamics of State Staleness and Effectiveness of State Resets Kristi Topollai et.al. 2603.16731 null
2026-03-17 Efficient generation of entangled photons in the telecommunications range using nonlinear metasurfaces integrated with ScAlN/GaN heterostructures Jaeyeon Yu et.al. 2603.16699 null
2026-03-17 Can Linguistically Related Languages Guide LLM Translation in Low-Resource Settings? Aishwarya Ramasethu et.al. 2603.16660 null
2026-03-17 FlowComposer: Composable Flows for Compositional Zero-Shot Learning Zhenqi He et.al. 2603.16641 null
2026-03-17 BATQuant: Outlier-resilient MXFP4 Quantization via Learnable Block-wise Optimization Ji-Fu Li et.al. 2603.16590 null
2026-03-17 Exploring different approaches to customize language models for domain-specific text-to-code generation Luís Freire et.al. 2603.16526 null
2026-03-17 TinyGLASS: Real-Time Self-Supervised In-Sensor Anomaly Detection Pietro Bonazzi et.al. 2603.16451 null
2026-03-17 Fast-HaMeR: Boosting Hand Mesh Reconstruction using Knowledge Distillation Hunain Ahmed Jillani et.al. 2603.16444 null
2026-03-17 Capability-Guided Compression: Toward Interpretability-Aware Budget Allocation for Large Language Models Rishaank Gupta et.al. 2603.16440 null
2026-03-17 CD-FKD: Cross-Domain Feature Knowledge Distillation for Robust Single-Domain Generalization in Object Detection Junseok Lee et.al. 2603.16439 null
2026-03-17 VQKV: High-Fidelity and High-Ratio Cache Compression via Vector-Quantization Yixuan Wang et.al. 2603.16435 null
2026-03-18 EngGPT2: Sovereign, Efficient and Open Intelligence G. Ciarfaglia et.al. 2603.16430 null
2026-03-17 PlotTwist: A Creative Plot Generation Framework with Small Language Models Abhinav Thorat et.al. 2603.16410 null
2026-03-17 DermaFlux: Synthetic Skin Lesion Generation with Rectified Flows for Enhanced Image Classification Stathis Galanakis et.al. 2603.16392 null
2026-03-17 RASLF: Representation-Aware State Space Model for Light Field Super-Resolution Zeqiang Wei et.al. 2603.16243 null
2026-03-17 SpecSteer: Synergizing Local Context and Global Reasoning for Efficient Personalized Generation Hang Lv et.al. 2603.16219 null
2026-03-17 SIA: A Synthesize-Inject-Align Framework for Knowledge-Grounded and Secure E-commerce Search LLMs with Industrial Deployment Zhouwei Zhai et.al. 2603.16137 null
2026-03-17 Knowledge Distillation for Collaborative Learning in Distributed Communications and Sensing Nhan Thanh Nguyen et.al. 2603.16116 null
2026-03-17 Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization Francesco Pio Monaco et.al. 2603.16105 null
2026-03-17 POaaS: Minimal-Edit Prompt Optimization as a Service to Lift Accuracy and Cut Hallucinations on On-Device sLLMs Jungwoo Shim et.al. 2603.16045 null
2026-03-17 Enhancing Linguistic Generalization of VLA: Fine-Tuning OpenVLA via Synthetic Instruction Augmentation Dongik Shin et.al. 2603.16044 null
2026-03-16 Mostly Text, Smart Visuals: Asymmetric Text-Visual Pruning for Large Vision-Language Models Sijie Li et.al. 2603.16001 null
2026-03-16 Sparse but not Simpler: A Multi-Level Interpretability Analysis of Vision Transformers Siyu Zhang et.al. 2603.15919 null
2026-03-16 Agent-based imitation dynamics can yield efficiently compressed population-level vocabularies Nathaniel Imel et.al. 2603.15903 null
2026-03-16 Domain Adaptation Without the Compute Burden for Efficient Whole Slide Image Analysis Umar Marikkar et.al. 2603.15774 null
2026-03-16 S2Act: Simple Spiking Actor Ugur Akcal et.al. 2603.15725 null
2026-03-16 Mastering the Minority: An Uncertainty-guided Multi-Expert Framework for Challenging-tailed Sequence Learning Ye Wang et.al. 2603.15708 null
2026-03-16 TabKD: Tabular Knowledge Distillation through Interaction Diversity of Learned Feature Bins Shovon Niverd Pereira et.al. 2603.15481 null
2026-03-16 CLAG: Adaptive Memory Organization via Agent-Driven Clustering for Small Language Model Agents Taeyun Roh et.al. 2603.15421 null
2026-03-16 RESQ: A Unified Framework for REliability- and Security Enhancement of Quantized Deep Neural Networks Ali Soltan Mohammadi et.al. 2603.15413 null
2026-03-16 Spectral Rectification for Parameter-Efficient Adaptation of Foundation Models in Colonoscopy Depth Estimation Xiaoxian Zhang et.al. 2603.15374 null
2026-03-16 Physically Motivated Knowledge Distillation for Blind Geometric Correction of Side-Scan Sonar Imagery Can Lei et.al. 2603.15200 null
2026-03-16 Joint Routing and Model Pruning for Decentralized Federated Learning in Bandwidth-Constrained Multi-Hop Wireless Networks Xiaoyu He et.al. 2603.15188 null
2026-03-16 DAIT: Distillation from Vision-Language Models to Lightweight Classifiers with Adaptive Intermediate Teacher Transfer Zhengxu He et.al. 2603.15166 null
2026-03-16 An Efficient Cumulative Edge-Detection Method for Image Reconstruction Toluwani Okunola et.al. 2603.15151 null
2026-03-16 Accelerating Byzantine-Robust Distributed Learning with Compressed Communication via Double Momentum and Variance Reduction Yanghao Li et.al. 2603.15144 null
2026-03-16 PrototypeNAS: Rapid Design of Deep Neural Networks for Microcontroller Units Mark Deutel et.al. 2603.15106 null
2026-03-16 Affordable Precision Agriculture: A Deployment-Oriented Review of Low-Cost, Low-Power Edge AI and TinyML for Resource-Constrained Farming Systems Riya Samanta et.al. 2603.15085 null
2026-03-16 Edit2Interp: Adapting Image Foundation Models from Spatial Editing to Video Frame Interpolation with Few-Shot Learning Nasrin Rahimi et.al. 2603.15003 null
2026-03-16 Smooth finite time singularity formation without quantization Istvan Kadar et.al. 2603.14985 null
2026-03-16 Lightweight User-Personalization Method for Closed Split Computing Yuya Okada et.al. 2603.14958 null
2026-03-16 GT-PCQA: Geometry-Texture Decoupled Point Cloud Quality Assessment with MLLM Guohua Zhang et.al. 2603.14951 null
2026-03-16 Spiking Layer-Adaptive Magnitude-based Pruning Junqiao Wang et.al. 2603.14946 null
2026-03-16 Directional Routing in Transformers Kevin Taylor et.al. 2603.14923 null
2026-03-16 Photonic Quantum-Enhanced Knowledge Distillation Kuan-Cheng Chen et.al. 2603.14898 null
2026-03-16 RAZOR: Ratio-Aware Layer Editing for Targeted Unlearning in Vision Transformers and Diffusion Models Ravi Ranjan et.al. 2603.14819 null
2026-03-16 SimCert: Probabilistic Certification for Behavioral Similarity in Deep Neural Network Compression Jingyang Li et.al. 2603.14818 null
2026-03-16 Efficient Event Camera Volume System Juan Camilo Soto et.al. 2603.14738 null
2026-03-15 Parameter-Efficient Quality Estimation via Frozen Recursive Models Umar Abubacar et.al. 2603.14593 null
2026-03-15 FlashHead: Efficient Drop-In Replacement for the Classification Head in Language Model Inference Wilhelm Tranheden et.al. 2603.14591 null
2026-03-15 Multilingual TinyStories: A Synthetic Combinatorial Corpus of Indic Children’s Stories for Training Small Language Models Deepon Halder et.al. 2603.14563 null
2026-03-15 ASAP: Attention-Shift-Aware Pruning for Efficient LVLM Inference Surendra Pathak et.al. 2603.14549 null
2026-03-15 Distilling Latent Manifolds: Resolution Extrapolation by Variational Autoencoders Jiaming Chu et.al. 2603.14536 null
2026-03-15 LatSearch: Latent Reward-Guided Search for Faster Inference-Time Scaling in Video Diffusion Zengqun Zhao et.al. 2603.14526 null
2026-03-15 Uni-MDTrack: Learning Decoupled Memory and Dynamic States for Parameter-Efficient Visual Tracking in All Modality Wenrui Cai et.al. 2603.14452 null
2026-03-15 Flux Quantization on M-Strings Pinak Banerjee et.al. 2603.14440 null
2026-03-15 SPARQ: Spiking Early-Exit Neural Networks for Energy-Efficient Edge AI Parth Patne et.al. 2603.14380 null
2026-03-15 Direct Object-Level Reconstruction via Probabilistic Gaussian Splatting Shuai Guo et.al. 2603.14316 null
2026-03-15 All-day Multi-scenes Lifelong Vision-and-Language Navigation with Tucker Adaptation Xudong Wang et.al. 2603.14276 null
2026-03-15 On aggregation-quantization permutability problem for discrete-time Markov chains Adam Doliwa et.al. 2603.14269 null
2026-03-15 Not All Directions Matter: Toward Structured and Task-Aware Low-Rank Adaptation Xi Xiao et.al. 2603.14228 null
2026-03-15 Self-Indexing KVCache: Predicting Sparse Attention from Compressed Keys Xu Yang et.al. 2603.14224 null
2026-03-15 Safety-Potential Pruning for Enhancing Safety Prompts Against VLM Jailbreaking Without Retraining Chongxin Li et.al. 2603.14219 null
2026-03-15 Relationship-Aware Safety Unlearning for Multimodal LLMs Vishnu Narayanan Anilkumar et.al. 2603.14185 null
2026-03-15 Selective Fine-Tuning of GPT Architectures for Parameter-Efficient Clinical Text Classification Fariba Afrin Irany et.al. 2603.14183 null
2026-03-14 Universal method of selective detection of a wide range of pollutants in liquids using conductance quantization O. Pospelov et.al. 2603.14140 null
2026-03-13 MoEKD: Mixture-of-Experts Knowledge Distillation for Robust and High-Performing Compressed Code Models Md. Abdul Awal et.al. 2603.13213 null
2026-03-13 Resource-efficient Quantum Algorithms for Selected Hamiltonian Subspace Diagonalization Vincent Graves et.al. 2603.13160 null
2026-03-13 Steve-Evolving: Open-World Embodied Self-Evolution via Fine-Grained Diagnosis and Dual-Track Knowledge Distillation Zhengwei Xie et.al. 2603.13131 null
2026-03-13 ZO-SAM: Zero-Order Sharpness-Aware Minimization for Efficient Sparse Training Jie Ji et.al. 2603.13115 null
2026-03-13 Efficient and Interpretable Multi-Agent LLM Routing via Ant Colony Optimization Xudong Wang et.al. 2603.12933 null
2026-03-13 Consistent and Efficient MSCKF-based LiDAR-Inertial Odometry with Inferred Cluster-to-Plane Constraints for UAVs Jinwen Zhu et.al. 2603.12904 null
2026-03-13 Surrogates for Physics-based and Data-driven Modelling of Parametric Systems: Review and New Perspectives Matteo Giacomini et.al. 2603.12870 null
2026-03-13 HIFICL: High-Fidelity In-Context Learning for Multimodal Tasks Xiaoyu Li et.al. 2603.12760 null
2026-03-13 ToolTree: Efficient LLM Agent Tool Planning via Dual-Feedback Monte Carlo Tree Search and Bidirectional Pruning Shuo Yang et.al. 2603.12740 null
2026-03-13 Vision Verification Enhanced Fusion of VLMs for Efficient Visual Reasoning Selim Furkan Tekin et.al. 2603.12669 null
2026-03-13 AVION: Aerial Vision-Language Instruction from Offline Teacher to Prompt-Tuned Network Yu Hu et.al. 2603.12659 null
2026-03-13 VGGT-World: Transforming VGGT into an Autoregressive Geometry World Model Xiangyu Sun et.al. 2603.12655 null
2026-03-13 Sobolev–Ricci Curvature Kyoichi Iwasaki et.al. 2603.12652 null
2026-03-13 LightMoE: Reducing Mixture-of-Experts Redundancy through Expert Replacing Jiawei Hao et.al. 2603.12645 null
2026-03-13 Early Pruning for Public Transport Routing Andrii Rohovyi et.al. 2603.12592 null
2026-03-13 CA-HFP: Curvature-Aware Heterogeneous Federated Pruning with Model Reconstruction Gang Hu et.al. 2603.12591 null
2026-03-13 Expert Pyramid Tuning: Efficient Parameter Fine-Tuning for Expertise-Driven Task Allocation Jia-Chen Zhang et.al. 2603.12577 null
2026-03-13 Spatio-Semantic Expert Routing Architecture with Mixture-of-Experts for Referring Image Segmentation Alaa Dalaq et.al. 2603.12538 null
2026-03-12 Efficient Quantum Simulation for Nonlinear Stochastic Differential Equations Xiangyu Li et.al. 2603.12398 null
2026-03-12 NeuroLoRA: Context-Aware Neuromodulation for Parameter-Efficient Multi-Task Adaptation Yuxin Yang et.al. 2603.12378 null
2026-03-12 Efficient Reasoning with Balanced Thinking Yulin Li et.al. 2603.12372 null
2026-03-12 Alternating Gradient Flow Utility: A Unified Metric for Structural Pruning and Dynamic Routing in Deep Networks Tianhao Qian et.al. 2603.12354 null
2026-03-12 Pruning-induced phases in fully-connected neural networks: the eumentia, the dementia, and the amentia Haining Pan et.al. 2603.12316 null
2026-03-12 HiAP: A Multi-Granular Stochastic Auto-Pruning Framework for Vision Transformers Andy Li et.al. 2603.12222 null
2026-03-12 ForensicZip: More Tokens are Better but Not Necessary in Forensic Vision-Language Models Yingxin Lai et.al. 2603.12208 null
2026-03-12 Long-Context Encoder Models for Polish Language Understanding Sławomir Dadas et.al. 2603.12191 null
2026-03-12 Space-Efficient Approximate Spherical Range Counting in High Dimensions Andreas Kalavas et.al. 2603.12106 null
2026-03-12 Resource-Efficient Iterative LLM-Based NAS with Feedback Memory Xiaojie Gu et.al. 2603.12091 null
2026-03-12 EmbTracker: Traceable Black-box Watermarking for Federated Language Models Haodong Zhao et.al. 2603.12089 null
2026-03-12 Intelligent 6G Edge Connectivity: A Knowledge Driven Optimization Framework for Small Cell Selection Tuğçe Bilen et.al. 2603.12086 null
2026-03-12 A Joint JSCC-Resource Allocation Framework for QoS-Aware Semantic Communication in LEO Satellite-based EO Missions Hung Nguyen-Kha et.al. 2603.12027 null
2026-03-12 Efficient Generative Modeling with Unitary Matrix Product States Using Riemannian Optimization Haotong Duan et.al. 2603.12026 null
2026-03-12 Asymptotically Efficient Recursive Identification Under One-Bit Communications Achieving Original CRLB Xingrui Liu et.al. 2603.11964 null
2026-03-12 PicoSAM3: Real-Time In-Sensor Region-of-Interest Segmentation Pietro Bonazzi et.al. 2603.11917 null
2026-03-12 Bielik-Minitron-7B: Compressing Large Language Models via Structured Pruning and Knowledge Distillation for the Polish Language Remigiusz Kinas et.al. 2603.11881 null
2026-03-12 AdaFuse: Accelerating Dynamic Adapter Inference via Token-Level Pre-Gating and Fused Kernel Optimization Qiyang Li et.al. 2603.11873 null
2026-03-12 A Further Efficient Algorithm with Best-of-Both-Worlds Guarantees for $m$ -Set Semi-Bandit Problem Botao Chen et.al. 2603.11764 null
2026-03-12 UCAN: Unified Convolutional Attention Network for Expansive Receptive Fields in Lightweight Super-Resolution Cao Thien Tan et.al. 2603.11680 null
2026-03-12 Simple Recipe Works: Vision-Language-Action Models are Natural Continual Learners with Reinforcement Learning Jiaheng Hu et.al. 2603.11653 null
2026-03-12 MedPruner: Training-Free Hierarchical Token Pruning for Efficient 3D Medical Image Understanding in Vision-Language Models Shengyuan Liu et.al. 2603.11625 null
2026-03-12 DyWeight: Dynamic Gradient Weighting for Few-Step Diffusion Sampling Tong Zhao et.al. 2603.11607 null
2026-03-12 Quantum mechanical framework for quantization-based optimization: from Gradient flow to Schroedinger equation Jinwuk Seok et.al. 2603.11536 null
2026-03-12 Mobile-GS: Real-time Gaussian Splatting for Mobile Devices Xiaobiao Du et.al. 2603.11531 null
2026-03-12 Can Small Language Models Use What They Retrieve? An Empirical Study of Retrieval Utilization Across Model Scale Sanchit Pandey et.al. 2603.11513 null
2026-03-12 AutoVeriFix+: High-Correctness RTL Generation via Trace-Aware Causal Fix and Semantic Redundancy Pruning Yan Tan et.al. 2603.11489 null
2026-03-11 Evaluating Explainable AI Attribution Methods in Neural Machine Translation via Attention-Guided Knowledge Distillation Aria Nourbakhsh et.al. 2603.11342 null
2026-03-11 Unified Flavor: Lattice Quantization, Chain Locality, and a Dynamical Origin of Hierarchical Yukawas Vernon Barger et.al. 2603.11341 null
2026-03-11 Reversible Lifelong Model Editing via Semantic Routing-Based LoRA Haihua Luo et.al. 2603.11239 null
2026-03-11 Representation Finetuning for Continual Learning Haihua Luo et.al. 2603.11201 null
2026-03-11 Efficient Approximation to Analytic and $L^p$ functions by Height-Augmented ReLU Networks ZeYu Li et.al. 2603.11128 null
2026-03-11 Leech Lattice Vector Quantization for Efficient LLM Compression Tycho F. A. van der Ouderaa et.al. 2603.11021 null
2026-03-11 Med-DualLoRA: Local Adaptation of Foundation Models for 3D Cardiac MRI Joan Perramon-Llussà et.al. 2603.10967 null
2026-03-11 GLM-OCR Technical Report Shuaiqi Duan et.al. 2603.10910 null
2026-03-11 LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation Jinwoo Ahn et.al. 2603.10899 null
2026-03-11 Continuous Diffusion Transformers for Designing Synthetic Regulatory Elements Jonathan Liu et.al. 2603.10885 null
2026-03-11 From Images to Words: Efficient Cross-Modal Knowledge Distillation to Language Models from Black-box Teachers Ayan Sengupta et.al. 2603.10877 null
2026-03-11 Denoising diffusion and latent diffusion models for physics field simulations Yuan Jia et.al. 2603.10799 null
2026-03-11 From path integral quantization to stochastic quantization: a pedestrian’s journey Dario Benedetti et.al. 2603.10761 null
2026-03-11 Double-Precision Matrix Multiplication Emulation via Ozaki-II Scheme with FP8 Quantization Yuki Uchino et.al. 2603.10634 null
2026-03-11 TacLoc: Global Tactile Localization on Objects from a Registration Perspective Zirui Zhang et.al. 2603.10565 null
2026-03-11 Quantization Robustness of Monotone Operator Equilibrium Networks James Li et.al. 2603.10562 null
2026-03-11 PET-F2I: A Comprehensive Benchmark and Parameter-Efficient Fine-Tuning of LLMs for PET/CT Report Impression Generation Yuchen Liu et.al. 2603.10560 null
2026-03-11 SCORE: Replacing Layer Stacking with Contractive Recurrent Depth Guillaume Godin et.al. 2603.10544 null
2026-03-11 In-Memory ADC-Based Nonlinear Activation Quantization for Efficient In-Memory Computing Shuai Dong et.al. 2603.10540 null
2026-03-11 DepthCache: Depth-Guided Training-Free Visual Token Merging for Vision-Language-Action Model Inference Yuquan Li et.al. 2603.10469 null
2026-03-11 The Curse and Blessing of Mean Bias in FP4-Quantized LLM Training Hengjie Cao et.al. 2603.10444 null
2026-03-11 AgentServe: Algorithm-System Co-Design for Efficient Agentic AI Serving on a Consumer-Grade GPU Yuning Zhang et.al. 2603.10342 null
2026-03-11 GaLoRA: Parameter-Efficient Graph-Aware LLMs for Node Classification Mayur Choudhary et.al. 2603.10298 null
2026-03-10 WME: Extending CDCL-based Model Enumeration with Weights Giuseppe Spallitta et.al. 2603.10236 null
2026-03-10 ARCHE: Autoregressive Residual Compression with Hyperprior and Excitation Sofia Iliopoulou et.al. 2603.10188 null
2026-03-10 ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning Ruizhong Qiu et.al. 2603.10160 null
2026-03-10 Batalin-Fradkin-Vilkovisky quantization of Einstein gravity with off-diagonal solutions encoding Hořava type generating functions Elşen Veli Veliev et.al. 2603.10082 null
2026-03-10 When Learning Rates Go Wrong: Early Structural Signals in PPO Actor-Critic Alberto Fernández-Hernández et.al. 2603.09950 null
2026-03-11 A Voronoi Cell Formulation for Principled Token Pruning in Late-Interaction Retrieval Models Yash Kankanampati et.al. 2603.09933 null
2026-03-10 GAST: Gradient-aligned Sparse Tuning of Large Language Models with Data-layer Selection Kai Yao et.al. 2603.09865 null
2026-03-10 Multi-spacecraft constraints on relativistic solar energetic particle transport in the widespread 28 October 2021 event E. Lavasa et.al. 2603.09839 null
2026-03-10 Exploiting Label-Aware Channel Scoring for Adaptive Channel Pruning in Split Learning Jialei Tan et.al. 2603.09792 null
2026-03-10 A Multi-Prototype-Guided Federated Knowledge Distillation Approach in AI-RAN Enabled Multi-Access Edge Computing System Luyao Zou et.al. 2603.09727 null
2026-03-10 TemporalDoRA: Temporal PEFT for Robust Surgical Video Question Answering Luca Carlini et.al. 2603.09696 null
2026-03-10 On Catastrophic Forgetting in Low-Rank Decomposition-Based Parameter-Efficient Fine-Tuning Muhammad Ahmad et.al. 2603.09684 null
2026-03-10 X-GS: An Extensible Open Framework Unifying 3DGS Architectures with Downstream Multimodal Models Yueen Ma et.al. 2603.09632 null
2026-03-10 Decoder-Free Distillation for Quantized Image Restoration S. M. A. Sharif et.al. 2603.09624 null
2026-03-10 BinaryAttention: One-Bit QK-Attention for Vision and Diffusion Transformers Chaodong Xiao et.al. 2603.09582 null
2026-03-10 Randomized Distributed Function Computation (RDFC): Ultra-Efficient Semantic Communication Applications to Privacy Onur Günlü et.al. 2603.09577 null
2026-03-10 Routing without Forgetting Alessio Masano et.al. 2603.09576 null
2026-03-10 Efficiently Aligning Draft Models via Parameter- and Data-Efficient Adaptation Luxi Lin et.al. 2603.09527 null
2026-03-10 Beyond Short-Horizon: VQ-Memory for Robust Long-Horizon Manipulation in Non-Markovian Simulation Benchmarks Wang Honghui et.al. 2603.09513 null
2026-03-10 TrainDeeploy: Hardware-Accelerated Parameter-Efficient Fine-Tuning of Small Transformer Models at the Extreme Edge Run Wang et.al. 2603.09511 null
2026-03-10 Evolving Prompt Adaptation for Vision-Language Models Enming Zhang et.al. 2603.09493 null
2026-03-11 Prune Redundancy, Preserve Essence: Vision Token Compression in VLMs via Synergistic Importance-Diversity Zhengyao Fang et.al. 2603.09480 null
2026-03-10 Reviving ConvNeXt for Efficient Convolutional Diffusion Models Taesung Kwon et.al. 2603.09408 null
2026-03-10 Deep Learning Search for Gravitational Waves from Compact Binary Coalescence Lorenzo Mobilia et.al. 2603.09386 null
2026-03-10 MIL-PF: Multiple Instance Learning on Precomputed Features for Mammography Classification Nikola Jovišić et.al. 2603.09374 null
2026-03-10 Efficient Reasoning at Fixed Test-Time Cost via Length-Aware Attention Priors and Gain-Aware Training Rian Atri et.al. 2603.09253 null
2026-03-10 LooComp: Leverage Leave-One-Out Strategy to Encoder-only Transformer for Efficient Query-aware Context Compression Thao Do et.al. 2603.09222 null
2026-03-10 Explainable Innovation Engine: Dual-Tree Agent-RAG with Methods-as-Nodes and Verifiable Write-Back Renwei Meng et.al. 2603.09192 null
2026-03-10 Point Cloud as a Foreign Language for Multi-modal Large Language Model Sneha Paul et.al. 2603.09173 null
2026-03-10 RTFDNet: Fusion-Decoupling for Robust RGB-T Segmentation Kunyu Tan et.al. 2603.09149 null
2026-03-09 Predictive first-principles simulations for co-designing next-generation energy-efficient AI systems Denis Mamaluy et.al. 2603.08995 null
2026-03-09 The $qs$ Inequality: Quantifying the Double Penalty of Mixture-of-Experts at Inference Vignesh Adhinarayanan et.al. 2603.08960 null
2026-03-09 An implicit restriction in the Dirac quantization Han Geurdes et.al. 2603.08516 null
2026-03-09 Oracle-Guided Soft Shielding for Safe Move Prediction in Chess Prajit T Rajendran et.al. 2603.08506 null
2026-03-09 Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck Fabio Valerio Massoli et.al. 2603.08462 null
2026-03-09 LycheeCluster: Efficient Long-Context Inference with Structure-Aware Chunking and Hierarchical KV Indexing Dongfang Li et.al. 2603.08453 null
2026-03-09 Alfa: Attentive Low-Rank Filter Adaptation for Structure-Aware Cross-Domain Personalized Gaze Estimation He-Yen Hsieh et.al. 2603.08445 null
2026-03-09 $Δ$ VLA: Prior-Guided Vision-Language-Action Models via World Knowledge Variation Yijie Zhu et.al. 2603.08361 null
2026-03-09 Rethinking Attention Output Projection: Structured Hadamard Transforms for Efficient Transformers Shubham Aggarwal et.al. 2603.08343 null
2026-03-09 PRIME: Efficient Algorithm for Token Graph Routing Problem Haotian Xu et.al. 2603.08337 null
2026-03-09 WaDi: Weight Direction-aware Distillation for One-step Image Synthesis Lei Wang et.al. 2603.08258 null
2026-03-09 NCL-UoR at SemEval-2026 Task 5: Embedding-Based Methods, Fine-Tuning, and LLMs for Word Sense Plausibility Rating Tong Wu et.al. 2603.08256 null
2026-03-09 SRNeRV: A Scale-wise Recursive Framework for Neural Video Representation Jia Wang et.al. 2603.08227 null
2026-03-09 SERQ: Saliency-Aware Low-Rank Error Reconstruction for LLM Quantization Yeonsik Park et.al. 2603.08185 null
2026-03-09 Adaptive MLP Pruning for Large Vision Transformers Chengchao Shen et.al. 2603.08100 null
2026-03-09 High-Fidelity Pruning for Large Language Models Yijun Zhu et.al. 2603.08083 null
2026-03-09 Deterministic Differentiable Structured Pruning for Large Language Models Weiyu Huang et.al. 2603.08065 null
2026-03-09 Stabilized Fine-Tuning with LoRA in Federated Learning: Mitigating the Side Effect of Client Size and Rank via the Scaling Factor Jiayu Huang et.al. 2603.08058 null
2026-03-09 Distributed Coordination Algorithms with Efficient Communication for Open Multi-Agent Systems with Dynamic Communication Links and Processing Delays Jiaqi Hu et.al. 2603.08038 null
2026-03-09 Capacity-Aware Mixture Law Enables Efficient LLM Data Optimization Jingwei Li et.al. 2603.08022 null
2026-03-09 Model-Free DRL Control for Power Inverters: From Policy Learning to Real-Time Implementation via Knowledge Distillation Yang Yang et.al. 2603.07964 null
2026-03-09 PSTNet: Physically-Structured Turbulence Network Boris Kriuk et.al. 2603.07957 null
2026-03-09 Geometric Transformation-Embedded Mamba for Learned Video Compression Hao Wei et.al. 2603.07912 null
2026-03-09 DyQ-VLA: Temporal-Dynamic-Aware Quantization for Embodied Vision-Language-Action Models Zihao Zheng et.al. 2603.07904 null
2026-03-08 DistillGuard: Evaluating Defenses Against LLM Knowledge Distillation Bo Jiang et.al. 2603.07835 null
2026-03-08 GazeShift: Unsupervised Gaze Estimation and Dataset for VR Gil Shapira et.al. 2603.07832 null
2026-03-08 SGI: Structured 2D Gaussians for Efficient and Compact Large Image Representation Zixuan Pan et.al. 2603.07789 null
2026-03-08 Geometric Knowledge-Assisted Federated Dual Knowledge Distillation Approach Towards Remote Sensing Satellite Imagery Luyao Zou et.al. 2603.07774 null
2026-03-08 Compressed Proximal Federated Learning for Non-Convex Composite Optimization on Heterogeneous Data Pu Qiu et.al. 2603.07654 null
2026-03-08 Efficient RGB-D Scene Understanding via Multi-task Adaptive Learning and Cross-dimensional Feature Guidance Guodong Sun et.al. 2603.07570 null
2026-03-08 CONSTANT: Towards High-Quality One-Shot Handwriting Generation with Patch Contrastive Enhancement and Style-Aware Quantization Anh-Duy Le et.al. 2603.07543 null
2026-03-08 TableMind++: An Uncertainty-Aware Programmatic Agent for Tool-Augmented Table Reasoning Mingyue Cheng et.al. 2603.07528 null
2026-03-08 GP-Tree: An in-memory spatial index combining adaptive grid cells with a prefix tree for efficient spatial querying Xiangyang Yang et.al. 2603.07517 null
2026-03-08 FedEU: Evidential Uncertainty-Driven Federated Fine-Tuning of Vision Foundation Models for Remote Sensing Image Segmentation Xiaokang Zhang et.al. 2603.07468 null
2026-03-08 Selective Transfer Learning of Cross-Modality Distillation for Monocular 3D Object Detection Rui Ding et.al. 2603.07464 null
2026-03-08 SLNet: A Super-Lightweight Geometry-Adaptive Network for 3D Point Cloud Recognition Mohammad Saeid et.al. 2603.07454 null
2026-03-08 Adaptive Capacity Allocation for Vision Language Action Fine-tuning Donghoon Kim et.al. 2603.07404 null
2026-03-07 Explainable and Hardware-Efficient Jamming Detection for 5G Networks Using the Convolutional Tsetlin Machine Vojtech Halenka et.al. 2603.07336 null
2026-03-07 Faster-HEAL: An Efficient and Privacy-Preserving Collaborative Perception Framework for Heterogeneous Autonomous Vehicles Armin Maleki et.al. 2603.07314 null
2026-03-07 LightMedSeg: Lightweight 3D Medical Image Segmentation with Learned Spatial Anchors Kavyansh Tyagi et.al. 2603.07228 null
2026-03-07 FastSTAR: Spatiotemporal Token Pruning for Efficient Autoregressive Video Synthesis Sungwoong Yune et.al. 2603.07192 null
2026-03-07 The Model Knows Which Tokens Matter: Automatic Token Selection via Noise Gating Landi He et.al. 2603.07135 null
2026-03-07 Enhancing User Fairness in Two-Layer RSMA: A Movable Antenna Approach Ji Luo et.al. 2603.07127 null
2026-03-07 Efficient Personalized Reranking with Semi-Autoregressive Generation and Online Knowledge Distillation Kai Cheng et.al. 2603.07107 null
2026-03-07 Exploring the Reasoning Depth of Small Language Models in Software Architecture: A Multidimensional Evaluation Framework Towards Software Engineering 2.0 Ha Vo et.al. 2603.07091 null
2026-03-07 Can Safety Emerge from Weak Supervision? A Systematic Analysis of Small Language Models Punyajoy Saha et.al. 2603.07017 null
2026-03-07 Two-Stage Path Following for Mobile Manipulators via Dimensionality-Reduced Graph Search and Numerical Optimization Fuyu Guo et.al. 2603.07003 null
2026-03-06 NOBLE: Accelerating Transformers with Nonlinear Low-Rank Branches Ethan Smith et.al. 2603.06492 null
2026-03-06 History-Conditioned Spatio-Temporal Visual Token Pruning for Efficient Vision-Language Navigation Qitong Wang et.al. 2603.06480 null
2026-03-06 GreenRFM: Toward a resource-efficient radiology foundation model Yingtai Li et.al. 2603.06467 null
2026-03-06 Spinor moving frame, type II superparticle quantization, hidden $SU(8)$ symmetry of linearized 10D supergravity, and superamplitudes Igor Bandos et.al. 2603.06404 null
2026-03-06 HiPP-Prune: Hierarchical Preference-Conditioned Structured Pruning for Vision-Language Models Lincen Bai et.al. 2603.06270 null
2026-03-06 TaPD: Temporal-adaptive Progressive Distillation for Observation-Adaptive Trajectory Forecasting in Autonomous Driving Mingyu Fan et.al. 2603.06231 null
2026-03-06 SPOT: Span-level Pause-of-Thought for Efficient and Interpretable Latent Reasoning in Large Language Models Yunlong Chu et.al. 2603.06222 null
2026-03-06 Multimodal Behavior Tree Generation: A Small Vision-Language Model for Robot Task Planning Cristiano Battistini et.al. 2603.06084 null
2026-03-06 EvoESAP: Non-Uniform Expert Pruning for Sparse MoE Zongfang Liu et.al. 2603.06003 null
2026-03-06 Balancing Latency and Accuracy of Code Completion via Local-Cloud Model Cascading Hanzhen Lu et.al. 2603.05974 null
2026-03-06 CR-QAT: Curriculum Relational Quantization-Aware Training for Open-Vocabulary Object Detection Jinyeong Park et.al. 2603.05964 null
2026-03-06 Omni-Masked Gradient Descent: Memory-Efficient Optimization via Mask Traversal with Improved Convergence Hui Yang et.al. 2603.05960 null
2026-03-06 Energy-Driven Adaptive Visual Token Pruning for Efficient Vision-Language Models Jialuo He et.al. 2603.05950 null
2026-03-06 Implicit Style Conditioning: A Structured Style-Rewrite Framework for Low-Resource Character Modeling Chanhui Zhu et.al. 2603.05933 null
2026-03-06 ROSE: Reordered SparseGPT for More Accurate One-Shot Large Language Models Pruning Mingluo Su et.al. 2603.05878 link
2026-03-06 Shifting Adaptation from Weight Space to Memory Space: A Memory-Augmented Agent for Medical Image Segmentation Bowen Chen et.al. 2603.05873 null
2026-03-06 Chiral Terahertz Amplification and Lasing using Two-Dimensional Materials with Berry Curvature Dipole Amin Hakimi et.al. 2603.05825 null
2026-03-06 Self-Auditing Parameter-Efficient Fine-Tuning for Few-Shot 3D Medical Image Segmentation Son Thai Ly et.al. 2603.05822 null
2026-03-06 Training-free Latent Inter-Frame Pruning with Attention Recovery Dennis Menn et.al. 2603.05811 null
2026-03-06 MoE Lens – An Expert Is All You Need Marmik Chaudhari et.al. 2603.05806 null
2026-03-06 Sparse Crosscoders for diffing MoEs and Dense models Marmik Chaudhari et.al. 2603.05805 null
2026-03-06 A Quantization-Aware Training Based Lightweight Method for Neural Distinguishers Guangwei Xiong et.al. 2603.05791 null
2026-03-05 LTLGuard: Formalizing LTL Specifications with Compact Language Models and Lightweight Symbolic Reasoning Medina Andresel et.al. 2603.05728 null
2026-03-05 Interpretable Motion Artificat Detection in structural Brain MRI Naveetha Nithianandam et.al. 2603.05726 null
2026-03-05 Gabor Primitives for Accelerated Cardiac Cine MRI Reconstruction Wenqi Huang et.al. 2603.05681 null
2026-03-05 Keeping the Evidence Chain: Semantic Evidence Allocation for Training-Free Token Pruning in Video Temporal Grounding Jiaqi Li et.al. 2603.05663 null
2026-03-05 Bias In, Bias Out? Finding Unbiased Subnetworks in Vanilla Models Ivan Luiz De Moura Matos et.al. 2603.05582 null
2026-03-05 POET-X: Memory-efficient LLM Training by Scaling Orthogonal Transformation Zeju Qiu et.al. 2603.05500 null
2026-03-05 Efficient simulation of Bose-Einstein condensates in nontrivial topologies Abel Beregi et.al. 2603.05447 null
2026-03-05 MobileFetalCLIP: Selective Repulsive Knowledge Distillation for Mobile Fetal Ultrasound Analysis Numan Saeed et.al. 2603.05421 null
2026-03-05 Preserving Continuous Symmetry in Discrete Spaces: Geometric-Aware Quantization for SO(3)-Equivariant GNNs Haoyu Zhou et.al. 2603.05343 null
2026-03-05 Med-V1: Small Language Models for Zero-shot and Scalable Biomedical Evidence Attribution Qiao Jin et.al. 2603.05308 null
2026-03-05 WavSLM: Single-Stream Speech Language Modeling via WavLM Distillation Luca Della Libera et.al. 2603.05299 null
2026-03-05 SlideSparse: Fast and Flexible (2N-2):2N Structured Sparsity Hanyong Shao et.al. 2603.05232 null
2026-03-05 Stable-LoRA: Stabilizing Feature Learning of Low-Rank Adaptation Yize Wu et.al. 2603.05204 link
2026-03-05 An efficient and accurate numerical method for computing the ground states of three-dimensional rotating dipolar Bose-Einstein condensates under strongly anisotropic trap Qinglin Tang et.al. 2603.05194 null
2026-03-05 CRISP: Correlation-Resilient Indexing via Subspace Partitioning Dimitris Dimitropoulos et.al. 2603.05180 null
2026-03-05 Trainable Bitwise Soft Quantization for Input Feature Compression Karsten Schrödter et.al. 2603.05172 null
2026-03-05 Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity Di Zhang et.al. 2603.05168 null
2026-03-05 FedBCD:Communication-Efficient Accelerated Block Coordinate Gradient Descent for Federated Learning Junkang Liu et.al. 2603.05116 null
2026-03-05 Diff-ES: Stage-wise Structural Diffusion Pruning via Evolutionary Search Zongfang Liu et.al. 2603.05105 null
2026-03-05 Beyond Positional Encoding: A 5D Spatio-Directional Hash Encoding Philippe Weier et.al. 2603.05079 null
2026-03-05 Constrained Symplectic Quantization: Disclosing the Deterministic Framework Behind Quantum Mechanics Martina Giachello et.al. 2603.05072 null
2026-03-05 MCEL: Margin-Based Cross-Entropy Loss for Error-Tolerant Quantized Neural Networks Mikail Yayla et.al. 2603.05048 null
2026-03-05 A loop quantization of the marginally bound Lemaître-Tolman-Bondi dust model Luca Cafaro et.al. 2603.04995 null
2026-03-05 Programmable superconducting neuron with intrinsic in-memory computation and dual-timescale plasticity for ultra-efficient neuromorphic computing Muen Wang et.al. 2603.04966 null
2026-03-05 VisionPangu: A Compact and Fine-Grained Multimodal Assistant with 1.7B Parameters Jiaxin Fan et.al. 2603.04957 null
2026-03-05 WaterSIC: information-theoretically (near) optimal linear layer quantization Egor Lifar et.al. 2603.04956 null
2026-03-05 AILS-NTUA at SemEval-2026 Task 3: Efficient Dimensional Aspect-Based Sentiment Analysis Stavros Gazetas et.al. 2603.04933 null
2026-03-05 MASQuant: Modality-Aware Smoothing Quantization for Multimodal Large Language Models Lulu Hu et.al. 2603.04800 null
2026-03-05 Stacked from One: Multi-Scale Self-Injection for Context Window Extension Wei Han et.al. 2603.04759 null
2026-03-05 A Benchmark Study of Neural Network Compression Methods for Hyperspectral Image Classification Sai Shi et.al. 2603.04720 null
2026-03-05 Detection of Illicit Content on Online Marketplaces using Large Language Models Quoc Khoa Tran et.al. 2603.04707 null
2026-03-04 Unified Integer and Fractional Quantum Hall Effects from Boundary-Induced Edge-State Quantization Pedro Pereyra et.al. 2603.04652 null
2026-03-04 An LLM-Guided Query-Aware Inference System for GNN Models on Large Knowledge Graphs Waleed Afandi et.al. 2603.04545 null
2026-03-04 Dissecting Quantization Error: A Concentration-Alignment Perspective Marco Federici et.al. 2603.04359 null
2026-03-04 Efficient Refusal Ablation in LLM through Optimal Transport Geraldin Nanfack et.al. 2603.04355 null
2026-03-04 Direct derivation of the modified Langevin noise formalism from the canonical quantization of macroscopic electromagnetism Alessandro Ciattoni et.al. 2603.04336 null
2026-03-04 Activation Outliers in Transformer Quantization: Reproduction, Statistical Analysis, and Deployment Tradeoffs Pranav Kumar Kaliaperumal et.al. 2603.04308 null
2026-03-04 Constraint-Aware Generative Re-ranking for Multi-Objective Optimization in Advertising Feeds Chenfei Li et.al. 2603.04227 null
2026-03-05 Bielik-Q2-Sharp: A Comparative Study of Extreme 2-bit Quantization Methods for a Polish 11B Language Model Jakub Prejzner et.al. 2603.04162 null
2026-03-04 Unbiased Dynamic Pruning for Efficient Group-Based Policy Optimization Haodong Zhu et.al. 2603.04135 null
2026-03-04 BeamPERL: Parameter-Efficient RL with Verifiable Rewards Specializes Compact LLMs for Structured Beam Mechanics Reasoning Tarjei Paule Hage et.al. 2603.04124 null
2026-03-04 Wasserstein Gradient Flows of semi-discret energies: evolution of urban areas anduniform quantization Joao Miguel Machado et.al. 2603.04088 null
2026-03-04 Tuning Just Enough: Lightweight Backdoor Attacks on Multi-Encoder Diffusion Models Ziyuan Chen et.al. 2603.04064 null
2026-03-05 LoRA-MME: Multi-Model Ensemble of LoRA-Tuned Encoders for Code Comment Classification Md Akib Haider et.al. 2603.03959 null
2026-03-04 Vector-Quantized Soft Label Compression for Dataset Distillation Ali Abbasi et.al. 2603.03808 null
2026-03-04 Adaptive Enhancement and Dual-Pooling Sequential Attention for Lightweight Underwater Object Detection with YOLOv10 Md. Mushibur Rahman et.al. 2603.03807 null
2026-03-04 LEA: Label Enumeration Attack in Vertical Federated Learning Wenhao Jiang et.al. 2603.03777 null
2026-03-04 Confidence-Calibrated Small-Large Language Model Collaboration for Cost-Efficient Reasoning Chuang Zhang et.al. 2603.03752 null
2026-03-04 EvoPrune: Early-Stage Visual Token Pruning for Efficient MLLMs Yuhao Chen et.al. 2603.03681 null
2026-03-04 ARMOR: Robust and Efficient CNN-Based SAR ATR through Model-Hardware Co-Design Sachini Wickramasinghe et.al. 2603.03598 null
2026-03-03 Trade-offs in Ensembling, Merging and Routing Among Parameter-Efficient Experts Sanae Lotfi et.al. 2603.03535 null
2026-03-03 Raising Bars, Not Parameters: LilMoo Compact Language Model for Hindi Shiza Fatimah et.al. 2603.03508 null
2026-03-03 DKD-KAN: A Lightweight knowledge-distilled KAN intrusion detection framework, based on MLP and KAN Mohammad Alikhani et.al. 2603.03486 null
2026-03-03 Towards Improved Sentence Representations using Token Graphs Krishna Sri Ipsit Mantri et.al. 2603.03389 null
2026-03-03 LiteVLA-Edge: Quantized On-Device Multimodal Control for Embedded Robotics Justin Williams et.al. 2603.03380 null
2026-03-03 No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models Omer Sela et.al. 2603.03203 null
2026-03-03 Channel-Adaptive Edge AI: Maximizing Inference Throughput by Adapting Computational Complexity to Channel States Jierui Zhang et.al. 2603.03146 null
2026-03-03 TinyIceNet: Low-Power SAR Sea Ice Segmentation for On-Board FPGA Inference Mhd Rashed Al Koutayni et.al. 2603.03075 null
2026-03-03 Stability properties of Minimal Gated Unit neural networks Stefano De Carli et.al. 2603.03017 null
2026-03-03 Reproducing and Comparing Distillation Techniques for Cross-Encoders Victor Morand et.al. 2603.03010 null
2026-03-03 QAOA-Predictor: Forecasting Success Probabilities and Minimal Depths for Efficient Fixed-Parameter Optimization Rodrigo Coelho et.al. 2603.02990 null
2026-03-03 ProGIC: Progressive and Lightweight Generative Image Compression with Residual Vector Quantization Hao Cao et.al. 2603.02897 null
2026-03-03 MuxTune: Efficient Multi-Task LLM Fine-Tuning in Multi-Tenant Datacenters via Spatial-Temporal Backbone Multiplexing Chunyu Xue et.al. 2603.02885 null
2026-03-03 SemanticDialect: Semantic-Aware Mixed-Format Quantization for Video Diffusion Transformers Wonsuk Jang et.al. 2603.02883 null
2026-03-03 Fast and memory-efficient classical simulation of quantum machine learning via forward and backward gate fusion Yoshiaki Kawase et.al. 2603.02804 null
2026-03-03 Hardware Implementation of Photonic Spiking Hash Retrieval Shangxuan Shi et.al. 2603.02738 null
2026-03-03 Gated Differential Linear Attention: A Linear-Time Decoder for High-Fidelity Medical Segmentation Hongbo Zheng et.al. 2603.02727 null
2026-03-03 SUN: Shared Use of Next-token Prediction for Efficient Multi-LLM Disaggregated Serving Sunghyeon Woo et.al. 2603.02599 null
2026-03-03 Synthetic-Child: An AIGC-Based Synthetic Data Pipeline for Privacy-Preserving Child Posture Estimation Taowen Zeng et.al. 2603.02598 null
2026-03-03 Generalizable Knowledge Distillation from Vision Foundation Models for Semantic Segmentation Chonghua Lv et.al. 2603.02554 null
2026-03-03 Semantic Forwarding and Codebook-Enhanced Model Division Multiple Access for Satellite-Terrestrial Networks Jinghong Huang et.al. 2603.02536 null
2026-03-03 Learning Object-Centric Spatial Reasoning for Sequential Manipulation in Cluttered Environments Chrisantus Eze et.al. 2603.02511 null
2026-03-02 A Unified Revisit of Temperature in Classification-Based Knowledge Distillation Logan Frank et.al. 2603.02430 null
2026-03-02 From Fewer Samples to Fewer Bits: Reframing Dataset Distillation as Joint Optimization of Precision and Compactness My H. Dinh et.al. 2603.02411 null
2026-03-02 Fast and Versatile RNA Design via Motif-level Divide-and-Conquer and Structure-level Rival Search Tianshuo Zhou et.al. 2603.02283 null
2026-03-02 Deep Unfolding for SIM-Assisted Multiband MU-MISO Downlink Systems Muhammad Ibrahim et.al. 2603.02122 null
2026-03-02 MetaRCA: A Generalizable Root Cause Analysis Framework for Cloud-Native Systems Powered by Meta Causal Knowledge Shuai Liang et.al. 2603.02032 null
2026-03-02 Rich Insights from Cheap Signals: Efficient Evaluations via Tensor Factorization Felipe Maia Polo et.al. 2603.02029 null
2026-03-02 KDFlow: A User-Friendly and Efficient Knowledge Distillation Framework for Large Language Models Songming Zhang et.al. 2603.01875 null
2026-03-02 FreeAct: Freeing Activations for LLM Quantization Xiaohao Liu et.al. 2603.01776 null
2026-03-02 Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning Zichen Tian et.al. 2603.01759 null
2026-03-02 StepVAR: Structure-Texture Guided Pruning for Visual Autoregressive Models Keli Liu et.al. 2603.01757 null
2026-03-02 CA-AFP: Cluster-Aware Adaptive Federated Pruning Om Govind Jha et.al. 2603.01739 null
2026-03-02 FastLightGen: Fast and Light Video Generation with Fewer Steps and Parameters Shao Shitong et.al. 2603.01685 null
2026-03-02 Beyond the Grid: Layout-Informed Multi-Vector Retrieval with Parsed Visual Document Representations Yibo Yan et.al. 2603.01666 null
2026-03-02 Boosting Entropy with Bell Box Quantization Ningfeng Yang et.al. 2603.01599 null
2026-03-02 Keyword-based Community Search in Bipartite Spatial-Social Networks (Technical Report) Kovan A. Bavi et.al. 2603.01500 null
2026-03-02 Token Reduction via Local and Global Contexts Optimization for Efficient Video Large Language Models Jinlong Li et.al. 2603.01400 null
2026-03-02 Quasar: Quantized Self-Speculative Acceleration for Rapid Inference via Memory-Efficient Verification Guang Huang et.al. 2603.01399 null
2026-03-02 3BASiL: An Algorithmic Framework for Sparse plus Low-Rank Compression of LLMs Mehdi Makni et.al. 2603.01376 null
2026-03-02 MixerCSeg: An Efficient Mixer Architecture for Crack Segmentation via Decoupled Mamba Attention Zilong Zhao et.al. 2603.01361 null
2026-03-01 AgilePruner: An Empirical Study of Attention and Diversity for Adaptive Visual Token Pruning in Large Vision-Language Models Changwoo Baek et.al. 2603.01236 link
2026-03-01 VP-Hype: A Hybrid Mamba-Transformer Framework with Visual-Textual Prompting for Hyperspectral Image Classification Abdellah Zakaria Sellam et.al. 2603.01174 null
2026-03-01 GuiDINO: Rethinking Vision Foundation Model in Medical Image Segmentation Zhuonan Liang et.al. 2603.01115 null
2026-03-01 \textsc{Mobile-VTON}: High-Fidelity On-Device Virtual Try-On Zhenchen Wan et.al. 2603.00947 null
2026-03-01 On the Exact Algorithmic Extraction of Finite Tesselations Through Prime Extraction of Minimal Representative Forms Sushish Baral et.al. 2603.00911 null
2026-03-01 Curvature-Weighted Capacity Allocation: A Minimum Description Length Framework for Layer-Adaptive Large Language Model Optimization Theophilus Amaefuna et.al. 2603.00910 null
2026-03-01 Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models Yichao Wu et.al. 2603.00846 null
2026-03-01 MedGPT-oss: Training a General-Purpose Vision-Language Model for Biomedicine Kai Zhang et.al. 2603.00842 null
2026-02-28 BornoViT: A Novel Efficient Vision Transformer for Bengali Handwritten Basic Characters Classification Rafi Hassan Chowdhury et.al. 2603.00755 null
2026-02-28 MARS: Harmonizing Multimodal Convergence via Adaptive Rank Search Minkyoung Cho et.al. 2603.00720 null
2026-02-28 Preliminary study of the $H$ dibaryon in $N_{\rm f}=2+1$ lattice QCD André Baião Raposo et.al. 2603.00698 null
2026-02-28 Specializing Foundation Models via Mixture of Low-Rank Experts for Comprehensive Head CT Analysis Youngjin Yoo et.al. 2603.00675 null
2026-02-28 Exploring 3D Dataset Pruning Xiaohan Zhao et.al. 2603.00651 null
2026-02-28 Linking Modality Isolation in Heterogeneous Collaborative Perception Changxing Liu et.al. 2603.00609 null
2026-02-28 CoMoL: Efficient Mixture of LoRA Experts via Dynamic Core Space Merging Jie Cao et.al. 2603.00573 null
2026-02-28 TP-Spikformer: Token Pruned Spiking Transformer Wenjie Wei et.al. 2603.00527 null
2026-02-28 What Do Visual Tokens Really Encode? Uncovering Sparsity and Redundancy in Multimodal Large Language Models Yingqi Fan et.al. 2603.00510 null
2026-02-28 COLE $^+$ : Towards Practical Column-based Learned Storage for Blockchain Systems Ce Zhang et.al. 2603.00509 null
2026-02-28 Improved Adversarial Diffusion Compression for Real-World Video Super-Resolution Bin Chen et.al. 2603.00458 null
2026-02-28 TAP-SLF: Parameter-Efficient Adaptation of Vision Foundation Models for Multi-Task Ultrasound Image Analysis Hui Wan et.al. 2603.00433 null
2026-02-28 Efficient Decoder Scaling Strategy for Neural Routing Solvers Qing Luo et.al. 2603.00430 null
2026-02-28 Weight Updates as Activation Shifts: A Principled Framework for Steering Dyah Adila et.al. 2603.00425 null
2026-02-27 Taming Momentum: Rethinking Optimizer States Through Low-Rank Approximation Zhengbo Wang et.al. 2602.24283 null
2026-02-27 Efficient Discovery of Approximate Causal Abstractions via Neural Mechanism Sparsification Amir Asiaee et.al. 2602.24266 null
2026-02-27 Joint Geometric and Trajectory Consistency Learning for One-Step Real-World Super-Resolution Chengyan Deng et.al. 2602.24240 null
2026-02-27 Task-Centric Acceleration of Small-Language Models Dor Tsur et.al. 2602.24174 null
2026-02-27 Prune Wisely, Reconstruct Sharply: Compact 3D Gaussian Splatting via Adaptive Pruning and Difference-of-Gaussian Primitives Haoran Wang et.al. 2602.24136 null
2026-02-27 Quant Experts: Token-aware Adaptive Error Reconstruction with Mixture of Experts for Large Vision-Language Models Quantization Chenwei Jia et.al. 2602.24059 null
2026-02-27 GPU-Native Approximate Nearest Neighbor Search with IVF-RaBitQ: Fast Index Build and Search Jifan Shi et.al. 2602.23999 null
2026-02-27 Towards Efficient and Generalizable Retrieval: Adaptive Semantic Quantization and Residual Knowledge Transfer Huimu Wang et.al. 2602.23978 null
2026-02-27 Bandwidth-adaptive Cloud-Assisted 360-Degree 3D Perception for Autonomous Vehicles Faisal Hawladera et.al. 2602.23871 null
2026-02-27 ULW-SleepNet: An Ultra-Lightweight Network for Multimodal Sleep Stage Scoring Zhaowen Wang et.al. 2602.23852 null
2026-02-27 GRAIL: Post-hoc Compensation by Linear Reconstruction for Compressed Networks Wenwu Tang et.al. 2602.23795 null
2026-02-27 UTPTrack: Towards Simple and Unified Token Pruning for Visual Tracking Hao Wu et.al. 2602.23734 null
2026-02-27 HiDrop: Hierarchical Vision Token Reduction in MLLMs via Late Injection, Concave Pyramid Pruning, and Early Exit Hao Wu et.al. 2602.23699 null
2026-02-27 ProtoDCS: Towards Robust and Efficient Open-Set Test-Time Adaptation for Vision-Language Models Wei Luo et.al. 2602.23653 null
2026-02-27 From quantum time to manifestly covariant QFT: on the need for a quantum-action-based quantization N. L. Diaz et.al. 2602.23625 null
2026-02-27 PDF: PUF-based DNN Fingerprinting for Knowledge Distillation Traceability Ning Lyu et.al. 2602.23587 null
2026-02-27 Construct, Merge, Solve & Adapt with Reinforcement Learning for the min-max Multiple Traveling Salesman Problem Guillem Rodríguez-Corominas et.al. 2602.23579 null
2026-02-27 Hybrid Quantum Temporal Convolutional Networks Junghoon Justin Park et.al. 2602.23578 null
2026-02-26 BiKA: Kolmogorov-Arnold-Network-inspired Ultra Lightweight Neural Network Hardware Accelerator Yuhao Liu et.al. 2602.23455 null
2026-02-26 U-CAN: Utility-Aware Contrastive Attenuation for Efficient Unlearning in Generative Recommendation Zezheng Wu et.al. 2602.23400 null
2026-02-26 A Dataset is Worth 1 MB Elad Kimchi Shoshani et.al. 2602.23358 null
2026-02-26 FlashOptim: Optimizers for Memory Efficient Training Jose Javier Gonzalez Ortiz et.al. 2602.23349 null
2026-02-26 Bitwise Systolic Array Architecture for Runtime-Reconfigurable Multi-precision Quantized Multiplication on Hardware Accelerators Yuhao Liu et.al. 2602.23334 null
2026-02-26 Evaluating Zero-Shot and One-Shot Adaptation of Small Language Models in Leader-Follower Interaction Rafael R. Baptista et.al. 2602.23312 null
2026-02-26 Data-Efficient Generative Modeling of Non-Gaussian Global Climate Fields via Scalable Composite Transformations Johannes Brachem et.al. 2602.23311 null
2026-02-26 Efficient evaluation of fundamental sensitivity limits and full counting statistics for continuously monitored Gaussian quantum systems Francesco Albarelli et.al. 2602.23304 null
2026-02-26 Workload-Aware Incremental Reclustering in Cloud Data Warehouses Yipeng Liu et.al. 2602.23289 null
2026-02-26 Real-Time Stream Compaction for Sparse Machine Learning on FPGAs Marc Neu et.al. 2602.23281 null
2026-02-26 AgentDropoutV2: Optimizing Information Flow in Multi-Agent Systems via Test-Time Rectify-or-Reject Pruning Yutong Wang et.al. 2602.23258 null
2026-02-26 A Scaling Law for Bandwidth Under Quantization Maximilian Kalcher et.al. 2602.23252 null
2026-02-26 Spatio-Temporal Token Pruning for Efficient High-Resolution GUI Agents Zhou Xu et.al. 2602.23235 null
2026-02-27 Motion-aware Event Suppression for Event Cameras Roberto Pellerito et.al. 2602.23204 null
2026-02-26 InnerQ: Hardware-aware Tuning-free Quantization of KV Cache for Large Language Models Sayed Mohammadreza Tayaranian Hosseini et.al. 2602.23200 null
2026-02-26 FairQuant: Fairness-Aware Mixed-Precision Quantization for Medical Image Classification Thomas Woergaard et.al. 2602.23192 null
2026-02-26 Efficient Real-Time Adaptation of ROMs for Unsteady Flows Using Data Assimilation Ismaël Zighed et.al. 2602.23188 null
2026-02-26 Efficient Encoder-Free Fourier-based 3D Large Multimodal Model Guofeng Mei et.al. 2602.23153 null
2026-02-26 TriLite: Efficient Weakly Supervised Object Localization with Universal Visual Features and Tri-Region Disentanglement Arian Sabaghi et.al. 2602.23120 null
2026-02-26 Learning Physical Operators using Neural Operators Vignesh Gopakumar et.al. 2602.23113 null
2026-02-26 Align then Adapt: Rethinking Parameter-Efficient Transfer Learning in 4D Perception Yiding Sun et.al. 2602.23069 null
2026-02-26 PackUV: Packed Gaussian UV Maps for 4D Volumetric Video Aashish Rai et.al. 2602.23040 null
2026-02-26 Sequential Regression for Continuous Value Prediction using Residual Quantization Runpeng Cui et.al. 2602.23012 null
2026-02-26 Holomorphic Quantization in Constant Curvature Backgrounds Dmitri Bykov et.al. 2602.22984 null
2026-02-26 ToProVAR: Efficient Visual Autoregressive Modeling via Tri-Dimensional Entropy-Aware Semantic Analysis and Sparsity Optimization Jiayu Chen et.al. 2602.22948 null
2026-02-26 pMoE: Prompting Diverse Experts Together Wins More in Visual Adaptation Shentong Mo et.al. 2602.22938 null
2026-02-26 NoRA: Breaking the Linear Ceiling of Low-Rank Adaptation via Manifold Expansion Hung-Hsuan Chen et.al. 2602.22911 null
2026-02-27 DySL-VLA: Efficient Vision-Language-Action Model Inference via Dynamic-Static Layer-Skipping for Robot Manipulation Zebin Yang et.al. 2602.22896 null
2026-02-26 Beyond Detection: Multi-Scale Hidden-Code for Natural Image Deepfake Recovery and Factual Retrieval Yuan-Chih Chen et.al. 2602.22759 null
2026-02-27 GFRRN: Explore the Gaps in Single Image Reflection Removal Yu Chen et.al. 2602.22695 null
2026-02-26 LoR-LUT: Learning Compact 3D Lookup Tables via Low-Rank Residuals Ziqi Zhao et.al. 2602.22607 null
2026-02-26 pQuant: Towards Effective Low-Bit Language Models via Decoupled Linear Quantization-Aware Training Wenzheng Zhang et.al. 2602.22592 null
2026-02-26 Quantum corrected thermodynamics and horizon quantization of the Reissner–Nordström black hole S. Jalalzadeh et.al. 2602.22559 null
2026-02-26 Autoregressive Visual Decoding from EEG Signals Sicheng Dai et.al. 2602.22555 null
2026-02-26 Agentic AI for Intent-driven Optimization in Cell-free O-RAN Mohammad Hossein Shokouhi et.al. 2602.22539 null
2026-02-26 Reinforcement-aware Knowledge Distillation for LLM Reasoning Zhaoyang Zhang et.al. 2602.22495 null
2026-02-25 Efficient Continual Learning in Language Models via Thalamically Routed Cortical Columns Afshin Khadangi et.al. 2602.22479 null
2026-02-25 MammoWise: Multi-Model Local RAG Pipeline for Mammography Report Generation Raiyan Jahangir et.al. 2602.22462 null
2026-02-25 How Do Latent Reasoning Methods Perform Under Weak and Strong Supervision? Yingqian Cui et.al. 2602.22441 null
2026-02-25 veScale-FSDP: Flexible and High-Performance FSDP at Scale Zezhou Wang et.al. 2602.22437 null
2026-02-25 Decoder-based Sense Knowledge Distillation Qitong Wang et.al. 2602.22351 null
2026-02-25 Structure and Redundancy in Large Language Models: A Spectral Study via Random Matrix Theory Davide Ettori et.al. 2602.22345 null
2026-02-25 Queue occupancy and server size distribution of a queue length dependent vacation queue with an optional service Ashish Verma et.al. 2602.22295 null
2026-02-25 SigmaQuant: Hardware-Aware Heterogeneous Quantization Method for Edge DNN Inference Qunyou Liu et.al. 2602.22136 null
2026-02-25 SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents Patrick Tser Jern Kon et.al. 2602.22124 null
2026-02-25 PatchDenoiser: Parameter-efficient multi-scale patch learning and fusion denoiser for medical images Jitindra Fartiyal et.al. 2602.21987 null
2026-02-25 Compact Circulant Layers with Spectral Priors Joseph Margaryan et.al. 2602.21965 null
2026-02-25 D-COT: Disciplined Chain-of-Thought Learning for Efficient Reasoning in Small Language Models Shunsuke Ubukata et.al. 2602.21786 null
2026-02-25 XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression Zunhai Su et.al. 2602.21780 null
2026-02-25 Learning from Yesterday’s Error: An Efficient Online Learning Method for Traffic Demand Prediction Xiannan Huang et.al. 2602.21757 null
2026-02-25 DWA-KD: Dual-Space Weighting and Time-Warped Alignment for Cross-Tokenizer Knowledge Distillation Duc Trung Vu et.al. 2602.21669 null
2026-02-25 HybridINR-PCGC: Hybrid Lossless Point Cloud Geometry Compression Bridging Pretrained Model and Implicit Neural Representation Wenjie Huang et.al. 2602.21662 null
2026-02-25 Sparsity Induction for Accurate Post-Training Pruning of Large Language Models Minhao Jiang et.al. 2602.21652 null
2026-02-25 AQR-HNSW: Accelerating Approximate Nearest Neighbor Search via Density-aware Quantization and Multi-stage Re-ranking Ganap Ashit Tewary et.al. 2602.21600 null
2026-02-25 CADC: Content Adaptive Diffusion-Based Generative Image Compression Xihua Sheng et.al. 2602.21591 null
2026-02-24 MINAR: Mechanistic Interpretability for Neural Algorithmic Reasoning Jesse He et.al. 2602.21442 null
2026-02-24 Efficient Uncoupled Learning Dynamics with $\tilde{O}!\left(T^{-1/4}\right)$ Last-Iterate Convergence in Bilinear Saddle-Point Problems over Convex Sets under Bandit Feedback Arnab Maiti et.al. 2602.21436 null
2026-02-24 MMLoP: Multi-Modal Low-Rank Prompting for Efficient Vision-Language Adaptation Sajjad Ghiasvand et.al. 2602.21397 null
2026-02-24 Momentum Memory for Knowledge Distillation in Computational Pathology Yongxin Guo et.al. 2602.21395 null
2026-02-24 Small Language Models for Privacy-Preserving Clinical Information Extraction in Low-Resource Languages Mohammadreza Ghaffarzadeh-Esfahani et.al. 2602.21374 null
2026-02-24 OmniOCR: Generalist OCR for Ethnic Minority Languages Bonan Liu et.al. 2602.21042 null
2026-02-24 HiSAC: Hierarchical Sparse Activation Compression for Ultra-long Sequence Modeling in Recommenders Kun Yuan et.al. 2602.21009 null
2026-02-25 Constraints on dynamically-formed massive black holes in Little Red Dots from X-ray non-detections M. Liempi et.al. 2602.21002 null
2026-02-24 ParkDiffusion++: Ego Intention Conditioned Joint Multi-Agent Trajectory Prediction for Automated Parking using Diffusion Models Jiarong Wei et.al. 2602.20923 null
2026-02-24 Don’t Ignore the Tail: Decoupling top-K Probabilities for Efficient Language Model Distillation Sayantan Dasgupta et.al. 2602.20816 null
2026-02-24 CHESS: Context-aware Hierarchical Efficient Semantic Selection for Long-Context LLM Inference Chao Fei et.al. 2602.20732 null
2026-02-24 ID-LoRA: Efficient Low-Rank Adaptation Inspired by Matrix Interpolative Decomposition Xindian Ma et.al. 2602.20727 null
2026-02-24 PRECTR-V2:Unified Relevance-CTR Framework with Cross-User Preference Mining, Exposure Bias Correction, and LLM-Distilled Encoder Optimization Shuzhi Cao et.al. 2602.20676 null
2026-02-24 CAMEL: Confidence-Gated Reflection for Reward Modeling Zirui Zhu et.al. 2602.20670 null
2026-02-24 TOM: A Ternary Read-only Memory Accelerator for LLM-powered Edge Intelligence Hongyi Guan et.al. 2602.20662 null
2026-02-24 Dataset Color Quantization: A Training-Oriented Framework for Dataset-Level Compression Chenyue Yu et.al. 2602.20650 null
2026-02-24 OptiLeak: Efficient Prompt Reconstruction via Reinforcement Learning in Multi-tenant LLM Services Longxiang Wang et.al. 2602.20595 null
2026-02-24 BFA++: Hierarchical Best-Feature-Aware Token Prune for Multi-View Vision Language Action Model Haosheng Li et.al. 2602.20566 null
2026-02-24 PFGNet: A Fully Convolutional Frequency-Guided Peripheral Gating Network for Efficient Spatiotemporal Predictive Learning Xinyong Cai et.al. 2602.20537 null
2026-02-24 Elimination-compensation pruning for fully-connected neural networks Enrico Ballini et.al. 2602.20467 null
2026-02-23 CLIPoint3D: Language-Grounded Few-Shot Unsupervised 3D Point Cloud Domain Adaptation Mainak Singha et.al. 2602.20409 null
2026-02-23 Highly Efficient Selection of High-Redshift Emission-Line Galaxies for future DESI-like surveys with Deep Multi-band Imaging Yoquelbin Salcedo Hernandez et.al. 2602.20405 null
2026-02-25 QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models Jingxuan Zhang et.al. 2602.20309 null
2026-02-23 Mitigating Artifacts in Pre-quantization Based Scientific Data Compressors with Quantization-aware Interpolation Pu Jiao et.al. 2602.20097 null
2026-02-23 CQ-CiM: Hardware-Aware Embedding Shaping for Robust CiM-Based Retrieval Xinzhao Li et.al. 2602.20083 null
2026-02-23 Token-UNet: A New Case for Transformers Integration in Efficient and Interpretable 3D UNets for Brain Imaging Segmentation Louis Fabrice Tshimanga et.al. 2602.20008 null
2026-02-23 A Computationally Efficient Multidimensional Vision Transformer Alaa El Ichi et.al. 2602.19982 null
2026-02-23 Unlearning Noise in PINNs: A Selective Pruning Framework for PDE Inverse Problems Yongsheng Chen et.al. 2602.19967 null
2026-02-23 Rethinking LoRA for Privacy-Preserving Federated Learning in Large Models Jin Liu et.al. 2602.19926 null
2026-02-23 DerMAE: Improving skin lesion classification through conditioned latent diffusion and MAE distillation Francisco Filho et.al. 2602.19848 null
2026-02-23 Path-conditioned training: a principled way to rescale ReLU neural networks Arthur Lebeurrier et.al. 2602.19799 null
2026-02-23 Transcendental momentum quantization in semiconducting Rashba nanowires and zero energy states in their normal and superconducting phase Nico Leumer et.al. 2602.19796 null
2026-02-23 Training Deep Stereo Matching Networks on Tree Branch Imagery: A Benchmark Study for Real-Time UAV Forestry Applications Yida Lin et.al. 2602.19763 null
2026-02-23 Multimodal Dataset Distillation Made Simple by Prototype-Guided Data Synthesis Junhyeok Choi et.al. 2602.19756 null
2026-02-23 RAP: Fast Feedforward Rendering-Free Attribute-Guided Primitive Importance Score Prediction for Efficient 3D Gaussian Splatting Processing Kaifa Yang et.al. 2602.19753 null
2026-02-24 NEXUS: A compact neural architecture for high-resolution spatiotemporal air quality forecasting in Delhi National Capital Region Rampunit Kumar et.al. 2602.19654 null
2026-02-24 Nacrith: Neural Lossless Compression via Ensemble Context Modeling and High-Precision CDF Coding Roberto Tacconelli et.al. 2602.19626 null
2026-02-23 VecFormer: Towards Efficient and Generalizable Graph Transformer with Graph Token Attention Jingbo Zhou et.al. 2602.19622 null
2026-02-23 Sculpting the Vector Space: Towards Efficient Multi-Vector Visual Document Retrieval via Prune-then-Merge Framework Yibo Yan et.al. 2602.19549 null
2026-02-23 A Text-Guided Vision Model for Enhanced Recognition of Small Instances Hyun-Ki Jung et.al. 2602.19503 null
2026-02-23 Decoupling Vision and Language: Codebook Anchored Visual Adaptation Jason Wu et.al. 2602.19449 null
2026-02-23 FinSight-Net:A Physics-Aware Decoupled Network with Frequency-Domain Compensation for Underwater Fish Detection in Smart Aquaculture Jinsong Yang et.al. 2602.19437 null
2026-02-22 Adaptive Data Augmentation with Multi-armed Bandit: Sample-Efficient Embedding Calibration for Implicit Pattern Recognition Minxue Tang et.al. 2602.19385 null
2026-02-22 Prompt Tuning for CLIP on the Pretrained Manifold Xi Yang et.al. 2602.19198 null
2026-02-22 PositionOCR: Augmenting Positional Awareness in Multi-Modal Models via Hybrid Specialist Integration Chen Duan et.al. 2602.19188 null
2026-02-22 S $^3$ GND: An Effective Learning-Based Approach for Subgraph Similarity Search Under Generalized Neighbor Difference Semantics (Technical Report) Qi Wen et.al. 2602.19167 null
2026-02-22 Flash-VAED: Plug-and-Play VAE Decoders for Efficient Video Generation Lunjie Zhu et.al. 2602.19161 null
2026-02-22 Mapping Networks Lord Sen et.al. 2602.19134 null
2026-02-22 Learning from Complexity: Exploring Dynamic Sample Pruning of Spatio-Temporal Training Wei Chen et.al. 2602.19113 null
2026-02-22 Astra: Activation-Space Tail-Eigenvector Low-Rank Adaptation of Large Language Models Kainan Liu et.al. 2602.19111 null
2026-02-22 Do LLMs and VLMs Share Neurons for Inference? Evidence and Mechanisms of Cross-Modal Transfer Chenhang Cui et.al. 2602.19058 null
2026-02-22 SKYLIGHT: A Scalable Hundred-Channel 3D Photonic In-Memory Tensor Core Architecture for Real-time AI Inference Meng Zhang et.al. 2602.19031 null
2026-02-22 GUIDE-US: Grade-Informed Unpaired Distillation of Encoder Knowledge from Histopathology to Micro-UltraSound Emma Willis et.al. 2602.19005 null
2026-02-21 PCA-VAE: Differentiable Subspace Quantization without Codebook Collapse Hao Lu et.al. 2602.18904 null
2026-02-21 Beyond Stationarity: Rethinking Codebook Collapse in Vector Quantization Hao Lu et.al. 2602.18896 null
2026-02-21 Structure-Level Disentangled Diffusion for Few-Shot Chinese Font Generation Jie Li et.al. 2602.18874 null
2026-02-21 Joint Post-Training Quantization of Vision Transformers with Learned Prompt-Guided Data Generation Shile Li et.al. 2602.18861 null
2026-02-21 Hyperbolic Busemann Neural Networks Ziheng Chen et.al. 2602.18858 null
2026-02-21 DUET-VLM: Dual stage Unified Efficient Token reduction for VLM Training and Inference Aditya Kumar Singh et.al. 2602.18846 null
2026-02-21 UFO: Unlocking Ultra-Efficient Quantized Private Inference with Protocol and Algorithm Co-Optimization Wenxuan Zeng et.al. 2602.18758 null
2026-02-21 Federated Reasoning Distillation Framework with Model Learnability-Aware Data Allocation Wei Guo et.al. 2602.18749 null
2026-02-21 Deep LoRA-Unfolding Networks for Image Restoration Xiangming Wang et.al. 2602.18697 null
2026-02-21 In-Context Planning with Latent Temporal Abstractions Baiting Luo et.al. 2602.18694 null
2026-02-20 Communication-Efficient Personalized Adaptation via Federated-Local Model Merging Yinan Zou et.al. 2602.18658 null
2026-02-20 Ensemble Prediction of Task Affinity for Efficient Multi-Task Learning Afiya Ayman et.al. 2602.18591 null
2026-02-20 GIST: Targeted Data Selection for Instruction Tuning via Coupled Optimization Geometry Guanghui Min et.al. 2602.18584 null
2026-02-20 Luna-2: Scalable Single-Token Evaluation with Small Language Models Vatsal Goel et.al. 2602.18583 null
2026-02-20 SPQ: An Ensemble Technique for Large Language Model Compression Jiamin Yao et.al. 2602.18420 null
2026-02-20 MD-AirComp+: Adaptive Quantization for Blind Massive Digital Over-the-Air Computation Li Qiao et.al. 2602.18332 null
2026-02-20 Neural-HSS: Hierarchical Semi-Separable Neural PDE Solver Pietro Sittoni et.al. 2602.18248 null
2026-02-20 Parameter-Efficient Domain Adaptation of Physics-Informed Self-Attention based GNNs for AC Power Flow Prediction Redwanul Karim et.al. 2602.18227 null
2026-02-20 Cut Less, Fold More: Model Compression through the Lens of Projection Geometry Olga Saukh et.al. 2602.18116 null
2026-02-20 MUOT_3M: A 3 Million Frame Multimodal Underwater Benchmark and the MUTrack Tracking Method Ahsan Baidar Bakht et.al. 2602.18006 null
2026-02-20 Higher order quantization conditions for two-body scattering with spin Lucas Chandler et.al. 2602.17924 null
2026-02-19 Calibrated Adaptation: Bayesian Stiefel Manifold Priors for Reliable Parameter-Efficient Fine-Tuning Ibne Farabi Shihab et.al. 2602.17809 null
2026-02-19 Hardware-Aware Design of a GNN-Based Hit Filtering Algorithm for the Belle II Level-1 Trigger Greta Heine et.al. 2602.17761 null
2026-02-19 Sink-Aware Pruning for Diffusion Language Models Aidar Myrzakhan et.al. 2602.17664 null
2026-02-19 Reverso: Efficient Time Series Foundation Models for Zero-shot Forecasting Xinghong Fu et.al. 2602.17634 null
2026-02-19 Revisiting Weight Regularization for Low-Rank Continual Learning Yaoyue Zheng et.al. 2602.17559 null
2026-02-19 LORA-CRAFT: Cross-layer Rank Adaptation via Frozen Tucker Decomposition of Pre-trained Attention Weights Kasun Dewage et.al. 2602.17510 null
2026-02-19 Analytical Derivation of Quantization Error in Threshold Level Quantizers Using Bipolar PFM Ricardo Carrero et.al. 2602.17471 null
2026-02-19 SpectralGCD: Spectral Concept Selection and Cross-modal Representation Learning for Generalized Category Discovery Lorenzo Caselli et.al. 2602.17395 null
2026-02-20 Contact-Anchored Proprioceptive Odometry for Quadruped Robots Minxing Sun et.al. 2602.17393 null
2026-02-19 Efficient privacy loss accounting for subsampling and random allocation Vitaly Feldman et.al. 2602.17284 null
2026-02-19 EntropyPrune: Matrix Entropy Guided Visual Token Pruning for Multimodal Large Language Models Yahong Wang et.al. 2602.17196 null
2026-02-19 Bonsai: A Framework for Convolutional Neural Network Acceleration Using Criterion-Based Pruning Joseph Bingham et.al. 2602.17145 null
2026-02-19 Efficient Parallel Algorithm for Decomposing Hard CircuitSAT Instances Victor Kondratiev et.al. 2602.17130 null
2026-02-19 FLoRG: Federated Fine-tuning with Low-rank Gram Matrices and Procrustes Alignment Chuiyang Meng et.al. 2602.17095 null
2026-02-19 Sign Lock-In: Randomly Initialized Weight Signs Persist and Bottleneck Sub-Bit Model Compression Akira Sakai et.al. 2602.17063 null
2026-02-19 Amber-Image: Efficient Compression of Large-Scale Diffusion Transformers Chaojie Yang et.al. 2602.17047 null
2026-02-18 BrainRVQ: A High-Fidelity EEG Foundation Model via Dual-Domain Residual Quantization and Hierarchical Autoregression Mingzhe Cui et.al. 2602.16951 null
2026-02-18 Numerical study of electron acceleration by microwave-driven plasma wakefields in rectangular waveguides Jesús E. López et.al. 2602.16896 null
2026-02-18 ML-driven detection and reduction of ballast information in multi-modal datasets Yaroslav Solovko et.al. 2602.16876 null
2026-02-18 Training Large Reasoning Models Efficiently via Progressive Thought Encoding Zeliang Zhang et.al. 2602.16839 null
2026-02-18 NeST: Neuron Selective Tuning for LLM Safety Sasha Behrouzi et.al. 2602.16835 null
2026-02-18 U-FedTomAtt: Ultra-lightweight Federated Learning with Attention for Tomato Disease Recognition Romiyal George et.al. 2602.16749 null
2026-02-18 One Hand to Rule Them All: Canonical Representations for Unified Dexterous Manipulation Zhenyu Wei et.al. 2602.16712 null
2026-02-20 Agent Skill Framework: Perspectives on the Potential of Small Language Models in Industrial Environments Yangjie Xu et.al. 2602.16653 null
2026-02-18 Quecto-V1: Empirical Analysis of 8-bit Quantized Small Language Models for On-Device Legal Retrieval Subrit Dikshit et.al. 2602.16640 null
2026-02-18 A Scalable Approach to Solving Simulation-Based Network Security Games Michael Lanier et.al. 2602.16564 null
2026-02-18 Subtractive Modulative Network with Learnable Periodic Activations Tiou Wang et.al. 2602.16337 null
2026-02-18 RefineFormer3D: Efficient 3D Medical Image Segmentation via Adaptive Multi-Scale Transformer with Cross Attention Fusion Kavyansh Tyagi et.al. 2602.16320 null
2026-02-18 AFFMAE: Scalable and Efficient Vision Pretraining for Desktop Graphics Cards David Smerkous et.al. 2602.16249 null
2026-02-18 Uncertainty-Guided Inference-Time Depth Adaptation for Transformer-Based Visual Tracking Patrick Poggi et.al. 2602.16160 null
2026-02-18 Rethinking ANN-based Retrieval: Multifaceted Learnable Index for Large-scale Recommendation System Jiang Zhang et.al. 2602.16124 null
2026-02-18 Collaborative Zone-Adaptive Zero-Day Intrusion Detection for IoBT Amirmohammad Pasdar et.al. 2602.16098 null
2026-02-17 LGQ: Learning Discretization Geometry for Scalable and Stable Image Tokenization Idil Bilge Altun et.al. 2602.16086 null
2026-02-17 ROIX-Comp: Optimizing X-ray Computed Tomography Imaging Strategy for Data Reduction and Reconstruction Amarjit Singh et.al. 2602.15917 null
2026-02-17 QwaveMPS: An efficient open-source Python package for simulating non-Markovian waveguide-QED using matrix product states Sofia Arranz Regidor et.al. 2602.15826 null
2026-02-17 Quantitative local recovery of Kerr-de Sitter parameters from high-frequency equatorial quasinormal modes Ruiliang Li et.al. 2602.15764 null
2026-02-17 Enabling Low-Latency Machine learning on Radiation-Hard FPGAs with hls4ml Katya Govorkova et.al. 2602.15751 null
2026-02-17 Learning to Retrieve Navigable Candidates for Efficient Vision-and-Language Navigation Shutian Gu et.al. 2602.15724 null
2026-02-18 ToaSt: Token Channel Selection and Structured Pruning for Efficient ViT Hyunchan Moon et.al. 2602.15720 null
2026-02-17 1-Bit Wonder: Improving QAT Performance in the Low-Bit Regime through K-Means Quantization Sohir Maskey et.al. 2602.15563 null
2026-02-17 Efficient Road Renovation Scheduling under Uncertainty using Lower Bound Pruning Robbert Bosch et.al. 2602.15554 null
2026-02-17 jina-embeddings-v5-text: Task-Targeted Embedding Distillation Mohammad Kalim Akram et.al. 2602.15547 null
2026-02-17 LEADER: Lightweight End-to-End Attention-Gated Dual Autoencoder for Robust Minutiae Extraction Raffaele Cappelli et.al. 2602.15493 null
2026-02-17 The Vision Wormhole: Latent-Space Communication in Heterogeneous Multi-Agent Systems Xiaoze Liu et.al. 2602.15382 null
2026-02-17 Orchestration-Free Customer Service Automation: A Privacy-Preserving and Flowchart-Guided Framework Mengze Hong et.al. 2602.15377 null
2026-02-17 Sparse Additive Model Pruning for Order-Based Causal Structure Learning Kentaro Kanamori et.al. 2602.15306 null
2026-02-16 Pruning distance of upset-decomposable persistence modules Roy Nicolas Nehme et.al. 2602.15243 null
2026-02-16 Phase Transitions in Neural Networks Pruning Diego Pesce et.al. 2602.15224 null
2026-02-16 COMPOT: Calibration-Optimized Matrix Procrustes Orthogonalization for Transformers Compression Denis Makhov et.al. 2602.15200 null
2026-02-16 ScrapeGraphAI-100k: A Large-Scale Dataset for LLM-Based Web Information Extraction William Brach et.al. 2602.15189 null
2026-02-16 Quantization as a Categorical Equivalence for Hilbert Bimodules and Lagrangian Relations Benjamin H. Feintzeig et.al. 2602.15188 null
2026-02-16 Learning Data-Efficient and Generalizable Neural Operators via Fundamental Physics Knowledge Siying Ma et.al. 2602.15184 null
2026-02-16 Synthesizing Trajectory Queries from Examples Stephen Mell et.al. 2602.15164 null
2026-02-16 Protecting Language Models Against Unauthorized Distillation through Trace Rewriting Xinhang Ma et.al. 2602.15143 null
2026-02-16 CGRA-DeBERTa Concept Guided Residual Augmentation Transformer for Theologically Islamic Understanding Tahir Hussain et.al. 2602.15139 null
2026-02-16 Text Style Transfer with Parameter-efficient LLM Finetuning and Round-trip Translation Ruoxi Liu et.al. 2602.15013 null
2026-02-16 Scaling QAOA: transferring optimal adiabatic schedules from small-scale to large-scale variational circuits Ugo Nzongani et.al. 2602.14986 null
2026-02-16 DRAMA: Domain Retrieval using Adaptive Module Allocation Pranav Kasela et.al. 2602.14960 null
2026-02-16 Algorithmic Simplification of Neural Networks with Mosaic-of-Motifs Pedram Bakhtiarifard et.al. 2602.14896 link
2026-02-16 Depth Completion as Parameter-Efficient Test-Time Adaptation Bingxin Ke et.al. 2602.14751 null
2026-02-16 D2-LoRA: A Synergistic Approach to Differential and Directional Low-Rank Adaptation Nozomu Fujisawa et.al. 2602.14728 null
2026-02-16 GradMAP: Faster Layer Pruning with Gradient Metric and Projection Compensation Hao Liu et.al. 2602.14649 null
2026-02-16 RNM-TD3: N:M Semi-structured Sparse Reinforcement Learning From Scratch Isam Vrce et.al. 2602.14578 null
2026-02-16 Efficient Text-Guided Convolutional Adapter for the Diffusion Model Aryan Das et.al. 2602.14514 null
2026-02-16 Parameter-Efficient Fine-Tuning of LLMs with Mixture of Space Experts Buze Zhang et.al. 2602.14490 null
2026-02-16 S2D: Selective Spectral Decay for Quantization-Friendly Conditioning of Neural Activations Arnav Chavan et.al. 2602.14432 null
2026-02-16 LLM-Guided Knowledge Distillation for Temporal Knowledge Graph Reasoning Wang Xing et.al. 2602.14428 null
2026-02-15 Train Less, Learn More: Adaptive Efficient Rollout Optimization for Group-Based Reinforcement Learning Zhi Zhang et.al. 2602.14338 null
2026-02-15 Floe: Federated Specialization for Real-Time LLM-SLM Inference Chunlin Tian et.al. 2602.14302 null
2026-02-15 DeepFusion: Accelerating MoE Training via Federated Knowledge Distillation from Heterogeneous Edge Devices Songyuan Li et.al. 2602.14301 null
2026-02-15 Energy-Efficient Over-the-Air Federated Learning via Pinching Antenna Systems Saba Asaad et.al. 2602.14250 null
2026-02-15 Towards Spatial Transcriptomics-driven Pathology Foundation Models Konstantin Hemker et.al. 2602.14177 null
2026-02-15 ROAST: Rollout-based On-distribution Activation Steering Technique Xuanbo Su et.al. 2602.14143 null
2026-02-15 TabTracer: Monte Carlo Tree Search for Complex Table Reasoning with Large Language Models Zhizhao Luo et.al. 2602.14089 null
2026-02-15 Policy Gradient with Adaptive Entropy Annealing for Continual Fine-Tuning Yaqian Zhang et.al. 2602.14078 null
2026-02-15 LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts Yang Liu et.al. 2602.14060 null
2026-02-15 Explainability-Inspired Layer-Wise Pruning of Deep Neural Networks for Efficient Object Detection Abhinav Shukla et.al. 2602.14040 null
2026-02-15 Extended Universal Joint Source-Channel Coding for Digital Semantic Communications: Improving Channel-Adaptability Eunsoo Kim et.al. 2602.14018 null
2026-02-15 A Deployment-Friendly Foundational Framework for Efficient Computational Pathology Yu Cai et.al. 2602.14010 null
2026-02-15 Elastic Diffusion Transformer Jiangshan Wang et.al. 2602.13993 null
2026-02-15 Efficient Off-Grid Near-Field Cascade Channel Estimation for XL-IRS Systems via Tucker Decomposition Wenzhou Cao et.al. 2602.13988 null
2026-02-15 QuRL: Efficient Reinforcement Learning with Quantized Rollout Yuhang Li et.al. 2602.13953 null
2026-02-14 Evaluating Prompt Engineering Techniques for RAG in Small Language Models: A Multi-Hop QA Approach Amir Hossein Mohammadi et.al. 2602.13890 null
2026-02-14 Parameter-Efficient Fine-Tuning of DINOv2 for Large-Scale Font Classification Daniel Chen et.al. 2602.13889 null
2026-02-14 Bridging the Multilingual Safety Divide: Efficient, Culturally-Aware Alignment for Global South Languages Somnath Banerjee et.al. 2602.13867 null
2026-02-14 High-Fidelity Causal Video Diffusion Models for Real-Time Ultra-Low-Bitrate Semantic Communication Cem Eteke et.al. 2602.13837 null
2026-02-14 NeuroMambaLLM: Dynamic Graph Learning of fMRI Functional Connectivity in Autistic Brains Using Mamba and Language Model Reasoning Yasaman Torabi et.al. 2602.13770 null
2026-02-14 MOTIF: Learning Action Motifs for Few-shot Cross-Embodiment Transfer Heng Zhi et.al. 2602.13764 null
2026-02-14 HBVLA: Pushing 1-Bit Post-Training Quantization for Vision-Language-Action Models Xin Yan et.al. 2602.13710 null
2026-02-14 A WDLoRA-Based Multimodal Generative Framework for Clinically Guided Corneal Confocal Microscopy Image Synthesis in Diabetic Neuropathy Xin Zhang et.al. 2602.13693 null
2026-02-14 HyFunc: Accelerating LLM-based Function Calls for Agentic AI through Hybrid-Model Cascade and Dynamic Templating Weibin Liao et.al. 2602.13665 null
2026-02-14 Layer-Guided UAV Tracking: Enhancing Efficiency and Occlusion Robustness Yang Zhou et.al. 2602.13636 null
2026-02-14 GEMs: Breaking the Long-Sequence Barrier in Generative Recommendation with a Multi-Stream Decoder Yu Zhou et.al. 2602.13631 null
2026-02-14 Compact LLM Deployment and World Model Assisted Offloading in Mobile Edge Computing Ruichen Zhang et.al. 2602.13628 null
2026-02-14 Parametric-Sensitivity Aware Retransmission for Efficient AI Downloading You Zhou et.al. 2602.13607 null
2026-02-14 The Quantization Trap: Breaking Linear Scaling Laws in Multi-Hop Reasoning Henry Han et.al. 2602.13595 null
2026-02-14 Unleash the Potential of Long Semantic IDs for Generative Recommendation Ming Xia et.al. 2602.13573 null
2026-02-14 DistillLens: Symmetric Knowledge Distillation Through Logit Lens Manish Dhakal et.al. 2602.13567 null
2026-02-13 Quantization-Robust LLM Unlearning via Low-Rank Adaptation João Vitor Boer Abitante et.al. 2602.13151 null
2026-02-13 FlashSchNet: Fast and Accurate Coarse-Grained Neural Network Molecular Dynamics Pingzhi Li et.al. 2602.13140 null
2026-02-13 EXCODER: EXplainable Classification Of DiscretE time series Representations Yannik Hahn et.al. 2602.13087 null
2026-02-13 LCSB: Layer-Cyclic Selective Backpropagation for Memory-Efficient On-Device LLM Fine-Tuning Juneyoung Park et.al. 2602.13073 null
2026-02-13 Quantization-Aware Collaborative Inference for Large Embodied AI Models Zhonghao Lyu et.al. 2602.13052 null
2026-02-13 Resource-Efficient Gesture Recognition through Convexified Attention Daniel Schwartz et.al. 2602.13030 null
2026-02-13 A two-step approach for speech enhancement in low-SNR scenarios using cyclostationary beamforming and DNNs Giovanni Bologni et.al. 2602.12986 null
2026-02-13 Multi-Dimensional Visual Data Recovery: Scale-Aware Tensor Modeling and Accelerated Randomized Computation Wenjin Qin et.al. 2602.12982 null
2026-02-13 Limits of Thermal Conductance Quantization in Chiral Topological Josephson Junctions Daniel Gresta et.al. 2602.12947 null
2026-02-13 Unleashing MLLMs on the Edge: A Unified Framework for Cross-Modal ReID via Adaptive SVD Distillation Hongbo Jiang et.al. 2602.12936 null
2026-02-13 WebClipper: Efficient Evolution of Web Agents with Graph-based Trajectory Pruning Junjie Wang et.al. 2602.12852 null
2026-02-13 Adaptive Structured Pruning of Convolutional Neural Networks for Time Series Classification Javidan Abdullayev et.al. 2602.12744 null
2026-02-13 Trust the uncertain teacher: distilling dark knowledge via calibrated uncertainty Jeonghyun Kim et.al. 2602.12687 null
2026-02-13 $\mathcal{X}$ -KD: General Experiential Knowledge Distillation for Large Language Models Yuang Cai et.al. 2602.12674 null
2026-02-13 PMG: Parameterized Motion Generator for Human-like Locomotion Control Chenxi Han et.al. 2602.12656 null
2026-02-13 Vision Token Reduction via Attention-Driven Self-Compression for Efficient Multimodal Large Language Models Omer Faruk Deniz et.al. 2602.12618 null
2026-02-13 QuEPT: Quantized Elastic Precision Transformers with One-Shot Calibration for Multi-Bit Switching Ke Xu et.al. 2602.12609 null
2026-02-13 Monte Carlo Tree Search with Reasoning Path Refinement for Small Language Models in Conversational Text-to-NoSQL Xubang Xiong et.al. 2602.12574 null
2026-02-13 Constraint-Rectified Training for Efficient Chain-of-Thought Qinhang Wu et.al. 2602.12526 null
2026-02-12 Human-Like Coarse Object Representations in Vision Models Andrey Gizdov et.al. 2602.12486 null
2026-02-12 Rational Neural Networks have Expressivity Advantages Maosen Tang et.al. 2602.12390 null
2026-02-12 LLaMo: Scaling Pretrained Language Models for Unified Motion Understanding and Generation with Continuous Autoregressive Tokens Zekun Li et.al. 2602.12370 null
2026-02-12 On-Policy Context Distillation for Language Models Tianzhu Ye et.al. 2602.12275 null
2026-02-13 DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing Dianyi Wang et.al. 2602.12205 null
2026-02-12 SAM3-LiteText: An Anatomical Study of the SAM3 Text Encoder for Efficient Vision-Language Segmentation Chengxi Zeng et.al. 2602.12173 null
2026-02-12 Pedagogically-Inspired Data Synthesis for Language Model Knowledge Distillation Bowei He et.al. 2602.12172 null
2026-02-12 Compress, Cross and Scale: Multi-Level Compression Cross Networks for Efficient Scaling in Recommender Systems Heng Yu et.al. 2602.12041 null
2026-02-12 Improved state mixing in higher-order and block diagonal linear recurrent networks Igor Dubinin et.al. 2602.12021 null
2026-02-13 LaCy: What Small Language Models Can and Should Learn is Not Just a Question of Loss Szilvia Ujváry et.al. 2602.12005 null
2026-02-12 Manifold-Aware Temporal Domain Generalization for Large Language Models Yiheng Yao et.al. 2602.11965 null
2026-02-12 Extending Puzzle for Mixture-of-Experts Reasoning Models with Application to GPT-OSS Acceleration Akhiad Bercovich et.al. 2602.11937 null
2026-02-12 Optimal Quantization for Nonuniform Densities on Spherical Curves Silpi Saha et.al. 2602.11926 null
2026-02-12 Improving Code Generation via Small Language Model-as-a-judge Giuseppe Crupi et.al. 2602.11911 null
2026-02-12 Where Bits Matter in World Model Planning: A Paired Mixed-Bit Study for Efficient Spatial Reasoning Suraj Ranganath et.al. 2602.11882 null
2026-02-12 MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling MiniCPM Team et.al. 2602.11761 null
2026-02-12 Dopamine: Brain Modes, Not Brains Shervin Ghasemlou et.al. 2602.11726 null
2026-02-12 LAER-MoE: Load-Adaptive Expert Re-layout for Efficient Mixture-of-Experts Training Xinyi Liu et.al. 2602.11686 null
2026-02-12 U-Net with Hadamard Transform and DCT Latent Spaces for Next-day Wildfire Spread Prediction Yingyi Luo et.al. 2602.11672 null
2026-02-12 LoRA-based Parameter-Efficient LLMs for Continuous Learning in Edge-based Malware Detection Christian Rondanini et.al. 2602.11655 null
2026-02-12 Quantization Mapping on Dirac Dynamics via Voltage-Driven Charge Density in Monolayer Graphene: A Klein Paradox and Entropy-Ruled Wavevector Mechanics Study Karuppuchamy Navamani et.al. 2602.11604 null
2026-02-12 Move What Matters: Parameter-Efficient Domain Adaptation via Optimal Transport Flow for Collaborative Perception Zesheng Jia et.al. 2602.11565 null
2026-02-12 Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm Jinrui Zhang et.al. 2602.11543 null
2026-02-12 Differentially Private and Communication Efficient Large Language Model Split Inference via Stochastic Quantization and Soft Prompt Yujie Gu et.al. 2602.11513 null
2026-02-11 Investigation of Toroidal Rotation Effects on Spherical Torus Equilibria using the Fast Spectral Solver VEQ-R Xingyu Li et.al. 2602.11422 null
2026-02-11 Efficient Simulation of Pre-Born-Oppenheimer Dynamics on a Quantum Computer Matthew Pocrnic et.al. 2602.11272 null
2026-02-11 Reed-Muller Error-Correction Code Encoder for SFQ-to-CMOS Interface Circuits Yerzhan Mustafa et.al. 2602.11140 null
2026-02-11 PuriLight: A Lightweight Shuffle and Purification Framework for Monocular Depth Estimation Yujie Chen et.al. 2602.11066 null
2026-02-11 ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression Ammar Ali et.al. 2602.11008 null
2026-02-11 Enhancing Predictability of Multi-Tenant DNN Inference for Autonomous Vehicles’ Perception Liangkai Liu et.al. 2602.11004 null
2026-02-11 LoRA-Squeeze: Simple and Effective Post-Tuning and In-Tuning Compression of LoRA Modules Ivan Vulić et.al. 2602.10993 null
2026-02-11 Deformation quantization of symplectic vector fields Haoyuan Gao et.al. 2602.10988 null
2026-02-11 MoEEdit: Efficient and Routing-Stable Knowledge Editing for Mixture-of-Experts LLMs Yupu Gu et.al. 2602.10965 null
2026-02-11 Agentic Knowledge Distillation: Autonomous Training of Small Language Models for SMS Threat Detection Adel ElZemity et.al. 2602.10869 null
2026-02-11 Enhancing Multivariate Time Series Forecasting with Global Temporal Retrieval Fanpu Cao et.al. 2602.10847 null
2026-02-11 Resource-Efficient RGB-Only Action Recognition for Edge Deployment Dongsik Yoon et.al. 2602.10818 null
2026-02-11 EST: Towards Efficient Scaling Laws in Click-Through Rate Prediction via Unified Modeling Mingyang Liu et.al. 2602.10811 null
2026-02-11 GoodVibe: Security-by-Vibe for LLM-Based Code Generation Maximilian Thang et.al. 2602.10778 null
2026-02-12 Efficient Operator Selection and Warm-Start Strategy for Excitations in Variational Quantum Eigensolvers Max Haas et.al. 2602.10776 null
2026-02-11 Kalman Linear Attention: Parallel Bayesian Filtering For Efficient Language Modelling and State Tracking Vaisakh Shaj et.al. 2602.10743 null
2026-02-11 SnapMLA: Efficient Long-Context MLA Decoding via Hardware-Aware FP8 Quantized Pipelining Yifan Zhang et.al. 2602.10718 null
2026-02-11 Spend Search Where It Pays: Value-Guided Structured Sampling and Optimization for Generative Recommendation Jie Jiang et.al. 2602.10699 null
2026-02-11 Bridging the Compression-Precision Paradox: A Hybrid Architecture for Clinical EEG Report Generation with Guaranteed Measurement Accuracy Wuyang Zhang et.al. 2602.10544 null
2026-02-11 Efficient Computation of Maximum Flexi-Clique in Networks Song Kim et.al. 2602.10459 null
2026-02-11 Compute Only Once: UG-Separation for Efficient Large Recommendation Models Hui Lu et.al. 2602.10455 null
2026-02-11 End-to-End Semantic ID Generation for Generative Advertisement Recommendation Jie Jiang et.al. 2602.10445 null
2026-02-11 QTALE: Quantization-Robust Token-Adaptive Layer Execution for LLMs Kanghyun Noh et.al. 2602.10431 null
2026-02-11 Modular Multi-Task Learning for Chemical Reaction Prediction Jiayun Pang et.al. 2602.10404 null
2026-02-10 Theoretical Analysis of Contrastive Learning under Imbalanced Data: From Training Dynamics to a Pruning Solution Haixu Liao et.al. 2602.10357 null
2026-02-10 Efficient Policy Adaptation for Voltage Control Under Unknown Topology Changes Jie Feng et.al. 2602.10355 null
2026-02-10 Efficient reduction of stellar contamination and noise in planetary transmission spectra using neural networks David S. Duque-Castaño et.al. 2602.10330 null
2026-02-10 R2RAG-Flood: A reasoning-reinforced training-free retrieval augmentation generation framework for flood damage nowcasting Lipai Huang et.al. 2602.10312 null
2026-02-10 Optimal Bounds-Only Pruning for Spatial AkNN Joins Dominik Winecki et.al. 2602.10027 null
2026-02-10 Answer First, Reason Later: Aligning Search Relevance via Mode-Balanced Reinforcement Learning Shijie Zhang et.al. 2602.10006 null
2026-02-10 AdaTSQ: Pushing the Pareto Frontier of Diffusion Transformers via Temporal-Sensitivity Quantization Shaoqiu Zhang et.al. 2602.09883 null
2026-02-10 BabyMamba-HAR: Lightweight Selective State Space Models for Efficient Human Activity Recognition on Resource Constrained Devices Mridankan Mandal et.al. 2602.09872 null
2026-02-11 Text summarization via global structure awareness Jiaquan Zhang et.al. 2602.09821 null
2026-02-10 CompSplat: Compression-aware 3D Gaussian Splatting for Real-world Video Hojun Song et.al. 2602.09816 null
2026-02-10 From Lightweight CNNs to SpikeNets: Benchmarking Accuracy-Energy Tradeoffs with Pruned Spiking SqueezeNet Radib Bin Kabir et.al. 2602.09717 null
2026-02-10 Stellar-mass black holes in young massive and open stellar clusters – VII. Comparisons with gravitational-wave events until LVK-O4a and Gaia compact binaries Sambaran Banerjee et.al. 2602.09694 null
2026-02-10 Life Cycle-Aware Evaluation of Knowledge Distillation for Machine Translation: Environmental Impact and Translation Quality Trade-offs Joseph Attieh et.al. 2602.09691 null
2026-02-10 Talking with the Latents – how to convert your LLM into an astronomer Ilay Kamai et.al. 2602.09670 null
2026-02-10 MATA: Multi-Agent Framework for Reliable and Flexible Table Question Answering Sieun Hyeon et.al. 2602.09642 null
2026-02-10 TeleGate: Whole-Body Humanoid Teleoperation via Gated Expert Selection with Motion Prior Jie Li et.al. 2602.09628 null
2026-02-10 Multimode fiber laser cavities as nonlinear optical processors Dilem Eşlik et.al. 2602.09519 null
2026-02-11 Beyond Student: An Asymmetric Network for Neural Network Inheritance Yiyun Zhou et.al. 2602.09509 null
2026-02-10 Beyond Next-Token Alignment: Distilling Multimodal Large Language Models via Token Interactions Lin Chen et.al. 2602.09483 null
2026-02-10 Personalized Parameter-Efficient Fine-Tuning of Foundation Models for Multimodal Recommendation Sunwoo Kim et.al. 2602.09445 null
2026-02-10 Sparse Layer Sharpness-Aware Minimization for Efficient Fine-Tuning Yifei Cheng et.al. 2602.09395 null
2026-02-10 AfriNLLB: Efficient Translation Models for African Languages Yasmin Moslem et.al. 2602.09373 null
2026-02-10 LLM-CoOpt: A Co-Design and Optimization Framework for Efficient LLM Inference on Heterogeneous Platforms Jie Kong et.al. 2602.09323 null
2026-02-10 Effective MoE-based LLM Compression by Exploiting Heterogeneous Inter-Group Experts Routing Frequency and Information Density Zhendong Mi et.al. 2602.09316 null
2026-02-09 A Lightweight Multi-View Approach to Short-Term Load Forecasting Julien Guité-Vinet et.al. 2602.09220 null
2026-02-09 Train Less, Infer Faster: Efficient Model Finetuning and Compression via Structured Sparsity Jonathan Svirsky et.al. 2602.09169 null
2026-02-09 UniComp: A Unified Evaluation of Large Language Model Compression via Pruning, Quantization and Distillation Jonathan von Rad et.al. 2602.09130 null
2026-02-09 Looping Back to Move Forward: Recursive Transformers for Efficient and Flexible Large Multimodal Models Ruihan Xu et.al. 2602.09080 null
2026-02-09 CLUE: Crossmodal disambiguation via Language-vision Understanding with attEntion Mouad Abrini et.al. 2602.08999 null
2026-02-09 AMS-HD: Hyperdimensional Computing for Real-Time and Energy-Efficient Acute Mountain Sickness Detection Abu Masum et.al. 2602.08916 null
2026-02-09 Efficient and Stable Reinforcement Learning for Diffusion Language Models Jiawei Liu et.al. 2602.08905 null
2026-02-09 FlattenGPT: Depth Compression for Transformer with Layer Flattening Ruihan Xu et.al. 2602.08858 null
2026-02-09 Omni-Video 2: Scaling MLLM-Conditioned Diffusion for Unified Video Generation and Editing Hao Yang et.al. 2602.08820 null
2026-02-09 FlexMoRE: A Flexible Mixture of Rank-heterogeneous Experts for Efficient Federatedly-trained Large Language Models Annemette Brok Pirchert et.al. 2602.08818 null
2026-02-09 Reliable one-bit quantization of bandlimited graph data via single-shot noise shaping Johannes Maly et.al. 2602.08669 null
2026-02-09 OneLive: Dynamically Unified Generative Framework for Live-Streaming Recommendation Shen Wang et.al. 2602.08612 null
2026-02-09 Beyond Scalar Scores: Reinforcement Learning for Error-Aware Quality Estimation of Machine Translation Archchana Sindhujan et.al. 2602.08600 null
2026-02-09 SDFed: Bridging Local Global Discrepancy via Subspace Refinement and Divergence Control in Federated Prompt Learning Yicheng Di et.al. 2602.08590 null
2026-02-09 M-Loss: Quantifying Model Merging Compatibility with Limited Unlabeled Data Tiantong Wang et.al. 2602.08564 null
2026-02-09 Are Vision Foundation Models Foundational for Electron Microscopy Image Segmentation? Caterina Fuster-Barceló et.al. 2602.08505 null
2026-02-09 RIFLE: Robust Distillation-based FL for Deep Model Deployment on Resource-Constrained IoT Networks Pouria Arefijamal et.al. 2602.08446 null
2026-02-09 OJBKQ: Objective-Joint Babai-Klein Quantization Xinyu Wang et.al. 2602.08376 null
2026-02-09 Quantization-aware Photonic Homodyne computing for Accelerated Artificial Intelligence and Scientific Simulation Lian Zhou et.al. 2602.08269 null
2026-02-09 PTS-SNN: A Prompt-Tuned Temporal Shift Spiking Neural Networks for Efficient Speech Emotion Recognition Xun Su et.al. 2602.08240 null
2026-02-09 Linearization Explains Fine-Tuning in Large Language Models Zahra Rahimi Afzal et.al. 2602.08239 null
2026-02-10 Efficient-SAM2: Accelerating SAM2 with Object-Aware Visual Encoding and Memory Retrieval Jing Zhang et.al. 2602.08224 null
2026-02-09 CADO: From Imitation to Cost Minimization for Heatmap-based Solvers in Combinatorial Optimization Hyungseok Song et.al. 2602.08210 null
2026-02-09 DAS-SK: An Adaptive Model Integrating Dual Atrous Separable and Selective Kernel CNN for Agriculture Semantic Segmentation Mei Ling Chee et.al. 2602.08168 null
2026-02-10 AFDM: Evolving OFDM Towards 6G+ Hyeon Seok Rou et.al. 2602.08163 null
2026-02-08 Robustness of Vision Language Models Against Split-Image Harmful Input Attacks Md Rafi Ur Rashid et.al. 2602.08136 null
2026-02-08 Prune, Don’t Rebuild: Efficiently Tuning $α$ -Reachable Graphs for Nearest Neighbor Search Tian Zhang et.al. 2602.08097 null
2026-02-08 Efficient and Adaptable Detection of Malicious LLM Prompts via Bootstrap Aggregation Shayan Ali Hassan et.al. 2602.08062 null
2026-02-08 Bielik Guard: Efficient Polish Language Safety Classifiers for LLM Content Moderation Krzysztof Wróbel et.al. 2602.07954 null
2026-02-08 Rethinking Practical and Efficient Quantization Calibration for Vision-Language Models Zhenhao Shang et.al. 2602.07899 null
2026-02-08 Efficient Anti-exploration via VQVAE and Fuzzy Clustering in Offline Reinforcement Learning Long Chen et.al. 2602.07889 null
2026-02-08 LQA: A Lightweight Quantized-Adaptive Framework for Vision-Language Models on the Edge Xin Wang et.al. 2602.07849 null
2026-02-08 Pruning as a Cooperative Game: Surrogate-Assisted Layer Contribution Estimation for Large Language Models Xuan Ding et.al. 2602.07804 null
2026-02-08 Accelerating Black Hole Image Generation via Latent Space Diffusion Models Ao Liu et.al. 2602.07786 null
2026-02-07 Do We Need Adam? Surprisingly Strong and Sparse Reinforcement Learning with SGD in LLMs Sagnik Mukherjee et.al. 2602.07729 null
2026-02-07 High-Resolution Solvers for 3D Helmholtz Scattering Problems Using PFFT and Eigenvector-Based Preconditioning Yury Gryazin et.al. 2602.07711 null
2026-02-07 SERE: Similarity-based Expert Re-routing for Efficient Batch Decoding in MoE Models Juntong Wu et.al. 2602.07616 null
2026-02-07 Astro: Activation-guided Structured Regularization for Outlier-Robust LLM Post-Training Quantization Xi Chen et.al. 2602.07596 null
2026-02-07 ViCA: Efficient Multimodal LLMs with Vision-Only Cross-Attention Wenjie Liu et.al. 2602.07574 null
2026-02-07 VISOR: VIsual Spatial Object Reasoning for Language-driven Object Navigation Francesco Taioli et.al. 2602.07555 null
2026-02-07 Linguistic properties and model scale in brain encoding: from small to compressed language models Subba Reddy Oota et.al. 2602.07547 null
2026-02-07 Physical Analog Kolmogorov-Arnold Networks based on Reconfigurable Nonlinear-Processing Units Manuel Escudero et.al. 2602.07518 null
2026-02-07 ODELoRA: Training Low-Rank Adaptation by Solving Ordinary Differential Equations Yihang Gao et.al. 2602.07479 null
2026-02-07 On the Importance of a Multi-Scale Calibration for Quantization Seungwoo Son et.al. 2602.07465 null
2026-02-07 Efficient Post-Training Pruning of Large Language Models with Statistical Correction Peiqi Yu et.al. 2602.07375 null
2026-02-07 TernaryLM: Memory-Efficient Language Modeling via Native 1-Bit Quantization with Adaptive Layer-wise Scaling Nisharg Nargund et.al. 2602.07374 null
2026-02-07 Semantic Search At LinkedIn Fedor Borisyuk et.al. 2602.07309 null
2026-02-05 Shared LoRA Subspaces for almost Strict Continual Learning Prakhar Kaushik et.al. 2602.06043 null
2026-02-05 Correctness-Optimized Residual Activation Lens (CORAL): Transferrable and Calibration-Aware Inference-Time Steering Miranda Muqing Miao et.al. 2602.06022 null
2026-02-05 MambaVF: State Space Model for Efficient Video Fusion Zixiang Zhao et.al. 2602.06017 null
2026-02-05 Layer-wise LoRA fine-tuning: a similarity metric approach Keith Ando Ogawa et.al. 2602.05988 null
2026-02-05 CLIP-Map: Structured Matrix Mapping for Parameter-Efficient CLIP Compression Kangjie Zhang et.al. 2602.05909 null
2026-02-05 Regularized Calibration with Successive Rounding for Post-Training Quantization Seohyeon Cha et.al. 2602.05902 null
2026-02-05 Learning Compact Boolean Networks Shengpu Wang et.al. 2602.05830 null
2026-02-05 Focus-Scan-Refine: From Human Visual Perception to Efficient Visual Token Pruning Enwei Tong et.al. 2602.05809 null
2026-02-05 Price of universality in vector quantization is at most 0.11 bit Alina Harbuzova et.al. 2602.05790 null
2026-02-05 OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale Jingze Shi et.al. 2602.05711 null
2026-02-05 Cost-Efficient RAG for Entity Matching with LLMs: A Blocking-based Exploration Chuangtao Ma et.al. 2602.05708 null
2026-02-05 Consensus-Aligned Neuron Efficient Fine-Tuning Large Language Models for Multi-Domain Machine Translation Shuting Jiang et.al. 2602.05694 null
2026-02-05 Time-Complexity Characterization of NIST Lightweight Cryptography Finalists Najmul Hasan et.al. 2602.05641 null
2026-02-05 Shiva-DiT: Residual-Based Differentiable Top- $k$ Selection for Efficient Diffusion Transformers Jiaji Zhang et.al. 2602.05605 null
2026-02-05 MAGPrompt: Message-Adaptive Graph Prompt Tuning for Graph Neural Networks Long D. Nguyen et.al. 2602.05567 null
2026-02-05 Mapper-GIN: Lightweight Structural Graph Abstraction for Corrupted 3D Point Cloud Classification Jeongbin You et.al. 2602.05522 null
2026-02-05 VGGT-Motion: Motion-Aware Calibration-Free Monocular SLAM for Long-Range Consistency Zhuang Xiong et.al. 2602.05508 null
2026-02-05 SDFP: Speculative Decoding with FIT-Pruned Models for Training-Free and Plug-and-Play LLM Acceleration Hanyu Wei et.al. 2602.05499 null
2026-02-05 DistillER: Knowledge Distillation in Entity Resolution with Large Language Models Alexandros Zeakis et.al. 2602.05452 null
2026-02-05 RaBiT: Residual-Aware Binarization Training for Accurate and Efficient LLMs Youngcheon You et.al. 2602.05367 null
2026-02-05 AgentXRay: White-Boxing Agentic Systems via Workflow Reconstruction Ruijie Shi et.al. 2602.05353 null
2026-02-05 Consistency-Preserving Concept Erasure via Unsafe-Safe Pairing and Directional Fisher-weighted Adaptation Yongwoo Kim et.al. 2602.05339 null
2026-02-05 MentorCollab: Selective Large-to-Small Inference-Time Guidance for Efficient Reasoning Haojin Wang et.al. 2602.05307 null
2026-02-05 High-Performance Moment-Encoded Lattice Boltzmann Method with Stability-Guided Quantization Yixin Chen et.al. 2602.05295 null
2026-02-05 Unlocking Prototype Potential: An Efficient Tuning Framework for Few-Shot Class-Incremental Learning Shengqin Jiang et.al. 2602.05271 null
2026-02-05 CORP: Closed-Form One-shot Representation-Preserving Structured Pruning for Vision Transformers Boxiang Zhang et.al. 2602.05243 null
2026-02-05 Radon–Wasserstein Gradient Flows for Interacting-Particle Sampling in High Dimensions Elias Hess-Childs et.al. 2602.05227 null
2026-02-05 Diffusion-aided Extreme Video Compression with Lightweight Semantics Guidance Maojun Zhang et.al. 2602.05201 null
2026-02-05 An introduction to string states and their interactions Chrysoula Markou et.al. 2602.05173 null
2026-02-05 CoSA: Compressed Sensing-Based Adaptation of Large Language Models Songtao Wei et.al. 2602.05148 null
2026-02-04 Locas: Your Models are Principled Initializers of Locally-Supported Parametric Memories Sidi Lu et.al. 2602.05085 null
2026-02-04 Gabor Fields: Orientation-Selective Level-of-Detail for Volume Rendering Jorge Condor et.al. 2602.05081 null
2026-02-04 SynthForensics: A Multi-Generator Benchmark for Detecting Synthetic Video Deepfakes Roberto Leotta et.al. 2602.04939 null
2026-02-04 TurboBoA: Faster and Exact Attention-aware Quantization without Backpropagation Junhan Kim et.al. 2602.04929 null
2026-02-04 Pruning Minimal Reasoning Graphs for Efficient Retrieval-Augmented Generation Ning Wang et.al. 2602.04926 null
2026-02-04 The Key to State Reduction in Linear Attention: A Rank-based Perspective Philipp Nazari et.al. 2602.04852 null
2026-02-04 Light Forcing: Accelerating Autoregressive Video Diffusion via Sparse Attention Chengtao Lv et.al. 2602.04789 null
2026-02-04 Knowledge Distillation for mmWave Beam Prediction Using Sub-6 GHz Channels Sina Tavakolian et.al. 2602.04703 null
2026-02-04 REDistill: Robust Estimator Distillation for Balancing Robustness and Efficiency Ondrej Tybl et.al. 2602.04677 null
2026-02-04 Delving into Muon and Beyond: Deep Analysis and Extensions Xianbiao Qi et.al. 2602.04669 null
2026-02-04 Harmonia: Algorithm-Hardware Co-Design for Memory- and Compute-Efficient BFP-based LLM Inference Xinyu Wang et.al. 2602.04595 null
2026-02-04 Rethinking Weight Tying: Pseudo-Inverse Tying for Stable LM Training and Updates Jian Gu et.al. 2602.04556 null
2026-02-04 An Efficient Bayesian Framework for Inverse Problems via Optimization and Inversion: Surrogate Modeling, Parameter Inference, and Uncertainty Quantification Mihaela Chiappetta et.al. 2602.04537 null
2026-02-04 Greedy-Gnorm: A Gradient Matrix Norm-Based Alternative to Attention Entropy for Head Pruning Yuxi Guo et.al. 2602.04491 null
2026-02-04 Fine-tuning Pre-trained Vision-Language Models in a Human-Annotation-Free Manner Qian-Wei Wang et.al. 2602.04337 null
2026-02-04 Canonical Quantization of Cylindrical Waveguides: A Gauge-Based Approach Alexandre Delattre et.al. 2602.04295 null
2026-02-04 MiniRec: Data-Efficient Reinforcement Learning for LLM-based Recommendation Lin Wang et.al. 2602.04278 null
2026-02-04 Decoupled Hierarchical Distillation for Multimodal Emotion Recognition Yong Li et.al. 2602.04260 null
2026-02-04 Constructing Compact ADAPT Unitary Coupled-Cluster Ansatz with Parameter-Based Criterion Runhong He et.al. 2602.04253 null
2026-02-04 Provable Target Sample Complexity Improvements as Pre-Trained Models Scale Kazuto Fukuchi et.al. 2602.04233 null
2026-02-04 OAT: Ordered Action Tokenization Chaoqi Liu et.al. 2602.04215 null
2026-02-04 LatentTune: Efficient Tuning of High Dimensional Database Parameters via Latent Representation Learning Sein Kwon et.al. 2602.04190 null
2026-02-04 HoloEv-Net: Efficient Event-based Action Recognition via Holographic Spatial Embedding and Global Spectral Gating Weidong Hao et.al. 2602.04182 null
2026-02-04 Topology-Aware Revival for Efficient Sparse Training Meiling Jin et.al. 2602.04166 null
2026-02-04 BPDQ: Bit-Plane Decomposition Quantization on a Variable Grid for Large Language Models Junyu Chen et.al. 2602.04163 null
2026-02-04 Pruning for Generalization: A Transfer-Oriented Spatiotemporal Graph Framework Zihao Jing et.al. 2602.04153 null
2026-02-04 Interfaze: The Future of AI is built on Task-Specific Small Models Harsha Vardhan Khurdula et.al. 2602.04101 null
2026-02-03 Efficient Subgroup Analysis via Optimal Trees with Global Parameter Fusion Zhongming Xie et.al. 2602.04077 null
2026-02-03 Understanding and Guiding Layer Placement in Parameter-Efficient Fine-Tuning of Large Language Models Yichen Xu et.al. 2602.04019 null
2026-02-03 Efficient Long-Horizon Vision-Language-Action Models via Static-Dynamic Disentanglement Weikang Qiu et.al. 2602.03983 null
2026-02-03 Active Epistemic Control for Query-Efficient Verified Planning Shuhui Qu et.al. 2602.03974 null
2026-02-03 Entropy Reveals Block Importance in Masked Self-Supervised Vision Transformers Peihao Xiang et.al. 2602.03918 null
2026-02-03 Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing Tong Zheng et.al. 2602.03845 null
2026-02-03 Understanding and Exploiting Weight Update Sparsity for Communication-Efficient Distributed RL Erfan Miahi et.al. 2602.03839 null
2026-02-03 They Said Memes Were Harmless-We Found the Ones That Hurt: Decoding Jokes, Symbols, and Cultural References Sahil Tripathi et.al. 2602.03822 null
2026-02-03 Fast-Slow Efficient Training for Multimodal Large Language Models via Visual Token Pruning Dingkun Zhang et.al. 2602.03815 null
2026-02-03 On the Quantization-Dequantization Correspondence for (co)Poisson Hopf Algebras Andrea Rivezzi et.al. 2602.03810 null
2026-02-03 QVLA: Not All Channels Are Equal in Vision-Language-Action Model’s Quantization Yuhao Xu et.al. 2602.03782 null
2026-02-03 Edge-Optimized Vision-Language Models for Underground Infrastructure Assessment Johny J. Lopez et.al. 2602.03742 null
2026-02-03 Conflict-Resolving and Sharpness-Aware Minimization for Generalized Knowledge Editing with Multiple Updates Duy Nguyen et.al. 2602.03696 null
2026-02-03 Efficient Sequential Neural Network with Spatial-Temporal Attention and Linear LSTM for Robust Lane Detection Using Multi-Frame Images Sandeep Patil et.al. 2602.03669 null
2026-02-03 CALM: A Self-Adaptive Orchestration Approach for QoS-Aware Routing in Small Language Model based Systems Hemang Jain et.al. 2602.03632 null
2026-02-03 KTV: Keyframes and Key Tokens Selection for Efficient Training-Free Video LLMs Baiyang Song et.al. 2602.03615 null
2026-02-03 Quantization-Aware Regularizers for Deep Neural Networks Compression Dario Malchiodi et.al. 2602.03614 null
2026-02-03 APEX: Probing Neural Networks via Activation Perturbation Tao Ren et.al. 2602.03586 null
2026-02-03 Constrained Dynamic Gaussian Splatting Zihan Zheng et.al. 2602.03538 null
2026-02-03 MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization Maximilian Kleinegger et.al. 2602.03537 null
2026-02-03 PnP-U3D: Plug-and-Play 3D Framework Bridging Autoregression and Diffusion for Unified Understanding and Generation Yongwei Chen et.al. 2602.03533 null
2026-02-03 WARP Logic Neural Networks Lino Gerlach et.al. 2602.03527 null
2026-02-03 Generative Decompression: Optimal Lossy Decoding Against Distribution Mismatch Saeed R. Khosravirad et.al. 2602.03505 null
2026-02-03 DALI: A Workload-Aware Offloading Framework for Efficient MoE Inference on Local PCs Zeyu Zhu et.al. 2602.03495 null
2026-02-03 Inlier-Centric Post-Training Quantization for Object Detection Models Minsu Kim et.al. 2602.03472 null
2026-02-03 MeKi: Memory-based Expert Knowledge Injection for Efficient LLM Scaling Ning Ding et.al. 2602.03359 null
2026-02-03 RDT2: Exploring the Scaling Limit of UMI Data Towards Zero-Shot Cross-Embodiment Generalization Songming Liu et.al. 2602.03310 null
2026-02-03 POP: Prefill-Only Pruning for Efficient Large Model Inference Junhui He et.al. 2602.03295 null
2026-02-03 Merging Beyond: Streaming LLM Updates via Activation-Guided Rotations Yuxuan Yao et.al. 2602.03237 null
2026-02-03 PokeFusion Attention: Enhancing Reference-Free Style-Conditioned Generation Jingbang Tang et.al. 2602.03220 null
2026-02-03 FARTrack: Fast Autoregressive Visual Tracking with High Performance Guijie Wang et.al. 2602.03214 null
2026-02-03 WebSplatter: Enabling Cross-Device Efficient Gaussian Splatting in Web Browsers via WebGPU Yudong Han et.al. 2602.03207 null
2026-02-03 LSGQuant: Layer-Sensitivity Guided Quantization for One-Step Diffusion Real-World Video Super-Resolution Tianxing Wu et.al. 2602.03182 null
2026-02-03 BinaryDemoire: Moiré-Aware Binarization for Image Demoiréing Zheng Chen et.al. 2602.03176 null
2026-02-03 FASA: Frequency-aware Sparse Attention Yifei Wang et.al. 2602.03152 null
2026-02-03 Analyzing Zigbee Traffic: Datasets, Classification and Storage Trade-offs Antonio Boiano et.al. 2602.03140 null
2026-02-03 SwiftVLM: Efficient Vision-Language Model Inference via Cross-Layer Token Bypass Chen Qian et.al. 2602.03134 null
2026-02-03 Sharp $C^{1,\bar1}$ estimates in Kähler quantization and non-pluripolar Radon measures Zbigniew Błocki et.al. 2602.03111 null
2026-02-03 IVC-Prune: Revealing the Implicit Visual Coordinates in LVLMs for Vision Token Pruning Zhichao Sun et.al. 2602.03060 null
2026-02-03 SAES-SVD: Self-Adaptive Suppression of Accumulated and Local Errors for SVD-based LLM Compression Xing Hu et.al. 2602.03051 null
2026-02-03 SAFE-KD: Risk-Controlled Early-Exit Distillation for Vision Backbones Salim Khazem et.al. 2602.03043 null
2026-02-03 STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models Jiliang Ni et.al. 2602.03022 null
2026-02-03 FedKRSO: Communication and Memory Efficient Federated Fine-Tuning of Large Language Models Guohao Yang et.al. 2602.03019 null
2026-02-03 Agent Alpha: Tree Search Unifying Generation, Exploration and Evaluation for Computer-Use Agents Sizhe Tang et.al. 2602.02995 null
2026-02-03 Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization Haocheng Xi et.al. 2602.02958 null
2026-02-03 Nüwa: Mending the Spatial Integrity Torn by VLM Token Pruning Yihong Huang et.al. 2602.02951 null
2026-02-02 TraceNAS: Zero-shot LLM Pruning via Gradient Trace Correlation Prajna G. Malettira et.al. 2602.02891 null
2026-02-02 Efficiency Optimizations for Superblock-based Sparse Retrieval Parker Carlson et.al. 2602.02883 null
2026-02-02 Zero Sum SVD: Balancing Loss Sensitivity for Low Rank LLM Compression Ali Abbasi et.al. 2602.02848 null
2026-02-02 Col-Bandit: Zero-Shot Query-Time Pruning for Late-Interaction Retrieval Roi Pony et.al. 2602.02827 null
2026-02-02 When Efficient Communication Explains Convexity Ashvin Ranjan et.al. 2602.02821 null
2026-02-02 Efficient Counterfactual Estimation of Conditional Greeks via Malliavin-based Weak Derivatives Vikram Krishnamurthy et.al. 2602.02811 null
2026-02-02 De-Linearizing Agent Traces: Bayesian Inference of Latent Partial Orders for Efficient Execution Dongqing Li et.al. 2602.02806 null
2026-02-02 Scaling Small Agents Through Strategy Auctions Lisa Alazraki et.al. 2602.02751 null
2026-02-02 TopoPrune: Robust Data Pruning via Unified Latent Space Topology Arjun Roy et.al. 2602.02739 null
2026-02-02 Dynamic Mix Precision Routing for Efficient Multi-step LLM Interaction Yuanzhe Li et.al. 2602.02711 null
2026-02-02 Graph-Augmented Reasoning with Large Language Models for Tobacco Pest and Disease Management Siyu Li et.al. 2602.02635 null
2026-02-02 Rethinking Test-Time Training: Tilting The Latent Distribution For Few-Shot Source-Free Adaptation Tahir Qasim Syed et.al. 2602.02633 null
2026-02-02 Performance of Small Language Model Pretraining on FABRIC: An Empirical Study Praveen Rao et.al. 2602.02632 null
2026-02-02 Age-Aware Edge-Blind Federated Learning via Over-the-Air Aggregation Ahmed M. Elshazly et.al. 2602.02469 null
2026-02-02 Hierarchical Federated Learning with SignSGD: A Highly Communication-Efficient Approach Amirreza Kazemi et.al. 2602.02355 null
2026-02-02 Rethinking Generative Recommender Tokenizer: Recsys-Native Encoding and Semantic Quantization Beyond LLMs Yu Liang et.al. 2602.02338 null
2026-02-02 Enhancing Indoor Occupancy Prediction via Sparse Query-Based Multi-Level Consistent Knowledge Distillation Xiang Li et.al. 2602.02318 null
2026-02-02 MAIN-VLA: Modeling Abstraction of Intention and eNvironment for Vision-Language-Action Models Zheyuan Zhou et.al. 2602.02212 null
2026-02-02 More Than a Quick Glance: Overcoming the Greedy Bias in KV-Cache Compression Aryan Sood et.al. 2602.02199 null
2026-02-02 ECHO-2: A Large Scale Distributed Rollout Framework for Cost-efficient Reinforcement Learning Jie Xiao et.al. 2602.02192 null
2026-02-02 Reg4Pru: Regularisation Through Random Token Routing for Token Pruning Julian Wyatt et.al. 2602.02163 null
2026-02-02 Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing Lingkun Long et.al. 2602.02159 null
2026-02-02 Revisiting Adaptive Rounding with Vectorized Reparameterization for LLM Quantization Yuli Zhou et.al. 2602.02151 null
2026-02-02 Two-Stage Grid Optimization for Group-wise Quantization of LLMs Junhan Kim et.al. 2602.02126 null
2026-02-02 An Empirical Study of World Model Quantization Zhongqian Fu et.al. 2602.02110 null
2026-02-02 Teacher-Guided Student Self-Knowledge Distillation Using Diffusion Model Yu Wang et.al. 2602.02107 null
2026-02-02 UrbanGS: A Scalable and Efficient Architecture for Geometrically Accurate Large-Scene Reconstruction Changbai Li et.al. 2602.02089 null
2026-02-02 A global potential constrained by the Bohr-Sommerfeld quantization condition for $α$ -decay half-lives of even-even nuclei Nguyen Gia Huy et.al. 2602.02070 null
2026-02-02 Ultrafast On-chip Online Learning via Spline Locality in Kolmogorov-Arnold Networks Duc Hoang et.al. 2602.02056 link
2026-02-02 Dissecting Outlier Dynamics in LLM NVFP4 Pretraining Peijie Dong et.al. 2602.02047 null
2026-02-02 Bandwidth-Efficient Multi-Agent Communication through Information Bottleneck and Vector Quantization Ahmad Farooq et.al. 2602.02035 null
2026-02-02 Hippasus: Effective and Efficient Automatic Feature Augmentation for Machine Learning Tasks on Relational Data Serafeim Papadias et.al. 2602.02025 null
2026-02-02 Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation Zhanghao Hu et.al. 2602.02007 null
2026-02-02 Preserve-Then-Quantize: Balancing Rank Budgets for Quantization Error Reconstruction in LLMs Yoonjun Cho et.al. 2602.02001 null
2026-02-02 On the Limits of Layer Pruning for Generative Reasoning in LLMs Safal Shrestha et.al. 2602.01997 null
2026-02-02 FlyPrompt: Brain-Inspired Random-Expanded Routing with Temporal-Ensemble Experts for General Continual Learning Hongwei Yan et.al. 2602.01976 null
2026-02-02 IntraSlice: Towards High-Performance Structural Pruning with Block-Intra PCA for LLMs Meng Li et.al. 2602.01975 null
2026-02-02 Efficient Epistemic Uncertainty Estimation for Large Language Models via Knowledge Distillation Seonghyeon Park et.al. 2602.01956 null
2026-02-02 Q Cache: Visual Attention is Valuable in Less than Half of Decode Layers for Multimodal Large Language Model Jiedong Zhuang et.al. 2602.01901 null
2026-02-02 ProxyImg: Towards Highly-Controllable Image Representation via Hierarchical Disentangled Proxy Embedding Ye Chen et.al. 2602.01881 null
2026-02-02 BTGenBot-2: Efficient Behavior Tree Generation with Small Language Models Riccardo Andrea Izzo et.al. 2602.01870 null
2026-02-02 Zero-Shot Knowledge Base Resizing for Rate-Adaptive Digital Semantic Communication Shumin Yao et.al. 2602.01829 null
2026-02-02 ParaGSE: Parallel Generative Speech Enhancement with Group-Vector-Quantization-based Neural Speech Codec Fei Liu et.al. 2602.01793 null
2026-02-02 Efficient Cross-Architecture Knowledge Transfer for Large-Scale Online User Response Prediction Yucheng Wu et.al. 2602.01775 null
2026-02-02 Reduced Phase Space Quantization and Quantum Corrected Entropy of Schwarzschild-de Sitter Horizons S. Jalalzadeh et.al. 2602.01767 null
2026-02-02 Tail-Aware Post-Training Quantization for 3D Geometry Models Sicheng Pan et.al. 2602.01741 null
2026-02-02 A Practical Tensor-Network Compression Pipeline for Production-Scale Large Language Models Sergii Kozyrev et.al. 2602.01613 null
2026-02-02 Token Pruning for In-Context Generation in Diffusion Transformers Junqing Lin et.al. 2602.01609 null
2026-02-02 Spectral-Aligned Pruning for Universal Error-Correcting Code Transformers Sanghyeon Cho et.al. 2602.01602 null
2026-02-02 Plain Transformers are Surprisingly Powerful Link Predictors Quang Truong et.al. 2602.01553 null
2026-02-02 NeuroAI Temporal Neural Networks (NeuTNNs): Microarchitecture and Design Framework for Specialized Neuromorphic Processing Units Shanmuga Venkatachalam et.al. 2602.01546 null
2026-02-02 When Is Rank-1 Enough? Geometry-Guided Initialization for Parameter-Efficient Fine-Tuning Haoran Zhao et.al. 2602.01522 null
2026-02-02 HDSense: An efficient method for ranking observable sensitivity Benoît Assi et.al. 2602.01509 null
2026-02-01 ConPress: Learning Efficient Reasoning from Multi-Question Contextual Pressure Jie Deng et.al. 2602.01472 null
2026-02-01 Rethinking Selective Knowledge Distillation Almog Tavor et.al. 2602.01395 null
2026-02-01 The Enhanced Physics-Informed Kolmogorov-Arnold Networks: Applications of Newton’s Laws in Financial Deep Reinforcement Learning (RL) Algorithms Trang Thoi et.al. 2602.01388 null
2026-02-01 Gradient-Aligned Calibration for Post-Training Quantization of Diffusion Models Dung Anh Hoang et.al. 2602.01289 null
2026-02-01 Q-DiT4SR: Exploration of Detail-Preserving Diffusion Transformer Quantization for Real-World Image Super-Resolution Xun Zhang et.al. 2602.01273 null
2026-02-01 Mixture-of-World Models: Scaling Multi-Task Reinforcement Learning with Modular Latent Dynamics Boxuan Zhang et.al. 2602.01270 null
2026-01-30 Geometric Quantization by Paths, Part III: The Metaplectic Anomaly Patrick Iglesias-Zemmour et.al. 2601.23259 null
2026-01-30 Agile Reinforcement Learning through Separable Neural Architecture Rajib Mostakim et.al. 2601.23225 null
2026-01-30 High-quality generation of dynamic game content via small language models: A proof of concept Morten I. K. Munk et.al. 2601.23206 null
2026-01-30 Segment Any Events with Language Seungjun Lee et.al. 2601.23159 null
2026-01-30 Compressed BC-LISTA via Low-Rank Convolutional Decomposition Han Wang et.al. 2601.23148 null
2026-01-30 Lossy Compression of Cellular Network KPIs Andrea Pimpinella et.al. 2601.23105 null
2026-01-30 FlexLoRA: Entropy-Guided Flexible Low-Rank Adaptation Muqing Liu et.al. 2601.22905 null
2026-01-30 Decomposing and Composing: Towards Efficient Vision-Language Continual Learning via Rank-1 Expert Pool in a Single LoRA Zhan Fa et.al. 2601.22828 null
2026-01-30 CVeDRL: An Efficient Code Verifier via Difficulty-aware Reinforcement Learning Ji Shi et.al. 2601.22803 null
2026-01-30 Float8@2bits: Entropy Coding Enables Data-Free Model Compression Patrick Putzky et.al. 2601.22787 null
2026-01-30 Active Learning-Driven Lightweight YOLOv9: Enhancing Efficiency in Smart Agriculture Hung-Chih Tu et.al. 2601.22732 null
2026-01-30 Breaking the Blocks: Continuous Low-Rank Decomposed Scaling for Unified LLM Quantization and Adaptation Pingzhi Tang et.al. 2601.22716 null
2026-01-30 Gated Relational Alignment via Confidence-based Distillation for Efficient VLMs Yanlong Chen et.al. 2601.22709 null
2026-01-30 A Unified Study of LoRA Variants: Taxonomy, Review, Codebase, and Empirical Evaluation Haonan He et.al. 2601.22708 null
2026-01-30 FNF: Functional Network Fingerprint for Large Language Models Yiheng Liu et.al. 2601.22692 null
2026-01-30 Fire on Motion: Optimizing Video Pass-bands for Efficient Spiking Action Recognition Shuhan Ye et.al. 2601.22675 null
2026-01-30 DART-ing Through the Drift: Dynamic Tracing of Knowledge Neurons for Adaptive Inference-Time Pruning Abhishek Tyagi et.al. 2601.22632 null
2026-01-30 PEFT-MuTS: A Multivariate Parameter-Efficient Fine-Tuning Framework for Remaining Useful Life Prediction based on Cross-domain Time Series Representation Model En Fu et.al. 2601.22631 null
2026-01-30 Rethinking LLM-as-a-Judge: Representation-as-a-Judge with Small Language Models via Semantic Capacity Asymmetry Zhuochun Li et.al. 2601.22588 null
2026-01-30 EUGens: Efficient, Unified, and General Dense Layers Sang Min Kim et.al. 2601.22563 null
2026-01-29 Understanding Efficiency: Quantization, Batching, and Serving Strategies in LLM Energy Use Julien Delavande et.al. 2601.22362 null
2026-01-29 MixQuant: Pushing the Limits of Block Rotations in Post-Training Quantization Sai Sanjeet et.al. 2601.22347 null
2026-01-29 Symmetry Breaking in Transformers for Efficient and Interpretable Training Eva Silverstein et.al. 2601.22257 null
2026-01-29 Is Hierarchical Quantization Essential for Optimal Reconstruction? Shirin Reyhanian et.al. 2601.22244 null
2026-01-29 Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts Yingfa Chen et.al. 2601.22156 null
2026-01-29 Pay for Hints, Not Answers: LLM Shepherding for Cost-Efficient Inference Ziming Dong et.al. 2601.22132 null
2026-01-29 A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine Anran Li et.al. 2601.22124 null
2026-01-29 ReactEMG Stroke: Healthy-to-Stroke Few-shot Adaptation for sEMG-Based Intent Detection Runsheng Wang et.al. 2601.22090 null
2026-01-29 Making Foundation Models Probabilistic via Singular Value Ensembles Mehmet Ozgur Turkoglu et.al. 2601.22068 null
2026-01-30 PocketDP3: Efficient Pocket-Scale 3D Visuomotor Policy Jinhao Zhang et.al. 2601.22018 null
2026-01-29 OVD: On-policy Verbal Distillation Jing Xiong et.al. 2601.21968 null
2026-01-29 From Generative Modeling to Clinical Classification: A GPT-Based Architecture for EHR Notes Fariba Afrin Irany et.al. 2601.21955 null
2026-01-29 KnowBias: Mitigating Social Bias in LLMs via Know-Bias Neuron Enhancement Jinhao Pan et.al. 2601.21864 null
2026-01-29 Visual Disentangled Diffusion Autoencoders: Scalable Counterfactual Generation for Foundation Models Sidney Bender et.al. 2601.21851 null
2026-01-29 Enhancing Language Models for Robust Greenwashing Detection Neil Heinrich Braun et.al. 2601.21722 null
2026-01-29 Why Attention Patterns Exist: A Unifying Temporal Perspective Analysis Qingyue Yang et.al. 2601.21709 null
2026-01-29 Can David Beat Goliath? On Multi-Hop Reasoning with Resource-Constrained Agents Hojae Han et.al. 2601.21699 null
2026-01-29 Do Not Waste Your Rollouts: Recycling Search Experience for Efficient Test-Time Scaling Xinglin Wang et.al. 2601.21684 null
2026-01-29 SWE-Spot: Building Small Repo-Experts with Repository-Centric Learning Jinjun Peng et.al. 2601.21649 null
2026-01-29 Leveraging rapid parameter estimates for efficient gravitational-wave Bayesian inference via posterior repartitioning Metha Prathaban et.al. 2601.21630 null
2026-01-29 HeRo-Q: A General Framework for Stable Low Bit Quantization via Hessian Conditioning Jinhao Zhang Yunquan Zhang et.al. 2601.21626 null
2026-01-29 Thinking Broad, Acting Fast: Latent Reasoning Distillation from Multi-Perspective Chain-of-Thought for E-Commerce Relevance Baopu Qiu et.al. 2601.21611 null
2026-01-29 Representation Unlearning: Forgetting through Information Compression Antonio Almudévar et.al. 2601.21564 null
2026-01-29 On the Adversarial Robustness of Large Vision-Language Models under Visual Token Compression Xinwei Zhang et.al. 2601.21531 null
2026-01-29 Adaptive Confidence Gating in Multi-Agent Collaboration for Efficient and Optimized Code Generation Haoji Zhang et.al. 2601.21469 null
2026-01-29 ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation Zihao Huang et.al. 2601.21420 null
2026-01-29 Rethinking Federated Graph Foundation Models: A Graph-Language Alignment-based Approach Yinlin Zhu et.al. 2601.21369 null
2026-01-29 Small models, big threats: Characterizing safety challenges from low-compute AI models Prateek Puri et.al. 2601.21365 null
2026-01-29 L2R: Low-Rank and Lipschitz-Controlled Routing for Mixture-of-Experts Minghao Yang et.al. 2601.21349 null
2026-01-29 Semantic-Guided Dynamic Sparsification for Pre-Trained Model-based Class-Incremental Learning Ruiqi Liu et.al. 2601.21345 null
2026-01-29 A Time-Domain Dual-Edge Asynchronous Pipelined SAR ADC Featuring Reset-Free Quantization at Multi-GS/s Richard Zeng et.al. 2601.21308 null
2026-01-29 Mam-App: A Novel Parameter-Efficient Mamba Model for Apple Leaf Disease Classification Md Nadim Mahamood et.al. 2601.21307 null
2026-01-29 Grounding and Enhancing Informativeness and Utility in Dataset Distillation Shaobo Wang et.al. 2601.21296 null
2026-01-29 Drive-KD: Multi-Teacher Distillation for VLMs in Autonomous Driving Weitong Lian et.al. 2601.21288 null
2026-01-29 An efficient implicit scheme for the multimaterial Euler equations in Lagrangian coordinates Simone Chiocchetti et.al. 2601.21241 null
2026-01-29 PTQ4ARVG: Post-Training Quantization for AutoRegressive Visual Generation Models Xuewen Liu et.al. 2601.21238 null
2026-01-29 Soft Quantization: Model Compression Via Weight Coupling Daniel T. Bernstein et.al. 2601.21219 null
2026-01-29 Temporal Context and Architecture: A Benchmark for Naturalistic EEG Decoding Mehmet Ergezer et.al. 2601.21215 null
2026-01-29 ZipMoE: Efficient On-Device MoE Serving via Lossless Compression and Cache-Affinity Scheduling Yuchen Yang et.al. 2601.21198 null
2026-01-29 Generative Recall, Dense Reranking: Learning Multi-View Semantic IDs for Efficient Text-to-Video Retrieval Zecheng Zhao et.al. 2601.21193 null
2026-01-28 ChunkWise LoRA: Adaptive Sequence Partitioning for Memory-Efficient Low-Rank Adaptation and Accelerated LLM Inference Ketan Thakkar et.al. 2601.21109 null
2026-01-28 CompSRT: Quantization and Pruning for Image Super Resolution Transformers Dorsa Zeinali et.al. 2601.21069 null
2026-01-28 PatchFormer: A Patch-Based Time Series Foundation Model with Hierarchical Masked Reconstruction and Cross-Domain Transfer Learning for Zero-Shot Multi-Horizon Forecasting Olaf Yunus Laitinen Imanov et.al. 2601.20845 null
2026-01-28 MemCtrl: Using MLLMs as Active Memory Controllers on Embodied Agents Vishnu Sashank Dorbala et.al. 2601.20831 null
2026-01-28 REASON: Accelerating Probabilistic Logical Reasoning for Scalable Neuro-Symbolic Intelligence Zishen Wan et.al. 2601.20784 null
2026-01-28 Leveraging Second-Order Curvature for Efficient Learned Image Compression: Theory and Empirical Evidence Yichi Zhang et.al. 2601.20769 null
2026-01-28 HESTIA: A Hessian-Guided Differentiable Quantization-Aware Training Framework for Extremely Low-Bit LLMs Guoan Wang et.al. 2601.20745 null
2026-01-28 One Step Is Enough: Dispersive MeanFlow Policy Optimization Guowei Zou et.al. 2601.20701 null
2026-01-28 When Vision Meets Texts in Listwise Reranking Hongyi Cai et.al. 2601.20623 null
2026-01-28 DiffVC-RT: Towards Practical Real-Time Diffusion-based Perceptual Neural Video Compression Wenzhuo Ma et.al. 2601.20564 null
2026-01-28 Weaker quantization dimension results for self-similar measures Saurabh Verma et.al. 2601.20531 null
2026-01-28 IOTA: Corrective Knowledge-Guided Prompt Learning via Black-White Box Framework Shaokun Wang et.al. 2601.20526 null
2026-01-28 AnomalyVFM – Transforming Vision Foundation Models into Zero-Shot Anomaly Detectors Matic Fučka et.al. 2601.20524 null
2026-01-28 CtrlCoT: Dual-Granularity Chain-of-Thought Compression for Controllable Reasoning Zhenxuan Fan et.al. 2601.20467 null
2026-01-28 RepSFNet : A Single Fusion Network with Structural Reparameterization for Crowd Counting Mas Nurul Achmadiah et.al. 2601.20369 null
2026-01-28 PalmBridge: A Plug-and-Play Feature Alignment Framework for Open-Set Palmprint Verification Chenke Zhang et.al. 2601.20351 null
2026-01-28 Improving Diffusion Language Model Decoding through Joint Search in Generation Order and Token Space Yangyi Shen et.al. 2601.20339 null
2026-01-28 Window-Diffusion: Accelerating Diffusion Language Model Inference with Windowed Token Pruning and Caching Fengrui Zuo et.al. 2601.20332 null
2026-01-28 VersaQ-3D: A Reconfigurable Accelerator Enabling Feed-Forward and Generalizable 3D Reconstruction via Versatile Quantization Yipu Zhang et.al. 2601.20317 null
2026-01-28 Towards Compact and Robust DNNs via Compression-aware Sharpness Minimization Jialuo He et.al. 2601.20301 null
2026-01-28 MiLorE-SSL: Scaling Multilingual Capabilities in Self-Supervised Models without Forgetting Jing Xu et.al. 2601.20300 null
2026-01-28 Quantum Cosmology as a Hydrogen atom: Discrete $Λ$ and cyclic Universes from Wheeler-DeWitt quantization Dipayan Mukherjee et.al. 2601.20286 null
2026-01-28 SATA: Sparsity-Aware Scheduling for Selective Token Attention Zhenkun Fan et.al. 2601.20267 null
2026-01-28 Shallow-π: Knowledge Distillation for Flow-based VLAs Boseong Jeon et.al. 2601.20262 null
2026-01-28 Certificate-Guided Pruning for Stochastic Lipschitz Optimization Ibne Farabi Shihab et.al. 2601.20231 null
2026-01-28 MERGE: Next-Generation Item Indexing Paradigm for Large-Scale Streaming Recommendation Jing Yan et.al. 2601.20199 null
2026-01-28 Efficient Token Pruning for LLaDA-V Zhewen Wan et.al. 2601.20168 null
2026-01-27 Look in the Middle: Structural Anchor Pruning for Scalable Visual RAG Indexing Zhuchenyang Liu et.al. 2601.20107 null
2026-01-27 Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery Meng Xin et.al. 2601.20088 null
2026-01-27 E2HiL: Entropy-Guided Sample Selection for Efficient Real-World Human-in-the-Loop Reinforcement Learning Haoyuan Deng et.al. 2601.19969 null
2026-01-27 Melvin–Bonnor and Bertotti–Robinson spacetimes with Baryonic charge José Barrientos et.al. 2601.19858 null
2026-01-27 A Latent Space Framework for Modeling Transient Engine Emissions Using Joint Embedding Predictive Architectures Ganesh Sundaram et.al. 2601.19822 null
2026-01-27 Component-Aware Pruning Framework for Neural Network Controllers via Gradient-Based Importance Estimation Ganesh Sundaram et.al. 2601.19794 null
2026-01-27 Interpretable and backpropagation-free Green Learning for efficient multi-task echocardiographic segmentation and classification Jyun-Ping Kao et.al. 2601.19743 null
2026-01-27 LoPRo: Enhancing Low-Rank Quantization via Permuted Block-Wise Rotation Hongyaoxing Gu et.al. 2601.19675 null
2026-01-27 AC^2-VLA: Action-Context-Aware Adaptive Computation in Vision-Language-Action Models for Efficient Robotic Manipulation Wenda Yu et.al. 2601.19634 null
2026-01-27 GradPruner: Gradient-Guided Layer Pruning Enabling Efficient Fine-Tuning and Inference for LLMs Wei Huang et.al. 2601.19503 null
2026-01-27 StableQAT: Stable Quantization-Aware Training at Ultra-Low Bitwidths Tianyi Chen et.al. 2601.19320 null
2026-01-27 Reinforced Rate Control for Neural Video Compression via Inter-Frame Rate-Distortion Awareness Wuyang Cong et.al. 2601.19293 null
2026-01-27 DART: Diffusion-Inspired Speculative Decoding for Fast LLM Inference Fuliang Liu et.al. 2601.19278 null
2026-01-27 M $^{\text{2}}$ XFP: A Metadata-Augmented Microscaling Data Format for Efficient Low-bit Quantization Weiming Hu et.al. 2601.19213 null
2026-01-27 Optimized $k$ -means color quantization of digital images in machine-based and human perception-based colorspaces Ranjan Maitra et.al. 2601.19117 null
2026-01-27 EPAS: Efficient Training with Progressive Activation Sharing Rezaul Karim et.al. 2601.19089 null
2026-01-26 Is Finer Better? The Limits of Microscaling Formats in Large Language Models Andrea Fasoli et.al. 2601.19026 null
2026-01-26 EVEREST: An Evidential, Tail-Aware Transformer for Rare-Event Time-Series Forecasting Antanas Zilinskas et.al. 2601.19022 null
2026-01-26 FROST: Filtering Reasoning Outliers with Attention for Efficient Reasoning Haozheng Luo et.al. 2601.19001 null
2026-01-26 How Is Uncertainty Propagated in Knowledge Distillation? Ziyao Cui et.al. 2601.18909 null
2026-01-26 XProvence: Zero-Cost Multilingual Context Pruning for Retrieval-Augmented Generation Youssef Mohamed et.al. 2601.18886 null
2026-01-26 Low-Bit Quantization of Bandlimited Graph Signals via Iterative Methods Felix Krahmer et.al. 2601.18782 null
2026-01-26 Goal-oriented Communication for Fast and Robust Robotic Fault Detection and Recovery Shutong Chen et.al. 2601.18765 null
2026-01-26 Efficient Trotter-Suzuki Schemes for Long-time Quantum Dynamics Marko Maležič et.al. 2601.18756 null
2026-01-26 Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models Siyan Zhao et.al. 2601.18734 null
2026-01-26 AI-enabled Satellite Edge Computing: A Single-Pixel Feature based Shallow Classification Model for Hyperspectral Imaging Li Fang et.al. 2601.18560 null
2026-01-26 XFit: Global Optimization and Degeneracy Mapping in X-ray Spectral Modeling Austin MacMaster et.al. 2601.18542 null
2026-01-26 Hybrid Radar Fusion with Quantization: CRB-Rate Trade-offs and ADC Dynamic Range Akhileswar Chowdary et.al. 2601.18539 null
2026-01-26 DisasterInsight: A Multimodal Benchmark for Function-Aware and Grounded Disaster Assessment Sara Tehrani et.al. 2601.18493 null
2026-01-26 DV-VLN: Dual Verification for Reliable LLM-Based Vision-and-Language Navigation Zijun Li et.al. 2601.18492 null
2026-01-27 An Adaptive Purification Controller for Quantum Networks: Dynamic Protocol Selection and Multipartite Distillation Pranav Kulkarni et.al. 2601.18351 null
2026-01-26 Orchestrating Specialized Agents for Trustworthy Enterprise RAG Xincheng You et.al. 2601.18267 null
2026-01-26 Facial Emotion Recognition on FER-2013 using an EfficientNetB2-Based Approach Sahil Naik et.al. 2601.18228 null
2026-01-26 Multi-Perspective Subimage CLIP with Keyword Guidance for Remote Sensing Image-Text Retrieval Yifan Li et.al. 2601.18190 null
2026-01-27 Quantum Recurrent Unit: A Parameter-Efficient Quantum Neural Network Architecture for NISQ Devices Tzong-Daw Wu et.al. 2601.18164 null
2026-01-26 From LLMs to LRMs: Rethinking Pruning for Reasoning-Centric Models Longwei Ding et.al. 2601.18091 null
2026-01-25 Systematic Characterization of Minimal Deep Learning Architectures: A Unified Analysis of Convergence, Pruning, and Quantization Ziwei Zheng et.al. 2601.17987 null
2026-01-25 SD-E $^2$ : Semantic Exploration for Reasoning Under Token Budgets Kshitij Mishra et.al. 2601.17982 null
2026-01-25 From Specialist to Generalist: Unlocking SAM’s Learning Potential on Unlabeled Medical Images Vi Vu et.al. 2601.17934 null
2026-01-25 RemEdit: Efficient Diffusion Editing with Riemannian Geometry Eashan Adhikarla et.al. 2601.17927 null
2026-01-25 ShapLoRA: Allocation of Low-rank Adaption on Large Language Models via Shapley Value Inspired Importance Estimation Yi Zhao et.al. 2601.17921 null
2026-01-25 treaming-dLLM: Accelerating Diffusion LLMs via Suffix Pruning and Dynamic Decoding Zhongyu Xiao et.al. 2601.17917 null
2026-01-25 Adaptive Weighting in Knowledge Distillation: An Axiomatic Framework for Multi-Scale Teacher Ensemble Optimization Aaron R. Flouro et.al. 2601.17910 null
2026-01-25 Assessment of Generative Named Entity Recognition in the Era of Large Language Models Qi Zhan et.al. 2601.17898 null
2026-01-25 VidLaDA: Bidirectional Diffusion Large Language Models for Efficient Video Understanding Zhihao He et.al. 2601.17868 null
2026-01-25 ViTCoP: Accelerating Large Vision-Language Models via Visual and Textual Semantic Collaborative Pruning Wen Luo et.al. 2601.17818 null
2026-01-25 Residual neural-field ptychography for dose-efficient electron, X-ray, and optical nanoscopy Qianhao Zhao et.al. 2601.17694 null
2026-01-24 BrainDistill: Implantable Motor Decoding with Task-Specific Knowledge Distillation Yuhan Xie et.al. 2601.17625 null
2026-01-24 Split-on-Share: Mixture of Sparse Experts for Task-Agnostic Continual Learning Fatema Siddika et.al. 2601.17616 null
2026-01-24 Travelling Waves in Wolbachia Spread Dynamics Zhuolin Qu et.al. 2601.17590 null
2026-01-24 Saliency Driven Imagery Preprocessing for Efficient Compression – Industrial Paper Justin Downes et.al. 2601.17555 null
2026-01-24 Reconstructing Training Data from Adapter-based Federated Large Language Models Silong Chen et.al. 2601.17533 null
2026-01-24 Less is More for RAG: Information Gain Pruning for Generator-Aligned Reranking and Evidence Selection Zhipeng Song et.al. 2601.17532 null
2026-01-24 Efficient Dilated Squeeze and Excitation Neural Operator for Differential Equations Prajwal Chauhan et.al. 2601.17407 null
2026-01-24 SMV-EAR: Bring Spatiotemporal Multi-View Representation Learning into Efficient Event-Based Action Recognition Rui Fan et.al. 2601.17391 null
2026-01-24 Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian Laws Mohammed Fasha et.al. 2601.17364 null
2026-01-24 Spectral Geometry for Deep Learning: Compression and Hallucination Detection via Random Matrix Theory Davide Ettori et.al. 2601.17357 null
2026-01-24 Dynamic Meta-Ensemble Framework for Efficient and Accurate Deep Learning in Plant Leaf Disease Detection on Resource-Constrained Edge Devices Weloday Fikadu Moges et.al. 2601.17290 null
2026-01-24 Latent-Space Contrastive Reinforcement Learning for Stable and Efficient LLM Reasoning Lianlei Shan et.al. 2601.17275 null
2026-01-23 JetFormer: A Scalable and Efficient Transformer for Jet Tagging from Offline Analysis to FPGA Triggers Ruoqing Zheng et.al. 2601.17215 null
2026-01-23 AstroTimer: Rethinking Non-Access Stratum Timers in LEO Constellations Arshiya Rezaie Hezaveh et.al. 2601.17195 null
2026-01-23 High-Rate Quantized Matrix Multiplication: Theory and Practice Or Ordentlich et.al. 2601.17187 null
2026-01-23 Constrained Symplectic Quantization I: the Quantum Harmonic Oscillator Martina Giachello et.al. 2601.16963 null
2026-01-23 Is BatchEnsemble a Single Model? On Calibration and Diversity of Efficient Ensembles Anton Zamyatin et.al. 2601.16936 null
2026-01-23 Evaluating Large Vision-language Models for Surgical Tool Detection Nakul Poudel et.al. 2601.16895 null
2026-01-23 PocketDVDNet: Realtime Video Denoising for Real Camera Noise Crispian Morris et.al. 2601.16780 null
2026-01-23 SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents Yuhang Wang et.al. 2601.16746 null
2026-01-23 Dirac-Bergmann algorithm and canonical quantization of $k$ -essence cosmology Andrés Lueiza et.al. 2601.16703 null
2026-01-23 Fast, faithful and photorealistic diffusion-based image super-resolution with enhanced Flow Map models Maxence Noble et.al. 2601.16660 null
2026-01-23 Typologically Informed Parameter Aggregation Stef Accou et.al. 2601.16629 null
2026-01-23 AuroraEdge-V-2B: A Faster And Stronger Edge Visual Large Language Model Xiang Chen et.al. 2601.16615 null
2026-01-23 Spiking Neural Networks for Communication Systems: Encoding Schemes, Learning Algorithms, and Equalization~Techniques Eike-Manuel Edelmann et.al. 2601.16550 null
2026-01-23 LLM is Not All You Need: A Systematic Evaluation of ML vs. Foundation Models for text and image based Medical Classification Meet Raval et.al. 2601.16549 null
2026-01-23 W4A16 Mixed-Precision Matrix Multiplication on Decoupled Architecture: Kernel Design and Memory Bottleneck Analysis for Ascend NPUs Yuanhong He et.al. 2601.16536 null
2026-01-23 Indefinite Causal Order from Failure-to-Glue: Contextual Semantics and Parametric Time Partha Ghose et.al. 2601.16494 null
2026-01-23 Log-Likelihood Loss for Semantic Compression Anuj Kumar Yadav et.al. 2601.16461 null
2026-01-22 EdgeSpot: Efficient and High-Performance Few-Shot Model for Keyword Spotting Oguzhan Buyuksolak et.al. 2601.16316 null
2026-01-22 Teaching and Evaluating LLMs to Reason About Polymer Design Related Tasks Dikshya Mohanty et.al. 2601.16312 null
2026-01-22 LiDMaS: Architecture-Level Modeling of Fault-Tolerant Magic-State Injection in GKP Photonic Qubits Dennis Delali Kwesi Wayo et.al. 2601.16244 null
2026-01-22 CamPilot: Improving Camera Control in Video Diffusion Model with Efficient Camera Reward Feedback Wenhang Ge et.al. 2601.16214 null
2026-01-22 PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation Onkar Susladkar et.al. 2601.16210 null
2026-01-22 Domain-Incremental Continual Learning for Robust and Efficient Keyword Spotting in Resource Constrained Systems Prakash Dhungana et.al. 2601.16158 null
2026-01-22 SAMTok: Representing Any Mask with Two Words Yikang Zhou et.al. 2601.16093 null
2026-01-22 DSFedMed: Dual-Scale Federated Medical Image Segmentation via Mutual Distillation Between Foundation and Lightweight Models Hanwen Zhang et.al. 2601.16073 null
2026-01-22 DTP: A Simple yet Effective Distracting Token Pruning Framework for Vision-Language Action Models Chenyang Li et.al. 2601.16065 null
2026-01-22 An Efficient Algorithm to Generate all Labeled Triangle-free Graphs with a given Graphical Degree Sequence Kai Wang et.al. 2601.15943 null
2026-01-22 A Lightweight Brain-Inspired Machine Learning Framework for Coronary Angiography: Hybrid Neural Representation and Robust Learning Strategies Jingsong Xia et.al. 2601.15865 null
2026-01-22 TinySense: Effective CSI Compression for Scalable and Accurate Wi-Fi Sensing Toan Gian et.al. 2601.15838 null
2026-01-22 Improving the efficiency of QAOA using efficient parameter transfer initialization and targeted-single-layer regularized optimization with minimal performance degradation Shubham Patel et.al. 2601.15760 null
2026-01-22 Communication-efficient Federated Graph Classification via Generative Diffusion Modeling Xiuling Wang et.al. 2601.15722 null
2026-01-22 FlexLLM: Composable HLS Library for Flexible Hybrid LLM Accelerator Design Jiahao Zhang et.al. 2601.15710 null
2026-01-22 D-Optimality-Guided Reinforcement Learning for Efficient Open-Loop Calibration of a 3-DOF Ankle Rehabilitation Robot Qifan Hu et.al. 2601.15707 null
2026-01-22 Integrating Knowledge Distillation Methods: A Sequential Multi-Stage Framework Yinxi Tian et.al. 2601.15657 null
2026-01-22 Scaling-Based Quantization of Spacetime Microstructure Weihu Ma et.al. 2601.15649 null
2026-01-21 QUAIL: Quantization Aware Unlearning for Mitigating Misinformation in LLMs Himanshu Mishra et.al. 2601.15538 null
2026-01-21 SAGE-FM: A lightweight and interpretable spatial transcriptomics foundation model Xianghao Zhan et.al. 2601.15504 null
2026-01-21 Memorization Dynamics in Knowledge Distillation for Language Models Jaydeep Borkar et.al. 2601.15394 null
2026-01-21 FedUMM: A General Framework for Federated Learning with Unified Multimodal Models Zhaolong Su et.al. 2601.15390 null
2026-01-21 Towards Understanding Best Practices for Quantization of Vision-Language Models Gautom Das et.al. 2601.15287 null
2026-01-21 Lightweight LLMs for Network Attack Detection in IoT Networks Piyumi Bhagya Sudasinghe et.al. 2601.15269 null
2026-01-21 Metadata Conditioned Large Language Models for Localization Anjishnu Mukherjee et.al. 2601.15236 null
2026-01-21 Overcoming In-Memory Bottlenecks in Graph Foundation Models via Retrieval-Augmented Generation Haonan Yuan et.al. 2601.15124 null
2026-01-21 Parameter-Efficient Multi-Task Fine-Tuning in Code-Related Tasks Md Zahidul Haque et.al. 2601.15094 null
2026-01-21 LoRAP: Low-Rank Aggregation Prompting for Quantized Graph Neural Networks Training Chenyu Liu et.al. 2601.15079 null
2026-01-21 Efficient and Minimax-optimal In-context Nonparametric Regression with Transformers Michelle Ching et.al. 2601.15014 null
2026-01-21 Solution-derived barium titanate waveguides for integrated electro-optic modulation Virginia Falcone et.al. 2601.14938 null
2026-01-21 What Makes Low-Bit Quantization-Aware Training Work for Reasoning LLMs? A Systematic Study Keyu Lv et.al. 2601.14888 null
2026-01-21 POTR: Post-Training 3DGS Compression Bert Ramlot et.al. 2601.14821 null
2026-01-21 Efficient Beamforming for Discrete SIM-Aided Multiuser Systems Under Statistical CSI Yuhui Jiao et.al. 2601.14803 null
2026-01-21 Training-Efficient Text-to-Music Generation with State-Space Modeling Wei-Jaw Lee et.al. 2601.14786 null
2026-01-21 RefProtoFL: Communication-Efficient Federated Learning via External-Referenced Prototype Alignment Hongyue Wu et.al. 2601.14746 null
2026-01-21 PULSE: Socially-Aware User Representation Modeling Toward Parameter-Efficient Graph Collaborative Filtering Doyun Choi et.al. 2601.14720 null
2026-01-21 Triage knowledge distillation for speaker verification Ju-ho Kim et.al. 2601.14699 null
2026-01-21 Maximum Edge-based Quasi-Clique: Novel Iterative Frameworks Hongbo Xia et.al. 2601.14619 null
2026-01-21 IntelliSA: An Intelligent Static Analyzer for IaC Security Smell Detection Using Symbolic Rules and Neural Inference Qiyue Mei et.al. 2601.14595 null
2026-01-21 Breaking the accuracy-resource dilemma: a lightweight adaptive video inference enhancement Wei Ma et.al. 2601.14568 null
2026-01-21 QMC: Efficient SLM Edge Inference via Outlier-Aware Quantization and Emergent Memories Co-Design Nilesh Prasad Pandey et.al. 2601.14549 null
2026-01-22 Structured Image-based Coding for Efficient Gaussian Splatting Compression Pedro Martin et.al. 2601.14510 null
2026-01-20 Neutrino production mechanisms in strongly magnetized quark matter: Current status and open questions Igor A. Shovkovy et.al. 2601.14450 null
2026-01-20 Layer-adaptive Expert Pruning for Pre-Training of Mixture-of-Experts Large Language Models YuanLab. ai et.al. 2601.14327 null
2026-01-20 LRC-DHVC: Towards Local Rate Control in Neural Video Compression Marc Windsheimer et.al. 2601.14240 null
2026-01-20 Domain-Adaptation through Synthetic Data: Fine-Tuning Large Language Models for German Law Ali Hamza Bashir et.al. 2601.14160 null
2026-01-20 LLMOrbit: A Circular Taxonomy of Large Language Models -From Scaling Walls to Agentic AI Systems Badri N. Patro et.al. 2601.14053 null
2026-01-20 Kakugo: Distillation of Low-Resource Languages into Small Language Models Peter Devine et.al. 2601.14051 null
2026-01-20 Differentiable Logic Synthesis: Spectral Coefficient Selection via Sinkhorn-Constrained Composition Gorgi Pavlov et.al. 2601.13953 null
2026-01-21 Chain-of-Thought Compression Should Not Be Blind: V-Skip for Efficient Multimodal Reasoning via Dual-Path Anchoring Dongxu Zhang et.al. 2601.13879 null
2026-01-20 An efficient treatment of heat-flux boundary conditions in GSIS for rarefied gas flows Yanbing Zhang et.al. 2601.13870 null
2026-01-20 MirageNet:A Secure, Efficient, and Scalable On-Device Model Protection in Heterogeneous TEE and GPU System Huadi Zheng et.al. 2601.13826 null
2026-01-20 Three-dimensional properties of a coronal shock and the longitudinal distribution of its related solar energetic particles Yue Zhou et.al. 2601.13692 null
2026-01-20 Ultra-Lightweight Network for Ship-Radiated Sound Classification on Embedded Deployment Sangwon Park et.al. 2601.13679 null
2026-01-20 Direct Finite-Time Contraction (Step-Log) Profiling–Driven Optimization of Parallel Schemes for Nonlinear Problems on Multicore Architectures Mudassir Shams et.al. 2601.13637 null
2026-01-20 A Kubernetes custom scheduler based on reinforcement learning for compute-intensive pods Hanlin Zhou et.al. 2601.13579 null
2026-01-21 ButterflyMoE: Sub-Linear Ternary Experts via Structured Butterfly Orbits Aryan Karmore et.al. 2601.13563 null
2026-01-20 DIS2: Disentanglement Meets Distillation with Classwise Attention for Robust Remote Sensing Segmentation under Missing Modalities Nhi Kieu et.al. 2601.13502 null
2026-01-19 Quantum Circuit Pruning: Improving Fidelity via Compilation-Aware Circuit Approximation Pau Escofet et.al. 2601.13322 null
2026-01-19 Verifying Local Robustness of Pruned Safety-Critical Networks Minh Le et.al. 2601.13303 null
2026-01-19 An efficient model of cosmology dependence in the covariance matrix of the matter power spectrum Theodore Steele et.al. 2601.13245 null
2026-01-19 Co-Channel Interference Mitigation Using Deep Learning for Drone-Based Large-Scale Antenna Measurements Kadyrzhan Tortayev et.al. 2601.13205 null
2026-01-19 Onsager’s Mean Field Theory of Vortex Flows with Singular Sources: Blow-Up and Concentration without Quantization Daniele Bartolucci et.al. 2601.13192 null
2026-01-19 Probe and Skip: Self-Predictive Token Skipping for Efficient Long-Context LLM Inference Zimeng Wu et.al. 2601.13155 null
2026-01-19 Recursive Meta-Distillation: An Axiomatic Framework for Iterative Knowledge Refinement Aaron R. Flouro et.al. 2601.13100 null
2026-01-19 PaperGuide: Making Small Language-Model Paper-Reading Agents More Efficient Zijian Wang et.al. 2601.12988 null
2026-01-19 Sparse ActionGen: Accelerating Diffusion Policy with Real-time Pruning Kangye Ji et.al. 2601.12894 null
2026-01-19 SCULPT: Constraint-Guided Pruned MCTS that Carves Efficient Paths for Mathematical Reasoning Qitong Fang et.al. 2601.12842 null
2026-01-19 CSGaussian: Progressive Rate-Distortion Compression and Segmentation for 3D Gaussian Splatting Yu-Jen Tseng et.al. 2601.12814 null
2026-01-19 Distilling Time Series Foundation Models for Efficient Forecasting Yuqi Li et.al. 2601.12785 null
2026-01-19 CodeSep: Low-Bitrate Codec-Driven Speech Separation with Base-Token Disentanglement and Auxiliary-Token Serial Prediction Hui-Peng Du et.al. 2601.12757 null
2026-01-19 P2L-CA: An Effective Parameter Tuning Framework for Rehearsal-Free Multi-Label Class-Incremental Learning Songlin Dong et.al. 2601.12714 null
2026-01-19 BlocksecRT-DETR: Decentralized Privacy-Preserving and Token-Efficient Federated Transformer Learning for Secure Real-Time Object Detection in ITS Mohoshin Ara Tahera et.al. 2601.12693 null
2026-01-19 Mixed Precision PointPillars for Efficient 3D Object Detection with TensorRT Ninnart Fuengfusin et.al. 2601.12638 null
2026-01-18 Mixtenna: A Self-Biased Nonlinear Patch Antenna for Passive Third-Harmonic Radiation Yishai Brill et.al. 2601.12462 null
2026-01-18 LiQSS: Post-Transformer Linear Quantum-Inspired State-Space Tensor Networks for Real-Time 6G Farhad Rezazadeh et.al. 2601.12375 null
2026-01-18 Efficient classical simulation of time dynamics in Fermi-Hubbard models with imaginary interactions Raul A. Santos et.al. 2601.12368 null
2026-01-18 FlowIID: Single-Step Intrinsic Image Decomposition via Latent Flow Matching Mithlesh Singla et.al. 2601.12329 null
2026-01-18 Adaptive Multi-Scale Correlation Meta-Network for Few-Shot Remote Sensing Image Classification Anurag Kaushish et.al. 2601.12308 null
2026-01-18 AgenticPruner: MAC-Constrained Neural Network Compression via LLM-Driven Strategy Search Shahrzad Esmat et.al. 2601.12272 null
2026-01-16 MHA2MLA-VLM: Enabling DeepSeek’s Economical Multi-Head Latent Attention across Vision-Language Models Xiaoran Fan et.al. 2601.11464 null
2026-01-16 IMS: Intelligent Hardware Monitoring System for Secure SoCs Wadid Foudhaili et.al. 2601.11447 null
2026-01-16 FEATHer: Fourier-Efficient Adaptive Temporal Hierarchy Forecaster for Time-Series Forecasting Jaehoon Lee et.al. 2601.11350 null
2026-01-16 X-Distill: Cross-Architecture Vision Distillation for Visuomotor Learning Maanping Shao et.al. 2601.11269 null
2026-01-16 Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation Pingzhi Tang et.al. 2601.11258 null
2026-01-16 Language-Agnostic Visual Embeddings for Cross-Script Handwriting Retrieval Fangke Chen et.al. 2601.11248 null
2026-01-16 SDFLoRA: Selective Dual-Module LoRA for Federated Fine-tuning with Heterogeneous Clients Zhikang Shen et.al. 2601.11219 null
2026-01-16 FAQ: Mitigating Quantization Error via Regenerating Calibration Data with Family-Aware Quantization Haiyang Xiao et.al. 2601.11200 null
2026-01-16 Democratizing planetary-scale analysis: An ultra-lightweight Earth embedding database for accurate and flexible global land monitoring Shuang Chen et.al. 2601.11183 null
2026-01-16 PruneRAG: Confidence-Guided Query Decomposition Trees for Efficient Retrieval-Augmented Generation Shuguang Jiao et.al. 2601.11024 null
2026-01-15 EncodeRec: An Embedding Backbone for Recommendation Systems Guy Hadad et.al. 2601.10837 null
2026-01-15 Mugi: Value Level Parallelism For Efficient LLMs Daniel Price et.al. 2601.10823 null
2026-01-15 Towards Tensor Network Models for Low-Latency Jet Tagging on FPGAs Alberto Coppi et.al. 2601.10801 null
2026-01-15 Astrometric microlensing probes of the isolated neutron star population with Roman Zofia Kaczmarek et.al. 2601.10789 null
2026-01-14 Pruning as Evolution: Emergent Sparsity Through Selection Dynamics in Neural Networks Zubair Shah et.al. 2601.10765 null
2026-01-15 From One-to-One to Many-to-Many: Dynamic Cross-Layer Injection for Deep Vision-Language Fusion Cheng Chen et.al. 2601.10710 null
2026-01-15 Communication-Efficient and Privacy-Adaptable Mechanism – a Federated Learning Scheme with Convergence Analysis Chun Hei Michael Shiu et.al. 2601.10701 null
2026-01-15 PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution Minghao Yan et.al. 2601.10657 null
2026-01-15 Representation-Aware Unlearning via Activation Signatures: From Suppression to Knowledge-Signature Erasure Syed Naveed Mahmood et.al. 2601.10566 null
2026-01-15 TF3-RO-50M: Training Compact Romanian Language Models from Scratch on Synthetic Moral Microfiction Mihai Dan Nadas et.al. 2601.10410 null
2026-01-15 coTherapist: A Behavior-Aligned Small Language Model to Support Mental Healthcare Experts Prottay Kumar Adhikary et.al. 2601.10246 null
2026-01-15 LOOKAT: Lookup-Optimized Key-Attention for Memory-Efficient Transformers Aryan Karmore et.al. 2601.10155 null
2026-01-15 Privacy Enhanced PEFT: Tensor Train Decomposition Improves Privacy Utility Tradeoffs under DP-SGD Pradip Kunwar et.al. 2601.10045 null
2026-01-15 Instruction Finetuning LLaMA-3-8B Model Using LoRA for Financial Named Entity Recognition Zhiming Lian et.al. 2601.10043 null
2026-01-15 Resistive Memory based Efficient Machine Unlearning and Continual Learning Ning Lin et.al. 2601.10037 null
2026-01-15 FaTRQ: Tiered Residual Quantization for LLM Vector Search in Far-Memory-Aware ANNS Systems Tianqi Zhang et.al. 2601.09985 null
2026-01-14 Advancing Model Refinement: Muon-Optimized Distillation and Quantization for LLM Deployment Jacob Sander et.al. 2601.09865 null
2026-01-16 NanoSD: Edge Efficient Foundation Model for Real Time Image Restoration Subhajit Sanyal et.al. 2601.09823 null
2026-01-14 QFed: Parameter-Compact Quantum-Classical Federated Learning Samar Abdelghani et.al. 2601.09809 null
2026-01-14 ShortCoder: Knowledge-Augmented Syntax Optimization for Token-Efficient Code Generation Sicong Liu et.al. 2601.09703 null
2026-01-14 COMPOSE: Hypergraph Cover Optimization for Multi-view 3D Human Pose Estimation Tony Danjun Wang et.al. 2601.09698 null
2026-01-14 LLMs can Compress LLMs: Adaptive Pruning by Agents Sai Varun Kodathala et.al. 2601.09694 null
2026-01-14 Disentangling Task Conflicts in Multi-Task LoRA via Orthogonal Gradient Projection Ziyu Yang et.al. 2601.09684 null
2026-01-14 Quantization Commutes with Reduction of Chern-Simons Gauge Theory Geyang Dai et.al. 2601.09666 null
2026-01-14 Exploring Fine-Tuning for Tabular Foundation Models Aditya Tanna et.al. 2601.09654 null
2026-01-14 Benchmarking Post-Training Quantization of Large Language Models under Microscaling Floating Point Formats Manyi Zhang et.al. 2601.09555 null
2026-01-14 Strange quark star I: the maximum gravitational mass and deformation of magnetized spinning model Fatemeh Kayanikhoo et.al. 2601.09529 null
2026-01-14 CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion Ralf Römer et.al. 2601.09512 null
2026-01-14 Unifying Search and Recommendation in LLMs via Gradient Multi-Subspace Tuning Jujia Zhao et.al. 2601.09496 null
2026-01-14 How many users have been here for a long time? Efficient solutions for counting long aggregated visits Peyman Afshani et.al. 2601.09489 null
2026-01-14 Analysis of the Maximum Prediction Gain of Short-Term Prediction on Sustained Speech Reemt Hinrichs et.al. 2601.09461 null
2026-01-14 GeoRA: Geometry-Aware Low-Rank Adaptation for RLVR Jiaying Zhang et.al. 2601.09361 null
2026-01-14 Spectral Complex Autoencoder Pruning: A Fidelity-Guided Criterion for Extreme Structured Channel Compression Wei Liu et.al. 2601.09352 null
2026-01-14 Arbitrary fractional quantization in Dirac systems Christos Papapanos et.al. 2601.09331 null
2026-01-14 On-Device Large Language Models for Sequential Recommendation Xin Xia et.al. 2601.09306 null
2026-01-14 TIDI-GS: Floater Suppression in 3D Gaussian Splatting for Enhanced Indoor Scene Fidelity Sooyeun Yang et.al. 2601.09291 null
2026-01-14 RISER: Orchestrating Latent Reasoning Skills for Adaptive Activation Steering Wencheng Ye et.al. 2601.09269 null
2026-01-14 A Theoretical Framework for Rate-Distortion Limits in Learned Image Compression Changshuo Wang et.al. 2601.09254 null
2026-01-14 Integrating Diverse Assignment Strategies into DETRs Yiwei Zhang et.al. 2601.09247 null
2026-01-14 CLIDD: Cross-Layer Independent Deformable Description for Efficient and Discriminative Local Feature Representation Haodi Yao et.al. 2601.09230 null
2026-01-14 Pairing-free Group-level Knowledge Distillation for Robust Gastrointestinal Lesion Classification in White-Light Endoscopy Qiang Hu et.al. 2601.09209 null
2026-01-14 From Performance to Practice: Knowledge-Distilled Segmentator for On-Premises Clinical Workflows Qizhen Lan et.al. 2601.09191 null
2026-01-14 OrthoGeoLoRA: Geometric Parameter-Efficient Fine-Tuning for Structured Social Science Concept Retrieval on theWeb Zeqiang Wang et.al. 2601.09185 null
2026-01-14 $D^2Prune$ : Sparsifying Large Language Models via Dual Taylor Expansion and Attention Distribution Awareness Lang Xiong et.al. 2601.09176 null
2026-01-14 N-EIoU-YOLOv9: A Signal-Aware Bounding Box Regression Loss for Lightweight Mobile Detection of Rice Leaf Diseases Dung Ta Nguyen Duc et.al. 2601.09170 null
2026-01-14 Multi-Teacher Ensemble Distillation: A Mathematical Framework for Probability-Domain Knowledge Aggregation Aaron R. Flouro et.al. 2601.09165 null
2026-01-14 SkinFlow: Efficient Information Transmission for Open Dermatological Diagnosis via Dynamic Visual Encoding and Staged RL Lijun Liu et.al. 2601.09136 null
2026-01-14 LPCAN: Lightweight Pyramid Cross-Attention Network for Rail Surface Defect Detection Using RGB-D Data Jackie Alex et.al. 2601.09118 null
2026-01-14 LP-LLM: End-to-End Real-World Degraded License Plate Text Recognition via Large Multimodal Models Haoyan Gong et.al. 2601.09116 null
2026-01-14 Hidden States as Early Signals: Step-level Trace Evaluation and Pruning for Efficient Test-Time Scaling Zhixiang Liang et.al. 2601.09093 null
2026-01-14 Efficient Multilingual Dialogue Processing via Translation Pipelines and Distilled Language Models Santiago Martínez Novoa et.al. 2601.09059 null
2026-01-13 Semiparametric Efficient Data Integration Using the Dual-Frame Sampling Framework Kosuke Morikawa et.al. 2601.08707 null
2026-01-13 Efficient Parameter Calibration of Numerical Weather Prediction Models via Evolutionary Sequential Transfer Optimization Heping Fang et.al. 2601.08663 null
2026-01-13 SfMamba: Efficient Source-Free Domain Adaptation via Selective Scan Modeling Xi Chen et.al. 2601.08608 null
2026-01-13 Bridging Theory and Experiment in Virtually Imaged Phased Array (VIPA) Spectrometers Kiumars Aryana et.al. 2601.08589 null
2026-01-13 Ministral 3 Alexander H. Liu et.al. 2601.08584 null
2026-01-13 JudgeRLVR: Judge First, Generate Second for Efficient Reasoning Jiangshan Duo et.al. 2601.08468 null
2026-01-13 Hybrid Distillation with CoT Guidance for Edge-Drone Control Code Generation Yizhan Feng et.al. 2601.08412 null
2026-01-13 An Efficient Algorithm to Sample Quantum Low-Density Parity-Check Codes Paolo Santini et.al. 2601.08387 null
2026-01-13 RotCurves: A PYTHON package for efficient modelling and fitting of galactic rotation curves at high-z A. Nestor Shachar et.al. 2601.08348 null
2026-01-13 ReCo-KD: Region- and Context-Aware Knowledge Distillation for Efficient 3D Medical Image Segmentation Qizhen Lan et.al. 2601.08301 null
2026-01-13 Variable-Length Wideband CSI Feedback via Loewner Interpolation and Deep Learning Meilin Li et.al. 2601.08300 null
2026-01-13 Human-inspired Global-to-Parallel Multi-scale Encoding for Lightweight Vision Models Wei Xu et.al. 2601.08190 null
2026-01-13 Relational Knowledge Distillation Using Fine-tuned Function Vectors Andrea Kang et.al. 2601.08169 null
2026-01-13 Q-realign: Piggybacking Realignment on Quantization for Safe and Efficient LLM Deployment Qitao Tan et.al. 2601.08089 null
2026-01-12 LUT-Compiled Kolmogorov-Arnold Networks for Lightweight DoS Detection on IoT Edge Devices Oleksandr Kuznetsov et.al. 2601.08044 null
2026-01-12 InfGraND: An Influence-Guided GNN-to-MLP Knowledge Distillation Amir Eskandari et.al. 2601.08033 null
2026-01-12 DYCP: Dynamic Context Pruning for Long-Form Dialogue with LLMs Nayoung Choi et.al. 2601.07994 null
2026-01-12 LWMSCNN-SE: A Lightweight Multi-Scale Network for Efficient Maize Disease Classification on Edge Devices Fikadu Weloday et.al. 2601.07957 null
2026-01-12 Sherry: Hardware-Efficient 1.25-Bit Ternary Quantization via Fine-grained Sparsification Hong Huang et.al. 2601.07892 null
2026-01-12 KVzap: Fast, Adaptive, and Faithful KV Cache Pruning Simon Jegou et.al. 2601.07891 null
2026-01-12 Vision-Language Model for Accurate Crater Detection Patrick Bauer et.al. 2601.07795 null
2026-01-12 Benchmarking Small Language Models and Small Reasoning Language Models on System Log Severity Classification Yahya Masri et.al. 2601.07790 null
2026-01-13 Free-RBF-KAN: Kolmogorov-Arnold Networks with Adaptive Radial Basis Functions for Efficient Function Learning Shao-Ting Chiu et.al. 2601.07760 null
2026-01-12 Tab-TRM: Tiny Recursive Model for Insurance Pricing on Tabular Data Kishan Padayachy et.al. 2601.07675 null
2026-01-12 Adaptive Layer Selection for Layer-Wise Token Pruning in LLM Inference Rei Taniguchi et.al. 2601.07667 null
2026-01-12 Quantization-scheme-Independent Energy and Its Implications for Holographic Bounds Ze Li et.al. 2601.07607 null
2026-01-12 Vector Quantized-Aided XL-MIMO CSI Feedback with Channel Adaptive Transmission Yuhang Ma et.al. 2601.07584 null
2026-01-12 Backpropagation-Free Test-Time Adaptation for Lightweight EEG-Based Brain-Computer Interfaces Siyang Li et.al. 2601.07556 null
2026-01-12 High-Rank Structured Modulation for Parameter-Efficient Fine-Tuning Yongkang Liu et.al. 2601.07507 null
2026-01-12 ARCQuant: Boosting NVFP4 Quantization with Augmented Residual Channels for LLMs Haoqian Meng et.al. 2601.07475 null
2026-01-12 Knowledge Distillation for LLM-Based Human Activity Recognition in Homes Julien Cumin et.al. 2601.07469 null
2026-01-12 From Sketch to Fresco: Efficient Diffusion Transformer with Progressive Resolution Shikang Zheng et.al. 2601.07462 null
2026-01-12 SDHSI-Net: Learning Better Representations for Hyperspectral Images via Self-Distillation Prachet Dev Singh et.al. 2601.07416 null
2026-01-12 Forecast the Principal, Stabilize the Residual: Subspace-Aware Feature Caching for Efficient Diffusion Transformers Guantao Chen et.al. 2601.07396 null
2026-01-12 Software-Hardware Co-optimization for Modular E2E AV Paradigm: A Unified Framework of Optimization Approaches, Simulation Environment and Evaluation Metrics Chengzhi Ji et.al. 2601.07393 null
2026-01-12 MI-PRUN: Optimize Large Language Model Pruning via Mutual Information Hao Zhang et.al. 2601.07212 null
2026-01-12 Active Context Compression: Autonomous Memory Management in LLM Agents Nikhil Verma et.al. 2601.07190 null
2026-01-12 Stable On-Policy Distillation through Adaptive Target Reformulation Ijun Jang et.al. 2601.07155 null
2026-01-11 Robust Mean Estimation under Quantization Pedro Abdalla et.al. 2601.07074 null
2026-01-11 Jasper: ANNS Quantized for Speed, Built for Change on GPU Hunter McCoy et.al. 2601.07048 null
2026-01-11 Magnetic winds in resistive compact binary discs Marc Van den Bossche et.al. 2601.06994 null
2026-01-11 HAS-VQ: Hessian-Adaptive Sparse Vector Quantization for High-Fidelity LLM Compression Vladimer Khasia et.al. 2601.06959 null
2026-01-11 TagSpeech: End-to-End Multi-Speaker ASR and Diarization with Fine-Grained Temporal Grounding Mingyue Huo et.al. 2601.06896 null
2026-01-11 SecMoE: Communication-Efficient Secure MoE Inference via Select-Then-Compute Bowen Shen et.al. 2601.06790 null
2026-01-11 Artificial Entanglement in the Fine-Tuning of Large Language Models Min Chen et.al. 2601.06788 null
2026-01-11 Garbage Attention in Large Language Models: BOS Sink Heads and Sink-aware Pruning Jaewon Sok et.al. 2601.06787 null
2026-01-10 GRASP LoRA: GRPO Guided Adapter Sparsity Policy for Cross Lingual Transfer Besher Hassan et.al. 2601.06702 null
2026-01-10 Families of Toeplitz operators, family index and deformation quantization Clément Cren et.al. 2601.06619 null
2026-01-10 Joint Impact of ADC and Fronthaul Quantization in Cell-Free Massive MIMO-OFDM Uplink Özlem Tuğfe Demir et.al. 2601.06483 null
2026-01-10 PRISP: Privacy-Safe Few-Shot Personalization via Lightweight Adaptation Junho Park et.al. 2601.06471 null
2026-01-10 SecureDyn-FL: A Robust Privacy-Preserving Federated Learning Framework for Intrusion Detection in IoT Networks Imtiaz Ali Soomro et.al. 2601.06466 null
2026-01-10 Gecko: An Efficient Neural Architecture Inherently Processing Sequences with Arbitrary Lengths Xuezhe Ma et.al. 2601.06463 null
2026-01-09 Monkey Jump : MoE-Style PEFT for Efficient Multi-Task Learning Nusrat Jahan Prottasha et.al. 2601.06356 null
2026-01-09 Why LoRA Fails to Forget: Regularized Low-Rank Adaptation Against Backdoors in Language Models Hoang-Chau Luong et.al. 2601.06305 null
2026-01-09 Real-Time Image Processing Algorithms for Embedded Systems Soundes Oumaima Boufaida et.al. 2601.06243 null
2026-01-09 Distilling Lightweight Domain Experts from Large ML Models by Identifying Relevant Subspaces Pattarawat Chormai et.al. 2601.05913 null
2026-01-09 FLRQ: Faster LLM Quantization with Flexible Low-Rank Matrix Sketching Hongyaoxing Gul et.al. 2601.05684 null
2026-01-09 Compressing image encoders via latent distillation Caroline Mazini Rodrigues et.al. 2601.05639 null
2026-01-09 LatentVLA: Efficient Vision-Language Models for Autonomous Driving via Latent Action Prediction Chengen Xie et.al. 2601.05611 null
2026-01-09 AntibodyDesignBFN: High-Fidelity Fixed-Backbone Antibody Design via Discrete Bayesian Flow Networks Yue Hu et.al. 2601.05605 null
2026-01-09 Generalizable and Adaptive Continual Learning Framework for AI-generated Image Detection Hanyi Wang et.al. 2601.05580 null
2026-01-09 One Language-Free Foundation Model Is Enough for Universal Vision Anomaly Detection Bin-Bin Gao et.al. 2601.05552 null
2026-01-09 Discrete Homogeneity and Quantizer Design for Nonlinear Homogeneous Control Systems Yu Zhou et.al. 2601.05526 null
2026-01-08 Efficient Inference for Noisy LLM-as-a-Judge Evaluation Yiqun T Chen et.al. 2601.05420 null
2026-01-08 Interactive Distillation for Cooperative Multi-Agent Reinforcement Learning Minwoo Cho et.al. 2601.05407 null
2026-01-08 Markovian Compression: Looking to the Past Helps Accelerate the Future Andrey Veprikov et.al. 2601.05398 null
2026-01-08 Sketch&Patch++: Efficient Structure-Aware 3D Gaussian Representation Yuang Shi et.al. 2601.05394 null
2026-01-08 Knowledge Distillation of a Protein Language Model Yields a Foundational Implicit Solvent Model Justin Airas et.al. 2601.05388 null
2026-01-08 EdgeLDR: Quaternion Low-Displacement Rank Neural Networks for Edge-Efficient Deep Learning Vladimir Frants et.al. 2601.05379 null
2026-01-08 MOSAIC-GS: Monocular Scene Reconstruction via Advanced Initialization for Complex Dynamic Environments Svitlana Morkva et.al. 2601.05368 null
2026-01-08 STResNet & STYOLO : A New Family of Compact Classification and Object Detection Models for MCUs Sudhakar Sah et.al. 2601.05364 null
2026-01-08 Microscopic Unitarity and the Quantization of Black Hole Evaporation Time Ahmad Adel Abutaleb et.al. 2601.05305 null
2026-01-08 RelayLLM: Efficient Reasoning via Collaborative Decoding Chengsong Huang et.al. 2601.05167 null
2026-01-08 Learning Mixture Models via Efficient High-dimensional Sparse Fourier Transforms Alkis Kalavasis et.al. 2601.05157 null
2026-01-08 ArcAligner: Adaptive Recursive Aligner for Compressed Context Embeddings in RAG Jianbo Li et.al. 2601.05038 link
2026-01-08 Guided Variational Network for Image Decomposition Alessandro Lanza et.al. 2601.04999 null
2026-01-08 ConMax: Confidence-Maximizing Compression for Efficient Chain-of-Thought Reasoning Minda Hu et.al. 2601.04973 null
2026-01-08 DR-LoRA: Dynamic Rank LoRA for Mixture-of-Experts Adaptation Guanzhi Deng et.al. 2601.04823 null
2026-01-08 GPU-Accelerated INT8 Quantization for KV Cache Compression in Large Language Models Maanas Taneja et.al. 2601.04719 null
2026-01-08 PROMISE: Process Reward Models Unlock Test-Time Scaling Laws in Generative Recommendations Chengcheng Guo et.al. 2601.04674 null
2026-01-08 FedKDX: Federated Learning with Negative Knowledge Distillation for Enhanced Healthcare AI Systems Quang-Tu Pham et.al. 2601.04587 null
2026-01-08 TokenSeg: Efficient 3D Medical Image Segmentation via Hierarchical Visual Token Compression Sen Zeng et.al. 2601.04519 null
2026-01-07 Exact Multimode Quantization of Superconducting Circuits via Boundary Admittance Mustafa Bakr et.al. 2601.04407 null
2026-01-07 SCAR-GS: Spatial Context Attention for Residuals in Progressive Gaussian Splatting Diego Revilla et.al. 2601.04348 null
2026-01-07 MemKD: Memory-Discrepancy Knowledge Distillation for Efficient Time Series Classification Nilushika Udayangani et.al. 2601.04264 null
2026-01-07 Learning to Reason: Temporal Saliency Distillation for Interpretable Knowledge Transfer Nilushika Udayangani Hewa Dehigahawattage et.al. 2601.04263 null
2026-01-07 Safety-Utility Conflicts Are Not Global: Surgical Alignment via Head-Level Diagnosis Wang Cai et.al. 2601.04262 null
2026-01-07 ToTMNet: FFT-Accelerated Toeplitz Temporal Mixing Network for Lightweight Remote Photoplethysmography Vladimir Frants et.al. 2601.04159 null
2026-01-07 Hybrid Downlink Beamforming with Outage Constraints under Imperfect CSI using Model-Driven Deep Learning Lukas Schynol et.al. 2601.04069 null
2026-01-07 A Scheduling Framework for Efficient MoE Inference on Edge GPU-NDP Systems Qi Wu et.al. 2601.03992 null
2026-01-07 Using Small Language Models to Reverse-Engineer Machine Learning Pipelines Structures Nicolas Lacroix et.al. 2601.03988 null
2026-01-07 FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection Mingyu Ouyang et.al. 2601.03928 null
2026-01-07 Evaluating Small Decoder-Only Language Models for Grammar Correction and Text Simplification Anthony Lamelas et.al. 2601.03874 null
2026-01-07 MPM-QIR: Measurement-Probability Matching for Quantum Image Representation and Compression via Variational Quantum Circuit Chong-Wei Wang et.al. 2601.03855 null
2026-01-07 Rethinking Table Pruning in TableQA: From Sequential Revisions to Gold Trajectory-Supervised Parallel Search Yu Guo et.al. 2601.03851 null
2026-01-07 Unified and Efficient Analysis of Machining Chatter and Surface Location Error Woraphrut Kornmaneesang et.al. 2601.03819 null
2026-01-07 AI Generated Text Detection Adilkhan Alikhanov et.al. 2601.03812 null
2026-01-07 Improving Compactness and Reducing Ambiguity of CFIRE Rule-Based Explanations Sebastian Müller et.al. 2601.03776 null
2026-01-07 Topological quantization of vector meson anomalous couplings Chao-Qiang Geng et.al. 2601.03740 null
2026-01-07 Investigating Knowledge Distillation Through Neural Networks for Protein Binding Affinity Prediction Wajid Arshad Abbasi et.al. 2601.03704 null
2026-01-07 ELO: Efficient Layer-Specific Optimization for Continual Pretraining of Multilingual LLMs HanGyeol Yoo et.al. 2601.03648 null
2026-01-07 PhysicsFormer: An Efficient and Fast Attention-Based Physics Informed Neural Network for Solving Incompressible Navier Stokes Equations Biswanath Barman et.al. 2601.03613 null
2026-01-07 Policy-Guided Search on Tree-of-Thoughts for Efficient Problem Solving with Bounded Language Model Queries Sumedh Pendurkar et.al. 2601.03606 null
2026-01-07 Stratified Pseudobundles and Quantization Ethan Ross et.al. 2601.03544 null
2026-01-07 Cyberattack Detection in Virtualized Microgrids Using LightGBM and Knowledge-Distilled Classifiers Osasumwen Cedric Ogiesoba-Eguakun et.al. 2601.03495 null
2026-01-07 From Bits to Chips: An LLM-based Hardware-Aware Quantization Agent for Streamlined Deployment of LLMs Kaiyuan Deng et.al. 2601.03484 null
2026-01-06 Implicit Graph, Explicit Retrieval: Towards Efficient and Interpretable Long-horizon Memory for Large Language Models Xin Zhang et.al. 2601.03417 null
2026-01-06 PIVONet: A Physically-Informed Variational Neuro ODE Model for Efficient Advection-Diffusion Fluid Simulation Hei Shing Cheung et.al. 2601.03397 null
2026-01-06 Edit2Restore:Few-Shot Image Restoration via Parameter-Efficient Adaptation of Pre-trained Editing Models M. Akın Yılmaz et.al. 2601.03391 null
2026-01-06 Optimal Quantization of Finite Uniform Data on the Sphere Mrinal Kanti Roychowdhury et.al. 2601.03333 null
2026-01-06 LUT-KAN: Segment-wise LUT Quantization for Fast KAN Inference Oleksandr Kuznetsov et.al. 2601.03332 null
2026-01-06 Fine-tuning Small Language Models as Efficient Enterprise Search Relevance Labelers Yue Kang et.al. 2601.03211 null
2026-01-06 Sparse Knowledge Distillation: A Mathematical Framework for Probability-Domain Temperature Scaling and Multi-Stage Compression Aaron R. Flouro et.al. 2601.03195 null
2026-01-06 Do LLMs Encode Functional Importance of Reasoning Tokens? Janvijay Singh et.al. 2601.03066 null
2026-01-06 From Memorization to Creativity: LLM as a Designer of Novel Neural-Architectures Waleed Khalid et.al. 2601.02997 null
2026-01-06 Few-shot learning for security bug report identification Muhammad Laiq et.al. 2601.02971 null
2026-01-06 Reliability-Aware Adaptive Self-Consistency for Efficient Sampling in LLM Reasoning Junseok Kim et.al. 2601.02970 null
2026-01-06 RPIQ: Residual-Projected Multi-Collaboration Closed-Loop and Single Instance Quantization for Visually Impaired Assistance Xuanyu Wang et.al. 2601.02888 null
2026-01-06 Sample-Efficient Neurosymbolic Deep Reinforcement Learning Celeste Veronese et.al. 2601.02850 null
2026-01-06 AnyDepth: Depth Estimation Made Easy Zeyu Ren et.al. 2601.02760 null
2026-01-06 CRoPE: Efficient Parametrization of Rotary Positional Embedding Beicheng Lou et.al. 2601.02728 null
2026-01-06 Transform and Entropy Coding in AV2 Alican Nalci et.al. 2601.02712 null
2026-01-06 Adversarial Contrastive Learning for LLM Quantization Attacks Dinghong Song et.al. 2601.02680 null
2026-01-06 Iterative Structured Pruning for Large Language Models with Multi-Domain Calibration Guangxin Wu et.al. 2601.02674 null
2026-01-05 Compressed code: the hidden effects of quantization and distillation on programming tokens Viacheslav Siniaev et.al. 2601.02563 null
2026-01-05 ModeX: Evaluator-Free Best-of-N Selection for Open-Ended Generation Hyeong Kyu Choi et.al. 2601.02535 null
2026-01-05 GEM-Style Constraints for PEFT with Dual Gradient Projection in LoRA Brian Tekmen et.al. 2601.02500 null
2026-01-05 TAP-ViTs: Task-Adaptive Pruning for On-Device Deployment of Vision Transformers Zhibo Wang et.al. 2601.02437 null
2026-01-04 A Dynamic Retrieval-Augmented Generation System with Selective Memory and Remembrance Okan Bursa et.al. 2601.02428 null
2026-01-05 DARC: Drum accompaniment generation with fine-grained rhythm control Trey Brosnan et.al. 2601.02357 null
2026-01-05 Meta-Learning Guided Pruning for Few-Shot Plant Pathology on Edge Devices Shahnawaz Alam et.al. 2601.02353 null
2026-01-05 Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Falcon LLM Team et.al. 2601.02346 null
2026-01-05 Power-of-Two Quantization-Aware-Training (PoT-QAT) in Large Language Models (LLMs) Mahmoud Elgenedy et.al. 2601.02298 null
2026-01-05 TopoLoRA-SAM: Topology-Aware Parameter-Efficient Adaptation of Foundation Segmenters for Thin-Structure and Cross-Domain Binary Semantic Segmentation Salim Khazem et.al. 2601.02273 null
2026-01-05 SLGNet: Synergizing Structural Priors and Language-Guided Modulation for Multimodal Object Detection Xiantai Xiang et.al. 2601.02249 null
2026-01-05 Quantized SO(3)-Equivariant Graph Neural Networks for Efficient Molecular Property Prediction Haoyu Zhou et.al. 2601.02213 null
2026-01-05 Parameter-Efficient Domain Adaption for CSI Crowd-Counting via Self-Supervised Learning with Adapter Modules Oliver Custance et.al. 2601.02203 null
2026-01-05 HFRWKV: A High-Performance Fully On-Chip Hardware Accelerator for RWKV Liu Shijie et.al. 2601.02135 null
2026-01-05 MindChat: A Privacy-preserving Large Language Model for Mental Health Support Dong Xue et.al. 2601.01993 null
2026-01-05 Vector Search for the Future: From Memory-Resident, Static Heterogeneous Storage, to Cloud-Native Architectures Yitong Song et.al. 2601.01937 null
2026-01-05 RRNet: Configurable Real-Time Video Enhancement with Arbitrary Local Lighting Variations Wenlong Yang et.al. 2601.01865 null
2026-01-05 Causality-Aware Temporal Projection for Video Understanding in Video-LLMs Zhengjian Kang et.al. 2601.01804 null
2026-01-05 Subsymmetry-protected compact edge states Ruoqi Cheng et.al. 2601.01721 null
2026-01-05 Digital Twin-Driven Communication-Efficient Federated Anomaly Detection for Industrial IoT Mohammed Ayalew Belay et.al. 2601.01701 null
2026-01-05 Real-Time Lane Detection via Efficient Feature Alignment and Covariance Optimization for Low-Power Embedded Systems Yian Liu et.al. 2601.01696 null
2026-01-04 DiffKD-DCIS: Predicting Upgrade of Ductal Carcinoma In Situ with Diffusion Augmentation and Knowledge Distillation Tao Li et.al. 2601.01507 null
2026-01-04 SGD-Based Knowledge Distillation with Bayesian Teachers: Theory and Guidelines Itai Morad et.al. 2601.01484 null
2026-01-04 Efficient Cover Construction for Ball Mapper via Accelerated Range Queries Jay-Anne Bulauan et.al. 2601.01405 null
2026-01-04 Empowering Small Language Models with Factual Hallucination-Aware Reasoning for Financial Classification Han Yuan et.al. 2601.01378 null
2026-01-03 T3C: Test-Time Tensor Compression with Consistency Guarantees Ismail Lamaakal et.al. 2601.01299 null
2026-01-03 MambaFormer: Token-Level Guided Routing Mixture-of-Experts for Accurate and Efficient Clinical Assistance Hamad Khan et.al. 2601.01260 null
2026-01-03 Racka: Efficient Hungarian LLM Adaptation on Academic Infrastructure Zsolt Csibi et.al. 2601.01244 null
2026-01-03 XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression Zunhai Su et.al. 2601.01204 null
2026-01-03 EmoLoom-2B: Fast Base-Model Screening for Emotion Classification and VAD with Lexicon-Weak Supervision and KV-Off Evaluation Zilin Li et.al. 2601.01112 null
2026-01-03 Efficient Hyperspectral Image Reconstruction Using Lightweight Separate Spectral Transformers Jianan Li et.al. 2601.01064 null
2026-01-03 Multi-Dimensional Prompt Chaining to Improve Open-Domain Dialogue Generation Livia Leong Hui Teng et.al. 2601.01037 null
2026-01-02 Lightweight Channel Attention for Efficient CNNs Prem Babu Kanaparthi et.al. 2601.01002 null
2026-01-02 KDPhys: An Attention Guided 3D to 2D Knowledge Distillation for Real-time Video-Based Physiological Measurement Nicky Nirlipta Sahoo et.al. 2601.00714 null
2026-01-02 QSLM: A Performance- and Memory-aware Quantization Framework with Tiered Search Strategy for Spike-driven Language Models Rachmad Vidya Wicaksana Putra et.al. 2601.00679 null
2026-01-02 Sparse FEONet: A Low-Cost, Memory-Efficient Operator Network via Finite-Element Local Sparsity for Parametric PDEs Seungchan Ko et.al. 2601.00672 null
2026-01-02 CoCo-Fed: A Unified Framework for Memory- and Communication-Efficient Federated Learning at the Wireless Edge Zhiheng Guo et.al. 2601.00549 null
2026-01-02 Variable Elimination in Hybrid Factor Graphs for Discrete-Continuous Inference & Estimation Varun Agrawal et.al. 2601.00545 null
2026-01-02 ECR: Manifold-Guided Semantic Cues for Compact Language Models Chung-Wei Victor Yuan et.al. 2601.00543 null
2026-01-02 Federated Customization of Large Models: Approaches, Experiments, and Insights Yuchuan Ye et.al. 2601.00526 null
2026-01-02 Optimizing LSTM Neural Networks for Resource-Constrained Retail Sales Forecasting: A Model Compression Study Ravi Teja Pagidoju et.al. 2601.00525 null
2026-01-01 Fisher-Information-Driven Adaptive Acquisition for Photon-Efficient FLIM: A Dual-Implementation Framework for TCSPC and Programmable Time-Gating J. Sumaya-Martinez et.al. 2601.00490 null
2026-01-01 A Comparative Study of Adaptation Strategies for Time Series Foundation Models in Anomaly Detection Miseon Park et.al. 2601.00446 null
2026-01-01 Time–to–Digital Converter (TDC)–Based Resonant Compute–in–Memory for INT8 CNNs with Layer–Optimized SRAM Mapping Dhandeep Challagundla et.al. 2601.00434 null
2026-01-01 Efficient Prediction of Dense Visual Embeddings via Distillation and RGB-D Transformers Söhnke Benedikt Fischedick et.al. 2601.00359 null
2026-01-01 Can Optimal Transport Improve Federated Inverse Reinforcement Learning? David Millard et.al. 2601.00309 null
2026-01-01 VisNet: Efficient Person Re-Identification via Alpha-Divergence Loss, Feature Fusion and Dynamic Multi-Task Learning Anns Ijaz et.al. 2601.00307 null
2026-01-01 Can Large Language Models Still Explain Themselves? Investigating the Impact of Quantization on Self-Explanations Qianli Wang et.al. 2601.00282 null
2026-01-01 Equivariant Cohomology, BRST Quantization, and Analytic Localization: A Unified Framework Lixin Xu et.al. 2601.00256 null
2026-01-01 An Empirical Evaluation of LLM-Based Approaches for Code Vulnerability Detection: RAG, SFT, and Dual-Agent Systems Md Hasan Saju et.al. 2601.00254 null
2026-01-01 GRIT – Geometry-Aware PEFT with K-FACPreconditioning, Fisher-Guided Reprojection, andDynamic Rank Adaptation Pritish Saha et.al. 2601.00231 null
2026-01-01 Robust Graph Fine-Tuning with Adversarial Graph Prompting Ziyan Zhang et.al. 2601.00229 null
2026-01-01 LooC: Effective Low-Dimensional Codebook for Compositional Vector Quantization Jie Li et.al. 2601.00222 null
2026-01-01 Knowledge Distillation for Temporal Knowledge Graph Reasoning with Large Language Models Wang Xing et.al. 2601.00202 null
2025-12-31 Evaluating the Impact of Compression Techniques on the Robustness of CNNs under Natural Corruptions Itallo Patrick Castro Alves Da Silva et.al. 2512.24971 null
2025-12-31 OFL-SAM2: Prompt SAM2 with Online Few-shot Learner for Efficient Medical Image Segmentation Meng Lan et.al. 2512.24861 null
2025-12-31 HiGR: Efficient Generative Slate Recommendation via Hierarchical Planning and Multi-Objective Preference Alignment Yunsheng Pang et.al. 2512.24787 null
2025-12-31 Control of Microrobots with Reinforcement Learning under On-Device Compute Constraints Yichen Liu et.al. 2512.24740 null
2025-12-31 FPGA Co-Design for Efficient N:M Sparse and Quantized Model Inference Fen-Yu Hsieh et.al. 2512.24713 null
2025-12-31 Average Consensus with Dynamic Quantization Framing and Finite-Time Termination over Limited-Bandwidth Directed Networks Evagoras Makridis et.al. 2512.24700 null
2025-12-31 Distributed Bilevel Optimization with Dual Pruning for Resource-limited Clients Mingyi Li et.al. 2512.24667 null
2025-12-31 Renormalization Group Guided Tensor Network Structure Search Maolin Wang et.al. 2512.24663 null
2025-12-31 Geometric Quantization by Paths Part II: The General Case Patrick Iglesias-Zemmour et.al. 2512.24627 null
2025-12-31 AutoFed: Manual-Free Federated Traffic Prediction via Personalized Prompt Zijian Zhao et.al. 2512.24625 null
2025-12-31 Collaborative Low-Rank Adaptation for Pre-Trained Vision Transformers Zheng Liu et.al. 2512.24603 null
2025-12-31 Hierarchical Vector-Quantized Latents for Perceptual Low-Resolution Video Compression Manikanta Kotthapalli et.al. 2512.24547 null
2025-12-31 More Than Bits: Multi-Envelope Double Binary Factorization for Extreme Quantization Yuma Ichikawa et.al. 2512.24545 null
2025-12-30 Implementing the three-neutron quantization condition Wilder Schaaf et.al. 2512.24508 null
2025-12-30 Spectroscopy of Quantum Phase Slips: Visualizing Complex Real-Time Instantons Foster Thompson et.al. 2512.24495 null
2025-12-30 PackKV: Reducing KV Cache Memory Footprint through LLM-Aware Lossy Compression Bo Jiang et.al. 2512.24449 link
2025-12-30 FAST-IDS: A Fast Two-Stage Intrusion Detection System with Hybrid Compression for Real-Time Threat Detection in Connected and Autonomous Vehicles Devika S et.al. 2512.24391 null
2025-12-30 Incremental Certificate Learning for Hybrid Neural Network Verification . A Solver Architecture for Piecewise-Linear Safety Queries Chandrasekhar Gokavarapu et.al. 2512.24379 null
2025-12-30 Efficient Decoding of Twisted GRS Codes and Roth–Lempel Codes Runtian Zhu et.al. 2512.24217 null
2025-12-30 OptRot: Mitigating Weight Outliers via Data-Free Rotations for Post-Training Quantization Advait Gadhikar et.al. 2512.24124 null
2025-12-30 One-Shot Structured Pruning of Quantum Neural Networks via $q$ -Group Engineering and Quantum Geometric Metrics Haijian Shao et.al. 2512.24019 null
2025-12-30 Structure-Guided Allocation of 2D Gaussians for Image Representation and Compression Huanxiong Liang et.al. 2512.24018 null
2025-12-30 HERO-Sign: Hierarchical Tuning and Efficient Compiler-Time GPU Optimizations for SPHINCS+ Signature Generation Yaoyun Zhou et.al. 2512.23969 null
2025-12-30 Hardware Acceleration for Neural Networks: A Comprehensive Survey Bin Xu et.al. 2512.23914 null
2025-12-29 Efficient Deep Learning for Short-Term Solar Irradiance Time Series Forecasting: A Benchmark Study in Ho Chi Minh City Tin Hoang et.al. 2512.23898 null
2025-12-29 Probing the Limits of Compressive Memory: A Study of Infini-Attention in Small-Scale Pretraining Ruizhe Huang et.al. 2512.23862 null
2025-12-29 FRoD: Full-Rank Efficient Fine-Tuning with Rotational Degrees for Fast Convergence Guoan Wan et.al. 2512.23485 null
2025-12-29 Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Ang Lv et.al. 2512.23447 null
2025-12-29 Mobile-Efficient Speech Emotion Recognition Using DistilHuBERT: A Cross-Corpus Validation Study Saifelden M. Ismail et.al. 2512.23435 null
2025-12-29 Electro-optical modulation of light polarization in a nonlocal lithium niobate metasurface Agostino Di Francescantonio et.al. 2512.23393 null
2025-12-29 Post-Training Quantization of OpenPangu Models for Efficient Deployment on Atlas A2 Yilun Luo et.al. 2512.23367 null
2025-12-29 Deep learning for pedestrians: backpropagation in Transformers Laurent Boué et.al. 2512.23329 null
2025-12-29 Interpretable Safety Alignment via SAE-Constructed Low-Rank Subspace Adaptation Dianyun Wang et.al. 2512.23260 null
2025-12-29 RS-Prune: Training-Free Data Pruning at High Ratios for Efficient Remote Sensing Diffusion Foundation Models Fan Wei et.al. 2512.23239 null
2025-12-29 Energy and Memory-Efficient Federated Learning With Ordered Layer Freezing Ziru Niu et.al. 2512.23200 null
2025-12-29 A Simple, Optimal and Efficient Algorithm for Online Exp-Concave Optimization Yi-Han Wang et.al. 2512.23190 null
2025-12-30 Evaluating Parameter Efficient Methods for RLVR Qingyu Yin et.al. 2512.23165 null
2025-12-28 Efficient flip-chip and on-chip-based modulation of flux-tunable superconducting resonators Achintya Paradkar et.al. 2512.23119 null
2025-12-28 Taming the Tail: Stable LLM Reinforcement Learning via Dynamic Vocabulary Pruning Yingru Li et.al. 2512.23087 null
2025-12-28 Rethinking Fine-Tuning: Unlocking Hidden Capabilities in Vision-Language Models Mingyuan Zhang et.al. 2512.23073 null
2025-12-28 Federated Learning With L0 Constraint Via Probabilistic Gates For Sparsity Krishna Harsha Kovelakuntla Huthasana et.al. 2512.23071 null
2025-12-28 TYTAN: Taylor-series based Non-Linear Activation Engine for Deep Learning Accelerators Soham Pramanik et.al. 2512.23062 null
2025-12-28 The topological life of Dynkin indices: universal scaling and matter selection Mboyo Esole et.al. 2512.23041 null
2025-12-28 Interpretable Gallbladder Ultrasound Diagnosis: A Lightweight Web-Mobile Software Platform with Real-Time XAI Fuyad Hasan Bhoyan et.al. 2512.23033 null
2025-12-28 Merge before Forget: A Single LoRA Continual Learning via Continual Merging Fuli Qiao et.al. 2512.23017 null
2025-12-28 Improving Generalization in LLM Structured Pruning via Function-Aware Neuron Grouping Tao Yu et.al. 2512.23014 null
2025-12-28 YOLO-IOD: Towards Real Time Incremental Object Detection Shizhou Zhang et.al. 2512.22973 null
2025-12-28 Gauge Symmetry in Quantum Simulation Masanori Hanada et.al. 2512.22932 null
2025-12-28 Covering in Hamming and Grassmann Spaces: New Bounds and Reed–Solomon-Based Constructions Samin Riasat et.al. 2512.22911 null
2025-12-28 Hash Grid Feature Pruning Yangzhi Ma et.al. 2512.22882 null
2025-12-28 Parallel Diffusion Solver via Residual Dirichlet Policy Optimization Ruoyu Wang et.al. 2512.22796 null
2025-12-28 TrimTokenator-LC: Towards Adaptive Visual Token Pruning for Large Multimodal Models with Long Contexts Hao Zhang et.al. 2512.22748 null
2025-12-28 Robust LLM-based Column Type Annotation via Prompt Augmentation with LoRA Tuning Hanze Meng et.al. 2512.22742 null
2025-12-27 Fragile Knowledge, Robust Instruction-Following: The Width Pruning Dichotomy in Llama-3.2 Pere Martra et.al. 2512.22671 null
2025-12-27 The Quest for Winning Tickets in Low-Rank Adapters Hamed Damirchi et.al. 2512.22495 null
2025-12-27 Scalpel-SAM: A Semi-Supervised Paradigm for Adapting SAM to Infrared Small Object Detection Zihan Liu et.al. 2512.22483 null
2025-12-27 AFA-LoRA: Enabling Non-Linear Adaptations in LoRA with Activation Function Annealing Jiacheng Li et.al. 2512.22455 null
2025-12-26 Lightweight Inference-Time Personalization for Frozen Knowledge Graph Embeddings Ozan Oguztuzun et.al. 2512.22398 null
2025-12-26 Integrating Wide and Deep Neural Networks with Squeeze-and-Excitation Blocks for Multi-Target Property Prediction in Additively Manufactured Fiber Reinforced Composites Behzad Parvaresh et.al. 2512.22397 null
2025-12-26 Towards Efficient Post-Training via Fourier-Driven Adapter Architectures Donggyun Bae et.al. 2512.22378 null
2025-12-26 The Effectiveness of Approximate Regularized Replay for Efficient Supervised Fine-Tuning of Large Language Models Matthew Riemer et.al. 2512.22337 null
2025-12-26 PortionNet: Distilling 3D Geometric Knowledge for Food Nutrition Estimation Darrin Bright et.al. 2512.22304 null
2025-12-26 Pruning as a Game: Equilibrium-Driven Sparsification of Neural Networks Zubair Shah et.al. 2512.22106 null
2025-12-26 A Lightweight Multi-Scale Attention Framework for Real-Time Spinal Endoscopic Instance Segmentation Qi Lai et.al. 2512.21984 null
2025-12-26 Breaking Alignment Barriers: TPS-Driven Semantic Correlation Learning for Alignment-Free RGB-T Salient Object Detection Lupiao Hu et.al. 2512.21856 null
2025-12-26 Knowledge Reasoning of Large Language Models Integrating Graph-Structured Information for Pest and Disease Control in Tobacco Siyu Li et.al. 2512.21837 null
2025-12-26 LIME:Accelerating Collaborative Lossless LLM Inference on Memory-Constrained Edge Devices Mingyu Sun et.al. 2512.21835 null
2025-12-25 InstructMoLE: Instruction-Guided Mixture of Low-rank Experts for Multi-Conditional Image Generation Jinqi Xiao et.al. 2512.21788 null
2025-12-25 An Information Theoretic Perspective on Agentic System Design Shizhe He et.al. 2512.21720 null
2025-12-25 MoRAgent: Parameter Efficient Agent Tuning with Mixture-of-Roles Jing Han et.al. 2512.21708 null
2025-12-25 Rethinking Output Alignment For 1-bit Post-Training Quantization of Large Language Models Dung Anh Hoang et.al. 2512.21651 null
2025-12-25 UltraLBM-UNet: Ultralight Bidirectional Mamba-based Model for Skin Lesion Segmentation Linxuan Fan et.al. 2512.21584 null
2025-12-25 Gamayun’s Path to Multilingual Mastery: Cost-Efficient Training of a 1.5B-Parameter LLM Alexander Podolskiy et.al. 2512.21580 null
2025-12-25 Quantum $SL^+(N,\mathbb{R})$ as a locally compact quantum group K. De Commer et.al. 2512.21579 null
2025-12-25 Towards Long-window Anchoring in Vision-Language Model Distillation Haoyi Zhou et.al. 2512.21576 null
2025-12-25 World-Coordinate Human Motion Retargeting via SAM 3D Body Zhangzheng Tu et.al. 2512.21573 null
2025-12-25 RefineBridge: Generative Bridge Models Improve Financial Forecasting by Foundation Models Anthony Bolton et.al. 2512.21572 null
2025-12-25 Hierarchy-Aware Fine-Tuning of Vision-Language Models Jiayu Li et.al. 2512.21529 null
2025-12-25 Selective LLM-Guided Regularization for Enhancing Recommendation Models Shanglin Yang et.al. 2512.21526 null
2025-12-25 Fixed-Budget Parameter-Efficient Training with Frozen Encoders Improves Multimodal Chest X-Ray Classification Md Ashik Khan et.al. 2512.21508 null
2025-12-24 A Graph-Augmented knowledge Distillation based Dual-Stream Vision Transformer with Region-Aware Attention for Gastrointestinal Disease Classification with Explainable AI Md Assaduzzaman et.al. 2512.21372 null
2025-12-24 Fast SAM2 with Text-Driven Token Pruning Avilasha Mandal et.al. 2512.21333 null
2025-12-24 Model Merging via Multi-Teacher Knowledge Distillation Seyed Arshan Dalili et.al. 2512.21288 null
2025-12-24 SMART SLM: Structured Memory and Reasoning Transformer, A Small Language Model for Accurate Document Assistance Divij Dudeja et.al. 2512.21280 null
2025-12-24 TGC-Net: A Structure-Aware and Semantically-Aligned Framework for Text-Guided Medical Image Segmentation Gaoren Lin et.al. 2512.21135 null
2025-12-24 Classical reservoir approach for efficient molecular ground state preparation Zekun He et.al. 2512.21069 null
2025-12-24 Formal O(N3) scaling GW calculations by block tensor decomposition for large molecule systems Yueyang Zhang et.al. 2512.21022 null
2025-12-24 Efficient and Robust Video Defense Framework against 3D-field Personalized Talking Face Rui-qing Sun et.al. 2512.21019 null
2025-12-24 Distilling the Essence: Efficient Reasoning Distillation via Sequence Truncation Wei-Rui Chen et.al. 2512.21002 null
2025-12-24 Leveraging Overfitting for Low-Complexity and Modality-Agnostic Joint Source-Channel Coding Haotian Wu et.al. 2512.20981 null
2025-12-24 Universal Transient Stability Analysis: A Large Language Model-Enabled Dynamics Prediction Framework Chao Shen et.al. 2512.20970 null
2025-12-24 AirGS: Real-Time 4D Gaussian Streaming for Free-Viewpoint Video Experiences Zhe Wang et.al. 2512.20943 null
2025-12-24 RevFFN: Memory-Efficient Full-Parameter Fine-Tuning of Mixture-of-Experts LLMs with Reversible Blocks Ningyuan Liu et.al. 2512.20920 null
2025-12-24 Beyond Weight Adaptation: Feature-Space Domain Injection for Cross-Modal Ship Re-Identification Tingfeng Xian et.al. 2512.20892 null
2025-12-24 Architectural Trade-offs in Small Language Models Under Compute Constraints Shivraj Singh Bhatti et.al. 2512.20877 null
2025-12-25 Learning to Sense for Driving: Joint Optics-Sensor-Model Co-Design for Semantic Segmentation Reeshad Khan et.al. 2512.20815 null
2025-12-23 Making Large Language Models Efficient Dense Retrievers Yibin Lei et.al. 2512.20612 null
2025-12-23 FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models Kaitong Cai et.al. 2512.20561 null
2025-12-23 Simplifying Multi-Task Architectures Through Task-Specific Normalization Mihai Suteu et.al. 2512.20420 null
2025-12-23 Branch Learning in MRI: More Data, More Models, More Training Yuyang Li et.al. 2512.20330 null
2025-12-23 Mixture-of-Experts with Gradient Conflict-Driven Subspace Topology Pruning for Emergent Modularity Yuxing Gan et.al. 2512.20291 null
2025-12-23 Generative Latent Coding for Ultra-Low Bitrate Image Compression Zhaoyang Jia et.al. 2512.20194 null
2025-12-23 HEART-VIT: Hessian-Guided Efficient Dynamic Attention and Token Pruning in Vision Transformer Mohammad Helal Uddin et.al. 2512.20120 null
2025-12-23 Neural Compression of 360-Degree Equirectangular Videos using Quality Parameter Adaptation Daichi Arai et.al. 2512.20093 null
2025-12-23 Rethinking Knowledge Distillation in Collaborative Machine Learning: Memory, Knowledge, and Their Interactions Pengchao Han et.al. 2512.19972 null
2025-12-22 Fine-Tuned In-Context Learners for Efficient Adaptation Jorg Bornschein et.al. 2512.19879 null
2025-12-22 Quantization for sequences of blow-up solutions to an elliptic equation having nonlocal exponential nonlinearity Mathew Gluck et.al. 2512.19865 null
2025-12-22 Efficient Vision Mamba for MRI Super-Resolution via Hybrid Selective Scanning Mojtaba Safari et.al. 2512.19676 null
2025-12-22 Quantization of Random Homogeneous Self-Similar Measures Akash Banerjee et.al. 2512.19628 null
2025-12-22 Yang-Mills energy quantization over non-collapsed degenerating Einstein manifolds and applications Youmin Chen et.al. 2512.19552 null
2025-12-22 Lightweight Intrusion Detection in IoT via SHAP-Guided Feature Pruning and Knowledge-Distilled Kronecker Networks Hafsa Benaddi et.al. 2512.19488 null
2025-12-22 Sensitivity-Aware Mixed-Precision Quantization for ReRAM-based Computing-in-Memory Guan-Cheng Chen et.al. 2512.19445 null
2025-12-22 D2Pruner: Debiased Importance and Structural Diversity for MLLM Token Pruning Evelyn Zhang et.al. 2512.19443 null
2025-12-22 A Computationally Efficient Framework for Overlapping Community Detection in Large Bipartite Graphs Yue Zeng et.al. 2512.19426 null
2025-12-22 Sprecher Networks: A Parameter-Efficient Kolmogorov-Arnold Architecture Christian Hägg et.al. 2512.19367 null
2025-12-22 Are All Data Necessary? Efficient Data Pruning for Large-scale Autonomous Driving Dataset via Trajectory Entropy Maximization Zhaoyang Liu et.al. 2512.19270 null
2025-12-22 Small Language Models as Compiler Experts: Auto-Parallelization for Heterogeneous Systems Prathamesh Devadiga et.al. 2512.19250 null
2025-12-22 Towards Minimal Fine-Tuning of VLMs Tiange Luo et.al. 2512.19219 null
2025-12-22 MixKVQ: Query-Aware Mixed-Precision KV Cache Quantization for Long-Context Reasoning Tao Zhang et.al. 2512.19206 null
2025-12-22 SAP: Syntactic Attention Pruning for Transformer-based Language Models Tzu-Yun Lee et.al. 2512.19125 null
2025-12-22 GaussianImage++: Boosted Image Representation and Compression with 2D Gaussian Splatting Tiantian Li et.al. 2512.19108 null
2025-12-22 Tool-Augmented Hybrid Ensemble Reasoning with Distillation for Bilingual Mathematical Problem Solving Peiqing Lu et.al. 2512.19093 null
2025-12-22 Can abstract concepts from LLM improve SLM performance? Siddharth Tandon et.al. 2512.19069 null
2025-12-22 When Less is More: 8-bit Quantization Improves Continual Learning in Large Language Models Michael S. Zhang et.al. 2512.18934 null
2025-12-21 Stochastic quantization of the weighted exponential QFT Seiichiro Kusuoka et.al. 2512.18927 null
2025-12-21 FedVideoMAE: Efficient Privacy-Preserving Federated Video Moderation Ziyuan Tao et.al. 2512.18809 null
2025-12-21 IPCV: Information-Preserving Compression for MLLM Visual Encoders Yuan Chen et.al. 2512.18747 null
2025-12-21 Uni-Neur2Img: Unified Neural Signal-Guided Image Generation, Editing, and Stylization via Diffusion Transformers Xiyue Bai et.al. 2512.18635 null
2025-12-21 A Multi-agent Text2SQL Framework using Small Language Models and Execution Feedback Thanh Dat Hoang et.al. 2512.18622 null
2025-12-21 Neologism Learning as a Parameter-Efficient Alternative to Fine-Tuning for Model Steering Sungjoon Park et.al. 2512.18551 null
2025-12-20 Position-Resolved Resonance Quantization for Lossy Cavities Lucas Weitzel et.al. 2512.18478 null
2025-12-20 Analog Quantum Image Representation with Qubit-Frugal Encoding Vikrant Sharma et.al. 2512.18451 null
2025-12-20 MoE Pathfinder: Trajectory-driven Expert Pruning Xican Yang et.al. 2512.18425 null
2025-12-20 Quantization for Vector Search under Streaming Updates Ishaq Aden-Ali et.al. 2512.18335 null
2025-12-20 Asynchronous Pipeline Parallelism for Real-Time Multilingual Lip Synchronization in Video Communication Systems Eren Caglar et.al. 2512.18318 null
2025-12-20 SG-RIFE: Semantic-Guided Real-Time Intermediate Flow Estimation with Diffusion-Competitive Perceptual Quality Pan Ben Wong et.al. 2512.18241 null
2025-12-19 ACE-Sync: An Adaptive Cloud-Edge Synchronization Framework for Communication-Efficient Large-Scale Distributed Model Training Yi Yang et.al. 2512.18127 null
2025-12-19 Efficient Mixture-of-Agents Serving via Tree-Structured Routing, Adaptive Pruning, and Dependency-Aware Prefill-Decode Overlap Zijun Wang et.al. 2512.18126 null
2025-12-19 YolovN-CBi: A Lightweight and Efficient Architecture for Real-Time Detection of Small UAVs Ami Pandat et.al. 2512.18046 null
2025-12-19 CoPE: A Small Language Model for Steerable and Scalable Content Labeling Samidh Chakrabarti et.al. 2512.18027 null
2025-12-19 On General Linearly Implicit Quantized State System Methods Mariana Bergonzi et.al. 2512.17855 null
2025-12-19 Two-photon light-sheet live imaging at kilohertz frame rate using birefringence-based pulse splitting Lei Zhu et.al. 2512.17783 null
2025-12-19 Easy Adaptation: An Efficient Task-Specific Knowledge Injection Method for Large Models in Resource-Constrained Environments Dong Chen et.al. 2512.17771 null
2025-12-19 AdaptPrompt: Parameter-Efficient Adaptation of VLMs for Generalizable Deepfake Detection Yichen Jiang et.al. 2512.17730 null
2025-12-19 Mitigating Forgetting in Low Rank Adaptation Joanna Sliwa et.al. 2512.17720 null
2025-12-19 Confidence-Credibility Aware Weighted Ensembles of Small LLMs Outperform Large LLMs in Emotion Detection Menna Elgabry et.al. 2512.17630 null
2025-12-19 Guided progressive reconstructive imaging: a new quantization-based framework for low-dose, high-throughput and real-time analytical ptychography Hoelen L. Lalandec Robert et.al. 2512.17561 null
2025-12-19 A 28nm 0.22 μJ/token memory-compute-intensity-aware CNN-Transformer accelerator with hybrid-attention-based layer-fusion and cascaded pruning for semanticsegmentation Pingcheng Dong et.al. 2512.17555 null
2025-12-19 Voxel-GS: Quantized Scaffold Gaussian Splatting Compression with Run-Length Coding Chunyang Fu et.al. 2512.17528 null
2025-12-19 Resource-efficient medical image classification for edge devices Mahsa Lavaei et.al. 2512.17515 null
2025-12-19 A lightweight Spatial-Temporal Graph Neural Network for Long-term Time Series Forecasting Henok Tenaw Moges et.al. 2512.17453 null
2025-12-19 Adaptive Graph Pruning with Sudden-Events Evaluation for Traffic Prediction using Online Semi-Decentralized ST-GNNs Ivan Kralj et.al. 2512.17352 null
2025-12-19 Auxiliary Descriptive Knowledge for Few-Shot Adaptation of Vision-Language Model SuBeen Lee et.al. 2512.17313 null
2025-12-19 Warmer for Less: A Cost-Efficient Strategy for Cold-Start Recommendations at Pinterest Saeed Ebrahimi et.al. 2512.17277 null
2025-12-19 BumpNet: A Sparse Neural Network Framework for Learning PDE Solutions Shao-Ting Chiu et.al. 2512.17198 null
2025-12-18 Atom: Efficient On-Device Video-Language Pipelines Through Modular Reuse Kunjal Panchal et.al. 2512.17108 null
2025-12-18 Bandwidth-Efficient Adaptive Mixture-of-Experts via Low-Rank Compensation Zhenyu Liu et.al. 2512.17073 null
2025-12-18 Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL Khushboo Thaker et.al. 2512.17053 null
2025-12-18 UniRel-R1: RL-tuned LLM Reasoning for Knowledge Graph Relational Question Answering Yinxu Tang et.al. 2512.17043 null
2025-12-18 Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection Kaixin Ding et.al. 2512.16905 null
2025-12-18 TOGGLE: Temporal Logic-Guided Large Language Model Compression for Edge Khurram Khalil et.al. 2512.16855 null
2025-12-18 Simulation-based inference with neural posterior estimation applied to X-ray spectral fitting - III Deriving exact posteriors with dimension reduction and importance sampling Didier Barret et.al. 2512.16709 null
2025-12-18 Direct inversion of data-space Hessian for efficient time-domain extended-source waveform inversion using the multiplier method Mahdi Sonbolestan et.al. 2512.16642 null
2025-12-18 Efficient CPU-GPU Collaborative Inference for MoE-based LLMs on Memory-Limited Systems En-Ming Huang et.al. 2512.16473 null
2025-12-18 CKA-Guided Modular Quantization: Beyond Bit-Width to Algorithmic Diversity Jinhao Zhang et.al. 2512.16282 null
2025-12-19 Trustworthy and Controllable Professional Knowledge Utilization in Large Language Models with TEE-GPU Execution Yifeng Cai et.al. 2512.16238 null
2025-12-18 A Domain-Adapted Pipeline for Structured Information Extraction from Police Incident Announcements on Social Media Mengfan Shen et.al. 2512.16183 null
2025-12-18 Tunneling in double-well potentials within stochastic quantization: Application to ammonia inversion Danilo F. Schafaschek et.al. 2512.16168 null
2025-12-18 Antisymmetrization of composite fermionic states for quantum simulations of nuclear reactions in first-quantization mapping Ionel Stetcu et.al. 2512.16138 null
2025-12-18 A Tri-Dynamic Preprocessing Framework for UGC Video Compression Fei Zhao et.al. 2512.16101 null
2025-12-18 TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times Jintao Zhang et.al. 2512.16093 null
2025-12-18 LAPX: Lightweight Hourglass Network with Global Context Haopeng Zhao et.al. 2512.16089 null
2025-12-17 AIE4ML: An End-to-End Framework for Compiling Neural Networks for the Next Generation of AMD AI Engines Dimitrios Danopoulos et.al. 2512.15946 null
2025-12-17 Small Language Models for Efficient Agentic Tool Calling: Outperforming Large Models with Targeted Fine-tuning Polaris Jhandi et.al. 2512.15943 null
2025-12-17 End-to-End Training for Autoregressive Video Diffusion via Self-Resampling Yuwei Guo et.al. 2512.15702 null
2025-12-17 How Much is Too Much? Exploring LoRA Rank Trade-offs for Retaining Knowledge and Domain Robustness Darshita Rathore et.al. 2512.15634 null
2025-12-17 Bolmo: Byteifying the Next Generation of Language Models Benjamin Minixhofer et.al. 2512.15586 null
2025-12-17 IMKD: Intensity-Aware Multi-Level Knowledge Distillation for Camera-Radar Fusion Shashank Mishra et.al. 2512.15581 null
2025-12-17 An Efficient and Effective Encoder Model for Vision and Language Tasks in the Remote Sensing Domain João Daniel Silva et.al. 2512.15531 null
2025-12-17 Photorealistic Phantom Roads in Real Scenes: Disentangling 3D Hallucinations from Physical Geometry Hoang Nguyen et.al. 2512.15423 null
2025-12-17 Dual-Density Inference for Efficient Language Model Reasoning Zhengyi Zhao et.al. 2512.15358 null
2025-12-17 Joint Activity Detection and Channel Estimation For Fluid Antenna System Exploiting Geographical and Angular Information Zhentian Zhang et.al. 2512.15342 null
2025-12-17 Bits for Privacy: Evaluating Post-Training Quantization via Membership Inference Chenxiang Zhang et.al. 2512.15335 null
2025-12-17 A Masked Reverse Knowledge Distillation Method Incorporating Global and Local Information for Image Anomaly Detection Yuxin Jiang et.al. 2512.15326 null
2025-12-17 KD360-VoxelBEV: LiDAR and 360-degree Camera Cross Modality Knowledge Distillation for Bird’s-Eye-View Segmentation Wenke E et.al. 2512.15311 null
2025-12-17 LLMQ: Efficient Lower-Precision Pretraining for Consumer GPUs Erik Schultheis et.al. 2512.15306 null
2025-12-17 Generative Preprocessing for Image Compression with Pre-trained Diffusion Models Mengxi Guo et.al. 2512.15270 null
2025-12-18 Null-LoRA: Low-Rank Adaptation on Null Space Yi Zhang et.al. 2512.15233 null
2025-12-17 ERIENet: An Efficient RAW Image Enhancement Network under Low-Light Environment Jianan Wang et.al. 2512.15186 null
2025-12-18 An updated efficient galaxy morphology classification model based on ConvNeXt encoding with UMAP dimensionality reduction Guanwen Fang et.al. 2512.15137 null
2025-12-17 Large Model Enabled Embodied Intelligence for 6G Integrated Perception, Communication, and Computation Network Zhuoran Li et.al. 2512.15109 null
2025-12-17 Quantization in mixed polarization via transverse Poincaré-Birkhoff-Witt theorem Dan Wang et.al. 2512.15060 null
2025-12-17 Fractional quantization by interaction of arbitrary strength in gapless flat bands with divergent quantum geometry Wenqi Yang et.al. 2512.15041 null
2025-12-16 Cross-Tokenizer Likelihood Scoring Algorithms for Language Model Distillation Buu Phan et.al. 2512.14954 null
2025-12-16 Parameter Efficient Multimodal Instruction Tuning for Romanian Vision Language Models George-Andrei Dima et.al. 2512.14926 null
2025-12-16 Compensation of Coarse Quantization Effects on Channel Estimation and BER in Massive MIMO Reza Mohammadkhani et.al. 2512.14893 null
2025-12-16 Spherical Leech Quantization for Visual Tokenization and Generation Yue Zhao et.al. 2512.14697 null
2025-12-16 Native and Compact Structured Latents for 3D Generation Jianfeng Xiang et.al. 2512.14692 null
2025-12-16 Focus: A Streaming Concentration Architecture for Efficient Vision-Language Models Chiyue Wei et.al. 2512.14661 null
2025-12-16 PruneX: A Hierarchical Communication-Efficient System for Distributed CNN Training with Structured Pruning Alireza Olama et.al. 2512.14628 null
2025-12-16 Distill Video Datasets into Images Zhenghao Zhao et.al. 2512.14621 null
2025-12-16 Polypersona: Persona-Grounded LLM for Synthetic Survey Responses Tejaswani Dash et.al. 2512.14562 null
2025-12-16 VersatileFFN: Achieving Parameter Efficiency in LLMs via Adaptive Wide-and-Deep Reuse Ying Nie et.al. 2512.14531 null
2025-12-16 SASQ: Static Activation Scaling for Quantization-Aware Training in Large Language Models Shizhuo Mao et.al. 2512.14481 null
2025-12-16 Context-Picker: Dynamic context selection using multi-stage reinforcement learning Siyuan Zhu et.al. 2512.14465 null
2025-12-16 HGS: Hybrid Gaussian Splatting with Static-Dynamic Decomposition for Compact Dynamic View Synthesis Kaizhe Zhang et.al. 2512.14352 null
2025-12-16 Ladder Up, Memory Down: Low-Cost Fine-Tuning With Side Nets Estelle Zheng et.al. 2512.14237 null
2025-12-16 Arithmetic-Intensity-Aware Quantization Taig Singh et.al. 2512.14090 null
2025-12-16 HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices HyperAI Team et.al. 2512.14052 null
2025-12-16 Evaluating Small Language Models for Agentic On-Farm Decision Support Systems Enhong Liu et.al. 2512.14043 null
2025-12-15 Ensemble-Guided Distillation for Compact and Robust Acoustic Scene Classification on Edge Devices Hossein Sharify et.al. 2512.13905 null
2025-12-15 OPTIMA: Optimal One-shot Pruning for LLMs via Quadratic Programming Reconstruction Mohammad Mozaffari et.al. 2512.13886 null
2025-12-15 Improvise, Adapt, Overcome – Telescopic Adapters for Efficient Fine-tuning of Vision Language Models in Medical Imaging Ujjwal Mishra et.al. 2512.13855 null
2025-12-15 Recurrent Video Masked Autoencoders Daniel Zoran et.al. 2512.13684 null
2025-12-15 SEDULity: A Proof-of-Learning Framework for Distributed and Secure Blockchains with Efficient Useful Work Weihang Cao et.al. 2512.13666 null
2025-12-15 Large-Language Memorization During the Classification of United States Supreme Court Cases John E. Ortega et.al. 2512.13654 null
2025-12-15 Performance Limits of Hardware-Constrained THz Inter-Satellite MIMO-ISAC Systems Haofan Dong et.al. 2512.13652 null
2025-12-16 MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning Haoyu Fu et.al. 2512.13636 null
2025-12-15 LightTopoGAT: Enhancing Graph Attention Networks with Topological Features for Efficient Graph Classification Ankit Sharma et.al. 2512.13617 null
2025-12-15 Null quantization, shadows and boost eigenfunctions in Lorentzian AdS Núria Navarro et.al. 2512.13541 null
2025-12-15 SkipCat: Rank-Maximized Low-Rank Compression of Large Language Models via Shared Projection and Block Skipping Yu-Chen Lu et.al. 2512.13494 null
2025-12-15 Element-wise Modulation of Random Matrices for Efficient Neural Layers Maksymilian Szorc et.al. 2512.13480 null
2025-12-15 Automated Information Flow Selection for Multi-scenario Multi-task Recommendation Chaohua Yang et.al. 2512.13396 null
2025-12-15 Space Efficient Algorithms for Parameterised Problems Sheikh Shakil Akhtar et.al. 2512.13342 null
2025-12-16 KD-PINN: Knowledge-Distilled PINNs for ultra-low-latency real-time neural PDE solvers Karim Bounja et.al. 2512.13336 null
2025-12-15 Distillation of continuous variable qudits from single photon sources: A cascaded approach Devibala Esakkimuthu et.al. 2512.13264 null
2025-12-15 Seeing the Whole Picture: Distribution-Guided Data-Free Distillation for Semantic Segmentation Hongxuan Sun et.al. 2512.13175 null
2025-12-15 An Optimal Alignment-Driven Iterative Closed-Loop Convergence Framework for High-Performance Ultra-Large Scale Layout Pattern Clustering Shuo Liu et.al. 2512.13133 null
2025-12-15 SliceMoE: Bit-Sliced Expert Caching under Miss-Rate Constraints for Efficient MoE Inference Yuseon Choi et.al. 2512.12990 null
2025-12-15 CoDeQ: End-to-End Joint Model Compression with Dead-Zone Quantizer for High-Sparsity and Low-Precision Networks Jonathan Wenshøj et.al. 2512.12981 null
2025-12-15 Application of Deep Learning in Biological Data Compression Chunyu Zou et.al. 2512.12975 null
2025-12-15 Investigating Data Pruning for Pretraining Biological Foundation Models at Scale Yifan Wu et.al. 2512.12932 null
2025-12-15 SeVeDo: A Heterogeneous Transformer Accelerator for Low-Bit Inference via Hierarchical Group Quantization and SVD-Guided Mixed Precision Yuseon Choi et.al. 2512.12930 null
2025-12-14 Improving Recursive Transformers with Mixture of LoRAs Mohammadmahdi Nouriborji et.al. 2512.12880 null
2025-12-14 KANELÉ: Kolmogorov-Arnold Networks for Efficient LUT-based Evaluation Duc Hoang et.al. 2512.12850 null
2025-12-14 HaShiFlex: A High-Throughput Hardened Shifter DNN Accelerator with Fine-Tuning Flexibility Jonathan Herbst et.al. 2512.12847 null
2025-12-14 Adapting Multimodal Foundation Models for Few-Shot Learning: A Comprehensive Study on Contrastive Captioners N. K. B. M. P. K. B. Narasinghe et.al. 2512.12824 null
2025-12-14 FuXi- $γ$ : Efficient Sequential Recommendation with Exponential-Power Temporal Encoder and Diagonal-Sparse Positional Mechanism Dezhi Yi et.al. 2512.12740 null
2025-12-14 Self-Motivated Growing Neural Network for Adaptive Architecture via Local Structural Plasticity Yiyang Jia et.al. 2512.12713 null
2025-12-14 Efficient Vision-Language Reasoning via Adaptive Token Pruning Xue Li et.al. 2512.12701 null
2025-12-14 Fine-Tuning Causal LLMs for Text Classification: Embedding-Based vs. Instruction-Based Approaches Amirhossein Yousefiramandi et.al. 2512.12677 null
2025-12-14 Patch-wise Retrieval: A Bag of Practical Techniques for Instance-level Matching Wonseok Choi et.al. 2512.12610 null
2025-12-14 StreamingAssistant: Efficient Visual Token Pruning for Accelerating Online Video Understanding Xinqi Jin et.al. 2512.12560 null
2025-12-14 Effective Fine-Tuning with Eigenvector Centrality Based Pruning Shaif Chowdhury et.al. 2512.12543 null
2025-12-13 Cross-Modal Representational Knowledge Distillation for Enhanced Spike-Informed LFP Modeling Eray Erturk et.al. 2512.12461 null
2025-12-13 Complete Topological Quantization of Higher Gauge Fields Hisham Sati et.al. 2512.12431 null
2025-12-13 Large and Small Model Collaboration for Air Interface Yiming Cui et.al. 2512.12170 null
2025-12-12 Instruction-Tuning Open-Weight Language Models for BPMN Model Generation Gökberk Çelikmasat et.al. 2512.12063 null
2025-12-12 HFS: Holistic Query-Aware Frame Selection for Efficient Video Reasoning Yiqing Yang et.al. 2512.11534 null
2025-12-12 Quantization for Semipositive Adjoint Line Bundles Yu-Chi Hou et.al. 2512.11523 null
2025-12-12 Enhanced Pruning for Distributed Closeness Centrality under Multi-Packet Messaging Patrick D. Manya et.al. 2512.11512 null
2025-12-12 qa-FLoRA: Data-free query-adaptive Fusion of LoRAs for LLMs Shreya Shukla et.al. 2512.11366 null
2025-12-12 Why cut-and-choose quantum state verification cannot be both efficient and secure Fabian Wiesner et.al. 2512.11358 null
2025-12-12 PhraseVAE and PhraseLDM: Latent Diffusion for Full-Song Multitrack Symbolic Music Generation Longshen Ou et.al. 2512.11348 null
2025-12-12 MLLM Machine Unlearning via Visual Knowledge Distillation Yuhang Wang et.al. 2512.11325 null
2025-12-12 AdaSD: Adaptive Speculative Decoding for Efficient Language Model Inference Kuan-Wei Lu et.al. 2512.11280 null
2025-12-12 Information-Theoretic Equivalences Across Rate-Distortion, Quantization, and Decoding Bruno Macchiavello et.al. 2512.11279 null
2025-12-11 Network and Compiler Optimizations for Efficient Linear Algebra Kernels in Private Transformer Inference Karthik Garimella et.al. 2512.11135 null
2025-12-11 Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching Bowen Wen et.al. 2512.11130 null
2025-12-11 Information-driven Fusion of Pathology Foundation Models for Enhanced Disease Characterization Brennan Flannery et.al. 2512.11104 null
2025-12-11 Q-BAR: Blogger Anomaly Recognition via Quantum-enhanced Manifold Learning Maida Wang et.al. 2512.11071 null
2025-12-11 Weakly Supervised Tuberculosis Localization in Chest X-rays through Knowledge Distillation Marshal Ashif Shawkat et.al. 2512.11057 null
2025-12-11 SparseSwaps: Tractable LLM Pruning Mask Refinement at Scale Max Zimmer et.al. 2512.10922 null
2025-12-11 Multi-Granular Node Pruning for Circuit Discovery Muhammad Umair Haider et.al. 2512.10903 null
2025-12-11 LDP: Parameter-Efficient Fine-Tuning of Multimodal LLM for Medical Report Generation Tianyu Zhou et.al. 2512.10750 null
2025-12-11 Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution Zouying Cao et.al. 2512.10696 null
2025-12-11 Master functions and hybrid quantization of perturbed nonrotating black hole interiors Michele Lenzi et.al. 2512.10692 null
2025-12-11 Deep Photonic Reservoir Computing with On-chip Nonlinearity Jinlong Xiang et.al. 2512.10626 null
2025-12-11 Phythesis: Physics-Guided Evolutionary Scene Synthesis for Energy-Efficient Data Center Design via LLMs Minghao LI et.al. 2512.10611 null
2025-12-11 Uncertainty-Preserving QBNNs: Multi-Level Quantization of SVI-Based Bayesian Neural Networks for Image Classification Hendrik Borras et.al. 2512.10602 null
2025-12-11 Quantization of massive Dirac neutrinos in external fields Maxim Dvornikov et.al. 2512.10587 null
2025-12-11 Disentangled and Distilled Encoder for Out-of-Distribution Reasoning with Rademacher Guarantees Zahra Rahiminasab et.al. 2512.10522 null
2025-12-11 Geometric quantization on big line bundles Siarhei Finski et.al. 2512.10466 null
2025-12-11 Error-Propagation-Free Learned Video Compression With Dual-Domain Progressive Temporal Alignment Han Li et.al. 2512.10450 null
2025-12-11 Clustered Federated Learning with Hierarchical Knowledge Distillation Sabtain Ahmad et.al. 2512.10443 null
2025-12-11 A Kernel-based Resource-efficient Neural Surrogate for Multi-fidelity Prediction of Aerodynamic Field Apurba Sarker et.al. 2512.10287 null
2025-12-11 An Efficient Graph-Transformer Operator for Learning Physical Dynamics with Manifolds Embedding Pengwei Liu et.al. 2512.10227 null
2025-12-10 Generate-Then-Validate: A Novel Question Generation Approach Using Small Language Models Yumou Wei et.al. 2512.10110 null
2025-12-10 Parallel Decoder Transformer: Model-Internal Parallel Decoding with Speculative Invariance via Note Conditioning Logan Robbins et.al. 2512.10054 null
2025-12-10 Spatial Spiking Neural Networks Enable Efficient and Robust Temporal Computation Lennart P. L. Landsmeer et.al. 2512.10011 null
2025-12-10 GoodSpeed: Optimizing Fair Goodput with Adaptive Speculative Decoding in Distributed Edge Inference Phuong Tran et.al. 2512.09963 null
2025-12-10 Token Expand-Merge: Training-Free Token Compression for Vision-Language-Action Models Yifan Ye et.al. 2512.09927 null
2025-12-10 Efficient Continual Learning in Neural Machine Translation: A Low-Rank Adaptation Approach Salvador Carrión et.al. 2512.09910 null
2025-12-10 SCOPE: Language Models as One-Time Teacher for Hierarchical Planning in Text Environments Haoye Lu et.al. 2512.09897 null
2025-12-10 HPM-KD: Hierarchical Progressive Multi-Teacher Framework for Knowledge Distillation and Efficient Model Compression Gustavo Coelho Haase et.al. 2512.09886 null
2025-12-10 FlipLLM: Efficient Bit-Flip Attacks on Multimodal LLMs using Reinforcement Learning Khurram Khalil et.al. 2512.09872 null
2025-12-10 RIFT: A Scalable Methodology for LLM Accelerator Fault Assessment using Reinforcement Learning Khurram Khalil et.al. 2512.09829 null
2025-12-10 Energy-Efficient Federated Learning with Relay-Assisted Aggregation in IIoT Networks Hamid Reza Hashempour et.al. 2512.09827 null
2025-12-10 GLaD: Geometric Latent Distillation for Vision-Language-Action Models Minghao Guo et.al. 2512.09619 null
2025-12-10 LiePrune: Lie Group and Quantum Geometric Dual Representation for One-Shot Structured Pruning of Quantum Neural Networks Haijian Shao et.al. 2512.09469 null
2025-12-10 Black-Box Behavioral Distillation Breaks Safety Alignment in Medical LLMs Sohely Jahan et.al. 2512.09403 null
2025-12-10 Are Hypervectors Enough? Single-Call LLM Reasoning over Knowledge Graphs Yezi Liu et.al. 2512.09369 null
2025-12-10 NOC4SC: A Bandwidth-Efficient Multi-User Semantic Communication Framework for Interference-Resilient Transmission Yunhao Wang et.al. 2512.09356 null
2025-12-10 Training-free Context-adaptive Attention for Efficient Long Context Modeling Zeng You et.al. 2512.09238 null
2025-12-10 Tensor-Compressed and Fully-Quantized Training of Neural PDE Solvers Jinming Lu et.al. 2512.09202 null
2025-12-09 GS-KAN: Parameter-Efficient Kolmogorov-Arnold Networks via Sprecher-Type Shared Basis Functions Oscar Eliasson et.al. 2512.09084 null
2025-12-09 KD-OCT: Efficient Knowledge Distillation for Clinical-Grade Retinal OCT Classification Erfan Nourbakhsh et.al. 2512.09069 null
2025-12-09 Towards Lossless Ultimate Vision Token Compression for VLMs Dehua Zheng et.al. 2512.09010 null
2025-12-10 Efficiently Reconstructing Dynamic Scenes One D4RT at a Time Chuhan Zhang et.al. 2512.08924 null
2025-12-09 Fed-SE: Federated Self-Evolution for Privacy-Constrained Multi-Environment LLM Agents Xiang Chen et.al. 2512.08870 null
2025-12-09 PrivTune: Efficient and Privacy-Preserving Fine-Tuning of Large Language Models via Device-Cloud Collaboration Yi Liu et.al. 2512.08809 null
2025-12-09 Skewness-Guided Pruning of Multimodal Swin Transformers for Federated Skin Lesion Classification on Edge Devices Kuniko Paxton et.al. 2512.08751 null
2025-12-09 Scale-invariant and View-relational Representation Learning for Full Surround Monocular Depth Kyumin Hwang et.al. 2512.08700 null
2025-12-09 Beyond Real Weights: Hypercomplex Representations for Stable Quantization Jawad Ibn Ahad et.al. 2512.08524 null
2025-12-10 Solving Oversmoothing in GNNs via Nonlocal Message Passing: Algebraic Smoothing and Depth Scalability Weiqi Guan et.al. 2512.08475 null
2025-12-09 Quantization and Security Parameter Design for Overflow-Free Confidential FRIT Jungjin Park et.al. 2512.08464 null
2025-12-09 Nucleon Structure from Basis Light-Front Quantization : Status and Prospects James P. Vary et.al. 2512.08283 null
2025-12-09 SOFA-FL: Self-Organizing Hierarchical Federated Learning with Adaptive Clustered Data Sharing Yi Ni et.al. 2512.08267 null
2025-12-09 HybridToken-VLM: Hybrid Token Compression for Vision-Language Models Jusheng Zhang et.al. 2512.08240 null
2025-12-09 MobileFineTuner: A Unified End-to-End Framework for Fine-Tuning LLMs on Mobile Phones Jiaxiang Geng et.al. 2512.08211 null
2025-12-09 Animal Re-Identification on Microcontrollers Yubo Chen et.al. 2512.08198 null
2025-12-08 Skein-valued mirror curves for toric CY3 strips Mingyuan Hu et.al. 2512.07762 null
2025-12-08 PVeRA: Probabilistic Vector-Based Random Matrix Adaptation Leo Fillioux et.al. 2512.07703 null
2025-12-08 Sharp values for all dynamical variables via Anti-Wick quantization Simon Friederich et.al. 2512.07616 null
2025-12-08 Algorithm-hardware co-design of neuromorphic networks with dual memory pathways Pengfei Sun et.al. 2512.07602 null
2025-12-08 All You Need Are Random Visual Tokens? Demystifying Token Pruning in VLLMs Yahong Wang et.al. 2512.07580 null
2025-12-08 LIME: Making LLM Data More Efficient with Linguistic Metadata Embeddings Sebastian Sztwiertnia et.al. 2512.07522 null
2025-12-08 Dictionary-Based Contrastive Learning for GNSS Jamming Detection Zawar Hussain et.al. 2512.07512 null
2025-12-08 Persian-Phi: Efficient Cross-Lingual Adaptation of Compact LLMs via Curriculum Learning Amir Mohammad Akhlaghi et.al. 2512.07454 null
2025-12-08 Revolutionizing Mixed Precision Quantization: Towards Training-free Automatic Proxy Discovery via Large Language Models Haidong Kang et.al. 2512.07419 null
2025-12-08 GlimmerNet: A Lightweight Grouped Dilated Depthwise Convolutions for UAV-Based Emergency Monitoring Đorđe Nedeljković et.al. 2512.07391 null
2025-12-08 Recover-to-Forget: Gradient Reconstruction from LoRA for Efficient LLM Unlearning Yezi Liu et.al. 2512.07374 null
2025-12-08 Non-Intrusive Data-Free Parametric Reduced Order Model for Geometrically Nonlinear Structures Alexander Saccani et.al. 2512.07366 null
2025-12-08 ReLKD: Inter-Class Relation Learning with Knowledge Distillation for Generalized Category Discovery Fang Zhou et.al. 2512.07229 null
2025-12-08 Geometric Prior-Guided Federated Prompt Calibration Fei Luo et.al. 2512.07208 null
2025-12-08 SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting Seokhyun Youn et.al. 2512.07197 null
2025-12-08 HVQ-CGIC: Enabling Hyperprior Entropy Modeling for VQ-Based Controllable Generative Image Compression Niu Yi et.al. 2512.07192 null
2025-12-08 MuSASplat: Efficient Sparse-View 3D Gaussian Splats via Lightweight Multi-Scale Adaptation Muyu Xu et.al. 2512.07165 null
2025-12-08 Winning the Lottery by Preserving Network Training Dynamics with Concrete Ticket Search Tanay Arora et.al. 2512.07142 null
2025-12-08 FOAM: Blocked State Folding for Memory-Efficient LLM Training Ziqing Wen et.al. 2512.07112 null
2025-12-08 Leveraging KV Similarity for Online Structured Pruning in LLMs Jungmin Lee et.al. 2512.07090 null
2025-12-07 DAUNet: A Lightweight UNet Variant with Deformable Convolutions and Parameter-Free Attention for Medical Image Segmentation Adnan Munir et.al. 2512.07051 null
2025-12-07 PARIS: Pruning Algorithm via the Representer theorem for Imbalanced Scenarios Enrico Camporeale et.al. 2512.06950 null
2025-12-07 SceneMixer: Exploring Convolutional Mixing Networks for Remote Sensing Scene Classification Mohammed Q. Alkhatib et.al. 2512.06877 null
2025-12-07 Physics Informed Generative Machine Learning for Accelerated Quantum-centric Supercomputing Chayan Patra et.al. 2512.06858 null
2025-12-07 RMAdapter: Reconstruction-based Multi-Modal Adapter for Vision-Language Models Xiang Lin et.al. 2512.06811 null
2025-12-07 Parameter-Efficient Fine-Tuning with Differential Privacy for Robust Instruction Adaptation in Large Language Models Yulin Huang et.al. 2512.06711 null
2025-12-07 Towards Small Language Models for Security Query Generation in SOC Workflows Saleha Muzammil et.al. 2512.06660 null
2025-12-07 Quantum Temporal Convolutional Neural Networks for Cross-Sectional Equity Return Prediction: A Comparative Benchmark Study Chi-Sheng Chen et.al. 2512.06630 null
2025-12-07 Vector Quantization using Gaussian Variational Autoencoder Tongda Xu et.al. 2512.06609 null
2025-12-06 QL-LSTM: A Parameter-Efficient LSTM for Stable Long-Sequence Modeling Isaac Kofi Nti et.al. 2512.06582 null
2025-12-06 BEACON: A Unified Behavioral-Tactical Framework for Explainable Cybercrime Analysis with Large Language Models Arush Sachdeva et.al. 2512.06555 null
2025-12-06 ProSocialAlign: Preference Conditioned Test Time Alignment in Language Models Somnath Banerjee et.al. 2512.06515 null
2025-12-06 Small Language Models Can Use Nuanced Reasoning For Health Science Research Classification: A Microbial-Oncogenesis Case Study Muhammed Muaaz Dawood et.al. 2512.06502 null
2025-12-06 Optimizing LLMs Using Quantization for Mobile Execution Agatsya Yadav et.al. 2512.06490 null
2025-12-06 Neural expressiveness for beyond importance model compression Angelos-Christos Maroudis et.al. 2512.06440 null
2025-12-06 TreeQ: Pushing the Quantization Boundary of Diffusion Transformer via Tree-Structured Mixed-Precision Search Kaicheng Yang et.al. 2512.06353 null
2025-12-06 Theoretical Compression Bounds for Wide Multilayer Perceptrons Houssam El Cheairi et.al. 2512.06288 null
2025-12-06 Nanbeige4-3B Technical Report: Exploring the Frontier of Small Language Models Chen Yang et.al. 2512.06266 null
2025-12-06 Quantization Blindspots: How Model Compression Breaks Backdoor Defenses Rohan Pandey et.al. 2512.06243 null
2025-12-06 LOCUS: A System and Method for Low-Cost Customization for Universal Specialization Dhanasekar Sundararaman et.al. 2512.06239 null
2025-12-06 GPU-GLMB: Assessing the Scalability of GPU-Accelerated Multi-Hypothesis Tracking Pranav Balakrishnan et.al. 2512.06230 null
2025-12-05 KQ-SVD: Compressing the KV Cache with Provable Guarantees on Attention Fidelity Damien Lesens et.al. 2512.05916 null
2025-12-05 Hadronic Emissions from the Microquasar V4641 Sgr, SS433, and its implications in the Diffuse Galactic Emission Basanti Paul et.al. 2512.05839 null
2025-12-05 HQ-DM: Single Hadamard Transformation-Based Quantization-Aware Training for Low-Bit Diffusion Models Shizhuo Mao et.al. 2512.05746 null
2025-12-05 Efficient Text Classification with Conformal In-Context Learning Ippokratis Pantelidis et.al. 2512.05732 null
2025-12-05 LeAD-M3D: Leveraging Asymmetric Distillation for Real-time Monocular 3D Detection Johannes Meier et.al. 2512.05663 null
2025-12-05 Efficient sequential Bayesian inference for state-space epidemic models using ensemble data assimilation Dhorasso Temfack et.al. 2512.05650 null
2025-12-05 DistillFSS: Synthesizing Few-Shot Knowledge into a Lightweight Segmentation Model Pasquale De Marinis et.al. 2512.05613 null
2025-12-05 Fast SceneScript: Accurate and Efficient Structured Language Model via Multi-Token Prediction Ruihong Yin et.al. 2512.05597 null
2025-12-05 Rethinking Infrared Small Target Detection: A Foundation-Driven Efficient Paradigm Chuang Yu et.al. 2512.05511 null
2025-12-05 TED-4DGS: Temporally Activated and Embedding-based Deformation for 4DGS Compression Cheng-Yuan Ho et.al. 2512.05446 null
2025-12-05 BEAVER: An Efficient Deterministic LLM Verifier Tarun Suresh et.al. 2512.05439 null
2025-12-05 Performance Evaluation of Deep Learning for Tree Branch Segmentation in Autonomous Forestry Systems Yida Lin et.al. 2512.05418 null
2025-12-05 YOLO and SGBM Integration for Autonomous Tree Branch Detection and Depth Estimation in Radiata Pine Pruning Applications Yida Lin et.al. 2512.05412 null
2025-12-05 SQ-format: A Unified Sparse-Quantized Hardware-friendly Data Format for LLMs Ruixuan Huang et.al. 2512.05409 null
2025-12-05 LoC-Path: Learning to Compress for Pathology Multimodal Large Language Models Qingqiao Hu et.al. 2512.05391 null
2025-12-05 ShaRP: SHAllow-LayeR Pruning for Video Large Language Models Acceleration Yingjie Xia et.al. 2512.05385 null
2025-12-05 Group Orthogonal Low-Rank Adaptation for RGB-T Tracking Zekai Shao et.al. 2512.05359 null
2025-12-04 Uncertainty-Aware Data-Efficient AI: An Information-Theoretic Perspective Osvaldo Simeone et.al. 2512.05267 null
2025-12-04 Rethinking Tokenization for Clinical Time Series: When Less is More Rafi Al Attrach et.al. 2512.05217 null
2025-12-04 Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning Purbesh Mitra et.al. 2512.05105 null
2025-12-04 Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression Jung Yi et.al. 2512.05081 null
2025-12-04 David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design? Shashwat Shankar et.al. 2512.05073 null
2025-12-04 Meta-Learning for Quantum Optimization via Quantum Sequence Model Yu-Cheng Lin et.al. 2512.05058 null
2025-12-04 Arbitrage: Efficient Reasoning via Advantage-Aware Speculation Monishwaran Maheswaran et.al. 2512.05033 null
2025-12-04 Generative Neural Video Compression via Video Diffusion Prior Qi Mao et.al. 2512.05016 null
2025-12-04 Plug-and-Play Homeostatic Spark: Zero-Cost Acceleration for SNN Training Across Paradigms Rui Chen et.al. 2512.05015 null
2025-12-04 Efficient Generative Transformer Operators For Million-Point PDEs Armand Kassaï Koupaï et.al. 2512.04974 null
2025-12-04 FASTer: Toward Efficient Autoregressive Vision Language Action Modeling via neural Action Tokenization Yicheng Liu et.al. 2512.04952 null
2025-12-04 LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging Zhijian Shu et.al. 2512.04939 null
2025-12-04 Autoregressive Image Generation Needs Only a Few Lines of Cached Tokens Ziran Qin et.al. 2512.04857 null
2025-12-05 EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture Xin He et.al. 2512.04810 null
2025-12-04 MemLoRA: Distilling Expert Adapters for On-Device Memory Systems Massimo Bini et.al. 2512.04763 null
2025-12-04 Model Whisper: Steering Vectors Unlock Large Language Models’ Potential in Test-time Xinyue Kang et.al. 2512.04748 null
2025-12-04 SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs Wenhua Cheng et.al. 2512.04746 null
2025-12-04 TRINITY: An Evolved LLM Coordinator Jinglue Xu et.al. 2512.04695 null
2025-12-04 Towards Ethical Multi-Agent Systems of Large Language Models: A Mechanistic Interpretability Perspective Jae Hee Lee et.al. 2512.04691 null
2025-12-04 Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation Yunhong Lu et.al. 2512.04678 null
2025-12-04 Rethinking Decoupled Knowledge Distillation: A Predictive Distribution Perspective Bowen Zheng et.al. 2512.04625 null
2025-12-04 Metric dimension of Cartesian product of stars Akbar Davoodi et.al. 2512.04620 null
2025-12-04 Infrared UAV Target Tracking with Dynamic Feature Refinement and Global Contextual Attention Knowledge Distillation Houzhang Fang et.al. 2512.04581 null
2025-12-04 AdmTree: Compressing Lengthy Context with Adaptive Semantic Trees Yangning Li et.al. 2512.04550 null
2025-12-04 Boundary-Aware Test-Time Adaptation for Zero-Shot Medical Image Segmentation Chenlin Xu et.al. 2512.04520 null
2025-12-04 RapidUn: Influence-Driven Parameter Reweighting for Efficient Large Language Model Unlearning Guoshenghui Zhao et.al. 2512.04457 null
2025-12-04 MD-SNN: Membrane Potential-aware Distillation on Quantized Spiking Neural Network Donghyun Lee et.al. 2512.04443 null
2025-12-04 Dual-Stream Spectral Decoupling Distillation for Remote Sensing Object Detection Xiangyi Gao et.al. 2512.04413 null
2025-12-04 Efficient Reinforcement Learning with Semantic and Token Entropy for LLM Reasoning Hongye Cao et.al. 2512.04359 null
2025-12-03 GRASP: GRouped Activation Shared Parameterization for Parameter-Efficient Fine-Tuning and Robust Inference of Transformers Malyaban Bal et.al. 2512.04296 null
2025-12-03 Constructing Low-Redundancy Codes via Distributed Graph Coloring Yuting Li et.al. 2512.04197 null
2025-12-03 Quantum geometry and linear orbital response in arbitrary $SU(2)$ representation Rhonald Burgos Atencia et.al. 2512.04164 null
2025-12-03 Minuet: A Diffusion Autoencoder for Compact Semantic Compression of Multi-Band Galaxy Images Alexander T. Gagliano et.al. 2512.04145 null
2025-12-03 Solving N-Queen Problem using Las Vegas Algorithm with State Pruning Susmita Sharma et.al. 2512.04139 null
2025-12-03 RELIC: Interactive Video World Model with Long-Horizon Memory Yicong Hong et.al. 2512.04040 null
2025-12-03 Fast & Efficient Normalizing Flows and Applications of Image Generative Models Sandeep Nagar et.al. 2512.04039 null
2025-12-03 PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation Xiaolong Li et.al. 2512.04025 null
2025-12-03 Ultra-lightweight Neural Video Representation Compression Ho Man Kwan et.al. 2512.04019 null
2025-12-03 DIQ-H: Evaluating Hallucination Persistence in VLMs Under Temporal Visual Degradation Zexin Lin et.al. 2512.03992 null
2025-12-03 Teaching Old Tokenizers New Words: Efficient Tokenizer Adaptation for Pre-trained Models Taido Purason et.al. 2512.03989 null
2025-12-03 Parameter efficient hybrid spiking-quantum convolutional neural network with surrogate gradient and quantum data-reupload Luu Trong Nhan et.al. 2512.03895 null
2025-12-03 Lean Unet: A Compact Model for Image Segmentation Ture Hassler et.al. 2512.03834 null
2025-12-03 AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition Zichuan Lin et.al. 2512.03794 null
2025-12-03 AR-Med: Automated Relevance Enhancement in Medical Search via LLM-Driven Information Augmentation Chuyue Wang et.al. 2512.03737 null
2025-12-03 PosA-VLA: Enhancing Action Generation via Pose-Conditioned Anchor Attention Ziwen Li et.al. 2512.03724 null
2025-12-03 ConvRot: Rotation-Based Plug-and-Play 4-bit Quantization for Diffusion Transformers Feice Huang et.al. 2512.03673 null
2025-12-03 Multi-Scale Visual Prompting for Lightweight Small-Image Classification Salim Khazem et.al. 2512.03663 null
2025-12-03 Optical Context Compression Is Just (Bad) Autoencoding Ivan Yee Lee et.al. 2512.03643 null
2025-12-03 SELF: A Robust Singular Value and Eigenvalue Approach for LLM Fingerprinting Hanxiu Zhang et.al. 2512.03620 null
2025-12-03 KVNAND: Efficient On-Device Large Language Model Inference Using DRAM-Free In-Flash Computing Lishuo Deng et.al. 2512.03608 null
2025-12-03 Federated Learning and Trajectory Compression for Enhanced AIS Coverage Thomas Gräupl et.al. 2512.03584 null
2025-12-03 Optimal Transportation and Alignment Between Gaussian Measures Sanjit Dandapanthula et.al. 2512.03579 null
2025-12-03 Parameter-Efficient Augment Plugin for Class-Incremental Learning Zhiming Xu et.al. 2512.03537 null
2025-12-03 NAS-LoRA: Empowering Parameter-Efficient Fine-Tuning for Visual Foundation Models with Searchable Adaptation Renqi Chen et.al. 2512.03499 null
2025-12-03 Quantum Encrypted Control of Networked Systems Zihao Ren et.al. 2512.03434 null
2025-12-03 Dual LoRA: Enhancing LoRA with Magnitude and Direction Updates Yixing Xu et.al. 2512.03402 null
2025-12-03 UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs Hung-Yueh Chiang et.al. 2512.03383 null
2025-12-03 Nexus: Higher-Order Attention Mechanisms in Transformers Hanting Chen et.al. 2512.03377 null
2025-12-03 Hierarchical Attention for Sparse Volumetric Anomaly Detection in Subclinical Keratoconus Lynn Kandakji et.al. 2512.03346 null
2025-12-03 Idea-Gated Transformers: Enforcing Semantic Coherence via Differentiable Vocabulary Pruning Darshan Fofadiya et.al. 2512.03343 null
2025-12-03 Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs Ngoc Bui et.al. 2512.03324 null
2025-12-02 InvertiTune: High-Quality Data Synthesis for Cost-Effective Single-Shot Text-to-Knowledge Graph Generation Faezeh Faez et.al. 2512.03197 null
2025-12-02 A Mathematical Introduction to Geometric Quantization Kadri İlker Berktav et.al. 2512.03171 null
2025-12-02 The Hilbert space of gauge theories: group averaging and the quantization of Jackiw-Teitelboim gravity Elba Alonso-Monsalve et.al. 2512.03030 null
2025-12-02 TokenPowerBench: Benchmarking the Power Consumption of LLM Inference Chenxu Niu et.al. 2512.03024 null
2025-12-02 Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks Matthew Dutson et.al. 2512.03014 null
2025-12-02 Pruning AMR: Efficient Visualization of Implicit Neural Representations via Weight Matrix Analysis Jennifer Zvonek et.al. 2512.02967 null
2025-12-02 A Lightweight Real-Time Low-Light Enhancement Network for Embedded Automotive Vision Systems Yuhan Chen et.al. 2512.02965 null
2025-12-02 AutoNeural: Co-Designing Vision-Language Models for NPU Inference Wei Chen et.al. 2512.02924 null
2025-12-02 FAIRY2I: Universal Extremely-Low Bit QAT framework via Widely-Linear Representation and Phase-Aware Quantization Feiyu Wang et.al. 2512.02901 null
2025-12-02 MindGPT-4ov: An Enhanced MLLM via a Multi-Stage Post-Training Paradigm Wei Chen et.al. 2512.02895 null
2025-12-02 Network Self-Configuration based on Fine-Tuned Small Language Models Oscar G. Lira et.al. 2512.02861 null
2025-12-02 LumiX: Structured and Coherent Text-to-Intrinsic Generation Xu Han et.al. 2512.02781 null
2025-12-02 PEFT-Factory: Unified Parameter-Efficient Fine-Tuning of Autoregressive Large Language Models Robert Belanec et.al. 2512.02764 null
2025-12-02 Menta: A Small Language Model for On-Device Mental Health Prediction Tianyi Zhang et.al. 2512.02716 null
2025-12-02 G-PIFNN: A Generalizable Physics-informed Fourier Neural Network Framework for Electrical Circuits Ibrahim Shahbaz et.al. 2512.02712 null
2025-12-02 CREST: Universal Safety Guardrails Through Cluster-Guided Cross-Lingual Transfer Lavish Bansal et.al. 2512.02711 null
2025-12-02 VLM-Pruner: Buffering for Spatial Sparsity in an Efficient VLM Centrifugal Token Pruning Paradigm Zhenkai Wu et.al. 2512.02700 null
2025-12-02 PGP-DiffSR: Phase-Guided Progressive Pruning for Efficient Diffusion-based Image Super-Resolution Zhongbao Yang et.al. 2512.02681 null
2025-12-02 A Communication-Efficient Distributed Optimization Algorithm with Coupled Constraints Yuzhu Duan et.al. 2512.02634 null
2025-12-02 Adapting Tensor Kernel Machines to Enable Efficient Transfer Learning for Seizure Detection Seline J. S. de Rooij et.al. 2512.02626 null
2025-12-02 Stepwise Schema-Guided Prompting Framework with Parameter Efficient Instruction Tuning for Multimedia Event Extraction Xiang Yuan et.al. 2512.02584 null
2025-12-02 ADORE: Autonomous Domain-Oriented Relevance Engine for E-commerce Zheng Fang et.al. 2512.02555 null
2025-12-02 In-Context Distillation with Self-Consistency Cascades: A Simple, Training-Free Way to Reduce LLM Agent Costs Vishnu Sarukkai et.al. 2512.02543 null
2025-12-02 Improved Ising Meson Spectroscopy Simulation on a Noisy Digital Quantum Device Hao-Ti Hung et.al. 2512.02516 null
2025-12-02 TGDD: Trajectory Guided Dataset Distillation with Balanced Distribution Fengli Ran et.al. 2512.02469 null
2025-12-02 Artificial Noise Aided Physical Layer Security for Near-Field MIMO with Fluid Antenna Systems Peng Zhang et.al. 2512.02461 null
2025-12-02 Basis-Oriented Low-rank Transfer for Few-Shot and Test-Time Adaptation Junghwan Park et.al. 2512.02441 null
2025-12-02 Boosting Medical Vision-Language Pretraining via Momentum Self-Distillation under Limited Computing Resources Phuc Pham et.al. 2512.02438 null
2025-12-02 Generalizing Vision-Language Models with Dedicated Prompt Guidance Xinyao Li et.al. 2512.02421 null
2025-12-02 Data Curation Through the Lens of Spectral Dynamics: Static Limits, Dynamic Acceleration, and Practical Oracles Yizhou Zhang et.al. 2512.02409 null
2025-12-02 ESACT: An End-to-End Sparse Accelerator for Compute-Intensive Transformers via Local Similarity Hongxiang Liu et.al. 2512.02403 null
2025-12-02 Understanding and Harnessing Sparsity in Unified Multimodal Models Shwai He et.al. 2512.02351 null
2025-12-01 Fantasy: Efficient Large-scale Vector Search on GPU Clusters with GPUDirect Async Yi Liu et.al. 2512.02278 null
2025-12-01 Adversarial Robustness of Traffic Classification under Resource Constraints: Input Structure Matters Adel Chehade et.al. 2512.02276 null
2025-12-01 Lightweight Latent Reasoning for Narrative Tasks Alexander Gurung et.al. 2512.02240 null
2025-12-01 Thermodynamic Entropy as Information – A compression-based demonstration of the Shannon-Boltzmann equivalence in condensed matter Dallin Fisher et.al. 2512.02221 null
2025-12-01 Parameter-Efficient Subspace Optimization for LLM Fine-Tuning Yuchen Lou et.al. 2512.02216 null
2025-12-01 Think Before You Prune: Self-Reflective Structured Pruning for Reasoning Language Models Ziyan Wang et.al. 2512.02185 null
2025-12-01 TT-Stack: A Transformer-Based Tiered-Stacking Ensemble Framework with Meta-Learning for Automated Breast Cancer Detection in Mammography Showkat Osman et.al. 2512.02091 null
2025-12-01 Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling Jack Cook et.al. 2512.02010 null
2025-12-01 Feature-Based Semantics-Aware Scheduling for Energy-Harvesting Federated Learning Eunjeong Jeong et.al. 2512.01983 null
2025-12-01 Low-Rank Prehab: Preparing Neural Networks for SVD Compression Haoran Qin et.al. 2512.01980 null
2025-12-01 KV Pareto: Systems-Level Optimization of KV Cache and Model Compression for Long Context Inference Sai Gokhale et.al. 2512.01953 null
2025-12-01 Script: Graph-Structured and Query-Conditioned Semantic Token Pruning for Multimodal Large Language Models Zhongyu Yang et.al. 2512.01949 null
2025-12-01 SAM3-UNet: Simplified Adaptation of Segment Anything Model 3 Xinyu Xiong et.al. 2512.01789 null
2025-12-01 Learned Image Compression for Earth Observation: Implications for Downstream Segmentation Tasks Christian Mollière et.al. 2512.01788 null
2025-12-01 Resource Estimation for VQE on Small Molecules: Impact of Fermion Mappings and Hamiltonian Reductions Anurag K. S. V. et.al. 2512.01605 null
2025-12-01 Neural Network Perturbation Theory (NNPT): Learning Residual Corrections from Exact Solutions Zhenhao Chen et.al. 2512.01558 null
2025-12-01 LPCD: Unified Framework from Layer-Wise to Submodule Quantization Yuma Ichikawa et.al. 2512.01546 null
2025-12-01 FlashVGGT: Efficient and Scalable Visual Geometry Transformers with Compressed Descriptor Attention Zipeng Wang et.al. 2512.01540 null
2025-12-01 The Poisson-Fourier Transform for bicrossed products I: Abelian approximations and the quantum duality principle A. Massar et.al. 2512.01536 null
2025-12-01 Diffusion Fuzzy System: Fuzzy Rule Guided Latent Multi-Path Diffusion Modeling Hailong Yang et.al. 2512.01533 null
2025-12-01 MEGConformer: Conformer-Based MEG Decoder for Robust Speech and Phoneme Classification Xabier de Zuazo et.al. 2512.01443 null
2025-12-01 Fantastic Features and Where to Find Them: A Probing Method to combine Features from Multiple Foundation Models Benjamin Ramtoula et.al. 2512.01405 null
2025-12-01 Intrinsic Structure as a Proxy for Saliency: SVD-Based Weight Preservation for Mixed-Precision Quantization in Large Language Models Shashank Landge et.al. 2512.01343 null
2025-12-01 EGG-Fusion: Efficient 3D Reconstruction with Geometry-aware Gaussian Surfel on the Fly Xiaokun Pan et.al. 2512.01296 null
2025-12-01 Diffusion Model in Latent Space for Medical Image Segmentation Task Huynh Trinh Ngoc et.al. 2512.01292 null
2025-12-01 Efficient Training of Diffusion Mixture-of-Experts Models: A Practical Recipe Yahui Liu et.al. 2512.01252 null
2025-12-01 First On-Orbit Demonstration of a Geospatial Foundation Model Andrew Du et.al. 2512.01181 null
2025-11-30 Projection-Free CNN Pruning via Frank-Wolfe with Momentum: Sparser Models with Less Pretraining Hamza ElMokhtar Shili et.al. 2512.01147 null
2025-11-30 Structural Prognostic Event Modeling for Multimodal Cancer Survival Analysis Yilan Zhang et.al. 2512.01116 null
2025-11-30 Parameter Reduction Improves Vision Transformers: A Comparative Study of Sharing and Width Reduction Anantha Padmanaban Krishna Kumar et.al. 2512.01059 null
2025-11-30 A Provably Efficient Method for Tensor Ring Decomposition and Its Applications Han Chen et.al. 2512.01016 null
2025-11-30 WUSH: Near-Optimal Adaptive Transforms for LLM Quantization Jiale Chen et.al. 2512.00956 null
2025-11-28 Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction Bao Shu et.al. 2511.23476 null
2025-11-28 Visual Generation Tuning Jiahao Guo et.al. 2511.23469 null
2025-11-28 Quantized-Tinyllava: a new multimodal foundation model enables efficient split learning Jiajun Guo et.al. 2511.23402 null
2025-11-28 FedSGT: Exact Federated Unlearning via Sequential Group-based Training Bokang Zhang et.al. 2511.23393 null
2025-11-28 VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction Sinan Du et.al. 2511.23386 null
2025-11-28 Optimizing Multimodal Language Models through Attention-based Interpretability Alexander Sergeev et.al. 2511.23375 null
2025-11-28 Chart2Code-MoLA: Efficient Multi-Modal Code Generation via Adaptive Expert Routing Yifei Wang et.al. 2511.23321 null
2025-11-28 Efficient Estimation of Sum-Parameters for Multi-Component Complex Exponential Signals with Theoretical Cramer-Rao Bound Analysis Huiguang Zhang et.al. 2511.23318 null
2025-11-28 Closing the Generalization Gap in Parameter-efficient Federated Edge Learning Xinnong Du et.al. 2511.23282 null
2025-11-28 Behavior-Equivalent Token: Single-Token Replacement for Long Prompts in LLMs Jiancheng Dong et.al. 2511.23271 null
2025-11-28 PointCNN++: Performant Convolution on Native Points Lihan Li et.al. 2511.23227 null
2025-11-28 TWEO: Transformers Without Extreme Outliers Enables FP8 Training And Quantization For Dummies Guang Liang et.al. 2511.23225 null
2025-11-28 Pathryoshka: Compressing Pathology Foundation Models via Multi-Teacher Knowledge Distillation with Nested Embeddings Christian Grashei et.al. 2511.23204 null
2025-11-28 InstanceV: Instance-Level Video Generation Yuheng Chen et.al. 2511.23146 null
2025-11-28 Evolutionary Discovery of Heuristic Policies for Traffic Signal Control Ruibing Wang et.al. 2511.23122 null
2025-11-28 Dripper: Token-Efficient Main HTML Extraction with a Lightweight LM Mengjie Liu et.al. 2511.23119 null
2025-11-28 Accent Placement Models for Rigvedic Sanskrit Text Akhil Rajeev P et.al. 2511.23088 null
2025-11-28 EnECG: Efficient Ensemble Learning for Electrocardiogram Multi-task Foundation Model Yuhao Xu et.al. 2511.22935 null
2025-11-28 AgentShield: Make MAS more secure and efficient Kaixiang Wang et.al. 2511.22924 null
2025-11-28 ORION: Teaching Language Models to Reason Efficiently in the Language of Thought Kumar Tanmay et.al. 2511.22891 null
2025-11-28 Serving Heterogeneous LoRA Adapters in Distributed LLM Inference Systems Shashwat Jaiswal et.al. 2511.22880 null
2025-11-28 CNN-Based Framework for Pedestrian Age and Gender Classification Using Far-View Surveillance in Mixed-Traffic Intersections Shisir Shahriar Arif et.al. 2511.22873 null
2025-11-28 PerfMamba: Performance Analysis and Pruning of Selective State Space Models Abdullah Al Asif et.al. 2511.22849 null
2025-11-27 FPGA-Enabled Modulo ADC with x100 Dynamic-Range Expansion: Hardware Design and Performance Evaluation Zeyuan Li et.al. 2511.22752 null
2025-11-27 All Centers Are at most a Few Tokens Apart: Knowledge Distillation with Domain Invariant Prompt Tuning Amir Mohammad Ezzati et.al. 2511.22739 null
2025-11-27 Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Z-Image Team et.al. 2511.22699 null
2025-11-27 Smarter, not Bigger: Fine-Tuned RAG-Enhanced LLMs for Automotive HIL Testing Chao Feng et.al. 2511.22584 null
2025-11-27 Diff-ICMH: Harmonizing Machine and Human Vision in Image Compression with Generative Prior Ruoyu Feng et.al. 2511.22549 null
2025-11-27 Enhancing Trustworthiness with Mixed Precision: Benchmarks, Opportunities, and Challenges Guanxi Lu et.al. 2511.22483 null
2025-11-27 OmniInfer: System-Wide Acceleration Techniques for Optimizing LLM Serving Throughput and Latency Jun Wang et.al. 2511.22481 null
2025-11-27 RoadSceneBench: A Lightweight Benchmark for Mid-Level Road Scene Understanding Xiyan Liu et.al. 2511.22466 null
2025-11-27 An Efficient Embedding Based Ad Retrieval with GPU-Powered Feature Interaction Yifan Lei et.al. 2511.22460 null
2025-11-27 ITS3D: Inference-Time Scaling for Text-Guided 3D Diffusion Models Zhenglin Zhou et.al. 2511.22456 null
2025-11-27 Fin3R: Fine-tuning Feed-forward 3D Reconstruction Models via Monocular Knowledge Distillation Weining Ren et.al. 2511.22429 null
2025-11-27 Efficient-Husformer: Efficient Multimodal Transformer Hyperparameter Optimization for Stress and Cognitive Loads Merey Orazaly et.al. 2511.22362 null
2025-11-26 ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Hongjin Su et.al. 2511.21689 null
2025-11-26 Continual Error Correction on Low-Resource Devices Kirill Paramonov et.al. 2511.21652 null
2025-11-26 Automated Protein Motif Localization using Concept Activation Vectors in Protein Language Model Embedding Space Ahmad Shamail et.al. 2511.21614 null
2025-11-26 Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining Dongyang Fan et.al. 2511.21613 null
2025-11-26 Multimodal Robust Prompt Distillation for 3D Point Cloud Models Xiang Gu et.al. 2511.21574 null
2025-11-26 EoS-FM: Can an Ensemble of Specialist Models act as a Generalist Feature Extractor? Pierre Adorni et.al. 2511.21523 null
2025-11-26 IntAttention: A Fully Integer Attention Pipeline for Efficient Edge Inference Wanli Zhong et.al. 2511.21513 null
2025-11-26 CanKD: Cross-Attention-based Non-local operation for Feature-based Knowledge Distillation Shizhe Sun et.al. 2511.21503 null
2025-11-26 MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices Shuai Zhang et.al. 2511.21475 null
2025-11-26 VibraWave: Sensing the Pulse of Polluted Waters Sagnik Ghosh et.al. 2511.21456 null
2025-11-26 Odin: Oriented Dual-module Integration for Text-rich Network Representation Learning Kaifeng Hong et.al. 2511.21416 null
2025-11-26 Knowledge Distillation for Continual Learning of Biomedical Neural Fields Wouter Visser et.al. 2511.21409 null
2025-11-26 Prune4Web: DOM Tree Pruning Programming for Web Agent Jiayuan Zhang et.al. 2511.21398 null
2025-11-26 FITRep: Attention-Guided Item Representation via MLLMs Guoxiao Zhang et.al. 2511.21389 null
2025-11-26 An octree-based sampling algorithm for analyzing big simulation data Janis Geise et.al. 2511.21352 null
2025-11-26 Helical Quasiperiodic Chains with Engineered Dissipation: Liouvillian Rapidity Diagnostics of Transport and Localization Mohammad Pouranvari et.al. 2511.21332 null
2025-11-26 PEFT-Bench: A Parameter-Efficient Fine-Tuning Methods Benchmark Robert Belanec et.al. 2511.21285 null
2025-11-26 DynamicAdaptiveClimb: Adaptive Cache Replacement with Dynamic Resizing Daniel Berend et.al. 2511.21235 null
2025-11-26 Data Exfiltration by Compression Attack: Definition and Evaluation on Medical Image Data Huiyu Li et.al. 2511.21227 null
2025-11-26 LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs Shichu Sun et.al. 2511.21150 null
2025-11-26 Which Layer Causes Distribution Deviation? Entropy-Guided Adaptive Pruning for Diffusion and Flow Models Changlin Li et.al. 2511.21122 null
2025-11-26 Quantum Hard Spheres with Affine Quantization Riccardo Fantoni et.al. 2511.21119 null
2025-11-26 EM-KD: Distilling Efficient Multimodal Large Language Model with Unbalanced Vision Tokens Ze Feng et.al. 2511.21106 null
2025-11-26 MLPMoE: Zero-Shot Architectural Metamorphosis of Dense LLM MLPs into Static Mixture-of-Experts Ivan Novikov et.al. 2511.21089 null
2025-11-26 5G Network Automation Using Local Large Language Models and Retrieval-Augmented Generation Ahmadreza Majlesara et.al. 2511.21084 null
2025-11-26 G-Net: A Provably Easy Construction of High-Accuracy Random Binary Neural Networks Alireza Aghasi et.al. 2511.21063 null
2025-11-26 RAVQ-HoloNet: Rate-Adaptive Vector-Quantized Hologram Compression Shima Rafiei et.al. 2511.21035 null
2025-11-26 Lightweight Model Editing for LLMs to Correct Deprecated API Recommendations Guancheng Lin et.al. 2511.21022 null
2025-11-26 ICPO: Intrinsic Confidence-Driven Group Relative Preference Optimization for Efficient Reinforcement Learning Jinpeng Wang et.al. 2511.21005 null
2025-11-25 $Δ$ -NeRF: Incremental Refinement of Neural Radiance Fields through Residual Control and Knowledge Transfer Kriti Ghosh et.al. 2511.20804 null
2025-11-25 Unleashing the Power of Vision-Language Models for Long-Tailed Multi-Label Visual Recognition Wei Tang et.al. 2511.20641 null
2025-11-25 DiFR: Inference Verification Despite Nondeterminism Adam Karvonen et.al. 2511.20621 null
2025-11-25 NVIDIA Nemotron Parse 1.1 Kateryna Chumachenko et.al. 2511.20478 null
2025-11-25 Efficient Estimation of Multiple Temperatures via a Collisional Model Srijon Ghosh et.al. 2511.20448 null
2025-11-25 Object-Centric Vision Token Pruning for Vision Language Models Guangyuan Li et.al. 2511.20439 null
2025-11-25 BRIC: Bridging Kinematic Plans and Physical Control at Test Time Dohun Lim et.al. 2511.20431 null
2025-11-25 Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled Pairs Bao Tang et.al. 2511.20410 null
2025-11-25 MoRE: Batch-Robust Multi-Omics Representations from Frozen Pre-trained Transformers Audrey Pei-Hsuan Chen et.al. 2511.20382 null
2025-11-25 From Passive Perception to Active Memory: A Weakly Supervised Image Manipulation Localization Framework Driven by Coarse-Grained Annotations Zhiqing Guo et.al. 2511.20359 null
2025-11-25 Resistive switching and long-range filaments in metal/DMSO liquid systems for three-dimensional, multi-terminal connection schemes with on demand dynamic reconfigurability Roshani Madurawala et.al. 2511.20314 null
2025-11-25 CrossEarth-Gate: Fisher-Guided Adaptive Tuning Engine for Efficient Adaptation of Cross-Domain Remote Sensing Semantic Segmentation Shilei Cao et.al. 2511.20302 null
2025-11-25 Forgetting by Pruning: Data Deletion in Join Cardinality Estimation Chaowei He et.al. 2511.20293 null
2025-11-25 Modality-Balanced Collaborative Distillation for Multi-Modal Domain Generalization Xiaohan Wang et.al. 2511.20258 null
2025-11-25 Communication-Efficient Learning for Satellite Constellations Ruxandra-Stefania Tudose et.al. 2511.20220 null
2025-11-25 Interactive AI NPCs Powered by LLMs: Technical Report for the CPDC Challenge 2025 Yitian Huang et.al. 2511.20200 null
2025-11-25 Efficient multi-fidelity Gaussian process regression for noisy outputs and non-nested experimental designs Nils Baillie et.al. 2511.20183 null
2025-11-25 KyrgyzBERT: A Compact, Efficient Language Model for Kyrgyz NLP Adilet Metinov et.al. 2511.20182 null
2025-11-25 Hybrid Convolution and Frequency State Space Network for Image Compression Haodong Pan et.al. 2511.20151 null
2025-11-25 Fusion of Simulation and Experiment Data for Hypersonic Flow Field Prediction via Pre-Training and Fine-Tuning Yuan Jia et.al. 2511.20149 null
2025-11-25 IDAP++: Advancing Divergence-Based Pruning via Filter-Level and Layer-Level Optimization Aleksei Samarin et.al. 2511.20141 null
2025-11-25 WPT: World-to-Policy Transfer via Online World Model Distillation Guangfeng Jiang et.al. 2511.20095 null
2025-11-25 VICoT-Agent: A Vision-Interleaved Chain-of-Thought Framework for Interpretable Multimodal Reasoning and Scalable Remote Sensing Analysis Chujie Wang et.al. 2511.20085 null
2025-11-25 FLaTEC: Frequency-Disentangled Latent Triplanes for Efficient Compression of LiDAR Point Clouds Xiaoge Zhang et.al. 2511.20065 null
2025-11-25 On-Demand Multi-Task Sparsity for Efficient Large-Model Deployment on Edge Devices Lianming Huang et.al. 2511.19986 null
2025-11-25 Error-structure-tailored early fault-tolerant quantum computing Pei Zeng et.al. 2511.19983 null
2025-11-25 M $^3$ Prune: Hierarchical Communication Graph Pruning for Efficient Multi-Modal Multi-Agent Retrieval-Augmented Generation Weizi Shao et.al. 2511.19969 null
2025-11-25 Towards Edge General Intelligence: Knowledge Distillation for Mobile Agentic AI Yuxuan Wu et.al. 2511.19947 null
2025-11-25 EfficientXpert: Efficient Domain Adaptation for Large Language Models via Propagation-Aware Pruning Songlin Zhao et.al. 2511.19935 null
2025-11-25 Context-Aware Token Pruning and Discriminative Selective Attention for Transformer Tracking Janani Kugarajeevan et.al. 2511.19928 null
2025-11-25 Efficient Importance Sampling under Heston Model: Short Maturity and Deep Out-of-the-Money Options Yun-Feng Tu et.al. 2511.19826 null
2025-11-25 Mosaic Pruning: A Hierarchical Framework for Generalizable Pruning of Mixture-of-Experts Models Wentao Hu et.al. 2511.19822 null
2025-11-24 NOEM $^{3}$ A: A Neuro-Symbolic Ontology-Enhanced Method for Multi-Intent Understanding in Mobile Agents Ioannis Tzachristas et.al. 2511.19780 null
2025-11-24 A Storage-Efficient Feature for 3D Concrete Defect Segmentation to Replace Normal Vector Linxin Hua et.al. 2511.19760 null
2025-11-24 Leveraging Foundation Models for Histological Grading in Cutaneous Squamous Cell Carcinoma using PathFMTools Abdul Rahman Diab et.al. 2511.19751 null
2025-11-24 Rethinking Vision Transformer Depth via Structural Reparameterization Chengwei Zhou et.al. 2511.19718 null
2025-11-24 CafeQ: Calibration-free Quantization via Learned Transformations and Adaptive Rounding Ziteng Sun et.al. 2511.19705 null
2025-11-24 RADSeg: Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models Omar Alama et.al. 2511.19704 null
2025-11-24 INTERLACE: Interleaved Layer Pruning and Efficient Adaptation in Large Vision-Language Models Parsa Madinei et.al. 2511.19676 null
2025-11-24 FISCAL: Financial Synthetic Claim-document Augmented Learning for Efficient Fact-Checking Rishab Sharma et.al. 2511.19671 null
2025-11-24 SLMFix: Leveraging Small Language Models for Error Fixing with Reinforcement Learning David Jiahao Fu et.al. 2511.19422 null
2025-11-24 Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens Yiming Qin et.al. 2511.19418 null
2025-11-24 Be My Eyes: Extending Large Language Models to New Modalities Through Multi-Agent Collaboration James Y. Huang et.al. 2511.19417 null
2025-11-24 Learning Plug-and-play Memory for Guiding Video Diffusion Models Selena Song et.al. 2511.19229 null
2025-11-24 Communication: Modeling layered mosaic perovskite alloy microstructures across length scales via a packing algorithm Murray Skolnick et.al. 2511.19228 null
2025-11-24 UMCL: Unimodal-generated Multimodal Contrastive Learning for Cross-compression-rate Deepfake Detection Ching-Yi Lai et.al. 2511.18983 null
2025-11-24 FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning Xin Yuan et.al. 2511.18977 null
2025-11-24 Compressor-VLA: Instruction-Guided Visual Token Compression for Efficient Robotic Manipulation Juntao Gao et.al. 2511.18950 null
2025-11-24 SWAN: Sparse Winnowed Attention for Reduced Inference Memory via Decompression-Free KV-Cache Compression Santhosh G S et.al. 2511.18936 null
2025-11-24 EventSTU: Event-Guided Efficient Spatio-Temporal Understanding for Video Large Language Models Wenhao Xu et.al. 2511.18920 null
2025-11-24 Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models Yonggan Fu et.al. 2511.18890 null
2025-11-24 HunyuanVideo 1.5 Technical Report Bing Wu et.al. 2511.18870 null
2025-11-24 Think Before You Prune: Selective Self-Generated Calibration for Pruning Large Reasoning Models Yang Xiang et.al. 2511.18864 null
2025-11-24 Optimizing LLM Code Suggestions: Feedback-Driven Timing with Lightweight State Bounds Mohammad Nour Al Awad et.al. 2511.18842 null
2025-11-24 Auto-ML Graph Neural Network Hypermodels for Outcome Prediction in Event-Sequence Data Fang Wang et.al. 2511.18835 null
2025-11-24 Concept than Document: Context Compression via AMR-based Conceptual Entropy Kaize Shi et.al. 2511.18832 null
2025-11-24 VideoCompressa: Data-Efficient Video Understanding via Joint Temporal Compression and Spatial Reconstruction Shaobo Wang et.al. 2511.18831 null
2025-11-24 Towards Characterizing Knowledge Distillation of PPG Heart Rate Estimation Models Kanav Arora et.al. 2511.18829 null
2025-11-24 Uncertainty-Aware Dual-Student Knowledge Distillation for Efficient Image Classification Aakash Gore et.al. 2511.18826 null
2025-11-24 DiP: Taming Diffusion Models in Pixel Space Zhennan Chen et.al. 2511.18822 null
2025-11-24 HERMES: Towards Efficient and Verifiable Mathematical Reasoning in LLMs Azim Ospanov et.al. 2511.18760 null
2025-11-24 CoD: A Diffusion Foundation Model for Image Compression Zhaoyang Jia et.al. 2511.18706 null
2025-11-24 VLM in a flash: I/O-Efficient Sparsification of Vision-Language Model via Neuron Chunking Kichang Yang et.al. 2511.18692 null
2025-11-24 EVCC: Enhanced Vision Transformer-ConvNeXt-CoAtNet Fusion for Classification Kazi Reyazul Hasan et.al. 2511.18691 null
2025-11-24 QuantKAN: A Unified Quantization Framework for Kolmogorov Arnold Networks Kazi Ahmed Asif Fuad et.al. 2511.18689 null
2025-11-23 Kitty: Accurate and Efficient 2-bit KV Cache Quantization with Dynamic Channel-wise Precision Boost Haojun Xia et.al. 2511.18643 null
2025-11-23 AutoFocus-IL: VLM-based Saliency Maps for Data-Efficient Visual Imitation Learning without Extra Human Annotations Litian Gong et.al. 2511.18617 null
2025-11-23 Quantum machine learning for efficient reduced order modelling of turbulent flows Han Li et.al. 2511.18552 null
2025-11-21 Native 3D Editing with Full Attention Weiwei Cai et.al. 2511.17501 null
2025-11-21 Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift Björn Michele et.al. 2511.17455 null
2025-11-21 MMT-ARD: Multimodal Multi-Teacher Adversarial Distillation for Robust Vision-Language Models Yuqi Li et.al. 2511.17448 null
2025-11-21 Preventing Shortcut Learning in Medical Image Analysis through Intermediate Layer Knowledge Distillation from Specialist Teachers Christopher Boland et.al. 2511.17421 null
2025-11-21 DS-Span: Single-Phase Discriminative Subgraph Mining for Efficient Graph Embeddings Yeamin Kaiser et.al. 2511.17419 null
2025-11-21 METIS: Multi-Source Egocentric Training for Integrated Dexterous Vision-Language-Action Model Yankai Fu et.al. 2511.17366 null
2025-11-21 Efficient calculation of magnetic fields from ferromagnetic materials near strong electromagnets, and application to stellarator coil optimization Matt Landreman et.al. 2511.17305 null
2025-11-21 Equivariant-Aware Structured Pruning for Efficient Edge Deployment: A Comprehensive Framework with Adaptive Fine-Tuning Mohammed Alnemari et.al. 2511.17242 null
2025-11-21 E $^3$ -Pruner: Towards Efficient, Economical, and Effective Layer Pruning for Large Language Models Tao Yuan et.al. 2511.17205 null
2025-11-21 Efficient Robot Design with Multi-Objective Black-Box Optimization and Large Language Models Kento Kawaharazuka et.al. 2511.17178 null
2025-11-21 Magnetized particle motion and accretion process with shock cone morphology around a decoupled hairy black holes G. Mustafa et.al. 2511.17137 null
2025-11-21 A Multi-Stage Optimization Framework for Deploying Learned Image Compression on FPGAs Jiaxun Fang et.al. 2511.17135 null
2025-11-21 Learning to Compress: Unlocking the Potential of Large Language Models for Text Representation Yeqin Zhang et.al. 2511.17129 null
2025-11-21 Layer-wise Weight Selection for Power-Efficient Neural Network Acceleration Jiaxun Fang et.al. 2511.17123 null
2025-11-21 CLLMRec: LLM-powered Cognitive-Aware Concept Recommendation via Semantic Alignment and Prerequisite Knowledge Distillation Xiangrui Xiong et.al. 2511.17041 null
2025-11-21 Supervised Fine Tuning of Large Language Models for Domain Specific Knowledge Graph Construction:A Case Study on Hunan’s Historical Celebrities Junjie Hao et.al. 2511.17012 null
2025-11-21 Gradient-Driven Natural Selection for Compact 3D Gaussian Splatting Xiaobin Deng et.al. 2511.16980 null
2025-11-21 RASTP: Representation-Aware Semantic Token Pruning for Generative Recommendation with Semantic Identifiers Tianyu Zhan et.al. 2511.16943 null
2025-11-21 Berezin-Toeplitz quantization revisited Kwokwai Chan et.al. 2511.16889 null
2025-11-21 Avoiding Quality Saturation in UGC Compression Using Denoised References Xin Xiong et.al. 2511.16876 null
2025-11-20 Efficient Penalty-Based Bilevel Methods: Improved Analysis, Novel Updates, and Flatness Condition Liuyuan Jiang et.al. 2511.16796 null
2025-11-20 Revisiting Multimodal KV Cache Compression: A Frequency-Domain-Guided Outlier-KV-Aware Approach Yaoxin Yang et.al. 2511.16786 null
2025-11-20 RampoNN: A Reachability-Guided System Falsification for Efficient Cyber-Kinetic Vulnerability Detection Kohei Tsujio et.al. 2511.16765 null
2025-11-20 Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter Qinghao Hu et.al. 2511.16665 null
2025-11-20 Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs Ali Taghibakhshi et.al. 2511.16664 null
2025-11-20 Teacher-Guided One-Shot Pruning via Context-Aware Knowledge Distillation Md. Samiul Alim et.al. 2511.16653 null
2025-11-21 You Only Forward Once: An Efficient Compositional Judging Paradigm Tianlong Zhang et.al. 2511.16600 null
2025-11-20 TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding Boshen Xu et.al. 2511.16595 null
2025-11-20 The Oracle and The Prism: A Decoupled and Efficient Framework for Generative Recommendation Explanation Jiaheng Zhang et.al. 2511.16543 null
2025-11-20 Optimizing Federated Learning in the Era of LLMs: Message Quantization and Streaming Ziyue Xu et.al. 2511.16450 null
2025-11-20 VLA-Pruner: Temporal-Aware Dual-Level Visual Token Pruning for Efficient Vision-Language-Action Inference Ziyan Liu et.al. 2511.16449 null
2025-11-20 FreqFlow: Long-term forecasting using lightweight flow matching Seyed Mohamad Moghadas et.al. 2511.16426 null
2025-11-20 TOFA: Training-Free One-Shot Federated Adaptation for Vision-Language Models Li Zhang et.al. 2511.16423 null
2025-11-20 An Efficient LLM-based Evolutional Recommendation with Locate-Forget-Update Paradigm Hao Liu et.al. 2511.16414 null
2025-11-20 VersaPants: A Loose-Fitting Textile Capacitive Sensing System for Lower-Body Motion Capture Deniz Kasap et.al. 2511.16346 null
2025-11-20 Aerial View River Landform Video segmentation: A Weakly Supervised Context-aware Temporal Consistency Distillation Approach Chi-Han Chen et.al. 2511.16343 null
2025-11-20 SDA: Steering-Driven Distribution Alignment for Open LLMs without Fine-Tuning Wei Xia et.al. 2511.16324 null
2025-11-20 WWE-UIE: A Wavelet & White Balance Efficient Network for Underwater Image Enhancement Ching-Heng Cheng et.al. 2511.16321 null
2025-11-20 SeSE: A Structural Information-Guided Uncertainty Quantification Framework for Hallucination Detection in LLMs Xingtao Zhao et.al. 2511.16275 null
2025-11-20 Accelerating Reionization Constraints: An ANN-Emulator Framework for the SCRIPT Semi-numerical Model Saptarshi Sarkar et.al. 2511.16256 null
2025-11-20 FT-NCFM: An Influence-Aware Data Distillation Framework for Efficient VLA Models Kewei Chen et.al. 2511.16233 null
2025-11-20 Q-MLLM: Vector Quantization for Robust Multimodal Large Language Model Security Wei Zhao et.al. 2511.16229 null
2025-11-20 Optical Waveguide-Pair Design for CMOS-Compatible Hybrid III-V-on-Silicon Quantum Dot Lasers Peter Raymond Smith et.al. 2511.16222 null
2025-11-20 PIPHEN: Physical Interaction Prediction with Hamiltonian Energy Networks Kewei Chen et.al. 2511.16200 null
2025-11-20 Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers Jian Ma et.al. 2511.16156 null
2025-11-20 TS-PEFT: Token-Selective Parameter-Efficient Fine-Tuning with Learnable Threshold Gating Dabiao Ma et.al. 2511.16147 null
2025-11-20 LEGO-SLAM: Language-Embedded Gaussian Optimization SLAM Sibaek Lee et.al. 2511.16144 null
2025-11-20 Degradation-Aware Hierarchical Termination for Blind Quality Enhancement of Compressed Video Li Yu et.al. 2511.16137 null
2025-11-20 Change-of-Basis Pruning via Rotational Invariance Alex Ning et.al. 2511.16061 null
2025-11-20 LiSTAR: Ray-Centric World Models for 4D LiDAR Sequences in Autonomous Driving Pei Liu et.al. 2511.16049 null
2025-11-20 Fairness in Multi-modal Medical Diagnosis with Demonstration Selection Dawei Li et.al. 2511.15986 null
2025-11-20 JudgeBoard: Benchmarking and Enhancing Small Language Models for Reasoning Evaluation Zhenyu Bi et.al. 2511.15958 null
2025-11-20 A Scalable NorthPole System with End-to-End Vertical Integration for Low-Latency and Energy-Efficient LLM Inference Michael V. DeBole et.al. 2511.15950 null
2025-11-19 discretize_distributions: Efficient Quantization of Gaussian Mixtures with Guarantees in Wasserstein Distance Steven Adams et.al. 2511.15854 null
2025-11-19 EfficientSAM3: Progressive Hierarchical Distillation for Video Concept Segmentation from SAM1, 2, and 3 Chengxi Zeng et.al. 2511.15833 null
2025-11-19 Dimensional Phenomenology in Polymeric Quantization Framework Kourosh Nozari et.al. 2511.15826 null
2025-11-19 UniUltra: Interactive Parameter-Efficient SAM2 for Universal Ultrasound Segmentation Yue Li et.al. 2511.15771 null
2025-11-19 Joint Semantic-Channel Coding and Modulation for Token Communications Jingkai Ying et.al. 2511.15699 null
2025-11-19 The Impact of Quantization on Large Reasoning Model Reinforcement Learning Medha Kumar et.al. 2511.15694 null
2025-11-19 From Low-Rank Features to Encoding Mismatch: Rethinking Feature Distillation in Vision Transformers Huiyuan Tian et.al. 2511.15572 null
2025-11-19 Learning to Expand Images for Efficient Visual Autoregressive Modeling Ruiqing Yang et.al. 2511.15499 null
2025-11-19 Batalin-Fradkin-Vilkovisky Quantization of Quadratic Gravity Jorge Bellorin et.al. 2511.15474 null
2025-11-19 Small Language Models for Phishing Website Detection: Cost, Performance, and Privacy Trade-Offs Georg Goldenits et.al. 2511.15434 null
2025-11-19 D4C: Data-free Quantization for Contrastive Language-Image Pre-training Models Wenlun Zhang et.al. 2511.15411 null
2025-11-19 Breaking Expert Knowledge Limits: Self-Pruning for Large Language Models Haidong Kang et.al. 2511.15390 null
2025-11-19 Parameter Importance-Driven Continual Learning for Foundation Models Lingxiang Wang et.al. 2511.15375 null
2025-11-19 IPTQ-ViT: Post-Training Quantization of Non-linear Functions for Integer-only Vision Transformers Gihwan Kim et.al. 2511.15369 null
2025-11-19 Fidelity-Preserving Quantum Encoding for Quantum Neural Networks Yuhu Lu et.al. 2511.15363 null
2025-11-19 Quant-Trim in Practice: Improved Cross-Platform Low-Bit Deployment on Edge NPUs Rayen Dhahri et.al. 2511.15300 null
2025-11-19 Context Cascade Compression: Exploring the Upper Limits of Text Compression Fanfan Liu et.al. 2511.15244 null
2025-11-19 SkinGPT-R1: Adapter-Only Dual Distillation for Efficient Dermatology Reasoning Yuhao Shen et.al. 2511.15242 null
2025-11-19 Masked Auto-Regressive Variational Acceleration: Fast Inference Makes Practical Reinforcement Learning Yuxuan Gu et.al. 2511.15190 null
2025-11-19 Efficient RF Passive Components Modeling with Bayesian Online Learning and Uncertainty Aware Sampling Huifan Zhang et.al. 2511.15125 null
2025-11-19 Multi-Aspect Cross-modal Quantization for Generative Recommendation Fuwei Zhang et.al. 2511.15122 null
2025-11-19 A Comprehensive Study on Visual Token Redundancy for Discrete Diffusion-based Multimodal Large Language Models Duo Li et.al. 2511.15098 null
2025-11-19 Cement2: Temporal Hardware Transactions for High-Level and Efficient FPGA Programming Youwei Xiao et.al. 2511.15073 null
2025-11-19 Learning Human-Like RL Agents Through Trajectory Optimization With Action Quantization Jian-Ting Guo et.al. 2511.15055 null
2025-11-19 Dynamic Expert Quantization for Scalable Mixture-of-Experts Inference Kexin Chu et.al. 2511.15015 null
2025-11-19 Compiling Set Queries into Work-Efficient Tree Traversals Alexander J Root et.al. 2511.15000 null
2025-11-18 Logit-Based Losses Limit the Effectiveness of Feature Knowledge Distillation Nicholas Cooper et.al. 2511.14981 null
2025-11-18 SparseST: Exploiting Data Sparsity in Spatiotemporal Modeling and Prediction Junfeng Wu et.al. 2511.14753 null
2025-11-18 AdamHD: Decoupled Huber Decay Regularization for Language Model Pre-Training Fu-Ming Guo et.al. 2511.14721 null
2025-11-18 Near-Lossless Model Compression Enables Longer Context Inference in DNA Large Language Models Rui Zhu et.al. 2511.14694 null
2025-11-18 AutoTool: Efficient Tool Selection for Large Language Model Agents Jingyi Jia et.al. 2511.14650 null
2025-11-18 Expert-Guided POMDP Learning for Data-Efficient Modeling in Healthcare Marco Locatelli et.al. 2511.14619 null
2025-11-18 CCSD: Cross-Modal Compositional Self-Distillation for Robust Brain Tumor Segmentation with Missing Modalities Dongqing Xie et.al. 2511.14599 null
2025-11-18 IMSE: Efficient U-Net-based Speech Enhancement using Inception Depthwise Convolution and Amplitude-Aware Linear Attention Xinxin Tang et.al. 2511.14515 null
2025-11-18 Watch Out for the Lifespan: Evaluating Backdoor Attacks Against Federated Model Adaptation Bastien Vuillod et.al. 2511.14406 null
2025-11-18 Jasper-Token-Compression-600M Technical Report Dun Zhang et.al. 2511.14405 null
2025-11-18 SAM-Fed: SAM-Guided Federated Semi-Supervised Learning for Medical Image Segmentation Sahar Nasirihaghighi et.al. 2511.14302 null
2025-11-18 Weight Variance Amplifier Improves Accuracy in High-Sparsity One-Shot Pruning Vincent-Daniel Yun et.al. 2511.14282 null
2025-11-18 Entropy-Guided Reasoning Compression Hourun Zhu et.al. 2511.14258 null
2025-11-18 Enhancing Generalization of Depth Estimation Foundation Model via Weakly-Supervised Adaptation with Regularization Yan Huang et.al. 2511.14238 null
2025-11-18 Online Data Curation for Object Detection via Marginal Contributions to Dataset-level Average Precision Zitang Sun et.al. 2511.14197 null
2025-11-18 Few-Shot Precise Event Spotting via Unified Multi-Entity Graph and Distillation Zhaoyu Liu et.al. 2511.14186 null
2025-11-18 AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs Xinliang Zhang et.al. 2511.14169 null
2025-11-18 Run, Ruminate, and Regulate: A Dual-process Thinking System for Vision-and-Language Navigation Yu Zhong et.al. 2511.14131 null
2025-11-18 Canonical quantization for Equilibrium Thermodynamics Luis F. Santos et.al. 2511.14121 null
2025-11-18 FailSafe: High-performance Resilient Serving Ziyi Xu et.al. 2511.14116 null
2025-11-18 CascadedViT: Cascaded Chunk-FeedForward and Cascaded Group Attention Vision Transformer Srivathsan Sivakumar et.al. 2511.14111 null
2025-11-18 RTS-Mono: A Real-Time Self-Supervised Monocular Depth Estimation Method for Real-World Deployment Zeyu Cheng et.al. 2511.14107 null
2025-11-18 Lightweight Multi-task CNN for ECG Diagnosis with GRU-Diffusion Lehuai Xu et.al. 2511.14104 null
2025-11-18 Zero-Training Task-Specific Model Synthesis for Few-Shot Medical Image Classification Yao Qin et.al. 2511.14082 null
2025-11-18 CORE: Compact Object-centric REpresentations as a New Paradigm for Token Merging in LVLMs Jingyu Lei et.al. 2511.14072 null
2025-11-18 ELiC: Efficient LiDAR Geometry Compression via Cross-Bit-depth Feature Propagation and Bag-of-Encoders Junsik Kim et.al. 2511.14070 null
2025-11-18 Semantic Context Matters: Improving Conditioning for Autoregressive Models Dongyang Jin et.al. 2511.14063 null
2025-11-18 ALEX:A Light Editing-knowledge Extractor Minghu Wang et.al. 2511.14018 null
2025-11-17 T-SAR: A Full-Stack Co-design for CPU-Only Ternary LLM Inference via In-Place SIMD ALU Reorganization Hyunwoo Oh et.al. 2511.13676 null
2025-11-17 CacheFlow: Compressive Streaming Memory for Efficient Long-Form Video Understanding Shrenik Patel et.al. 2511.13644 null
2025-11-17 Compact Multimodal Language Models as Robust OCR Alternatives for Noisy Textual Clinical Reports Nikita Neveditsin et.al. 2511.13523 null
2025-11-17 Spin-Adapted Fermionic Unitaries: From Lie Algebras to Compact Quantum Circuits Ilias Magoulas et.al. 2511.13485 null
2025-11-17 A Novel Hierarchical Integration Method for Efficient Model Merging in Medical LLMs Prakrit Timilsina et.al. 2511.13373 null
2025-11-17 Donors and Recipients: On Asymmetric Transfer Across Tasks and Languages with Parameter-Efficient Fine-Tuning Kajetan Dymkiewicz et.al. 2511.13368 null
2025-11-17 TabFlash: Efficient Table Understanding with Progressive Question Conditioning and Token Focusing Jongha Kim et.al. 2511.13283 null
2025-11-17 SF-Recon: Simplification-Free Lightweight Building Reconstruction via 3D Gaussian Splatting Zihan Li et.al. 2511.13278 null
2025-11-17 SymGS : Leveraging Local Symmetries for 3D Gaussian Splatting Compression Keshav Gupta et.al. 2511.13264 null
2025-11-17 TokenSqueeze: Performance-Preserving Compression for Reasoning LLMs Yuxiang Zhang et.al. 2511.13223 null
2025-11-17 Personalized Federated Learning with Bidirectional Communication Compression via One-Bit Random Sketching Jiacheng Cheng et.al. 2511.13144 null
2025-11-17 Low-Level Dataset Distillation for Medical Image Enhancement Fengzhi Xu et.al. 2511.13106 null
2025-11-17 Self-Adaptive Graph Mixture of Models Mohit Meena et.al. 2511.13062 null
2025-11-17 MACKO: Sparse Matrix-Vector Multiplication for Low Sparsity Vladimír Macko et.al. 2511.13061 null
2025-11-17 Dimension vs. Precision: A Comparative Analysis of Autoencoders and Quantization for Efficient Vector Retrieval on BEIR SciFact Satyanarayan Pati et.al. 2511.13057 null
2025-11-17 uCLIP: Parameter-Efficient Multilingual Extension of Vision-Language Models with Unpaired Data Dahyun Chung et.al. 2511.13036 null
2025-11-17 SLMQuant:Benchmarking Small Language Model Quantization for Practical Deployment Jiacheng Wang et.al. 2511.13023 null
2025-11-17 Fine-Tuned LLMs Know They Don’t Know: A Parameter-Efficient Approach to Recovering Honesty Zeyu Shi et.al. 2511.12991 null
2025-11-17 UNSEEN: Enhancing Dataset Pruning from a Generalization Perspective Furui Xu et.al. 2511.12988 null
2025-11-17 MCAQ-YOLO: Morphological Complexity-Aware Quantization for Efficient Object Detection with Curriculum Learning Yoonjae Seo et.al. 2511.12976 null
2025-11-17 MedRule-KG: A Knowledge-Graph–Steered Scaffold for Reliable Mathematical and Biomedical Reasoning Crystal Su et.al. 2511.12963 null
2025-11-17 CoS: Towards Optimal Event Scheduling via Chain-of-Scheduling Yiming Zhao et.al. 2511.12913 null
2025-11-17 ActVAR: Activating Mixtures of Weights and Tokens for Efficient Visual Autoregressive Generation Kaixin Zhang et.al. 2511.12893 null
2025-11-17 Quantization and Algebraic Index Si Li et.al. 2511.12875 null
2025-11-17 View-aware Cross-modal Distillation for Multi-view Action Recognition Trung Thanh Nguyen et.al. 2511.12870 null
2025-11-17 NeuroLex: A Lightweight Domain Language Model for EEG Report Understanding and Generation Kang Yin et.al. 2511.12851 null
2025-11-16 Catastrophic Forgetting in Kolmogorov-Arnold Networks Mohammad Marufur Rahman et.al. 2511.12828 null
2025-11-16 LoRA-Enhanced Vision Transformer for Single Image based Morphing Attack Detection via Knowledge Distillation from EfficientNet Ria Shekhawat et.al. 2511.12602 null
2025-11-14 Data-efficient U-Net for Segmentation of Carbide Microstructures in SEM Images of Steel Alloys Alinda Ezgi Gerçek et.al. 2511.11485 null
2025-11-14 Rethinking Efficient Mixture-of-Experts for Remote Sensing Modality-Missing Classification Qinghao Gao et.al. 2511.11460 null
2025-11-14 DiffPro: Joint Timestep and Layer-Wise Precision Optimization for Efficient Diffusion Inference Farhana Amin et.al. 2511.11446 null
2025-11-14 CURENet: Combining Unified Representations for Efficient Chronic Disease Prediction Cong-Tinh Dao et.al. 2511.11423 null
2025-11-14 Low-Bit, High-Fidelity: Optimal Transport Quantization for Flow Matching Dara Varam et.al. 2511.11418 null
2025-11-14 Coupled Proca theories: Green-hyperbolicity, quantization and applications to polarization measurement Christopher J. Fewster et.al. 2511.11348 null
2025-11-14 DocSLM: A Small Vision-Language Model for Long Multimodal Document Understanding Tanveer Hannan et.al. 2511.11313 null
2025-11-14 iMAD: Intelligent Multi-Agent Debate for Efficient and Accurate LLM Inference Wei Fan et.al. 2511.11306 null
2025-11-14 EcoAlign: An Economically Rational Framework for Efficient LVLM Alignment Ruoxi Cheng et.al. 2511.11301 null
2025-11-14 Parameter-Efficient MoE LoRA for Few-Shot Multi-Style Editing Cong Cao et.al. 2511.11236 null
2025-11-14 A Comparison of Lightweight Deep Learning Models for Particulate-Matter Nowcasting in the Indian Subcontinent & Surrounding Regions Ansh Kushwaha et.al. 2511.11185 null
2025-11-14 Viper-F1: Fast and Fine-Grained Multimodal Understanding with Cross-Modal State-Space Modulation Quoc-Huy Trinh et.al. 2511.11177 null
2025-11-14 Hindsight Distillation Reasoning with Knowledge Encouragement Preference for Knowledge-based Visual Question Answering Yu Zhao et.al. 2511.11132 null
2025-11-14 SemanticNN: Compressive and Error-Resilient Semantic Offloading for Extremely Weak Devices Jiaming Huang et.al. 2511.11038 null
2025-11-14 Rethinking Autoregressive Models for Lossless Image Compression via Hierarchical Parallelism and Progressive Adaptation Daxin Li et.al. 2511.10991 null
2025-11-14 Heterogeneous Complementary Distillation Liuchi Xu et.al. 2511.10942 null
2025-11-14 PhaseWin Search Framework Enable Efficient Object-Level Interpretation Zihan Gu et.al. 2511.10914 null
2025-11-13 Accuracy-Preserving CNN Pruning Method under Limited Data Availability Daisuke Yasui et.al. 2511.10861 null
2025-11-13 GFT: Graph Feature Tuning for Efficient Point Cloud Analysis Manish Dhakal et.al. 2511.10799 null
2025-11-13 Structure-Aware Encodings of Argumentation Properties for Clique-width Yasir Mahmood et.al. 2511.10767 null
2025-11-13 ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference Yesheng Liang et.al. 2511.10645 null
2025-11-13 Black-Box On-Policy Distillation of Large Language Models Tianzhu Ye et.al. 2511.10643 null
2025-11-13 Know Your Limits: Entropy Estimation Modeling for Compression and Generalization Benjamin L. Badger et.al. 2511.10618 null
2025-11-13 Maximizing Efficiency of Dataset Compression for Machine Learning Potentials With Information Theory Benjamin Yu et.al. 2511.10561 null
2025-11-13 A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space Huijie Liu et.al. 2511.10555 null
2025-11-13 URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding Yongxin Shi et.al. 2511.10552 null
2025-11-13 Learning Post-Newtonian Corrections from Numerical Relativity Jooheon Yoo et.al. 2511.10522 null
2025-11-13 SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation Wei Li et.al. 2511.10518 null
2025-11-13 Analogical Structure, Minimal Contextual Cues and Contrastive Distractors: Input Design for Sample-Efficient Linguistic Rule Induction Chunyang Jiang et.al. 2511.10441 null
2025-11-13 AgentEvolver: Towards Efficient Self-Evolving Agent System Yunpeng Zhai et.al. 2511.10395 null
2025-11-13 EDGC: Entropy-driven Dynamic Gradient Compression for Efficient LLM Training Qingao Yi et.al. 2511.10333 null
2025-11-13 Semantic Communication with Hopfield Memories Karim Nasreddine et.al. 2511.10302 null
2025-11-13 HeatV2X: Scalable Heterogeneous Collaborative Perception via Efficient Alignment and Interaction Yueran Zhao et.al. 2511.10211 null
2025-11-13 LiNeXt: Revisiting LiDAR Completion with Efficient Non-Diffusion Architectures Wenzhe He et.al. 2511.10209 null
2025-11-13 EffiReason-Bench: A Unified Benchmark for Evaluating and Advancing Efficient Reasoning in Large Language Models Junquan Huang et.al. 2511.10201 null
2025-11-13 Microscopy X-ray Imaging enriched with Small Angle X-ray Scattering for few nanometer resolution reveals shock waves and compression in intense short pulse laser irradiation of solids Thomas Kluge et.al. 2511.10127 null
2025-11-13 RobIA: Robust Instance-aware Continual Test-time Adaptation for Deep Stereo Jueun Ko et.al. 2511.10107 null
2025-11-13 Balancing Centralized Learning and Distributed Self-Organization: A Hybrid Model for Embodied Morphogenesis Takehiro Ishikawa et.al. 2511.10101 null
2025-11-13 GridPrune: From “Where to Look” to “What to Select” in Visual Token Pruning for MLLMs Yuxiang Duan et.al. 2511.10081 null
2025-11-13 Image Aesthetic Reasoning via HCM-GRPO: Empowering Compact Model for Superior Performance Zhiyuan Hu et.al. 2511.10055 null
2025-11-13 Efficient Thought Space Exploration through Strategic Intervention Ziheng Li et.al. 2511.10038 null
2025-11-13 LampQ: Towards Accurate Layer-wise Mixed Precision Quantization for Vision Transformers Minjun Kim et.al. 2511.10004 link
2025-11-13 Explore and Establish Synergistic Effects Between Weight Pruning and Coreset Selection in Neural Network Training Weilin Wan et.al. 2511.09901 null
2025-11-13 Regional Attention-Enhanced Swin Transformer for Clinically Relevant Medical Image Captioning Zubia Naz et.al. 2511.09893 null
2025-11-13 HCC-3D: Hierarchical Compensatory Compression for 98% 3D Token Reduction in Vision-Language Models Liheng Zhang et.al. 2511.09883 null
2025-11-13 RWKV-PCSSC: Exploring RWKV Model for Point Cloud Semantic Scene Completion Wenzhe He et.al. 2511.09878 null
2025-11-13 DP-GENG : Differentially Private Dataset Distillation Guided by DP-Generated Data Shuo Shi et.al. 2511.09876 null
2025-11-13 HierRouter: Coordinated Routing of Specialized Large Language Models via Reinforcement Learning Nikunj Gupta et.al. 2511.09873 null
2025-11-13 Steering Pretrained Drafters during Speculative Decoding Frédéric Berdoz et.al. 2511.09844 null
2025-11-12 TARG: Training-Free Adaptive Retrieval Gating for Efficient RAG Yufeng Wang et.al. 2511.09803 null
2025-11-12 How Small Can You Go? Compact Language Models for On-Device Critical Error Detection in Machine Translation Muskaan Chopra et.al. 2511.09748 null
2025-11-12 Separating QMA from QCMA with a classical oracle John Bostanci et.al. 2511.09551 null
2025-11-10 StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation Tianrui Feng et.al. 2511.07399 null
2025-11-10 LeCoT: revisiting network architecture for two-view correspondence pruning Luanyuan Dai et.al. 2511.07078 null
2025-11-10 GFix: Perceptually Enhanced Gaussian Splatting Video Compression Siyue Teng et.al. 2511.06953 null
2025-11-10 A Closer Look at Knowledge Distillation in Spiking Neural Network Training Xu Liu et.al. 2511.06902 null
2025-11-10 Joint Access Point Selection and Beamforming Design for Bistatic Backscatter Communication Ahmet Kaplan et.al. 2511.06866 null
2025-11-10 Distillation Dynamics: Towards Understanding Feature-Based Distillation in Vision Transformers Huiyuan Tian et.al. 2511.06848 null
2025-11-10 MI-to-Mid Distilled Compression (M2M-DC): An Hybrid-Information-Guided-Block Pruning with Progressive Inner Slicing Approach to Model Compression Lionel Levine et.al. 2511.06842 null
2025-11-10 P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats Yuzong Chen et.al. 2511.06838 null
2025-11-10 QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common Patterns in Nonlinear Operations Zhixiong Zhao et.al. 2511.06767 null
2025-11-10 Sensitivity of Small Language Models to Fine-tuning Data Contamination Nicy Scaria et.al. 2511.06763 null
2025-11-10 MobileLLM-Pro Technical Report Patrick Huber et.al. 2511.06719 null
2025-11-09 You Had One Job: Per-Task Quantization Using LLMs’ Hidden Representations Amit LeVi et.al. 2511.06516 null
2025-11-09 EASE: Practical and Efficient Safety Alignment for Small Language Models Haonan Shi et.al. 2511.06512 null
2025-11-09 GHOST: Solving the Traveling Salesman Problem on Graphs of Convex Sets Jingtao Tang et.al. 2511.06471 null
2025-11-09 Efficient LLM Safety Evaluation through Multi-Agent Debate Dachuan Lin et.al. 2511.06396 null
2025-11-09 Ghost in the Transformer: Tracing LLM Lineage with SVD-Fingerprint Suqing Wang et.al. 2511.06390 null
2025-11-09 Precision-Scalable Microscaling Datapaths with Optimized Reduction Tree for Efficient NPU Integration Stef Cuyckens et.al. 2511.06313 null
2025-11-09 CAMP-HiVe: Cyclic Pair Merging based Efficient DNN Pruning with Hessian-Vector Approximation for Resource-Constrained Systems Mohammad Helal Uddin et.al. 2511.06265 null
2025-11-09 VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving Ruifei Zhang et.al. 2511.06256 null
2025-11-09 Explicit Knowledge-Guided In-Context Learning for Early Detection of Alzheimer’s Disease Puzhen Su et.al. 2511.06215 null
2025-11-09 LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs Zifan He et.al. 2511.06174 null
2025-11-08 Neodragon: Mobile Video Generation using Diffusion Transformer Animesh Karnewar et.al. 2511.06055 null
2025-11-08 Lethe: Layer- and Time-Adaptive KV Cache Pruning for Reasoning-Intensive LLM Serving Hui Zeng et.al. 2511.06029 null
2025-11-08 MoSKA: Mixture of Shared KV Attention for Efficient Long-Sequence LLM Inference Myunghyun Rhee et.al. 2511.06010 null
2025-11-08 GABFusion: Rethinking Feature Fusion for Low-Bit Quantization of Multi-Task Networks Zhaoyang Wang et.al. 2511.05898 null
2025-11-08 HarmoQ: Harmonized Post-Training Quantization for High-Fidelity Image Hongjun Wang et.al. 2511.05868 null
2025-11-08 EGG-SR: Embedding Symbolic Equivalence into Symbolic Regression via Equality Graph Nan Jiang et.al. 2511.05849 null
2025-11-08 Training-Free Adaptive Quantization for Variable Rate Image Coding for Machines Yui Tatsumi et.al. 2511.05836 null
2025-11-08 MOSS: Efficient and Accurate FP8 LLM Training with Microscaling and Automatic Scaling Yu Zhang et.al. 2511.05811 null
2025-11-07 An Efficient Gradient-Aware Error-Bounded Lossy Compressor for Federated Learning Zhijing Ye et.al. 2511.05770 null
2025-11-11 Compressing Multi-Task Model for Autonomous Driving via Pruning and Knowledge Distillation Jiayuan Wang et.al. 2511.05557 null
2025-11-07 A Metamorphic Testing Perspective on Knowledge Distillation for Language Models of Code: Does the Student Deeply Mimic the Teacher? Md. Abdul Awal et.al. 2511.05476 null
2025-11-07 APP: Accelerated Path Patching with Task-Specific Pruning Frauke Andersen et.al. 2511.05442 null
2025-11-07 Efficient CNN Inference on Ultra-Low-Power MCUs via Saturation-Aware Convolution Shiming Li et.al. 2511.05347 null
2025-11-07 Attention and Compression is all you need for Controllably Efficient Language Models Jatin Prakash et.al. 2511.05313 null
2025-11-07 Optimal Quantization on Spherical Surfaces: Continuous and Discrete Models - A Beginner-Friendly Expository Study Mrinal Kanti Roychowdhury et.al. 2511.05099 null
2025-11-07 An Efficient Proximity Graph-based Approach to Table Union Search Yiming Xie et.al. 2511.05082 null
2025-11-07 Representational power of selected neural network quantum states in second quantization Zhendong Li et.al. 2511.04932 null
2025-11-06 DMA: Online RAG Alignment with Human Feedback Yu Bai et.al. 2511.04880 null
2025-11-06 Data Efficiency and Transfer Robustness in Biomedical Image Segmentation: A Study of Redundancy and Forgetting with Cellpose Shuo Zhao et.al. 2511.04803 null
2025-11-06 Hardware-Accelerated GNN-based Hit Filtering for the Belle II Level-1 Trigger Greta Heine et.al. 2511.04731 null
2025-11-06 Benchmark Designers Should “Train on the Test Set” to Expose Exploitable Non-Visual Shortcuts Ellis Brown et.al. 2511.04655 null
2025-11-06 TT-Prune: Joint Model Pruning and Resource Allocation for Communication-efficient Time-triggered Federated Learning Xinlu Zhang et.al. 2511.04653 null
2025-11-06 Enabling Dynamic Sparsity in Quantized LLM Inference Rongxiang Wang et.al. 2511.04477 null
2025-11-06 Block Rotation is All You Need for MXFP4 Quantization Yuantian Shao et.al. 2511.04214 null
2025-11-06 DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization Yuantian Shao et.al. 2511.04063 null
2025-11-06 Tiny-WiFo: A Lightweight Wireless Foundation Model for Channel Prediction via Multi-Component Adaptive Knowledge Distillation Haotian Zhang et.al. 2511.04015 null
2025-11-06 Memory- and Latency-Constrained Inference of Large Language Models via Adaptive Split Computing Mingyu Sung et.al. 2511.04002 null
2025-11-06 TwIST: Rigging the Lottery in Transformers with Independent Subnetwork Training Michael Menezes et.al. 2511.03983 null
2025-11-09 Temporal Zoom Networks: Distance Regression and Continuous Depth for Efficient Action Localization Ibne Farabi Shihab et.al. 2511.03943 null
2025-11-05 Desert Waste Detection and Classification Using Data-Based and Model-Based Enhanced YOLOv12 DL Model Abdulmumin Sa’ad et.al. 2511.03888 null
2025-11-05 Unconventional quantization of 2D plasmons in cavities formed by gate slots Ilia Moiseenko et.al. 2511.03829 null
2025-11-05 Efficient Neural Networks with Discrete Cosine Transform Activations Marc Martinez-Gost et.al. 2511.03531 null
2025-11-05 Kastor: Fine-tuned Small Language Models for Shape-based Active Relation Extraction Ringwald Celian et.al. 2511.03466 null
2025-11-05 EQ-Negotiator: Dynamic Emotional Personas Empower Small Language Models for Edge-Deployable Credit Negotiation Yunbo Long et.al. 2511.03370 null
2025-11-05 Incorporating QM/MM molecular dynamics into the few-mode quantization approach for light-matter interactions in nanophotonic structures Ruth H. Tichauer et.al. 2511.03303 null
2025-11-07 Provable Separations between Memorization and Generalization in Diffusion Models Zeqi Ye et.al. 2511.03202 null
2025-11-05 A Quantized VAE-MLP Botnet Detection Model: A Systematic Evaluation of Quantization-Aware Training and Post-Training Quantization Strategies Hassan Wasswa et.al. 2511.03201 null
2025-11-05 LogicSparse: Enabling Engine-Free Unstructured Sparsity for Quantised Deep-learning Accelerators Changhong Li et.al. 2511.03079 null
2025-11-04 Targeted Error Correction in Knowledge Distillation: Small Language Models Surpass GPT Hee-Jin Lee et.al. 2511.03005 null
2025-11-04 Analog-to-Digital Converter Based on Voltage-controlled Superconducting Device Md Mazharul Islam et.al. 2511.02968 null
2025-11-04 In Good GRACEs: Principled Teacher Selection for Knowledge Distillation Abhishek Panigrahi et.al. 2511.02833 null
2025-11-04 A Non-Uniform Quantization Framework for Time-Encoding Machines Kaluguri Yashaswini et.al. 2511.02728 null
2025-11-04 Can Visual Input Be Compressed? A Visual Token Compression Benchmark for Large Multimodal Models Tianfan Peng et.al. 2511.02650 null
2025-11-04 LiteVoxel: Low-memory Intelligent Thresholding for Efficient Voxel Rasterization Jee Won Lee et.al. 2511.02510 null
2025-11-04 FP8-Flow-MoE: A Casting-Free FP8 Recipe without Double Quantization Error Fengjuan Wang et.al. 2511.02302 null
2025-11-05 IG-Pruning: Input-Guided Block Pruning for Large Language Models Kangyu Qiao et.al. 2511.02213 null
2025-11-03 Testing Quantum Gravity with Gravitational Waves from the ringdown of binary Black Holes coalescences: A New Frontier in Fundamental Physics Marco Danilo Claudio Torri et.al. 2511.02056 null
2025-11-01 Fibbinary-Based Compression and Quantization for Efficient Neural Radio Receivers Roberta Fiandaca et.al. 2511.01921 null
2025-11-03 KV Cache Transform Coding for Compact Storage in LLM Inference Konrad Staniszewski et.al. 2511.01815 null
2025-11-03 Random Initialization of Gated Sparse Adapters Vi Retault et.al. 2511.01794 null
2025-11-03 Optimizing Movable Antenna Position and Transmissive RIS Phase for Efficient Base Station Design Marjan Boloori et.al. 2511.01575 null
2025-11-03 Luminance-Aware Statistical Quantization: Unsupervised Hierarchical Learning for Illumination Enhancement Derong Kong et.al. 2511.01510 null
2025-11-03 Thinking with DistilQwen: A Tale of Four Distilled Reasoning and Reward Model Series Wenrui Cai et.al. 2511.01354 null
2025-11-03 FirstAidQA: A Synthetic Dataset for First Aid and Emergency Response in Low-Connectivity Settings Saiyma Sittul Muna et.al. 2511.01289 null
2025-11-03 MoSa: Motion Generation with Scalable Autoregressive Modeling Mengyuan Liu et.al. 2511.01200 null
2025-11-03 MicroAUNet: Boundary-Enhanced Multi-scale Fusion with Knowledge Distillation for Colonoscopy Polyp Image Segmentation Ziyi Wang et.al. 2511.01143 null
2025-11-02 All-in-one Graph-based Indexing for Hybrid Search on GPUs Zhonggen Li et.al. 2511.00855 null
2025-11-02 Towards Ultra-Low Latency: Binarized Neural Network Architectures for In-Vehicle Network Intrusion Detection Huiyao Dong et.al. 2511.00828 null
2025-11-02 Efficient Query Repair for Aggregate Constraints Shatha Algarni et.al. 2511.00826 link
2025-11-02 REaR: Retrieve, Expand and Refine for Effective Multitable Retrieval Rishita Agarwal et.al. 2511.00805 null
2025-11-01 Predicting Encoding Energy from Low-Pass Anchors for Green Video Streaming Zoha Azimi et.al. 2511.00707 null
2025-11-01 Privacy-Aware Time Series Synthesis via Public Knowledge Distillation Penghang Liu et.al. 2511.00700 null
2025-11-04 Inference-Time Chain-of-Thought Pruning with Latent Informativeness Signals Sophie Li et.al. 2511.00699 null
2025-11-01 Outlier-Aware Post-Training Quantization for Image Super-Resolution Hailing Wang et.al. 2511.00682 null
2025-11-01 Reviving Stale Updates: Data-Free Knowledge Distillation for Asynchronous Federated Learning Baris Askin et.al. 2511.00655 null
2025-11-01 Leveraging Multi-Agent System (MAS) and Fine-Tuned Small Language Models (SLMs) for Automated Telecom Network Troubleshooting Chenhua Shi et.al. 2511.00651 null
2025-11-01 Diluting Restricted Boltzmann Machines C. Díaz-Faloh et.al. 2511.00648 null
2025-11-05 Towards 1000-fold Electron Microscopy Image Compression for Connectomics via VQ-VAE with Transformer Prior Fuming Yang et.al. 2511.00231 null
2025-10-31 Vision Transformer for Robust Occluded Person Reidentification in Complex Surveillance Scenes Bo Li et.al. 2510.27677 null
2025-10-31 SpecAttn: Speculating Sparse Attention Harsh Shah et.al. 2510.27641 null
2025-10-31 Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications Zixuan Hu et.al. 2510.27186 null
2025-10-30 Elastic Architecture Search for Efficient Language Models Shang Wang et.al. 2510.27037 null
2025-10-30 LightPro: A Linear Photonic Processor with Full Programmability Amin Shafiee et.al. 2510.27013 null
2025-10-30 STaMP: Sequence Transformation and Mixed Precision for Low-Precision Activation Quantization Marco Federici et.al. 2510.26771 null
2025-10-30 LoRAQuant: Mixed-Precision Quantization of LoRA to Ultra-Low Bits Amir Reza Mirzaei et.al. 2510.26690 null
2025-10-30 Knowledge Distillation of Noisy Force Labels for Improved Coarse-Grained Force Fields Feranmi V. Olowookere et.al. 2510.26650 null
2025-10-30 ReSpec: Towards Optimizing Speculative Decoding in Reinforcement Learning Systems Qiaoling Chen et.al. 2510.26475 null
2025-10-30 1+1>2: A Synergistic Sparse and Low-Rank Compression Method for Large Language Models Zeliang Zong et.al. 2510.26446 null
2025-10-30 Personalized Treatment Outcome Prediction from Scarce Data via Dual-Channel Knowledge Distillation and Adaptive Fusion Wenjie Chen et.al. 2510.26444 null
2025-10-30 Discovering State Equivalences in UCT Search Trees By Action Pruning Robin Schmöcker et.al. 2510.26346 null
2025-10-30 Do LLMs Signal When They’re Right? Evidence from Neuron Agreement Kang Chen et.al. 2510.26277 null
2025-10-30 Distilling Multilingual Vision-Language Models: When Smaller Models Stay Multilingual Sukrit Sriratanawilai et.al. 2510.26271 null
2025-10-30 BitSemCom: A Bit-Level Semantic Communication Framework with Learnable Probabilistic Mapping Haoshuo Zhang et.al. 2510.26225 null
2025-10-30 STAR: A Privacy-Preserving, Energy-Efficient Edge AI Framework for Human Activity Recognition via Wi-Fi CSI in Mobile and Pervasive Computing Environments Kexing Liu et.al. 2510.26148 null
2025-10-30 Do Students Debias Like Teachers? On the Distillability of Bias Mitigation Methods Jiali Cheng et.al. 2510.26038 null
2025-10-29 Robust GNN Watermarking via Implicit Perception of Topological Invariants Jipeng Li et.al. 2510.25934 null
2025-10-29 Humains-Junior: A 3.8B Language Model Achieving GPT-4o-Level Factual Accuracy by Directed Exoskeleton Reasoning Nissan Yaron et.al. 2510.25933 null
2025-10-28 Group theoretic quantization of punctured plane Manvendra Somvanshi et.al. 2510.25794 null
2025-10-29 INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats Mengzhao Chen et.al. 2510.25602 null
2025-10-30 PureKV: Plug-and-Play KV Cache Optimization with Spatial-Temporal Sparse Attention for Vision-Language Large Models Zhonghua Jiang et.al. 2510.25600 null
2025-10-29 Feedback Alignment Meets Low-Rank Manifolds: A Structured Recipe for Local Learning Arani Roy et.al. 2510.25594 null
2025-10-29 Lightweight Federated Learning in Mobile Edge Computing with Statistical and Device Heterogeneity Awareness Jinghong Tan et.al. 2510.25342 null
2025-10-29 Adapting Small Language Models to Low-Resource Domains: A Case Study in Hindi Tourism QA Sandipan Majhi et.al. 2510.25273 null
2025-10-29 Energy-Efficient Autonomous Driving with Adaptive Perception and Robust Decision Yuyang Xia et.al. 2510.25205 null
2025-10-29 Machine Learning and CPU (Central Processing Unit) Scheduling Co-Optimization over a Network of Computing Centers Mohammadreza Doostmohammadian et.al. 2510.25176 null
2025-10-28 Resource-Efficient and Robust Inference of Deep and Bayesian Neural Networks on Embedded and Analog Computing Platforms Bernhard Klein et.al. 2510.24951 null
2025-10-28 SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens Yinhan He et.al. 2510.24940 null
2025-10-28 Send Less, Save More: Energy-Efficiency Benchmark of Embedded CNN Inference vs. Data Transmission in IoT Benjamin Karic et.al. 2510.24829 null
2025-10-27 A Survey on Efficient Vision-Language-Action Models Zhaoshu Yu et.al. 2510.24795 null
2025-10-27 ESCA: Enabling Seamless Codec Avatar Execution through Algorithm and Hardware Co-Optimization for Virtual Reality Mingzhi Zhu et.al. 2510.24787 null
2025-10-28 All in one timestep: Enhancing Sparsity and Energy efficiency in Multi-level Spiking Neural Networks Andrea Castagnetti et.al. 2510.24637 null
2025-10-28 Fast and accurate neural reflectance transformation imaging through knowledge distillation Tinsae G. Dulecha et.al. 2510.24486 null
2025-10-28 MiniOneRec: An Open-Source Framework for Scaling Generative Recommendation Xiaoyu Kong et.al. 2510.24431 null
2025-11-01 Comprehensive and Efficient Distillation for Lightweight Sentiment Analysis Models Guangyu Xie et.al. 2510.24425 null
2025-10-29 Odyssey: An End-to-End System for Pareto-Optimal Serverless Query Processing Shyam Jesalpura et.al. 2510.24307 null
2025-10-28 SCOPE: Saliency-Coverage Oriented Token Pruning for Efficient Multimodel LLMs Jinhong Deng et.al. 2510.24214 null
2025-11-01 Spectral-Geometric Deformations of Function Algebras on Manifolds Amandip Sangha et.al. 2510.24184 null
2025-10-28 UHKD: A Unified Framework for Heterogeneous Knowledge Distillation via Frequency-Domain Representations Fengming Yu et.al. 2510.24116 null
2025-10-28 FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic Kanghyun Choi et.al. 2510.24061 null
2025-10-28 SpecKD: Speculative Decoding for Effective Knowledge Distillation of LLMs Haiduo Huang et.al. 2510.24021 null
2025-10-27 Adaptive Training of INRs via Pruning and Densification Diana Aldana et.al. 2510.23943 null
2025-10-27 BitSkip: An Empirical Analysis of Quantization and Early Exit Composition Ramshankar Bhuvaneswaran et.al. 2510.23766 null
2025-10-25 The Structural Scalpel: Automated Contiguous Layer Pruning for Large Language Models Yao Lu et.al. 2510.23652 null
2025-10-25 Efficient Low Rank Attention for Long-Context Inference in Large Language Models Tenghui Li et.al. 2510.23649 null
2025-10-24 LLMComp: A Language Modeling Paradigm for Error-Bounded Scientific Data Compression Guozhong Li et.al. 2510.23632 null
2025-10-27 Tighter CMI-Based Generalization Bounds via Stochastic Projection and Quantization Milad Sefidgaran et.al. 2510.23485 null
2025-10-27 Enabling Vibration-Based Gesture Recognition on Everyday Furniture via Energy-Efficient FPGA Implementation of 1D Convolutional Networks Koki Shibata et.al. 2510.23156 null
2025-10-27 DeepSalt: Bridging Laboratory and Satellite Spectra through Domain Adaptation and Knowledge Distillation for Large-Scale Soil Salinity Estimation Rupasree Dey et.al. 2510.23124 null
2025-10-27 LightPFP: A Lightweight Route to Ab Initio Accuracy at Scale Wenwen Li et.al. 2510.23064 null
2025-10-27 AirFed: Federated Graph-Enhanced Multi-Agent Reinforcement Learning for Multi-UAV Cooperative Mobile Edge Computing Zhiyu Wang et.al. 2510.23053 null
2025-10-27 Sentinel: Dynamic Knowledge Distillation for Personalized Federated Intrusion Detection in Heterogeneous IoT Networks Gurpreet Singh et.al. 2510.23019 null
2025-10-28 Switchable Token-Specific Codebook Quantization For Face Image Compression Yongbo Wang et.al. 2510.22943 null
2025-10-27 Rethinking Inference Placement for Deep Learning across Edge and Cloud Platforms: A Multi-Objective Optimization Perspective and Future Directions Zongshun Zhang et.al. 2510.22909 null
2025-10-26 TELL-TALE: Task Efficient LLMs with Task Aware Layer Elimination Omar Naim et.al. 2510.22767 null
2025-10-26 Iterative Layer Pruning for Efficient Translation Inference Yasmin Moslem et.al. 2510.22763 null
2025-10-26 TVMC: Time-Varying Mesh Compression via Multi-Stage Anchor Mesh Generation He Huang et.al. 2510.22646 null
2025-10-26 Bag-of-Word-Groups (BoWG): A Robust and Efficient Loop Closure Detection Method Under Perceptual Aliasing Xiang Fei et.al. 2510.22529 null
2025-10-26 Frustratingly Easy Task-aware Pruning for Large Language Models Yuanhe Tian et.al. 2510.22489 null
2025-10-26 Single-Teacher View Augmentation: Boosting Knowledge Distillation via Angular Diversity Seonghoon Yu et.al. 2510.22480 null
2025-10-25 GigaEmbeddings: Efficient Russian Language Embedding Model Egor Kolodin et.al. 2510.22369 null
2025-10-25 Real-Time Semantic Segmentation on FPGA for Autonomous Vehicles Using LMIINet with the CGRA4ML Framework Amir Mohammad Khadem Hosseini et.al. 2510.22243 null
2025-10-25 Synthetic-to-Real Transfer Learning for Chromatin-Sensitive PWS Microscopy Jahidul Arafat et.al. 2510.22239 null
2025-10-25 When Fewer Layers Break More Chains: Layer Pruning Harms Test-Time Scaling in LLMs Keyu Wang et.al. 2510.22228 null
2025-10-25 Scaling Up Efficient Small Language Models Serving and Deployment for Semantic Job Search Kayhan Behdin et.al. 2510.22101 null
2025-10-24 Pruning and Quantization Impact on Graph Neural Networks Khatoon Khedri et.al. 2510.22058 null
2025-10-24 Performance Trade-offs of Optimizing Small Language Models for E-Commerce Josip Tomo Licardo et.al. 2510.21970 null
2025-10-23 TernaryCLIP: Efficiently Compressing Vision-Language Models with Ternary Weights and Distilled Knowledge Shu-Hao Zhang et.al. 2510.21879 null
2025-10-22 KARIPAP: Quantum-Inspired Tensor Network Compression of Large Language Models Using Infinite Projected Entangled Pair States and Tensor Renormalization Group Azree Nazri et.al. 2510.21844 null
2025-10-22 Restoring Pruned Large Language Models via Lost Component Compensation Zijian Feng et.al. 2510.21834 null
2025-10-24 A Dynamic Knowledge Distillation Method Based on the Gompertz Curve Han Yang et.al. 2510.21649 null
2025-10-24 Few-Shot Knowledge Distillation of LLMs With Counterfactual Explanations Faisal Hamman et.al. 2510.21631 null
2025-10-24 Does Model Size Matter? A Comparison of Small and Large Language Models for Requirements Classification Mohammad Amin Zadenoori et.al. 2510.21443 null
2025-10-24 A Convergence Analysis of Adaptive Optimizers under Floating-point Quantization Xuan Tang et.al. 2510.21314 null
2025-10-24 Correlation Dimension of Auto-Regressive Large Language Models Xin Du et.al. 2510.21258 null
2025-10-24 DictPFL: Efficient and Private Federated Learning on Encrypted Gradients Jiaqi Xue et.al. 2510.21086 null
2025-10-23 Learning Grouped Lattice Vector Quantizers for Low-Bit LLM Compression Xi Zhang et.al. 2510.20984 link
2025-10-23 Compress to Impress: Efficient LLM Adaptation Using a Single Gradient Step on 100 Samples Shiva Sreeram et.al. 2510.20800 null
2025-10-23 Efficient Multi-bit Quantization Network Training via Weight Bias Correction and Bit-wise Coreset Sampling Jinhee Kim et.al. 2510.20673 null
2025-10-23 xTime: Extreme Event Prediction with Hierarchical Knowledge Distillation and Expert Fusion Quan Li et.al. 2510.20651 null
2025-10-23 Dynamic Weight Adjustment for Knowledge Distillation: Leveraging Vision Transformer for High-Accuracy Lung Cancer Detection and Real-Time Deployment Saif Ur Rehman Khan et.al. 2510.20438 null
2025-10-23 AccuQuant: Simulating Multiple Denoising Steps for Quantizing Diffusion Models Seunghoon Lee et.al. 2510.20348 null
2025-10-24 EditInfinity: Image Editing with Binary-Quantized Generative Models Jiahuan Wang et.al. 2510.20217 null
2025-10-23 BoundRL: Efficient Structured Text Segmentation through Reinforced Boundary Generation Haoyuan Li et.al. 2510.20151 null
2025-10-22 Improving Predictive Confidence in Medical Imaging via Online Label Smoothing Kushan Choudhury et.al. 2510.20011 null
2025-10-22 From Large to Small: Transferring CUDA Optimization Expertise via Reasoning Graph Junfeng Gong et.al. 2510.19873 null
2025-10-21 Foveated Compression for Immersive Telepresence Visualization Max Schwarz et.al. 2510.19848 null
2025-10-20 Mechanics as a general-relativistic gauge field theory, and Relational Quantization J. François et.al. 2510.19845 null
2025-10-22 AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders Yuezhou Hu et.al. 2510.19779 null
2025-10-22 A flexible framework for structural plasticity in GPU-accelerated sparse spiking neural networks James C. Knight et.al. 2510.19764 null
2025-10-22 Adaptive Distribution-aware Quantization for Mixed-Precision Neural Networks Shaohang Jia et.al. 2510.19760 null
2025-10-22 Accelerating Moment Tensor Potentials through Post-Training Pruning Zijian Meng et.al. 2510.19737 null
2025-10-22 Single-Scale Magnetoelastic Landau Quantization: Thermodynamics, Quantum Oscillations, and Metrology Denise Assafrão et.al. 2510.19637 null
2025-10-22 HAD: Hierarchical Asymmetric Distillation to Bridge Spatio-Temporal Gaps in Event-Based Object Tracking Yao Deng et.al. 2510.19560 null
2025-10-22 Energy-Efficient and Dequantization-Free Q-LLMs: A Spiking Neural Network Approach to Salient Value Mitigation Chenyu Wang et.al. 2510.19498 null
2025-10-22 ELUTQ: Efficient LUT-Aware Quantization for Deploying Large Language Models on Edge Devices Xin Nie et.al. 2510.19482 null
2025-10-22 BLiSS 1.0: Evaluating Bilingual Learner Competence in Second Language Small Language Models Yuan Gao et.al. 2510.19419 null
2025-10-22 CPSVD: Enhancing Large Language Model Compression via Column-Preserving Singular Value Decomposition Lin Xv et.al. 2510.19385 null
2025-10-27 Multi-Rate Task-Oriented Communication for Multi-Edge Cooperative Inference Dongwon Kim et.al. 2510.19360 null
2025-10-24 Knowledge Distillation of Uncertainty using Deep Latent Factor Model Sehyun Park et.al. 2510.19290 null
2025-10-22 MobiAct: Efficient MAV Action Recognition Using MobileNetV4 with Contrastive Learning and Knowledge Distillation Zhang Nengbo et.al. 2510.19273 null
2025-10-23 Data Efficient Any Transformer-to-Mamba Distillation via Attention Bridge Penghao Wang et.al. 2510.19266 null
2025-10-22 Res-DPU: Resource-shared Digital Processing-in-memory Unit for Edge-AI Workloads Mukul Lokhande et.al. 2510.19260 null
2025-10-22 Background Fades, Foreground Leads: Curriculum-Guided Background Pruning for Efficient Foreground-Centric Collaborative Perception Yuheng Wu et.al. 2510.19250 null
2025-10-22 TinyUSFM: Towards Compact and Efficient Ultrasound Foundation Models Chen Ma et.al. 2510.19239 null
2025-10-22 Enhancing Graph Neural Networks: A Mutual Learning Approach Paul Agbaje et.al. 2510.19223 null
2025-10-22 MoE-GS: Mixture of Experts for Dynamic Gaussian Splatting In-Hwan Jin et.al. 2510.19210 null
2025-10-22 PruneHal: Reducing Hallucinations in Multi-modal Large Language Models through Adaptive KV Cache Pruning Fengyuan Sun et.al. 2510.19183 null
2025-10-21 Towards Universal Solvers: Using PGD Attack in Active Learning to Increase Generalizability of Neural Operators as Knowledge Distillation from Numerical PDE Solvers Yifei Sun et.al. 2510.18989 null
2025-10-21 DuoLens: A Framework for Robust Detection of Machine-Generated Multilingual Text and Code Shriyansh Agrawal et.al. 2510.18904 null
2025-10-20 CosmoCore Affective Dream-Replay Reinforcement Learning for Code Generation Santhosh Kumar Ravindran et.al. 2510.18895 null
2025-10-21 Fine-Tuned Thoughts: Leveraging Chain-of-Thought Reasoning for Industrial Asset Health Monitoring Shuxin Lin et.al. 2510.18817 null
2025-10-21 CAGE: Curvature-Aware Gradient Estimation For Accurate Quantization-Aware Training Soroush Tabesh et.al. 2510.18784 null
2025-10-21 Binary Quadratic Quantization: Beyond First-Order Quantization for Real-Valued Matrix Compression Kyo Kuroki et.al. 2510.18650 null
2025-10-21 C-SWAP: Explainability-Aware Structured Pruning for Efficient Neural Networks Compression Baptiste Bauvin et.al. 2510.18636 null
2025-10-21 Channel-Aware Vector Quantization for Robust Semantic Communication on Discrete Channels Zian Meng et.al. 2510.18604 null
2025-10-21 Pay Attention to the Triggers: Constructing Backdoors That Survive Distillation Giovanni De Muri et.al. 2510.18541 null
2025-10-21 From Quarter to All: Accelerating Speculative LLM Decoding via Floating-Point Exponent Remapping and Parameter Sharing Yushu Zhao et.al. 2510.18525 null
2025-10-21 DWaste: Greener AI for Waste Sorting using Mobile and Edge Devices Suman Kunwar et.al. 2510.18513 null
2025-10-21 How2Compress: Scalable and Efficient Edge Video Analytics via Adaptive Granular Video Compression Yuheng Wu et.al. 2510.18409 null
2025-10-21 MENTOR: A Reinforcement Learning Framework for Model Enhancement via Teacher-Optimized Rewards in Small Models ChangSu Choi et.al. 2510.18383 null
2025-10-21 S2AP: Score-space Sharpness Minimization for Adversarial Pruning Giorgio Piras et.al. 2510.18381 null
2025-10-21 Ensembling Pruned Attention Heads For Uncertainty-Aware Efficient Transformers Firas Gabetni et.al. 2510.18358 null
2025-10-21 StreamingTOM: Streaming Token Compression for Efficient Video Understanding Xueyi Chen et.al. 2510.18269 null
2025-10-21 Learning under Quantization for High-Dimensional Linear Regression Dechen Zhang et.al. 2510.18259 null
2025-10-21 DualHash: A Stochastic Primal-Dual Algorithm with Theoretical Guarantee for Deep Hashing Luxuan Li et.al. 2510.18218 null
2025-10-20 Learning from Generalization Patterns: An Evaluation-Driven Approach to Enhanced Data Augmentation for Fine-Tuning Small Language Models Huan Song et.al. 2510.18143 null
2025-10-20 CompactPrompt: A Unified Pipeline for Prompt Data Compression in LLM Workflows Joong Ho Choi et.al. 2510.18043 null
2025-10-20 From Local to Global: Revisiting Structured Pruning Paradigms for Large Language Models Ziyan Wang et.al. 2510.18030 null
2025-10-20 Quantum Computing Approach to Atomic and Molecular Three-Body Systems Mohammad Haidar et.al. 2510.18005 null
2025-10-20 SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference Samir Khaki et.al. 2510.17777 null
2025-10-21 Efficient Tensor Completion Algorithms for Highly Oscillatory Operators Navjot Singh et.al. 2510.17734 null
2025-10-20 Elastic ViTs from Pretrained Models without Retraining Walter Simoncini et.al. 2510.17700 null
2025-10-20 Deparametrization and Quantization of Scalar-Tensor Gravity and Its Cosmological Model Faqiang Yuan et.al. 2510.17663 null
2025-10-21 TrajMamba: An Efficient and Semantic-rich Vehicle Trajectory Pre-training Model Yichen Liu et.al. 2510.17545 null
2025-10-20 The Graphon Limit Hypothesis: Understanding Neural Network Pruning via Infinite Width Analysis Hoang Pham et.al. 2510.17515 null
2025-10-20 $\mathcal{V}isi\mathcal{P}runer$ : Decoding Discontinuous Cross-Modal Dynamics for Efficient Multimodal LLMs Yingqi Fan et.al. 2510.17205 null
2025-10-20 ZSPAPrune: Zero-Shot Prompt-Aware Token Pruning for Vision-Language Models Pu Zhang et.al. 2510.17197 null
2025-10-20 SOLE: Hardware-Software Co-design of Softmax and LayerNorm for Efficient Transformer Inference Wenxun Wang et.al. 2510.17189 null
2025-10-20 HyperSearch: Prediction of New Hyperedges through Unconstrained yet Efficient Search Hyunjin Choo et.al. 2510.17153 null
2025-10-19 Foundation Models in Medical Image Analysis: A Systematic Review and Meta-Analysis Praveenbalaji Rajendran et.al. 2510.16973 null
2025-10-19 Leave It to the Experts: Detecting Knowledge Distillation via MoE Expert Signatures Pingzhi Li et.al. 2510.16968 null
2025-10-19 SAC: Neural Speech Codec with Semantic-Acoustic Dual-Stream Quantization Wenxi Chen et.al. 2510.16841 null
2025-10-19 Mixed-Precision Quantization for Language Models: Techniques and Prospects Mariam Rakka et.al. 2510.16805 null
2025-10-19 ELMM: Efficient Lightweight Multimodal Large Language Models for Multimodal Knowledge Graph Completion Wei Huang et.al. 2510.16753 null
2025-10-19 DistilLock: Safeguarding LLMs from Unauthorized Knowledge Distillation on the Edge Asmita Mohanty et.al. 2510.16716 null
2025-10-19 CLIP: Client-Side Invariant Pruning for Mitigating Stragglers in Secure Federated Learning Anthony DiMaggio et.al. 2510.16694 null
2025-10-19 Pursuing Minimal Sufficiency in Spatial Reasoning Yejie Guo et.al. 2510.16688 null
2025-10-18 HYDRA: HYbrid knowledge Distillation and spectral Reconstruction Algorithm for high channel hyperspectral camera applications Christopher Thirgood et.al. 2510.16664 null
2025-10-18 Self-Supervised Learning to Fly using Efficient Semantic Segmentation and Metric Depth Estimation for Low-Cost Autonomous UAVs Sebastian Mocanu et.al. 2510.16624 null
2025-10-18 A Deep Learning Framework for Real-Time Image Processing in Medical Diagnostics: Enhancing Accuracy and Speed in Clinical Applications Melika Filvantorkaman et.al. 2510.16611 null
2025-10-18 SPLite Hand: Sparsity-Aware Lightweight 3D Hand Pose Estimation Yeh Keng Hao et.al. 2510.16396 null
2025-10-18 QSVD: Efficient Low-rank Approximation for Unified Query-Key-Value Weight Compression in Low-Precision Vision-Language Models Yutong Wang et.al. 2510.16292 null
2025-10-17 One-Bit Quantization for Random Features Models Danil Akhtiamov et.al. 2510.16250 null
2025-10-18 Differentiable, Bit-shifting, and Scalable Quantization without training neural network from scratch Zia Badar et.al. 2510.16088 null
2025-10-17 Optimization of the quantization of dense neural networks from an exact QUBO formulation Sergio Muñiz Subiñas et.al. 2510.16075 null
2025-10-16 AMS-QUANT: Adaptive Mantissa Sharing for Floating-point Quantization Mengtao Lv et.al. 2510.16045 null
2025-10-16 Vector Quantization in the Brain: Grid-like Codes in World Models Xiangyuan Peng et.al. 2510.16039 null
2025-10-17 SANR: Scene-Aware Neural Representation for Light Field Image Compression with Rate-Distortion Optimization Gai Zhang et.al. 2510.15775 null
2025-10-17 Evaluation of Novel Fast Machine Learning Algorithms for Knowledge-Distillation-Based Anomaly Detection at CMS Lino Gerlach et.al. 2510.15672 null
2025-10-17 Time evolution of the Husimi and Glauber-Sudarshan functions in terms of complementary Hamiltonian symbols Mritunjay Tyagi et.al. 2510.15628 null
2025-10-17 GRATING: Low-Latency and Memory-Efficient Semantic Selection on Device Jiahao Zhou et.al. 2510.15620 null
2025-10-17 Quantized FCA: Efficient Zero-Shot Texture Anomaly Detection Andrei-Timotei Ardelean et.al. 2510.15602 null
2025-10-17 SpikeFit: Towards Optimal Deployment of Spiking Networks on Neuromorphic Hardware Ivan Kartashov et.al. 2510.15542 null
2025-10-17 Revisiting Knowledge Distillation: The Hidden Role of Dataset Size Giulia Lanzillotta et.al. 2510.15516 null
2025-10-17 Quantization-Based Score Calibration for Few-Shot Keyword Spotting with Dynamic Time Warping in Noisy Environments Kevin Wilkinghoff et.al. 2510.15432 null
2025-10-17 ParaFormer: Shallow Parallel Transformers with Progressive Approximation Wei Wang et.al. 2510.15425 null
2025-10-17 Fine-Tuning MedGemma for Clinical Captioning to Enhance Multimodal RAG over Malaysia CPGs Lee Qi Zun et.al. 2510.15418 null
2025-10-17 Layer as Puzzle Pieces: Compressing Large Language Models through Layer Concatenation Fei Wang et.al. 2510.15304 null
2025-10-17 GRank: Towards Target-Aware and Streamlined Industrial Retrieval with a Generate-Rank Framework Yijia Sun et.al. 2510.15299 null
2025-10-17 Exemplar-Guided Planing: Enhanced LLM Agent for KGQA Jingao Xu et.al. 2510.15283 null
2025-10-16 Dyadic microlocal partition for anisotropic metrics and uniform Weyl quantization Vicente Vergara et.al. 2510.15183 null
2025-10-16 SaLon3R: Structure-aware Long-term Generalizable 3D Reconstruction from Unposed Images Jiaxin Guo et.al. 2510.15072 link
2025-10-16 MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning Mattia Segu et.al. 2510.15026 null
2025-10-16 TASLA: Text-Aligned Speech Tokens with Multiple Layer-Aggregation Ming-Hao Hsu et.al. 2510.14934 null
2025-10-16 Efficient and Robust Carathéodory-Steinitz Pruning of Positive Discrete Measures Filip Bělík et.al. 2510.14916 null
2025-10-16 Dynamic-Key-Aware Co-Simulation Framework for Next Generation of SCADA Systems Encrypted by Quantum-Key-Distribution Techniques Ziqing Zhu et.al. 2510.14838 null
2025-10-16 FraQAT: Quantization Aware Training with Fractional bits Luca Morreale et.al. 2510.14823 null
2025-10-16 Dataset Pruning in RecSys and ML: Best Practice or Mal-Practice? Leonie Winter et.al. 2510.14704 null
2025-10-16 WeCKD: Weakly-supervised Chained Distillation Network for Efficient Multimodal Medical Imaging Md. Abdur Rahman et.al. 2510.14668 null
2025-10-16 Task-Based Quantization for Channel Estimation in RIS Empowered MmWave Systems Gyoseung Lee et.al. 2510.14649 null
2025-10-16 GemiRec: Interest Quantization and Generation for Multi-Interest Recommendation Zhibo Wu et.al. 2510.14626 null
2025-10-16 Efficient Video Sampling: Pruning Temporally Redundant Tokens for Faster VLM Inference Natan Bagrov et.al. 2510.14624 null
2025-10-16 A Deep State-Space Model Compression Method using Upper Bound on Output Error Hiroki Sakamoto et.al. 2510.14542 null
2025-10-16 Pruning Overparameterized Multi-Task Networks for Degraded Web Image Restoration Thomas Katraouras et.al. 2510.14463 null
2025-10-16 A Free Lunch in LLM Compression: Revisiting Retraining after Pruning Moritz Wagner et.al. 2510.14444 null
2025-10-16 Low Power Vision Transformer Accelerator with Hardware-Aware Pruning and Optimized Dataflow Ching-Lin Hsiung et.al. 2510.14393 null
2025-10-16 DRBD-Mamba for Robust and Efficient Brain Tumor Segmentation with Analytical Insights Danish Ali et.al. 2510.14383 null
2025-10-16 Computing-In-Memory Aware Model Adaption For Edge Devices Ming-Han Lin et.al. 2510.14379 null
2025-10-16 Constraint-Driven Small Language Models Based on Agent and OpenAlex Knowledge Graph: Mining Conceptual Pathways and Discovering Innovation Points in Academic Papers Ziye Xia et.al. 2510.14303 null
2025-10-15 Toward Cybersecurity-Expert Small Language Models Matan Levi et.al. 2510.14113 null
2025-10-15 REAP the Experts: Why Pruning Prevails for One-Shot MoE compression Mike Lasby et.al. 2510.13999 null
2025-10-15 Readability $\ne$ Learnability: Rethinking the Role of Simplicity in Training Small Language Models Ivan Lee et.al. 2510.13915 null
2025-10-14 A Survey on Collaborating Small and Large Language Models for Performance, Cost-effectiveness, Cloud-edge Privacy, and Trustworthiness Fali Wang et.al. 2510.13890 null
2025-10-13 What Layers When: Learning to Skip Compute in LLMs with Residual Gates Filipe Laitenberger et.al. 2510.13876 null
2025-10-13 ShishuLM: Lightweight Language Model with Hybrid Decoder-MLP Architecture and Paired Weight Sharing Shivanshu Kumar et.al. 2510.13860 null
2025-10-15 Invited Paper: BitMedViT: Ternary-Quantized Vision Transformer for Medical AI Assistants on the Edge Mikolaj Walczak et.al. 2510.13760 null
2025-10-15 Don’t Be Greedy, Just Relax! Pruning LLMs via Frank-Wolfe Christophe Roux et.al. 2510.13713 null
2025-10-15 XD-RCDepth: Lightweight Radar-Camera Depth Estimation with Explainability-Aligned and Distribution-Aware Distillation Huawei Sun et.al. 2510.13565 null
2025-10-15 DistilCLIP-EEG: Enhancing Epileptic Seizure Detection Through Multi-modal Learning and Knowledge Distillation Zexin Wang et.al. 2510.13497 null
2025-10-15 F-BFQ: Flexible Block Floating-Point Quantization Accelerator for LLMs Jude Haris et.al. 2510.13401 null
2025-10-15 Energy-Efficient FPGA Framework for Non-Quantized Convolutional Neural Networks Angelos Athanasiadis et.al. 2510.13362 null
2025-10-15 Behavioral Embeddings of Programs: A Quasi-Dynamic Approach for Optimization Prediction Haolin Pan et.al. 2510.13158 null
2025-10-15 NeuroRVQ: Multi-Scale EEG Tokenization for Generative Large Brainwave Models Konstantinos Barmpas et.al. 2510.13068 null
2025-10-14 Data to Certificate: Guaranteed Cost Control with Quantization-Aware System Identification Shahab Ataei et.al. 2510.13024 null
2025-10-14 Pruning Cannot Hurt Robustness: Certified Trade-offs in Reinforcement Learning James Pedley et.al. 2510.12939 null
2025-10-14 Emergent spin Hall quantization and high-order van Hove singularities in square-octagonal MA $_2$Z$_4$ Rahul Verma et.al. 2510.12935 null
2025-10-14 Learning at the Speed of Physics: Equilibrium Propagation on Oscillator Ising Machines Alex Gower et.al. 2510.12934 null
2025-10-14 Efficient Adaptive Transformer: An Empirical Study and Reproducible Framework Jan Miller et.al. 2510.12856 null
2025-10-14 CARVQ: Corrective Adaptor with Group Residual Vector Quantization for LLM Embedding Compression Dayin Gou et.al. 2510.12721 null
2025-10-14 Rethinking Knowledge Distillation: A Data Dependent Regulariser With a Negative Asymmetric Payoff Israel Mason-Williams et.al. 2510.12615 null
2025-10-14 Automated Behavior Planning for Fruit Tree Pruning via Redundant Robot Manipulators: Addressing the Behavior Planning Challenge Gaoyuan Liu et.al. 2510.12509 null
2025-10-14 SMEC: Rethinking Matryoshka Representation Learning for Retrieval Embedding Compression Biao Zhang et.al. 2510.12474 null
2025-10-14 A Hierarchical Quantized Tokenization Framework for Task-Adaptive Graph Representation Learning Yang Xiang et.al. 2510.12369 null
2025-10-14 Dual Learning with Dynamic Knowledge Distillation and Soft Alignment for Partially Relevant Video Retrieval Jianfeng Dong et.al. 2510.12283 null
2025-10-14 CompoDistill: Attention Distillation for Compositional Reasoning in Multimodal LLMs Jiwan Kim et.al. 2510.12184 null
2025-10-14 Evolution of meta’s llama models and parameter-efficient fine-tuning of large language models: a survey Abdulhady Abas Abdullah et.al. 2510.12178 null
2025-10-14 Compressibility Measures Complexity: Minimum Description Length Meets Singular Learning Theory Einar Urdshals et.al. 2510.12077 null
2025-10-14 Multi-stage Prompt Refinement for Mitigating Hallucinations in Large Language Models Jung-Woo Shim et.al. 2510.12032 null
2025-10-13 MosaicDiff: Training-free Structural Pruning for Diffusion Model Acceleration Reflecting Pretraining Dynamics Bowei Guo et.al. 2510.11962 null
2025-10-13 Topological Vibration Analysis of Elastic Lattices via Bloch Sphere Mapping Kazi Tahsin Mahmood et.al. 2510.11930 null
2025-10-13 QeRL: Beyond Efficiency – Quantization-enhanced Reinforcement Learning for LLMs Wei Huang et.al. 2510.11696 null
2025-10-13 LLM-Oriented Token-Adaptive Knowledge Distillation Xurong Xie et.al. 2510.11615 null
2025-10-14 AndesVL Technical Report: An Efficient Mobile-side Multimodal Large Language Model Zhiwei Jin et.al. 2510.11496 null
2025-10-13 Rescaling-Aware Training for Efficient Deployment of Deep Learning Models on Full-Integer Hardware Lion Mueller et.al. 2510.11484 null
2025-10-13 XQuant: Achieving Ultra-Low Bit KV Cache Quantization with Cross-Layer Compression Haoqi Yang et.al. 2510.11236 null
2025-10-13 G2L:From Giga-Scale to Cancer-Specific Large-Scale Pathology Foundation Models via Knowledge Distillation Yesung Cho et.al. 2510.11176 null
2025-10-13 Lightweight Facial Landmark Detection in Thermal Images via Multi-Level Cross-Modal Knowledge Transfer Qiyi Tong et.al. 2510.11128 null
2025-10-13 DITTO: A Spoofing Attack Framework on Watermarked LLMs via Knowledge Distillation Hyeseon Ahn et.al. 2510.10987 null
2025-10-15 Bit Allocation Transfer for Perceptual Quality Enhancement of VVC Intra Coding Runyu Yang et.al. 2510.10970 null
2025-10-13 Not All Bits Are Equal: Scale-Dependent Memory Optimization Strategies for Reasoning Models Junhyuck Kim et.al. 2510.10964 null
2025-10-13 MC#: Mixture Compressor for Mixture-of-Experts Large Models Wei Huang et.al. 2510.10962 null
2025-10-12 PruneGCRN: Minimizing and explaining spatio-temporal problems through node pruning Javier García-Sigüenza et.al. 2510.10803 null
2025-10-12 Bhasha-Rupantarika: Algorithm-Hardware Co-design approach for Multilingual Neural Machine Translation Mukul Lokhande et.al. 2510.10676 null
2025-10-12 ADiP: Adaptive Precision Systolic Array for Matrix Multiplication Acceleration Ahmed J. Abdelmaksoud et.al. 2510.10623 null
2025-10-12 Preserving LLM Capabilities through Calibration Data Curation: From Analysis to Optimization Bowei He et.al. 2510.10618 link
2025-10-12 BitMar: Low-Bit Multimodal Fusion with Episodic Memory for Edge Devices Euhid Aman et.al. 2510.10560 null
2025-10-12 MRS-YOLO Railroad Transmission Line Foreign Object Detection Based on Improved YOLO11 and Channel Pruning Siyuan Liu et.al. 2510.10553 null
2025-10-12 Preserving Core Structures of Social Networks via Information Guided Multi-Step Graph Pruning Yutong Hu et.al. 2510.10499 null
2025-10-12 AnyBCQ: Hardware Efficient Flexible Binary-Coded Quantization for Multi-Precision LLMs Gunho Park et.al. 2510.10467 null
2025-10-14 Multi-View Graph Learning with Graph-Tuple Shiyu Chen et.al. 2510.10341 null
2025-10-11 Grounded AI for Code Review: Resource-Efficient Large-Model Serving in Enterprise Pipelines Sayan Mandal et.al. 2510.10290 null
2025-10-11 Opacity-Gradient Driven Density Control for Compact and Efficient Few-Shot 3D Gaussian Splatting Abdelrhman Elrawy et.al. 2510.10257 null
2025-10-11 Efficient Mining of Low-Utility Sequential Patterns Jian Zhu et.al. 2510.10243 null
2025-10-11 ImCoref-CeS: An Improved Lightweight Pipeline for Coreference Resolution with LLM-based Checker-Splitter Refinement Kangyang Luo et.al. 2510.10241 null
2025-10-11 PermLLM: Learnable Channel Permutation for N:M Sparse Large Language Models Lancheng Zou et.al. 2510.10136 null
2025-10-11 Preference-driven Knowledge Distillation for Few-shot Node Classification Xing Wei et.al. 2510.10116 null
2025-10-11 Targeted Sequential Pattern Mining with High Average Utility Kai Cao et.al. 2510.10115 null
2025-10-11 P-4DGS: Predictive 4D Gaussian Splatting with 90 $\times$ Compression Henan Wang et.al. 2510.10030 null
2025-10-11 Conformal Sparsification for Bandwidth-Efficient Edge-Cloud Speculative Decoding Payel Bhattacharjee et.al. 2510.09942 null
2025-10-10 DELTA: Dynamic Layer-Aware Token Attention for Efficient Long-Context Reasoning Hossein Entezari Zarch et.al. 2510.09883 null
2025-10-10 Tensor-based compression of the sea temperature data Ilya Kosolapov et.al. 2510.09778 null
2025-10-10 Secret-Key Agreement Through Hidden Markov Modeling of Wavelet Scattering Embeddings Nora Basha et.al. 2510.09773 null
2025-10-10 ReaLM: Residual Quantization Bridging Knowledge Graph Embeddings and Large Language Models Wenbin Guo et.al. 2510.09711 null
2025-10-09 Vanishing Contributions: A Unified Approach to Smoothly Transition Neural Models into Compressed Form Lorenzo Nikiforos et.al. 2510.09696 null
2025-10-10 Automated Evolutionary Optimization for Resource-Efficient Neural Network Training Ilia Revin et.al. 2510.09566 null
2025-10-10 Hierarchical Indexing with Knowledge Enrichment for Multilingual Video Corpus Retrieval Yu Wang et.al. 2510.09553 null
2025-10-10 Quantization of charged fields in the presence of intense electromagnetic fields Álvaro Álvarez-Domínguez et.al. 2510.09447 null
2025-10-10 ReTraceQA: Evaluating Reasoning Traces of Small Language Models in Commonsense Question Answering Francesco Maria Molfese et.al. 2510.09351 null
2025-10-10 Serial Polar Automorphism Ensemble Decoders for Physical Unclonable Functions Marvin Rübenacke et.al. 2510.09220 null
2025-10-10 DICE: Structured Reasoning in LLMs through SLM-Guided Chain-of-Thought Correction Yiqi Li et.al. 2510.09211 null
2025-10-10 Co-designing a Programmable RISC-V Accelerator for MPC-based Energy and Thermal Management of Many-Core HPC Processors Alessandro Ottaviano et.al. 2510.09163 null
2025-10-10 Cross-Representation Benchmarking in Time-Series Electronic Health Records for Clinical Outcome Prediction Tianyi Chen et.al. 2510.09159 null
2025-10-10 Dense2MoE: Restructuring Diffusion Transformer to MoE for Efficient Text-to-Image Generation Youwei Zheng et.al. 2510.09094 null
2025-10-10 HERO: Hardware-Efficient RL-based Optimization Framework for NeRF Quantization Yipu Zhang et.al. 2510.09010 null
2025-10-10 SQS: Bayesian DNN Compression through Sparse Quantized Sub-distributions Ziyi Wang et.al. 2510.08999 null
2025-10-10 FedL2T: Personalized Federated Learning with Two-Teacher Distillation for Seizure Prediction Jionghao Lou et.al. 2510.08984 null
2025-10-10 Defense against Unauthorized Distillation in Image Restoration via Feature Space Perturbation Han Hu et.al. 2510.08925 null
2025-10-09 FOLK: Fast Open-Vocabulary 3D Instance Segmentation via Label-guided Knowledge Distillation Hongrui Wu et.al. 2510.08849 null
2025-10-13 TinyGraphEstimator: Adapting Lightweight Language Models for Graph Structure Inference Michal Podstawski et.al. 2510.08808 null
2025-10-09 Learning What to Remember: Adaptive Probabilistic Memory Retention for Memory-Efficient Language Models S M Rafiuddin et.al. 2510.08798 null
2025-10-08 From What to Why: Thought-Space Recommendation with Small Language Models Prosenjit Biswas et.al. 2510.08626 null
2025-10-09 DeepPrune: Parallel Scaling without Inter-trace Redundancy Shangqing Tu et.al. 2510.08483 null
2025-10-09 Don’t Run with Scissors: Pruning Breaks VLA Models but They Can Be Recovered Jason Jabbour et.al. 2510.08464 null
2025-10-09 Hierarchical Spatial Algorithms for High-Resolution Image Quantization and Feature Extraction Noor Islam S. Mohammad et.al. 2510.08449 null
2025-10-09 Continuous Variable Hamiltonian Learning at Heisenberg Limit via Displacement-Random Unitary Transformation Xi Huang et.al. 2510.08419 null
2025-10-10 Fewer Weights, More Problems: A Practical Attack on LLM Pruning Kazuki Egashira et.al. 2510.07985 null
2025-10-09 LightReasoner: Can Small Language Models Teach Large Language Models Reasoning? Jingyuan Wang et.al. 2510.07962 null
2025-10-09 SimCast: Enhancing Precipitation Nowcasting with Short-to-Long Term Knowledge Distillation Yifang Yin et.al. 2510.07953 null
2025-10-09 Synergy Between the Strong and the Weak: Spiking Neural Networks are Inherently Self-Distillers Yongqi Ding et.al. 2510.07924 null
2025-10-09 STEPER: Step-wise Knowledge Distillation for Enhancing Reasoning Ability in Multi-Step Retrieval-Augmented Language Models Kyumin Lee et.al. 2510.07923 null
2025-10-09 Balanced ternary formalism of second quantization Yao Yao et.al. 2510.07863 null
2025-10-09 AdaSwitch: Adaptive Switching Generation for Knowledge Distillation Jingyu Peng et.al. 2510.07842 null
2025-10-09 RCPU: Rotation-Constrained Error Compensation for Structured Pruning of a Large Language Model Shuichiro Haruta et.al. 2510.07782 null
2025-10-09 From Noisy to Native: LLM-driven Graph Restoration for Test-Time Graph Domain Adaptation Xiangwei Lv et.al. 2510.07762 null
2025-10-09 OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference Yuzhe Gu et.al. 2510.07651 null
2025-10-08 Don’t Adapt Small Language Models for Tools; Adapt Tool Schemas to the Models Jonggeun Lee et.al. 2510.07248 null
2025-10-08 Where to Begin: Efficient Pretraining via Subnetwork Selection and Distillation Arjun Krishnakumar et.al. 2510.07227 null
2025-10-08 A Theoretically-Grounded Codebook for Digital Semantic Communications Lingyi Wang et.al. 2510.07108 null
2025-10-08 Sharpness-Aware Data Generation for Zero-shot Quantization Dung Hoang-Anh et.al. 2510.07018 null
2025-10-08 Efficient numeracy in language models through single-token number embeddings Linus Kreitner et.al. 2510.06824 null
2025-10-08 OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot Junhan Zhu et.al. 2510.06751 null
2025-10-08 Optimizing Fronthaul Quantization for Flexible User Load in Cell-Free Massive MIMO Fabian Göttsch et.al. 2510.06734 null
2025-10-08 Distilling Lightweight Language Models for C/C++ Vulnerabilities Zhiyuan Wei et.al. 2510.06645 null
2025-10-08 Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer Ziyuan Huang et.al. 2510.06590 null
2025-10-07 GUIDE: Guided Initialization and Distillation of Embeddings Khoa Trinh et.al. 2510.06502 null
2025-10-05 Dual-stage and Lightweight Patient Chart Summarization for Emergency Physicians Jiajun Wu et.al. 2510.06263 null
2025-10-07 Training Dynamics Impact Post-Training Quantization Robustness Albert Catalan-Tatjer et.al. 2510.06213 null
2025-10-07 Latent Speech-Text Transformer Yen-Ju Lu et.al. 2510.06195 null
2025-10-07 VecInfer: Efficient LLM Inference with Low-Bit KV Cache via Outlier-Suppressed Vector Quantization Dingyu Yao et.al. 2510.06175 null
2025-10-07 Downsized and Compromised?: Assessing the Faithfulness of Model Compression Moumita Kamal et.al. 2510.06125 null
2025-10-07 Influence Functions for Efficient Data Selection in Reasoning Prateek Humane et.al. 2510.06108 null
2025-10-07 The Valley of Code Reasoning: Scaling Knowledge Distillation of Large Language Models Muyu He et.al. 2510.06101 null
2025-10-07 Adaptive Pruning for Increased Robustness and Reduced Computational Overhead in Gaussian Process Accelerated Saddle Point Searches Rohit Goswami et.al. 2510.06030 null
2025-10-07 Distributed Platoon Control Under Quantization: Stability Analysis and Privacy Preservation Kaixiang Zhang et.al. 2510.05959 null
2025-10-07 $\bf{D^3}$ QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection Yanran Zhang et.al. 2510.05891 null
2025-10-07 Luth: Efficient French Specialization for Small Language Models and Cross-Lingual Transfer Maxence Lasbordes et.al. 2510.05846 null
2025-10-08 OneVision: An End-to-End Generative Framework for Multi-view E-commerce Vision Search Zexin Zheng et.al. 2510.05759 null
2025-10-07 Syn-Diag: An LLM-based Synergistic Framework for Generalizable Few-shot Fault Diagnosis on the Edge Zijun Jia et.al. 2510.05733 null
2025-10-07 DecEx-RAG: Boosting Agentic Retrieval-Augmented Generation with Decision and Execution Optimization via Process Supervision Yongqi Leng et.al. 2510.05691 null
2025-10-07 InstaGeo: Compute-Efficient Geospatial Machine Learning from Data to Deployment Ibrahim Salihu Yusuf et.al. 2510.05617 null
2025-10-07 Deciphering Invariant Feature Decoupling in Source-free Time Series Forecasting with Proxy Denoising Kangjia Yan et.al. 2510.05589 null
2025-10-07 Activation-Informed Pareto-Guided Low-Rank Compression for Efficient LLM/VLM Ryan Solgi et.al. 2510.05544 null
2025-10-07 H1B-KV: Hybrid One-Bit Caches for Memory-Efficient Large Language Model Inference Harshil Vejendla et.al. 2510.05529 null
2025-10-07 ARMOR: High-Performance Semi-Structured Pruning via Adaptive Matrix Factorization Lawrence Liu et.al. 2510.05528 null
2025-10-07 LANTERN: Scalable Distillation of Large Language Models for Job-Person Fit and Explanation Zhoutong Fu et.al. 2510.05490 null
2025-10-07 AMAQ: Adaptive Mixed-bit Activation Quantization for Collaborative Parameter Efficient Fine-tuning Yurun Song et.al. 2510.05468 null
2025-10-06 KVLinC : KV Cache Quantization with Hadamard Rotation and Linear Correction Utkarsh Saxena et.al. 2510.05373 null
2025-10-06 Gamma Mixture Modeling for Cosine Similarity in Small Language Models Kevin Player et.al. 2510.05309 null
2025-10-06 DP-Adam-AC: Privacy-preserving Fine-Tuning of Localizable Language Models Using Adam Optimization with Adaptive Clipping Ruoxing Yang et.al. 2510.05288 null
2025-10-05 OptiFLIDS: Optimized Federated Learning for Energy-Efficient Intrusion Detection in IoT Saida Elouardi et.al. 2510.05180 null
2025-10-05 PatternKV: Flattening KV Representation Expands Quantization Headroom Ji Zhang et.al. 2510.05176 null
2025-10-04 SATER: A Self-Aware and Token-Efficient Approach to Routing and Cascading Yuanzhe Shen et.al. 2510.05164 null
2025-10-06 Slm-mux: Orchestrating small language models for reasoning Chenyu Wang et.al. 2510.05077 null
2025-10-06 Boomerang Distillation Enables Zero-Shot Model Size Interpolation Sara Kangaslahti et.al. 2510.05064 null
2025-10-06 ERDE: Entropy-Regularized Distillation for Early-exit Martial Guidez et.al. 2510.04856 null
2025-10-06 Natural Language Edge Labelling: Decoupling Intent from Execution in Structured LM Reasoning Abhinav Madahar et.al. 2510.04817 null
2025-10-08 Are BabyLMs Deaf to Gricean Maxims? A Pragmatic Evaluation of Sample-efficient Language Models Raha Askari et.al. 2510.04764 null
2025-10-06 Dimensionally-Efficient Transmission and Storage of Unitary Matrices Juan Vidal Alegría et.al. 2510.04734 null
2025-10-06 TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA Chanjoo Jung et.al. 2510.04682 null
2025-10-06 FT-MDT: Extracting Decision Trees from Medical Texts via a Novel Low-rank Adaptation Method Yuheng Li et.al. 2510.04655 null
2025-10-06 Compressed Concatenation of Small Embedding Models Mohamed Ayoub Ben Ayad et.al. 2510.04626 null
2025-10-06 SpikingMamba: Towards Energy-Efficient Large Language Models via Knowledge Distillation from Mamba Yulong Huang et.al. 2510.04595 null
2025-10-06 Post-training quantization of vision encoders needs prefixing registers Seunghyeon Kim et.al. 2510.04547 null
2025-10-05 Diffusion-Assisted Distillation for Self-Supervised Graph Representation Learning with MLPs Seong Jin Ahn et.al. 2510.04241 null
2025-10-05 Enhancing Speaker Verification with w2v-BERT 2.0 and Knowledge Distillation guided Structured Pruning Ze Li et.al. 2510.04213 null
2025-10-05 Learning from All: Concept Alignment for Autonomous Distillation from Multiple Drifting MLLMs Xiaoyu Yang et.al. 2510.04142 null
2025-10-05 Efficient Training of Spiking Neural Networks by Spike-aware Data Pruning Chenxiang Ma et.al. 2510.04098 null
2025-10-05 QuantDemoire: Quantization with Outlier Aware for Image Demoiréing Zheng Chen et.al. 2510.04066 null
2025-10-05 Quantization Range Estimation for Convolutional Neural Networks Bingtao Yang et.al. 2510.04044 null
2025-10-05 Small Language Models for Emergency Departments Decision Support: A Benchmark Study Zirui Wang et.al. 2510.04032 null
2025-10-05 Dual Pruning and Sorting-Free Overestimation for Average-Utility Sequential Pattern Mining Kai Cao et.al. 2510.04014 null
2025-10-04 PsycholexTherapy: Simulating Reasoning in Psychotherapy with Small Language Models in Persian Mohammad Amin Abbasi et.al. 2510.03913 null
2025-10-04 NoTVLA: Narrowing of Dense Action Trajectories for Generalizable Robot Manipulation Zheng Huang et.al. 2510.03895 null
2025-10-04 SDAKD: Student Discriminator Assisted Knowledge Distillation for Super-Resolution Generative Adversarial Networks Nikolaos Kaparinos et.al. 2510.03870 null
2025-10-04 Optimized Minimal 4D Gaussian Splatting Minseo Lee et.al. 2510.03857 null
2025-10-04 Small Language Models for Agentic Systems: A Survey of Architectures, Capabilities, and Deployment Trade offs Raghav Sharma et.al. 2510.03847 null
2025-10-04 MECKD: Deep Learning-Based Fall Detection in Multilayer Mobile Edge Computing With Knowledge Distillation Wei-Lung Mao et.al. 2510.03601 null
2025-10-04 Decoupling Task-Solving and Output Formatting in LLM Generation Haikang Deng et.al. 2510.03595 null
2025-10-03 RAPID: An Efficient Reinforcement Learning Algorithm for Small Language Models Lianghuan Huang et.al. 2510.03515 null
2025-10-03 Conditional Pseudo-Supervised Contrast for Data-Free Knowledge Distillation Renrong Shao et.al. 2510.03375 null
2025-10-03 FocusAgent: Simple Yet Effective Ways of Trimming the Large Context of Web Agents Imene Kerboua et.al. 2510.03204 null
2025-10-03 Mixture of Many Zero-Compute Experts: A High-Rate Quantization Theory Perspective Yehuda Dar et.al. 2510.03151 null
2025-10-03 Enhancing XAI Narratives through Multi-Narrative Refinement and Knowledge Distillation Flavio Giorgi et.al. 2510.03134 null
2025-10-03 Studying $\textrm{QED}_3$ with radial quantization on the lattice – I. Free limit Peter A. Boyle et.al. 2510.03085 null
2025-10-03 CHORD: Customizing Hybrid-precision On-device Model for Sequential Recommendation with Device-cloud Collaboration Tianqi Liu et.al. 2510.03038 null
2025-10-03 PocketSR: The Super-Resolution Expert in Your Pocket Mobiles Haoze Sun et.al. 2510.03012 null
2025-10-03 Don’t Just Chase “Highlighted Tokens” in MLLMs: Revisiting Visual Holistic Context Retention Xin Zou et.al. 2510.02912 null
2025-10-03 FlexiQ: Adaptive Mixed-Precision Quantization for Latency/Accuracy Trade-Offs in Deep Neural Networks Jaemin Kim et.al. 2510.02822 null
2025-10-03 Using Landau quantization to probe disorder in semiconductor heterostructures Asser Elsayed et.al. 2510.02794 null
2025-10-03 GRNND: A GPU-Parallel Relative NN-Descent Algorithm for Efficient Approximate Nearest Neighbor Graph Construction Xiang Li et.al. 2510.02774 null
2025-10-03 Rate-Adaptive Semantic Communication via Multi-Stage Vector Quantization Jinsung Park et.al. 2510.02646 null
2025-10-03 HyperAdaLoRA: Accelerating LoRA Rank Allocation During Training via Hypernetworks without Sacrificing Performance Hao Zhang et.al. 2510.02630 null
2025-10-02 SAGE: Streaming Agreement-Driven Gradient Sketches for Representative Subset Selection Ashish Jha et.al. 2510.02470 null
2025-10-02 Assessing the Potential for Catastrophic Failure in Dynamic Post-Training Quantization Logan Frank et.al. 2510.02457 null
2025-10-02 Knowledge Distillation Detection for Open-weights Models Qin Shi et.al. 2510.02302 null
2025-10-02 BioX-Bridge: Model Bridging for Unsupervised Cross-Modal Knowledge Transfer across Biosignals Chenqi Li et.al. 2510.02276 null
2025-10-02 More Than One Teacher: Adaptive Multi-Guidance Policy Optimization for Diverse Exploration Xiaoyang Yuan et.al. 2510.02227 null
2025-10-02 Collaborative Edge Inference via Semantic Grouping under Wireless Channel Constraints Mateus P. Mota et.al. 2510.02222 null
2025-10-02 Demystifying the Roles of LLM Layers in Retrieval, Knowledge, and Reasoning Xinyuan Song et.al. 2510.02091 null
2025-10-02 Parallelism Empowered Guessing Random Additive Noise Decoding Li Wan et.al. 2510.01813 null
2025-10-02 $C^0$ -rigidity of Legendrians and coisotropics via sheaf quantization Tomohiro Asano et.al. 2510.01746 null
2025-10-02 ENLighten: Lighten the Transformer, Enable Efficient Optical Acceleration Hanqing Zhu et.al. 2510.01673 null
2025-10-02 Shift-Invariant Attribute Scoring for Kolmogorov-Arnold Networks via Shapley Value Wangxuan Fan et.al. 2510.01663 null
2025-10-02 Efficient Training of Robust Traditional Chinese LLaMA-1B on a Single Consumer GPU: Continual Pre-training, SFT, and DPO Yu-Cheng Chih et.al. 2510.01616 null
2025-10-02 Think Right: Learning to Mitigate Under-Over Thinking via Adaptive, Attentive Compression Joykirat Singh et.al. 2510.01581 null
2025-10-03 Ultra-Efficient Decoding for End-to-End Neural Compression and Reconstruction Ethan G. Rogers et.al. 2510.01407 null
2025-10-01 ThinKV: Thought-Adaptive KV Cache Compression for Efficient Reasoning Models Akshat Ramachandran et.al. 2510.01290 null
2025-10-01 Adaptive Event Stream Slicing for Open-Vocabulary Event-Based Object Detection via Vision-Language Knowledge Distillation Jinchang Zhang et.al. 2510.00681 null
2025-10-01 Expected Attention: KV Cache Compression by Estimating Attention from Future Queries Distribution Alessio Devoto et.al. 2510.00636 null
2025-10-01 Panorama: Fast-Track Nearest Neighbors Vansh Ramani et.al. 2510.00566 null
2025-10-01 GUI-KV: Efficient GUI Agents via KV Cache with Spatio-Temporal Awareness Kung-Hsiang Huang et.al. 2510.00536 null
2025-10-01 Has the Two-Decade-Old Prophecy Come True? Artificial Bad Intelligence Triggered by Merely a Single-Bit Flip in Large Language Models Yu Yan et.al. 2510.00490 null
2025-10-01 LongCodeZip: Compress Long Context for Code Language Models Yuling Shi et.al. 2510.00446 null
2025-10-01 Semantic-Driven AI Agent Communications: Challenges and Solutions Kaiwen Yu et.al. 2510.00381 null
2025-09-30 DiSC-AMC: Token- and Parameter-Efficient Discretized Statistics In-Context Automatic Modulation Classification Mohammad Rostami et.al. 2510.00316 null
2025-09-30 PrunedLoRA: Robust Gradient-Based structured pruning for Low-rank Adaptation in Fine-tuning Xin Yu et.al. 2510.00192 null
2025-09-30 Continuum Fractons: Quantization and the Many Body Problem Ylias Sadki et.al. 2510.00110 null
2025-09-30 Enhancing Certifiable Semantic Robustness via Robust Pruning of Deep Neural Networks Hanjiang Hu et.al. 2510.00083 null
2025-09-30 Fairness Testing in Retrieval-Augmented Generation: How Small Perturbations Reveal Bias in Small Language Models Matheus Vinicius da Silva de Oliveira et.al. 2509.26584 null
2025-10-01 Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation Miao Rang et.al. 2509.26497 null
2025-09-30 DiVeQ: Differentiable Vector Quantization Using the Reparameterization Trick Mohammad Hassan Vali et.al. 2509.26469 null
2025-09-30 Post-Training Quantization via Residual Truncation and Zero Suppression for Diffusion Models Donghoon Kim et.al. 2509.26436 null
2025-09-30 Cat: Post-training quantization error reduction via cluster-based affine transformation Ali Zoljodi et.al. 2509.26277 null
2025-09-30 Interpret, prune and distill Donut : towards lightweight VLMs for VQA on document Adnan Ben Mansour et.al. 2509.26235 null
2025-09-30 CAST: Continuous and Differentiable Semi-Structured Sparsity-Aware Training for Large Language Models Weiyu Huang et.al. 2509.25996 null
2025-09-30 Iterative Hypothesis Pruning and Distribution-based Early Labeling for Sequential Hypothesis Testing George Vershinin et.al. 2509.25908 null
2025-09-30 PerQ: Efficient Evaluation of Multilingual Text Personalization Quality Dominik Macko et.al. 2509.25903 null
2025-09-30 SAIL: SRAM-Accelerated LLM Inference System with Lookup-Table-based GEMV Jingyao Zhang et.al. 2509.25853 null
2025-09-30 Distillation of Large Language Models via Concrete Score Matching Yeongmin Kim et.al. 2509.25837 null
2025-10-03 Learning to Reason as Action Abstractions with Scalable Mid-Training RL Shenao Zhang et.al. 2509.25810 null
2025-09-30 Thinking Sparks!: Emergent Attention Heads in Reasoning Models During Post Training Yein Park et.al. 2509.25758 null
2025-09-30 Collaborative Compression for Large-Scale MoE Deployment on Edge Yixiao Chen et.al. 2509.25689 link
2025-09-30 Growing Winning Subnetworks, Not Pruning Them: A Paradigm for Density Discovery in Sparse Neural Networks Qihang Yao et.al. 2509.25665 null
2025-09-30 Effective Model Pruning Yixuan Wang et.al. 2509.25606 null
2025-09-29 On-Premise AI for the Newsroom: Evaluating Small Language Models for Investigative Document Search Nick Hagar et.al. 2509.25494 null
2025-09-29 Norm-Q: Effective Compression Method for Hidden Markov Models in Neuro-Symbolic Applications Hanyuan Gao et.al. 2509.25439 null
2025-09-29 Renormalization of Chern-Simons Wilson Loops via Flux Quantization in Cohomotopy Hisham Sati et.al. 2509.25336 null
2025-09-27 Knowledge distillation through geometry-aware representational alignment Prajjwal Bhattarai et.al. 2509.25253 null
2025-09-29 BALF: Budgeted Activation-Aware Low-Rank Factorization for Fine-Tuning-Free Model Compression David González-Martínez et.al. 2509.25136 null
2025-09-29 Towards Trustworthy Lexical Simplification: Exploring Safety and Efficiency with Small LLMs Akio Hayakawa et.al. 2509.25086 null
2025-09-29 Light-SQ: Structure-aware Shape Abstraction with Superquadrics for Generated Meshes Yuhan Wang et.al. 2509.24986 null
2025-09-29 Training-Free Token Pruning via Zeroth-Order Gradient Estimation in Vision-Language Models Youngeun Kim et.al. 2509.24837 null
2025-10-03 ExGS: Extreme 3D Gaussian Compression with Diffusion Priors Jiaqi Chen et.al. 2509.24758 null
2025-09-29 An asymptotic field approach for the control of dipole emission in integrated structures Vincenzo Macri’ et.al. 2509.24717 null
2025-09-29 Discrete Variational Autoencoding via Policy Search Michael Drolet et.al. 2509.24716 null
2025-09-29 Performance-Efficiency Trade-off for Fashion Image Retrieval Julio Hurtado et.al. 2509.24477 null
2025-09-29 Generalist Multi-Class Anomaly Detection via Distillation to Two Heterogeneous Student Networks Hangil Park et.al. 2509.24448 null
2025-10-01 Proxy-GS: Efficient 3D Gaussian Splatting via Proxy Mesh Yuanyuan Gao et.al. 2509.24421 null
2025-09-29 CLQ: Cross-Layer Guided Orthogonal-based Quantization for Diffusion Transformers Kai Liu et.al. 2509.24416 null
2025-09-29 S $^2$ NN: Sub-bit Spiking Neural Networks Wenjie Wei et.al. 2509.24266 null
2025-09-28 A Second-Order Perspective on Pruning at Initialization and Knowledge Transfer Leonardo Iurada et.al. 2509.24066 null
2025-09-28 The Hidden Costs of Translation Accuracy: Distillation, Quantization, and Environmental Impact Dhaathri Vijay et.al. 2509.23990 null
2025-09-28 AutoPrune: Each Complexity Deserves a Pruning Policy Hanshi Wang et.al. 2509.23931 null
2025-09-30 Differentiable Sparsity via $D$ -Gating: Simple and Versatile Structured Penalization Chris Kolb et.al. 2509.23898 null
2025-09-28 DocPruner: A Storage-Efficient Framework for Multi-Vector Visual Document Retrieval via Adaptive Patch-Level Embedding Pruning Yibo Yan et.al. 2509.23883 null
2025-09-28 Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning Shaobo Wang et.al. 2509.23873 null
2025-09-28 Taught Well Learned Ill: Towards Distillation-conditional Backdoor Attack Yukun Chen et.al. 2509.23871 null
2025-09-28 Tequila: Trapping-free Ternary Quantization for Large Language Models Hong Huang et.al. 2509.23809 null
2025-09-30 Texture Vector-Quantization and Reconstruction Aware Prediction for Generative Super-Resolution Qifan Li et.al. 2509.23774 null
2025-09-28 LUQ: Layerwise Ultra-Low Bit Quantization for Multimodal Large Language Models Shubhang Bhatnagar et.al. 2509.23729 null
2025-09-30 QuantSparse: Comprehensively Compressing Video Diffusion Transformer with Model Quantization and Attention Sparsification Weilun Feng et.al. 2509.23681 null
2025-09-28 Why Alignment Must Precede Distillation: A Minimal Working Explanation Sungmin Cha et.al. 2509.23667 null
2025-09-28 HIVTP: A Training-Free Method to Improve VLMs Efficiency via Hierarchical Visual Token Pruning Using Middle-Layer-Based Importance Score Jingqi Xu et.al. 2509.23663 null
2025-10-01 Reasoning Scaffolding: Distilling the Flow of Thought from LLMs Xiangyu Wen et.al. 2509.23619 null
2025-09-28 RobuQ: Pushing DiTs to W1.58A2 via Robust Activation Quantization Kaicheng Yang et.al. 2509.23582 null
2025-09-28 Towards Efficient CoT Distillation: Self-Guided Rationale Selector for Better Performance with Fewer Rationales Jianzhi Yan et.al. 2509.23574 null
2025-09-27 Bohr-Sommerfeld quantization conditions for Schrodinger operator: the Method of Microlocal Wronskian and Gram Matrix Abdelwaheb Ifa et.al. 2509.23514 null
2025-09-27 Beyond Outliers: A Study of Optimizers Under Quantization Georgios Vlassis et.al. 2509.23500 null
2025-09-27 RestoRect: Degraded Image Restoration via Latent Rectified Flow & Feature Distillation Shourya Verma et.al. 2509.23480 null
2025-09-27 Data-Efficient Training by Evolved Sampling Ziheng Cheng et.al. 2509.23461 null
2025-09-27 Enhancing Communication Efficiency in FL with Adaptive Gradient Quantization and Communication Frequency Optimization Asadullah Tariq et.al. 2509.23419 null
2025-09-27 CasPoinTr: Point Cloud Completion with Cascaded Networks and Knowledge Distillation Yifan Yang et.al. 2509.23375 null
2025-09-27 MedCritical: Enhancing Medical Reasoning in Small Language Models via Self-Collaborative Correction Xinchun Su et.al. 2509.23368 null
2025-09-27 Using AI on FPGAs for the CMS Overlap Muon Track Finder for the HL-LHC Pelayo Leguina et.al. 2509.23347 null
2025-09-27 Scaling LLM Test-Time Compute with Mobile NPU on Smartphones Zixu Hao et.al. 2509.23324 null
2025-09-27 Deformation quantization of a hessian KV- structure on $\mathbb{R}^2$ Herguey Mopeng et.al. 2509.23228 null
2025-09-27 Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization Vage Egiazarian et.al. 2509.23202 null
2025-09-27 Effective Quantization of Muon Optimizer States Aman Gupta et.al. 2509.23106 null
2025-09-27 Desensitizing for Improving Corruption Robustness in Point Cloud Classification through Adversarial Training Zhiqiang Tian et.al. 2509.23010 null
2025-09-26 SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights Lorenz K. Müller et.al. 2509.22944 null
2025-09-26 Compute-Optimal Quantization-Aware Training Aleksandr Dremov et.al. 2509.22935 null
2025-09-26 Vision-Language Alignment from Compressed Image Representations using 2D Gaussian Splatting Yasmine Omri et.al. 2509.22615 null
2025-09-26 Linear Causal Representation Learning by Topological Ordering, Pruning, and Disentanglement Hao Chen et.al. 2509.22553 null
2025-09-26 AxLLM: accelerator architecture for large language models with computation reuse capability Soroush Ahadi et.al. 2509.22512 null
2025-09-26 IIET: Efficient Numerical Transformer via Implicit Iterative Euler Method Xinyu Liu et.al. 2509.22463 null
2025-09-26 $γ$ -Quant: Towards Learnable Quantization for Low-bit Pattern Recognition Mishal Fatima et.al. 2509.22448 null
2025-09-26 Progressive Weight Loading: Accelerating Initial Inference and Gradually Boosting Performance on Resource-Constrained Environments Hyunwoo Kim et.al. 2509.22319 null
2025-09-26 HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space Ke Li et.al. 2509.22299 link
2025-09-26 A Multi-Level Framework for Multi-Objective Hypergraph Partitioning: Combining Minimum Spanning Tree and Proximal Gradient Yingying Li et.al. 2509.22294 null
2025-09-26 InfiMed-Foundation: Pioneering Advanced Multimodal Medical Models with Compute-Efficient Pre-Training and Multi-Stage Fine-Tuning Guanghao Zhu et.al. 2509.22261 null
2025-09-26 Lightweight error mitigation strategies for post-training N:M activation sparsity in LLMs Shirin Alanova et.al. 2509.22166 null
2025-09-26 Pushing Toward the Simplex Vertices: A Simple Remedy for Code Collapse in Smoothed Vector Quantization Takashi Morita et.al. 2509.22161 null
2025-09-26 Joint graph entropy knowledge distillation for point cloud classification and robustness against corruptions Zhiqiang Tian et.al. 2509.22150 null
2025-09-26 Action-aware Dynamic Pruning for Efficient Vision-Language-Action Manipulation Xiaohuan Pei et.al. 2509.22093 null
2025-09-26 COSPADI: Compressing LLMs via Calibration-Guided Sparse Dictionary Learning Dmitriy Shopkhoev et.al. 2509.22075 null
2025-09-26 Enriching Knowledge Distillation with Intra-Class Contrastive Learning Hua Yuan et.al. 2509.22053 null
2025-09-26 Multicollinearity-Aware Parameter-Free Strategy for Hyperspectral Band Selection: A Dependence Measures-Based Approach Dibyabha Deb et.al. 2509.21973 null
2025-09-26 Real-time Anomaly Detection for Liquid Argon Time Projection Chambers Seokju Chung et.al. 2509.21817 null
2025-09-26 SubZeroCore: A Submodular Approach with Zero Training for Coreset Selection Brian B. Moser et.al. 2509.21748 null
2025-09-26 HyperCore: Coreset Selection under Noise via Hypersphere Models Brian B. Moser et.al. 2509.21746 null
2025-09-26 Brain PathoGraph Learning Ciyuan Peng et.al. 2509.21742 null
2025-09-26 Optimizing the non-Clifford-count in unitary synthesis using Reinforcement Learning David Kremer et.al. 2509.21709 null
2025-09-25 Scalable Foundation Interatomic Potentials via Message-Passing Pruning and Graph Partitioning Lingyu Kong et.al. 2509.21694 null
2025-09-25 General Pruning Criteria for Fast SBL Jakob Möderl et.al. 2509.21572 null
2025-09-25 SlimDiff: Training-Free, Activation-Guided Hands-free Slimming of Diffusion Models Arani Roy et.al. 2509.21498 null
2025-09-25 Residual Vector Quantization For Communication-Efficient Multi-Agent Perception Dereje Shenkut et.al. 2509.21464 null
2025-09-24 Skeleton Sparsification and Densification Scale-Spaces Julia Gierke et.al. 2509.21398 null
2025-09-24 Large AI Model-Enabled Generative Semantic Communications for Image Transmission Qiyu Ma et.al. 2509.21394 null
2025-09-23 Do Sparse Subnetworks Exhibit Cognitively Aligned Attention? Effects of Pruning on Saliency Map Fidelity, Sparsity, and Concept Coherence Sanish Suwal et.al. 2509.21387 null
2025-09-25 SD3.5-Flash: Distribution-Guided Distillation of Generative Flows Hmrishav Bandyopadhyay et.al. 2509.21318 null
2025-09-25 Interactive Recommendation Agent with Active User Commands Jiakai Tang et.al. 2509.21317 null
2025-09-25 Efficient Digital Methods to Quantify Sensor Output Uncertainty Orestis Kaparounakis et.al. 2509.21311 null
2025-09-25 Hybrid RIS-Aided Digital Over-the-Air Computing for Edge AI Inference: Joint Feature Quantization and Active-Passive Beamforming Design Yang Fu et.al. 2509.21201 null
2025-09-26 GEP: A GCG-Based method for extracting personally identifiable information from chatbots built on small language models Jieli Zhu et.al. 2509.21192 null
2025-09-25 Can Less Precise Be More Reliable? A Systematic Evaluation of Quantization’s Impact on CLIP Beyond Accuracy Aymen Bouguerra et.al. 2509.21173 null
2025-09-25 On the geometric quantization of $θ$ -almost twisted Poisson manifold Nasser Saipele Nansidi et.al. 2509.21168 null
2025-09-25 Fast-SEnSeI: Lightweight Sensor-Independent Cloud Masking for On-board Multispectral Sensors Jan Kněžík et.al. 2509.20991 null
2025-09-25 Rejuvenating Cross-Entropy Loss in Knowledge Distillation for Recommender Systems Zhangchi Zhu et.al. 2509.20989 null
2025-09-25 Punching Above Precision: Small Quantized Model Distillation with Learnable Regularizer Abdur Rehman et.al. 2509.20854 null
2025-09-26 Real-Time Object Detection Meets DINOv3 Shihua Huang et.al. 2509.20787 null
2025-09-25 RAM-NAS: Resource-aware Multiobjective Neural Architecture Search Method for Robot Vision Tasks Shouren Mao et.al. 2509.20688 null
2025-09-24 Function Spaces Without Kernels: Learning Compact Hilbert Space Representations Su Ann Low et.al. 2509.20605 null
2025-09-24 Seedream 4.0: Toward Next-generation Multimodal Image Generation Team Seedream et.al. 2509.20427 null
2025-09-24 EmbeddingGemma: Powerful and Lightweight Text Representations Henrique Schechter Vera et.al. 2509.20354 null
2025-09-24 Q-Palette: Fractional-Bit Quantizers Toward Optimal Bit Allocation for Efficient LLM Deployment Deokjae Lee et.al. 2509.20214 null
2025-09-24 Play by the Type Rules: Inferring Constraints for LLM Functions in Declarative Programs Parker Glenn et.al. 2509.20208 null
2025-09-24 Smaller is Better: Enhancing Transparency in Vehicle AI Systems via Pruning Sanish Suwal et.al. 2509.20148 null
2025-09-23 Nano Bio-Agents (NBA): Small Language Model Agents for Genomics George Hong et.al. 2509.19566 null
2025-09-23 Adversarially-Refined VQ-GAN with Dense Motion Tokenization for Spatio-Temporal Heatmaps Gabriel Maldonado et.al. 2509.19252 null
2025-09-23 PPG-Distill: Efficient Photoplethysmography Signals Analysis via Foundation Model Distillation Juntong Ni et.al. 2509.19215 null
2025-09-23 Exact WKB Formulation of Quantization and Particle Production in Time-Dependent Backgrounds Ryo Namba et.al. 2509.19194 null
2025-09-23 Data-Free Knowledge Distillation for LiDAR-Aided Beam Tracking in MmWave Systems Abolfazl Zakeri et.al. 2509.19092 null
2025-09-23 Enhancing Noise Robustness for Neural Speech Codecs through Resource-Efficient Progressive Quantization Perturbation Simulation Rui-Chen Zheng et.al. 2509.19025 null
2025-09-23 Otters: An Energy-Efficient SpikingTransformer via Optical Time-to-First-Spike Encoding Zhanglu Yan et.al. 2509.18968 null
2025-09-23 VGGT-DP: Generalizable Robot Control via Vision Foundation Models Shijia Ge et.al. 2509.18778 null
2025-09-23 DiSSECT: Structuring Transfer-Ready Medical Image Representations through Discrete Self-Supervision Azad Singh et.al. 2509.18765 null
2025-09-23 Bi-VLM: Pushing Ultra-Low Precision Post-Training Quantization Boundaries in Vision-Language Models Xijun Wang et.al. 2509.18763 null
2025-09-23 Enhanced Survival Trees Ruiwen Zhou et.al. 2509.18494 null
2025-09-23 Codebook-Based Adaptive Feature Compression With Semantic Enhancement for Edge-Cloud Systems Xinyu Wang et.al. 2509.18481 null
2025-09-22 Individualized non-uniform quantization for vector search Mariano Tepper et.al. 2509.18471 null
2025-09-22 TinyBEV: Cross Modal Knowledge Distillation for Efficient Multi Task Bird’s Eye View Perception and Planning Reeshad Khan et.al. 2509.18372 null
2025-09-21 nDNA – the Semantic Helix of Artificial Cognition Amitava Das et.al. 2509.18216 null
2025-09-19 MMCD: Multi-Modal Collaborative Decision-Making for Connected Autonomy with Knowledge Distillation Rui Liu et.al. 2509.18198 null
2025-09-19 TinyEcoWeedNet: Edge Efficient Real-Time Aerial Agricultural Weed Detection Omar H. Khater et.al. 2509.18193 null
2025-09-22 Visual Detector Compression via Location-Aware Discriminant Analysis Qizhen Lan et.al. 2509.17968 null
2025-09-23 Optimizing Inference in Transformer-Based Models: A Multi-Method Benchmark Siu Hang Ho et.al. 2509.17894 null
2025-09-23 Breaking Token Into Concepts: Exploring Extreme Compression in Token Representation Via Compositional Shared Semantics Kavin R V et.al. 2509.17737 null
2025-09-22 RCTDistill: Cross-Modal Knowledge Distillation Framework for Radar-Camera 3D Object Detection with Temporal Fusion Geonho Bang et.al. 2509.17712 null
2025-09-22 Stratification of the half-density quantization of the Jeffrey-Weitsman-Witten invariants Adrian Chitan et.al. 2509.17656 null
2025-09-22 Evaluating the Energy Efficiency of NPU-Accelerated Machine Learning Inference on Embedded Microcontrollers Anastasios Fanariotis et.al. 2509.17533 null
2025-09-22 MapCoder-Lite: Squeezing Multi-Agent Coding into a Single Small LLM Woongkyu Lee et.al. 2509.17489 null
2025-09-22 Learning Dexterous Manipulation with Quantized Hand State Ying Feng et.al. 2509.17450 null
2025-09-23 QWHA: Quantization-Aware Walsh-Hadamard Adaptation for Parameter-Efficient Fine-Tuning on Large Language Models Hyesung Jeon et.al. 2509.17428 null
2025-09-22 Physics-Informed Operator Learning for Hemodynamic Modeling Ryan Chappell et.al. 2509.17293 null
2025-09-25 On the Quantization of the Electromagnetic Field with Magnetic Monopoles Kanan Anwar et.al. 2509.17284 null
2025-09-21 PTQTP: Post-Training Quantization to Trit-Planes for Large Language Models He Xiao et.al. 2509.16989 null
2025-09-24 Equip Pre-ranking with Target Attention by Residual Quantization Yutong Li et.al. 2509.16931 null
2025-09-21 PRISM: Precision-Recall Informed Data-Free Knowledge Distillation via Generative Diffusion Xuewan He et.al. 2509.16897 null
2025-09-20 Knowledge Distillation for Variational Quantum Convolutional Neural Networks on Heterogeneous Data Kai Yu et.al. 2509.16699 null
2025-09-20 When Big Models Train Small Ones: Label-Free Model Parity Alignment for Efficient Visual Question Answering using Small VLMs Abhirama Subramanyam Penamakuri et.al. 2509.16633 null
2025-09-20 The Role of Vocabularies in Learning Sparse Representations for Ranking Hiun Kim et.al. 2509.16621 null
2025-09-20 Federated Learning with Ad-hoc Adapter Insertions: The Case of Soft-Embeddings for Training Classifier-as-Retriever Marijan Fofonjka et.al. 2509.16508 null
2025-09-20 PrediPrune: Reducing Verification Overhead in Souper with Machine Learning Driven Pruning Ange-Thierry Ishimwe et.al. 2509.16497 null
2025-09-20 Eye Gaze Tells You Where to Compute: Gaze-Driven Efficient VLMs Qinyu Chen et.al. 2509.16476 null
2025-09-19 Locally Purified Maximally Mixed States At Scale: Entanglement Pruning and Symmetries Amit Jamadagni et.al. 2509.16439 null
2025-09-19 Pico: A Modular Framework for Hypothesis-Driven Small Language Model Research Richard Diehl Martinez et.al. 2509.16413 null
2025-09-19 A Unified AI Approach for Continuous Monitoring of Human Health and Diseases from Intensive Care Unit to Home with Physiological Foundation Models (UNIPHY+) Minxiao Wang et.al. 2509.16348 null
2025-09-24 The Role of High-Performance GPU Resources in Large Language Model Based Radiology Imaging Diagnosis Jyun-Ping Kao et.al. 2509.16328 null
2025-09-18 Language Modeling with Learned Meta-Tokens Alok N. Shah et.al. 2509.16278 null
2025-09-19 DiEP: Adaptive Mixture-of-Experts Compression through Differentiable Expert Pruning Sikai Bai et.al. 2509.16105 null
2025-09-19 DistillMatch: Leveraging Knowledge Distillation from Vision Foundation Model for Multimodal Image Matching Meng Yang et.al. 2509.16017 null
2025-09-19 DISPATCH: Distilling Selective Patches for Speech Enhancement Dohwan Kim et.al. 2509.15922 null
2025-09-19 RMT-KD: Random Matrix Theoretic Causal Knowledge Distillation Davide Ettori et.al. 2509.15724 null
2025-09-19 Once Upon a Time: Interactive Learning for Storytelling with Small Language Models Jonas Mayer Martins et.al. 2509.15714 null
2025-09-19 Training-Free Pyramid Token Pruning for Efficient Large Vision-Language Models via Region, Token, and Instruction-Guided Importance Yuxuan Liang et.al. 2509.15704 null
2025-09-19 pFedSAM: Personalized Federated Learning of Segment Anything Model for Medical Image Segmentation Tong Wang et.al. 2509.15638 null
2025-09-19 MEC-Quant: Maximum Entropy Coding for Extremely Low Bit Quantization-Aware Training Junbiao Pang et.al. 2509.15514 null
2025-09-19 Mental Accounts for Actions: EWA-Inspired Attention in Decision Transformers Zahra Aref et.al. 2509.15498 null
2025-09-19 Backdoor Mitigation via Invertible Pruning Masks Kealan Dunnett et.al. 2509.15497 null
2025-09-18 IMPQ: Interaction-Aware Layerwise Mixed Precision Quantization for LLMs Junchen Zhao et.al. 2509.15455 null
2025-09-18 Fair-GPTQ: Bias-Aware Quantization for Large Language Models Irina Proskurina et.al. 2509.15206 null
2025-09-18 MaRVIn: A Cross-Layer Mixed-Precision RISC-V Framework for DNN Inference, from ISA Extension to Hardware Acceleration Giorgos Armeniakos et.al. 2509.15187 null
2025-09-18 No Modality Left Behind: Adapting to Missing Modalities via Knowledge Distillation for Brain Tumor Segmentation Shenghao Zhu et.al. 2509.15017 null
2025-09-19 MeanFlowSE: one-step generative speech enhancement via conditional mean flow Duojia Li et.al. 2509.14858 null
2025-09-18 Delta Knowledge Distillation for Large Language Models Yihan Cao et.al. 2509.14526 null
2025-09-17 NIRVANA: Structured pruning reimagined for large language models compression Mengting Ai et.al. 2509.14230 null
2025-09-17 Where Do Tokens Go? Understanding Pruning Behaviors in STEP at High Resolutions Michal Szczepanski et.al. 2509.14165 null
2025-09-17 SV-Mixer: Replacing the Transformer Encoder with Lightweight MLPs for Self-Supervised Model Compression in Speaker Verification Jungwoo Heo et.al. 2509.14136 null
2025-09-17 MOCHA: Multi-modal Objects-aware Cross-arcHitecture Alignment Elena Camuffo et.al. 2509.14001 null
2025-09-17 Asymptotic Analysis of Nonlinear One-Bit Precoding in Massive MIMO Systems via Approximate Message Passing Zheyu Wu et.al. 2509.13955 null
2025-09-19 Efficient Quantization-Aware Neural Receivers: Beyond Post-Training Quantization SaiKrishna Saketh Yellapragada et.al. 2509.13786 null
2025-09-17 TENET: An Efficient Sparsity-Aware LUT-Centric Architecture for Ternary LLM Inference On Edge Zhirui Huang et.al. 2509.13765 null
2025-09-18 DSPC: Dual-Stage Progressive Compression Framework for Efficient Long-Context Reasoning Yaxin Gao et.al. 2509.13723 null
2025-09-17 InfraMind: A Novel Exploration-based GUI Agentic Framework for Mission-critical Industrial Management Liangtao Lin et.al. 2509.13704 null
2025-09-17 A High-Quality and Low-Complexity Streamable Neural Speech Codec with Knowledge Distillation En-Wei Zhang et.al. 2509.13670 null
2025-09-16 AQUA-LLM: Evaluating Accuracy, Quantization, and Adversarial Robustness Trade-offs in LLMs for Cybersecurity Question Answering Onat Gungor et.al. 2509.13514 null
2025-09-16 Improving 3D Gaussian Splatting Compression by Scene-Adaptive Lattice Vector Quantization Hao Xu et.al. 2509.13482 null
2025-09-16 LLMs for energy and macronutrients estimation using only text data from 24-hour dietary recalls: a parameter-efficient fine-tuning experiment using a 10-shot prompt Rodrigo M Carrillo-Larco et.al. 2509.13268 null
2025-09-18 HAM: Hierarchical Adapter Merging for Scalable Continual Learning Eric Nuertey Coleman et.al. 2509.13211 null
2025-09-16 Vi-SAFE: A Spatial-Temporal Framework for Efficient Violence Detection in Public Surveillance Ligang Chang et.al. 2509.13210 null
2025-09-16 Multi-Model Synthetic Training for Mission-Critical Small Language Models Nolan Platt et.al. 2509.13047 null
2025-09-16 Investigating ReLoRA: Effects on the Learning Dynamics of Small Language Models Yuval Weiss et.al. 2509.12960 null
2025-09-17 A Novel Compression Framework for YOLOv8: Achieving Real-Time Aerial Object Detection on Edge Devices via Structured Pruning and Channel-Wise Distillation Melika Sabaghian et.al. 2509.12918 null
2025-09-16 Energy-Efficient Quantized Federated Learning for Resource-constrained IoT devices Wilfrid Sougrinoma Compaoré et.al. 2509.12814 null
2025-09-16 NEFT: A Unified Transformer Framework for Efficient Near-Field CSI Feedback in XL-MIMO Systems Haiyang Li et.al. 2509.12748 null
2025-09-16 Effective Gaussian Management for High-fidelity Object Reconstruction Jiateng Liu et.al. 2509.12742 null
2025-09-16 ZTree: A Subgroup Identification Based Decision Tree Learning Framework Eric Cheng et.al. 2509.12688 null
2025-09-16 The Better You Learn, The Smarter You Prune: Towards Efficient Vision-language-action Models via Differentiable Token Pruning Titong Jiang et.al. 2509.12594 null
2025-09-16 iCD: A Implicit Clustering Distillation Mathod for Structural Information Mining Xiang Xue et.al. 2509.12553 null
2025-09-16 LEAF: Knowledge Distillation of Text Embedding Models with Teacher-Aligned Representations Robin Vujanic et.al. 2509.12539 null
2025-09-15 Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction Ryan Lucas et.al. 2509.12464 null
2025-09-15 GhostNetV3-Small: A Tailored Architecture and Comparative Study of Distillation Strategies for Tiny Images Florian Zager et.al. 2509.12380 null
2025-09-15 Unsupervised Atomic Data Mining via Multi-Kernel Graph Autoencoders for Machine Learning Force Fields Hong Sun et.al. 2509.12358 null
2025-09-15 SAQ: Pushing the Limits of Vector Quantization through Code Adjustment and Dimension Segmentation Hui Li et.al. 2509.12086 null
2025-09-15 AMQ: Enabling AutoML for Mixed-precision Weight-Only Quantization of Large Language Models Sangjun Lee et.al. 2509.12019 null
2025-09-15 CLAIRE: A Dual Encoder Network with RIFT Loss and Phi-3 Small Language Model Based Interpretability for Cross-Modality Synthetic Aperture Radar and Optical Land Cover Segmentation Debopom Sutradhar et.al. 2509.11952 null
2025-09-16 Enriched text-guided variational multimodal knowledge distillation network (VMD) for automated diagnosis of plaque vulnerability in 3D carotid artery MRI Bo Cao et.al. 2509.11924 null
2025-09-15 SpecVLM: Fast Speculative Decoding in Vision-Language Models Haiduo Huang et.al. 2509.11815 null
2025-09-15 Visualization and Analysis of the Loss Landscape in Graph Neural Networks Samir Moustafa et.al. 2509.11792 null
2025-09-15 Quantization Errors, Human–AI Interaction, and Approximate Fixed Points in $L^1(μ)$ Faruk Alpay et.al. 2509.11700 null
2025-09-15 DARD: Dice Adversarial Robustness Distillation against Adversarial Attacks Jing Zou et.al. 2509.11525 null
2025-09-14 Knowledge Distillation for Sensing-Assisted Long-Term Beam Tracking in mmWave Communications Mengyuan Ma et.al. 2509.11419 null
2025-09-14 Investigating the Lottery Ticket Hypothesis for Variational Quantum Circuits Michael Kölle et.al. 2509.11190 null
2025-09-16 Optimal Brain Restoration for Joint Quantization and Sparsification of LLMs Hang Guo et.al. 2509.11177 null
2025-09-14 SVR-GS: Spatially Variant Regularization for Probabilistic Masks in 3D Gaussian Splatting Ashkan Taghipour et.al. 2509.11116 null
2025-09-13 GAPrune: Gradient-Alignment Pruning for Domain-Aware Embeddings Yixuan Tang et.al. 2509.10844 null
2025-09-12 Automated MCQA Benchmarking at Scale: Evaluating Reasoning Traces as Retrieval Sources for Domain Adaptation of Small Language Models Ozan Gokdemir et.al. 2509.10744 null
2025-09-12 Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs Yixiao Zhou et.al. 2509.10377 null
2025-09-12 Efficient Learned Image Compression Through Knowledge Distillation Fabien Allemand et.al. 2509.10366 null
2025-09-12 I-Segmenter: Integer-Only Vision Transformer for Efficient Semantic Segmentation Jordan Sassoon et.al. 2509.10334 null
2025-09-12 Investigating Language Model Capabilities to Represent and Process Formal Knowledge: A Preliminary Study to Assist Ontology Engineering Hanna Abi Akl et.al. 2509.10249 null
2025-09-12 FedBiF: Communication-Efficient Federated Learning via Bits Freezing Shiwei Li et.al. 2509.10161 null
2025-09-12 Scalable Training for Vector-Quantized Networks with 100% Codebook Utilization Yifan Chang et.al. 2509.10140 null
2025-09-12 Efficient and Accurate Downfacing Visual Inertial Odometry Jonas Kühne et.al. 2509.10021 null
2025-09-12 Toward Green Code: Prompting Small Language Models for Energy-Efficient Code Generation Humza Ashraf et.al. 2509.09947 null
2025-09-12 Acoustic Scene Classification Using CNN-GRU Model Without Knowledge Distillation Ee-Leng Tan et.al. 2509.09931 null
2025-09-11 ButterflyQuant: Ultra-low-bit LLM Quantization through Learnable Orthogonal Butterfly Transforms Bingxin Xu et.al. 2509.09679 null
2025-09-11 ReBaNO: Reduced Basis Neural Operator Mitigating Generalization Gaps and Achieving Discretization Invariance Haolan Zheng et.al. 2509.09611 null
2025-09-11 Combating the Memory Walls: Optimization Pathways for Long-Context Agentic LLM Inference Haoran Wu et.al. 2509.09505 null
2025-09-11 Unified Start, Personalized End: Progressive Pruning for Efficient 3D Medical Image Segmentation Linhao Li et.al. 2509.09267 link
2025-09-11 Adaptive Knowledge Distillation using a Device-Aware Teacher for Low-Complexity Acoustic Scene Classification Seung Gyu Jeong et.al. 2509.09262 null
2025-09-11 SQAP-VLA: A Synergistic Quantization-Aware Pruning Framework for High-Performance Vision-Language-Action Models Hengyu Fang et.al. 2509.09090 null
2025-09-10 CSI Compression Beyond Latents: End-to-End Hybrid Attention-CNN Networks with Entropy Regularization Maryam Ansarifard et.al. 2509.08776 null
2025-09-10 Compressing CNN models for resource-constrained systems by channel and layer pruning Ahmed Sadaqa et.al. 2509.08714 null
2025-09-10 BitROM: Weight Reload-Free CiROM Architecture Towards Billion-Parameter 1.58-bit LLM Inference Wenlun Zhang et.al. 2509.08542 null
2025-09-12 SINDI: an Efficient Index for Approximate Maximum Inner Product Search on Sparse Vectors Ruoxuan Li et.al. 2509.08395 null
2025-09-10 Mitigating Catastrophic Forgetting in Large Language Models with Forgetting-aware Pruning Wei Huang et.al. 2509.08255 null
2025-09-10 Strategies for Improving Communication Efficiency in Distributed and Federated Learning: Compression, Local Training, and Personalization Kai Yi et.al. 2509.08233 null
2025-09-09 Risk-Bounded Multi-Agent Visual Navigation via Dynamic Budget Allocation Viraj Parimi et.al. 2509.08157 null
2025-09-09 Tensor-Train Operator Inference Engin Danis et.al. 2509.08071 null
2025-09-09 SA-OOSC: A Multimodal LLM-Distilled Semantic Communication Framework for Enhanced Coding Efficiency with Scenario Understanding Feifan Zhang et.al. 2509.07436 null
2025-09-09 The Role of Exploration Modules in Small Language Models for Knowledge Graph Question Answering Yi-Jie Cheng et.al. 2509.07399 null
2025-09-09 Knowledge Distillation Driven Semantic NOMA for Image Transmission with Diffusion Model Qifei Wang et.al. 2509.07363 null
2025-09-09 Word2Spike: Poisson Rate Coding for Associative Memories and Neuromorphic Algorithms Archit Kalra et.al. 2509.07361 null
2025-09-09 Quantization of the electromagnetic fields from single atomic or molecular radiators Valerica Raicu et.al. 2509.07359 null
2025-09-08 Recursive algorithm for constructing antisymmetric fermionic states in first quantization mapping E. Rule et.al. 2509.07279 null
2025-09-08 HealthSLM-Bench: Benchmarking Small Language Models for Mobile and Wearable Healthcare Monitoring Xin Wang et.al. 2509.07260 null
2025-09-08 Efficient Multi-Agent Coordination via Dynamic Joint-State Graph Construction Yanlin Zhou et.al. 2509.07234 null
2025-09-08 Efficient Low-Memory Fast Stack Decoding with Variance Polarization for PAC Codes Mohsen Moradi et.al. 2509.07231 null
2025-09-08 Explaining How Quantization Disparately Skews a Model Abhimanyu Bellam et.al. 2509.07222 null
2025-09-07 MEGS $^{2}$ : Memory-Efficient Gaussian Splatting via Spherical Gaussians and Unified Pruning Jiarui Chen et.al. 2509.07021 null
2025-09-08 H $_{2}$ OT: Hierarchical Hourglass Tokenizer for Efficient Video Pose Transformers Wenhao Li et.al. 2509.06956 null
2025-10-13 COMPACT: Common-token Optimized Model Pruning Across Channels and Tokens Eugene Kwek et.al. 2509.06836 null
2025-09-08 Tree of Agents: Improving Long-Context Capabilities of Large Language Models through Multi-Perspective Reasoning Song Yu et.al. 2509.06436 null
2025-09-08 Index-Preserving Lightweight Token Pruning for Efficient Document Understanding in Vision-Language Models Jaemin Son et.al. 2509.06415 null
2025-09-08 3DOF+Quantization: 3DGS quantization for large scenes with limited Degrees of Freedom Matthieu Gendrin et.al. 2509.06400 null
2025-09-08 Variational Garrote for Statistical Physics-based Sparse and Robust Variable Selection Hyungjoon Soh et.al. 2509.06383 null
2025-09-08 Mask-GCG: Are All Tokens in Adversarial Suffixes Necessary for Jailbreak Attacks? Junjie Mu et.al. 2509.06350 null
2025-09-08 LoaQ: Layer-wise Output Approximation Quantization Li Lin et.al. 2509.06297 null
2025-09-15 FineServe: Precision-Aware KV Slab and Two-Level Scheduling for Heterogeneous Precision LLM Serving Kyungmin Bin et.al. 2509.06261 null
2025-09-10 BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models Yuming Li et.al. 2509.06040 null
2025-09-07 StripDet: Strip Attention-Based Lightweight 3D Object Detection from Point Cloud Weichao Wang et.al. 2509.05954 null
2025-09-07 Quantization of bounded symplectic domains associated with compact Lie groups Alexey A. Sharapov et.al. 2509.05931 null
2025-09-06 Batalin-Fradkin-Vilkovisky Quantization of FLPR model Ansha S. Nair et.al. 2509.05632 null
2025-09-06 Quantization of spin circular photogalvanic effect in altermagnetic Weyl semimetals Hiroki Yoshida et.al. 2509.05620 null
2025-09-06 SpecPrune-VLA: Accelerating Vision-Language-Action Models via Action-Aware Self-Speculative Pruning Hanzhen Wang et.al. 2509.05614 null
2025-09-09 Mitigating Spurious Correlations Between Question and Answer via Chain-of-Thought Correctness Perception Distillation Hongyan Xie et.al. 2509.05602 null
2025-09-06 ProfilingAgent: Profiling-Guided Agentic Reasoning for Adaptive Model Optimization Sadegh Jafari et.al. 2509.05584 null
2025-09-06 Sensitivity-Aware Post-Training Quantization for Deep Neural Networks Zekang Zheng et.al. 2509.05576 null
2025-09-05 SuperSNN: A Hardware-Aware Framework for Physically Realizable, High-Performance Superconducting Spiking Neural Network Chips Changxu Song et.al. 2509.05532 null
2025-09-05 Dynamic Sensitivity Filter Pruning using Multi-Agent Reinforcement Learning For DCNN’s Iftekhar Haider Chowdhury et.al. 2509.05446 null
2025-09-05 Accuracy-Constrained CNN Pruning for Efficient and Reliable EEG-Based Seizure Detection Mounvik K et.al. 2509.05190 null
2025-09-05 FLOWER: Democratizing Generalist Robot Policies with Efficient Vision-Language-Action Flow Policies Moritz Reuss et.al. 2509.04996 null
2025-09-05 PLaMo 2 Technical Report Preferred Networks et.al. 2509.04897 null
2025-09-05 AI-Driven Fronthaul Link Compression in Wireless Communication Systems: Review and Method Design Keqin Zhang et.al. 2509.04805 null
2025-09-05 STADI: Fine-Grained Step-Patch Diffusion Parallelism for Heterogeneous GPUs Han Liang et.al. 2509.04719 null
2025-09-08 Advancing SLM Tool-Use Capability using Reinforcement Learning Dhruvi Paprunia et.al. 2509.04518 null
2025-09-02 ProST: Progressive Sub-task Training for Pareto-Optimal Multi-agent Systems Using Small Language Models Biddut Sarker Bijoy et.al. 2509.04508 null
2025-09-04 PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference Krishna Teja Chitty-Venkata et.al. 2509.04377 null
2025-09-04 Integrating Pruning with Quantization for Efficient Deep Neural Networks Compression Sara Makenali et.al. 2509.04244 null
2025-09-04 Real Time FPGA Based Transformers & VLMs for Vision Tasks: SOTA Designs and Optimizations Safa Mohammed Sali et.al. 2509.04162 null
2025-09-04 Real Time FPGA Based CNNs for Detection, Classification, and Tracking in Autonomous Systems: State of the Art Designs and Optimizations Safa Mohammed Sali et.al. 2509.04153 null
2025-09-04 Duality between polyhedral approximation of value functions and optimal quantization of measures Abdellah Bulaich Mehamdi et.al. 2509.04101 null
2025-09-04 Robust MIMO Semantic Communication with Imperfect CSI via Knowledge Distillation Mingze Gong et.al. 2509.04005 null
2025-09-04 Data-Augmented Quantization-Aware Knowledge Distillation Justin Kur et.al. 2509.03850 null
2025-09-03 QuantV2X: A Fully Quantized Multi-Agent System for Cooperative Perception Seth Z. Zhao et.al. 2509.03704 null
2025-09-03 DPQuant: Efficient and Differentially-Private Model Training via Dynamic Quantization Scheduling Yubo Gao et.al. 2509.03472 null
2025-09-08 Amplifying Effective CXL Memory Bandwidth for LLM Inference via Transparent Near-Data Processing Rui Xie et.al. 2509.03377 null
2025-09-03 NeurStore: Efficient In-database Deep Learning Model Management System Siqi Xiang et.al. 2509.03228 null
2025-09-03 BAMG: A Block-Aware Monotonic Graph Index for Disk-Based Approximate Nearest Neighbor Search Huiling Li et.al. 2509.03226 null
2025-09-03 CapsBeam: Accelerating Capsule Network based Beamformer for Ultrasound Non-Steered Plane Wave Imaging on Field Programmable Gate Array Abdul Rahoof et.al. 2509.03201 null
2025-09-03 Deep Self-knowledge Distillation: A hierarchical supervised learning for coronary artery segmentation Mingfeng Lin et.al. 2509.03173 null
2025-09-03 FastCaps: A Design Methodology for Accelerating Capsule Network on Field Programmable Gate Arrays Abdul Rahoof et.al. 2509.03103 null
2025-09-03 Binary Quantization For LLMs Through Dynamic Grouping Xinzhe Zheng et.al. 2509.03054 null
2025-09-02 LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference Krishna Teja Chitty-Venkata et.al. 2509.02753 null
2025-09-02 A quantization of the $\operatorname{SL}_2(\mathbb{C})$ -Chern-Simons invariant of tangle exteriors Calvin McPhail-Snyder et.al. 2509.02365 null
2025-09-02 All-optical band structure reconstruction and onset of Landau quantization of Dirac fermions Josef Riepl et.al. 2509.02362 null
2025-09-02 Operator Algebras and Third Quantization Yidong Chen et.al. 2509.02293 null
2025-08-11 Less is More: Selective Reflection for Compatible and Efficient Knowledge Distillation in Large Language Models Lingyuan Liu et.al. 2508.06135 null
2025-08-06 Exploring Layer-wise Information Effectiveness for Post-Training Quantization in Small Language Models He Xiao et.al. 2508.03332 null
2025-07-29 Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study Yiran Huang et.al. 2507.20749 null
2025-07-22 Collaborative Distillation Strategies for Parameter-Efficient Language Model Deployment Xiandong Meng et.al. 2507.15198 null
2025-07-11 Exploring the Limits of Model Compression in LLMs: A Knowledge Distillation Study on QA Tasks Joyeeta Datta et.al. 2507.07630 null
2025-07-08 Put Teacher in Student’s Shoes: Cross-Distillation for Ultra-compact Model Compression Framework Maolin Wang et.al. 2507.04636 null
2025-06-17 TensorSLM: Energy-efficient Embedding Compression of Sub-billion Parameter Language Models on Low-end Devices Mingxue Xu et.al. 2506.13514 null
2025-05-30 Small Language Models: Architectures, Techniques, Evaluation, Problems and Future Adaptation Tanjil Hasan Sakib et.al. 2505.19529 null
2025-10-14 Shifting AI Efficiency From Model-Centric to Data-Centric Compression Xuyang Liu et.al. 2505.19147 null
2025-05-27 Knowledge Grafting of Large Language Models Guodong Du et.al. 2505.18502 null
2025-04-25 Does Knowledge Distillation Matter for Large Language Model based Bundle Generation? Kaidong Feng et.al. 2504.17220 null
2025-04-24 Case Study: Fine-tuning Small Language Models for Accurate and Private CWE Detection in Python Code Md. Azizul Hakim Bappy et.al. 2504.16584 null
2025-04-22 Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions Luyang Fang et.al. 2504.14772 null
2025-04-09 Thanos: A Block-wise Pruning Algorithm for Efficient Large Language Model Compression Ivan Ilin et.al. 2504.05346 null
2025-07-01 Distillation and Refinement of Reasoning in Small Language Models for Document Re-ranking Chris Samarinas et.al. 2504.03947 null
2025-03-17 Small Vision-Language Models: A Survey on Compact Architectures and Techniques Nitesh Patnaik et.al. 2503.10665 null
2025-10-23 Using (Not-so) Large Language Models to Generate Simulation Models in a Formal DSL: A Study on Reaction Networks Justin N. Kreikemeyer et.al. 2503.01675 null
2025-03-06 Rethinking Data: Towards Better Performing Domain-Specific Small Language Models Boris Nazarov et.al. 2503.01464 null
2025-03-04 ReaderLM-v2: Small Language Model for HTML to Markdown and JSON Feng Wang et.al. 2503.01151 null
2025-05-27 Beyond the Tip of Efficiency: Uncovering the Submerged Threats of Jailbreak Attacks in Small Language Models Sibo Yi et.al. 2502.19883 null
2025-02-26 AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages Joshua Sakthivel Raju et.al. 2502.18020 null
2025-06-17 Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training Rui Pan et.al. 2502.03460 null
2025-03-03 TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models Makoto Shing et.al. 2501.16937 null
2025-06-09 GRASP: Replace Redundant Layers with Adaptive Singular Parameters for Efficient Model Compression Kainan Liu et.al. 2501.00339 link
2024-12-30 Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models Shuo Wang et.al. 2412.19449 null
2024-12-23 PruneVid: Visual Token Pruning for Efficient Video Large Language Models Xiaohu Huang et.al. 2412.16117 null
2024-11-22 Hymba: A Hybrid-head Architecture for Small Language Models Xin Dong et.al. 2411.13676 null
2025-02-19 Efficient Alignment of Large Language Models via Data Sampling Amrit Khera et.al. 2411.10545 null
2024-11-27 SlimLM: An Efficient Small Language Model for On-Device Document Assistance Thang M. Pham et.al. 2411.09944 null
2025-02-26 LLM-NEO: Parameter Efficient Knowledge Distillation for Large Language Models Runming Yang et.al. 2411.06839 null
2024-11-12 Over-parameterized Student Model via Tensor Decomposition Boosted Knowledge Distillation Yu-Liang Zhan et.al. 2411.06448 null
2025-04-09 Fox-1: Open Small Language Model for Cloud and Edge Zijian Hu et.al. 2411.05281 null
2024-10-29 KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation Rambod Azimi et.al. 2410.20777 link
2024-10-29 A Survey of Small Language Models Chien Van Nguyen et.al. 2410.20011 link
2025-07-15 Self-calibration for Language Model Quantization and Pruning Miles Williams et.al. 2410.17170 null
2024-10-21 Efficient Vision-Language Models by Summarizing Visual Tokens into Compact Registers Yuxin Wen et.al. 2410.14072 null
2025-06-03 RoCoFT: Efficient Finetuning of Large Language Models with Row-Column Updates Md Kowsher et.al. 2410.10075 null
2024-09-20 Efficient Knowledge Distillation: Empowering Small Language Models with Teacher Model Insights Mohamad Ballout et.al. 2409.12586 null
2024-10-04 FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation KaShun Shum et.al. 2408.12168 null
2024-11-05 Compact Language Models via Pruning and Knowledge Distillation Saurav Muralidharan et.al. 2407.14679 null
2024-07-09 Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations Bowen Shen et.al. 2407.05690 null
2025-04-22 SLMRec: Distilling Large Language Models into Small for Sequential Recommendation Wujiang Xu et.al. 2405.17890 null
2024-05-17 Densely Distilling Cumulative Knowledge for Continual Learning Zenglin Shi et.al. 2405.09820 null
2024-04-09 What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models Busayo Awobade et.al. 2404.04759 null
2024-04-05 Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought Jooyoung Lee et.al. 2404.03414 null
2024-06-26 Telecom Language Models: Must They Be Large? Nicola Piovesan et.al. 2403.04666 null
2024-05-31 Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models Xudong Lu et.al. 2402.14800 null
2024-02-16 Model Compression and Efficient Inference for Large Language Models: A Survey Wenxiao Wang et.al. 2402.09748 null
2024-04-09 A Survey on Transformer Compression Yehui Tang et.al. 2402.05964 null
2025-07-23 L4Q: Parameter Efficient Quantization-Aware Fine-Tuning on Large Language Models Hyesung Jeon et.al. 2402.04902 null
2024-06-25 Shortened LLaMA: Depth Pruning for Large Language Models with Comparison of Retraining Methods Bo-Kyeong Kim et.al. 2402.02834 null
2024-02-07 Dual Knowledge Distillation for Efficient Sound Event Detection Yang Xiao et.al. 2402.02781 null
2024-02-02 EPSD: Early Pruning with Self-Distillation for Efficient Model Compression Dong Chen et.al. 2402.00084 null
2024-06-05 APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference Bowen Zhao et.al. 2401.12200 null
2024-06-05 TinyLlama: An Open-Source Small Language Model Peiyuan Zhang et.al. 2401.02385 null
2024-02-23 LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language Model Yichen Zhu et.al. 2401.02330 null
2024-06-24 TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones Zhengqing Yuan et.al. 2312.16862 null
2024-03-19 Language Model Knowledge Distillation for Efficient Question Answering in Spanish Adrián Bazaga et.al. 2312.04193 link
2024-02-07 Compressed Context Memory For Online Language Model Interaction Jang-Hyun Kim et.al. 2312.03414 null
2023-11-09 PB-LLM: Partially Binarized Large Language Models Yuzhang Shang et.al. 2310.00034 null
2023-08-29 Improving Knowledge Distillation for BERT Models: Loss Functions, Mapping Methods, and Weight Tuning Apoorv Dankar et.al. 2308.13958 null
2023-06-27 Low-Rank Prune-And-Factorize for Language Model Compression Siyu Ren et.al. 2306.14152 null
2023-06-21 Categories of Response-Based, Feature-Based, and Relation-Based Knowledge Distillation Chuanguang Yang et.al. 2306.10687 null
2023-05-29 Neural Architecture Search for Parameter-Efficient Fine-tuning of Large Pre-trained Language Models Neal Lawton et.al. 2305.16597 null
2023-05-22 Parameter-Efficient Fine-Tuning with Layer Pruning on Free-Text Sequence-to-Sequence Modeling Yunqi Zhu et.al. 2305.08285 null
2023-04-20 An Empirical Study of Leveraging Knowledge Distillation for Compressing Multilingual Neural Machine Translation Models Varun Gumma et.al. 2304.09388 null
2023-02-15 Multi-teacher knowledge distillation as an effective method for compressing ensembles of neural networks Konrad Zuchniak et.al. 2302.07215 null
2022-10-17 EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning Tiannan Wang et.al. 2210.07795 null
2022-10-11 AlphaTuning: Quantization-Aware Parameter-Efficient Adaptation of Large-Scale Pre-Trained Language Models Se Jung Kwon et.al. 2210.03858 null
2022-08-04 Efficient Fine-Tuning of Compressed Language Models with Learners Danilo Vucetic et.al. 2208.02070 null
2022-06-01 Parameter-Efficient and Student-Friendly Knowledge Distillation Jun Rao et.al. 2205.15308 null
2022-05-24 Parameter-Efficient Sparsity for Large Language Models Fine-Tuning Yuchao Li et.al. 2205.11005 null
2022-05-04 Structured Pruning Learns Compact and Accurate Models Mengzhou Xia et.al. 2204.00408 null
2022-03-23 DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization Zheng Li et.al. 2203.11239 null
2022-03-09 HyperPELT: Unified Parameter-Efficient Language Model Tuning for Both Language and Vision-and-Language Tasks Zhengkun Zhang et.al. 2203.03878 null
2022-04-05 CHIP: CHannel Independence-based Pruning for Compact Neural Networks Yang Sui et.al. 2110.13981 null
2022-02-03 Towards a Unified View of Parameter-Efficient Transfer Learning Junxian He et.al. 2110.04366 null
2023-07-18 Pruning Ternary Quantization Dan Liu et.al. 2107.10998 null
2021-06-29 PQK: Model Compression via Pruning, Quantization, and Knowledge Distillation Jangho Kim et.al. 2106.14681 null
2022-02-11 An Information-Theoretic Justification for Model Pruning Berivan Isik et.al. 2102.08329 null
2022-11-03 Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across Domains Haojie Pan et.al. 2012.01266 null
2020-11-18 Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning Bingbing Li et.al. 2009.08065 null
2020-08-04 Differentiable Feature Aggregation Search for Knowledge Distillation Yushuo Guan et.al. 2008.00506 null
2020-05-19 MicroNet for Efficient Language Modeling Zhongxia Yan et.al. 2005.07877 null
2020-04-20 Triplet Loss for Knowledge Distillation Hideki Oki et.al. 2004.08116 null
2019-10-18 Training Compact Models for Low Resource Entity Tagging using Pre-trained Language Models Peter Izsak et.al. 1910.06294 null
2021-03-05 Revisiting Knowledge Distillation via Label Smoothing Regularization Li Yuan et.al. 1909.11723 null
2019-08-27 Patient Knowledge Distillation for BERT Model Compression Siqi Sun et.al. 1908.09355 null
2019-06-18 Scalable Syntax-Aware Language Models Using Knowledge Distillation Adhiguna Kuncoro et.al. 1906.06438 null
2019-05-07 Creating Lightweight Object Detectors with Model Compression for Deployment on Edge Devices Yiwu Yao et.al. 1905.01787 null
2018-12-04 Knowledge Distillation with Feature Maps for Image Classification Wei-Chun Chen et.al. 1812.00660 null
2018-11-07 Compact Personalized Models for Neural Machine Translation Joern Wuebker et.al. 1811.01990 null
2016-08-17 Fast, Small and Exact: Infinite-order Language Modelling with Compressed Suffix Trees Ehsan Shareghi et.al. 1608.04465 null

🔄 Data Augmentation

📊 992 papers

📅 Publish Date 📝 Title 👥 Authors 📄 PDF 💻 Code
2026-04-01 PDA: Text-Augmented Defense Framework for Robust Vision-Language Models against Adversarial Image Attacks Jingning Xu et.al. 2604.01010 null
2026-04-01 Mine-JEPA: In-Domain Self-Supervised Learning for Mine-Like Object Classification in Side-Scan Sonar Taeyoun Kwon et.al. 2604.00383 null
2026-03-30 UltraG-Ray: Physics-Based Gaussian Ray Casting for Novel Ultrasound View Synthesis Felix Duelmer et.al. 2603.29022 null
2026-03-30 LDDMM stochastic interpolants: an application to domain uncertainty quantification in hemodynamics Sarah Katz et.al. 2603.28324 null
2026-03-28 GIFT: Bootstrapping Image-to-CAD Program Synthesis via Geometric Feedback Giorgio Giannone et.al. 2603.27448 null
2026-03-28 Evaluating Large and Lightweight Vision Models for Irregular Component Segmentation in E-Waste Disassembly Xinyao Zhang et.al. 2603.27441 null
2026-03-28 Hybrid Deep Learning with Temporal Data Augmentation for Accurate Remaining Useful Life Prediction of Lithium-Ion Batteries Yun Tian et.al. 2603.27186 null
2026-03-27 Hybrid Diffusion Model for Breast Ultrasound Image Augmentation Farhan Fuad Abir et.al. 2603.26834 null
2026-03-27 Central-to-Local Adaptive Generative Diffusion Framework for Improving Gene Expression Prediction in Data-Limited Spatial Transcriptomics Yaoyu Fang et.al. 2603.26827 null
2026-03-25 PhyDCM: A Reproducible Open-Source Framework for AI-Assisted Brain Tumor Classification from Multi-Sequence MRI Hayder Saad Abdulbaqi et.al. 2603.26794 null
2026-03-26 A generalized Bayesian approach to multiple changepoint analysis Yuhui Wang et.al. 2603.25668 null
2026-03-26 Insights on back marking for the automated identification of animals David Brunner et.al. 2603.25535 null
2026-03-26 Lightweight GenAI for Network Traffic Synthesis: Fidelity, Augmentation, and Classification Giampaolo Bovenzi et.al. 2603.25507 null
2026-03-26 Translation Asymmetry in LLMs as a Data Augmentation Factor: A Case Study for 6 Romansh Language Varieties Jannis Vamvas et.al. 2603.25489 null
2026-03-26 MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness Yuto Matsuo et.al. 2603.25109 null
2026-03-26 $π$ , But Make It Fly: Physics-Guided Transfer of VLA Models to Aerial Manipulation Johnathan Tucker et.al. 2603.25038 null
2026-03-26 Toward domain-specific machine translation and quality estimation systems Javad Pourmostafa Roshan Sharami et.al. 2603.24955 null
2026-03-26 CVA: Context-aware Video-text Alignment for Video Temporal Grounding Sungho Moon et.al. 2603.24934 null
2026-03-26 Decoding Market Emotions in Cryptocurrency Tweets via Predictive Statement Classification with Machine Learning and Transformers Moein Shahiki Tash et.al. 2603.24933 null
2026-03-25 Amplified Patch-Level Differential Privacy for Free via Random Cropping Kaan Durmaz et.al. 2603.24695 null
2026-03-29 BCMDA: Bidirectional Correlation Maps Domain Adaptation for Mixed Domain Semi-Supervised Medical Image Segmentation Bentao Song et.al. 2603.24691 null
2026-03-25 How unconstrained machine-learning models learn physical symmetries Michelangelo Domina et.al. 2603.24638 null
2026-03-25 A Bayesian Dynamic Latent Space Model for Weighted Networks Roberto Casarin et.al. 2603.24201 null
2026-03-25 Enhancing and Reporting Robustness Boundary of Neural Code Models for Intelligent Code Understanding Tingxu Han et.al. 2603.24119 null
2026-03-25 SynMVCrowd: A Large Synthetic Benchmark for Multi-view Crowd Counting and Localization Qi Zhang et.al. 2603.23956 null
2026-03-25 3D-LLDM: Label-Guided 3D Latent Diffusion Model for Improving High-Resolution Synthetic MR Imaging in Hepatic Structure Segmentation Kyeonghun Kim et.al. 2603.23845 null
2026-03-30 Synthetic Mixed Training: Scaling Parametric Knowledge Acquisition Beyond RAG Seungju Han et.al. 2603.23562 null
2026-03-24 Parametric Knowledge and Retrieval Behavior in RAG Fine-Tuning for Electronic Design Automation Julian Oestreich et.al. 2603.23047 null
2026-03-25 Instrument-Splatting++: Towards Controllable Surgical Instrument Digital Twin Using Gaussian Splatting Shuojue Yang et.al. 2603.22792 null
2026-03-24 DALDALL: Data Augmentation for Lexical and Semantic Diverse in Legal Domain by leveraging LLM-Persona Janghyeok Choi et.al. 2603.22765 null
2026-03-23 Generalized multi-object classification and tracking with sparse feature resonator networks Lazar Supic et.al. 2603.22539 null
2026-03-23 SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection Kexian Tang et.al. 2603.22213 null
2026-03-23 Data Curation for Machine Learning Interatomic Potentials by Determinantal Point Processes Joanna Zou et.al. 2603.22160 null
2026-03-23 ROM: Real-time Overthinking Mitigation via Streaming Detection and Intervention Xinyan Wang et.al. 2603.22016 null
2026-03-23 Ctrl-A: Control-Driven Online Data Augmentation Jesper B. Christensen et.al. 2603.21819 null
2026-03-23 HACMatch Semi-Supervised Rotation Regression with Hardness-Aware Curriculum Pseudo Labeling Mei Li et.al. 2603.21583 null
2026-03-22 AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling Liang Ding et.al. 2603.21357 null
2026-03-22 Positional Segmentor-Guided Counterfactual Fine-Tuning for Spatially Localized Image Synthesis Tian Xia et.al. 2603.21213 null
2026-03-21 negMIX: Negative Mixup for OOD Generalization in Open-Set Node Classification Junwei Gong et.al. 2603.20798 null
2026-03-11 Abjad-Kids: An Arabic Speech Classification Dataset for Primary Education Abdul Aziz Snoubara et.al. 2603.20255 null
2026-03-19 A Novel Solution for Zero-Day Attack Detection in IDS using Self-Attention and Jensen-Shannon Divergence in WGAN-GP Ziyu Mu et.al. 2603.19350 null
2026-03-10 Maximizing mutual information between user-contexts and responses improve LLM personalization with no additional data Hyunji Nam et.al. 2603.19294 null
2026-03-19 PromptHub: Enhancing Multi-Prompt Visual In-Context Learning with Locality-Aware Fusion, Concentration and Alignment Tianci Luo et.al. 2603.18891 null
2026-03-19 Data-efficient pre-training by scaling synthetic megadocs Konwoo Kim et.al. 2603.18534 null
2026-03-19 R&D: Balancing Reliability and Diversity in Synthetic Data Augmentation for Semantic Segmentation Huy Che et.al. 2603.18427 null
2026-03-19 Where are the Hidden Gems? Applying Transformer Models for Design Discussion Detection Lawrence Arkoh et.al. 2603.18393 null
2026-03-18 Synthetic Data, Information, and Prior Knowledge: Why Synthetic Data Augmentation to Boost Sample Doesn’t Work for Statistical Inference Reid Dale et.al. 2603.18345 null
2026-03-20 R2-Dreamer: Redundancy-Reduced World Models without Decoders or Augmentation Naoki Morihira et.al. 2603.18202 null
2026-03-18 Towards Motion-aware Referring Image Segmentation Chaeyun Kim et.al. 2603.17413 null
2026-03-17 Machine intelligence supports the full chain of 2D dendrite synthesis Wenqiang Huang et.al. 2603.16959 null
2026-03-17 Spectral Property-Driven Data Augmentation for Hyperspectral Single-Source Domain Generalization Taiqin Chen et.al. 2603.16662 null
2026-03-17 Dexterous grasp data augmentation based on grasp synthesis with fingertip workspace cloud and contact-aware sampling Liqi Wu et.al. 2603.16609 null
2026-03-17 AW-MoE: All-Weather Mixture of Experts for Robust Multi-Modal 3D Object Detection Hongwei Lin et.al. 2603.16261 null
2026-03-17 When Generative Augmentation Hurts: A Benchmark Study of GAN and Diffusion Models for Bias Correction in AI Classification Systems Shesh Narayan Gupta et.al. 2603.16134 null
2026-03-16 Something from Nothing: Data Augmentation for Robust Severity Level Estimation of Dysarthric Speech Jaesung Bae et.al. 2603.15988 null
2026-03-16 Benchmarking Machine Learning Approaches for Polarization Mapping in Ferroelectrics Using 4D-STEM Matej Martinc et.al. 2603.15582 null
2026-03-16 Low-Complexity and Consistent Graphon Estimation from Multiple Networks Roland Boniface Sogan et.al. 2603.15578 null
2026-03-16 Data Augmentation via Causal-Residual Bootstrapping Mateusz Gajewski et.al. 2603.15335 null
2026-03-16 Datasets for Verb Alternations across Languages: BLM Templates and Data Augmentation Strategies Giuseppe Samo et.al. 2603.15295 null
2026-03-18 ViSA: Visited-State Augmentation for Generalized Goal-Space Contrastive Reinforcement Learning Issa Nakamura et.al. 2603.14887 null
2026-03-17 Topology-Preserving Data Augmentation for Ring-Type Polygon Annotations Sudip Laudari et.al. 2603.14764 null
2026-03-15 A Heterogeneous Ensemble for Multi-Center COVID-19 Classification from Chest CT Scans Aadit Nilay et.al. 2603.14621 null
2026-03-15 Infinite Problem Generator: Verifiably Scaling Physics Reasoning Data with Agentic Workflows Aditya Sharan et.al. 2603.14486 null
2026-03-15 PGcGAN: Pathological Gait-Conditioned GAN for Human Gait Synthesis Mritula Chandrasekaran et.al. 2603.14409 null
2026-03-15 A Physically-Grounded Attack and Adaptive Defense Framework for Real-World Low-Light Image Enhancement Tongshun Zhang et.al. 2603.14304 null
2026-03-14 EchoLVFM: One-Step Video Generation via Latent Flow Matching for Echocardiogram Synthesis Emmanuel Oladokun et.al. 2603.13967 null
2026-03-14 Close to Reality: Interpretable and Feasible Data Augmentation for Imbalanced Learning Matheus Camilo da Silva et.al. 2603.13927 null
2026-03-14 FMS $^2$ : Unified Flow Matching for Segmentation and Synthesis of Thin Structures Babak Asadi et.al. 2603.13659 null
2026-03-11 Layout-Guided Controllable Pathology Image Generation with In-Context Diffusion Transformers Yuntao Shou et.al. 2603.13386 null
2026-03-10 A Computer-aided Framework for Detecting Osteosarcoma in Computed Tomography Scans Maximo Rodriguez-Herrero et.al. 2603.13376 null
2026-03-09 Multimodal Deep Learning for Dynamic and Static Neuroimaging: Integrating MRI and fMRI for Alzheimer Disease Analysis Anima Kujur et.al. 2603.13367 null
2026-03-13 Surrogates for Physics-based and Data-driven Modelling of Parametric Systems: Review and New Perspectives Matteo Giacomini et.al. 2603.12870 null
2026-03-13 On Using Machine Learning to Early Detect Catastrophic Failures in Marine Diesel Engines Francesco Maione et.al. 2603.12733 null
2026-03-16 Overcoming the Modality Gap in Context-Aided Forecasting Vincent Zhihao Zheng et.al. 2603.12451 null
2026-03-16 Room Impulse Response Completion Using Signal-Prediction Diffusion Models Conditioned on Simulated Early Reflections Zeyu Xu et.al. 2603.12442 null
2026-03-12 LLM-Augmented Therapy Normalization and Aspect-Based Sentiment Analysis for Treatment-Resistant Depression on Reddit Yuxin Zhu et.al. 2603.12343 null
2026-03-24 Multi-Station WiFi CSI Sensing Framework Robust to Station-wise Feature Missingness and Limited Labeled Data Keita Kayano et.al. 2603.11858 null
2026-03-12 In the LLM era, Word Sense Induction remains unsolved Anna Mosolova et.al. 2603.11686 null
2026-03-12 FBCIR: Balancing Cross-Modal Focuses in Composed Image Retrieval Chenchen Zhao et.al. 2603.11520 null
2026-03-12 Dynamic Bayesian regression quantile synthesis for forecasting outlook-at-risk Genya Kobayashi et.al. 2603.11474 null
2026-03-11 Data Augmentation and Convolutional Network Architecture Influence on Distributed Learning Victor Forattini Jansen et.al. 2603.10902 null
2026-03-11 Riemannian Geometry-Preserving Variational Autoencoder for MI-BCI Data Augmentation Viktorija Poļaka et.al. 2603.10563 link
2026-03-11 Sparse Task Vector Mixup with Hypernetworks for Efficient Knowledge Transfer in Whole-Slide Image Prognosis Pei Liu et.al. 2603.10526 link
2026-03-11 FAR-Dex: Few-shot Data Augmentation and Adaptive Residual Policy Refinement for Dexterous Manipulation Yushan Bai et.al. 2603.10451 null
2026-03-10 Finetuning a Text-to-Audio Model for Room Impulse Response Generation Kirak Kim et.al. 2603.09708 null
2026-03-10 Improving 3D Foot Motion Reconstruction in Markerless Monocular Human Motion Capture Tom Wehrbein et.al. 2603.09681 null
2026-03-10 Grounding Synthetic Data Generation With Vision and Language Models Ümit Mert Çağlar et.al. 2603.09625 null
2026-03-10 Contrastive Bayesian Inference for Unnormalized Models Naruki Sonobe et.al. 2603.09306 null
2026-03-10 Multi-model approach for autonomous driving: A comprehensive study on traffic sign-, vehicle- and lane detection and behavioral cloning Kanishkha Jaisankar et.al. 2603.09255 null
2026-03-10 Acoustic and Semantic Modeling of Emotion in Spoken Language Soumya Dutta et.al. 2603.09212 null
2026-03-10 Wrong Code, Right Structure: Learning Netlist Representations from Imperfect LLM-Generated RTL Siyang Cai et.al. 2603.09161 null
2026-03-10 Scalable Neural Vocoder from Range-Null Space Decomposition Andong Li et.al. 2603.08574 null
2026-03-09 Diffusion-Based Data Augmentation for Image Recognition: A Systematic Analysis and Evaluation Zekun Li et.al. 2603.08364 null
2026-03-09 Seed2Scale: A Self-Evolving Data Engine for Embodied AI via Small to Large Model Synergy and Multimodal Evaluation Cong Tai et.al. 2603.08260 null
2026-03-09 WhispEar: A Bi-directional Framework for Scaling Whispered Speech Conversion via Pseudo-Parallel Whisper Generation Zihao Fang et.al. 2603.08046 null
2026-03-09 Hard/Soft NLoS Detection via Combinatorial Data Augmentation for 6G Positioning Sang-Hyeok Kim et.al. 2603.07932 null
2026-03-08 Nwāchā Munā: A Devanagari Speech Corpus and Proximal Transfer Benchmark for Nepal Bhasha ASR Rishikesh Kumar Sharma et.al. 2603.07554 null
2026-03-08 An efficient method of posterior sampling for Poisson INGARCH models Yixuan Fan et.al. 2603.07527 null
2026-03-08 InterReal: A Unified Physics-Based Imitation Framework for Learning Human-Object Interaction Skills Dayang Liang et.al. 2603.07516 null
2026-03-07 Neural Control and Learning of Simulated Hand Movements With an EMG-Based Closed-Loop Interface Balint K. Hodossy et.al. 2603.07364 null
2026-03-07 MedSteer: Counterfactual Endoscopic Synthesis via Training-Free Activation Steering Trong-Thang Pham et.al. 2603.07066 null
2026-03-07 OV-DEIM: Real-time DETR-Style Open-Vocabulary Object Detection with GridSynthetic Augmentation Leilei Wang et.al. 2603.07022 null
2026-03-06 Learning From Design Procedure To Generate CAD Programs for Data Augmentation Yan-Ying Chen et.al. 2603.06894 null
2026-03-06 Physics-Informed Diffusion Model for Generating Synthetic Extreme Rare Weather Events Data Marawan Yakout et.al. 2603.06782 null
2026-03-05 On the Generalization Capacities of MLLMs for Spatial Intelligence Gongjie Zhang et.al. 2603.06704 null
2026-03-02 EnsAug: Augmentation-Driven Ensembles for Human Motion Sequence Analysis Bikram De et.al. 2603.06661 null
2026-03-06 NOBLE: Accelerating Transformers with Nonlinear Low-Rank Branches Ethan Smith et.al. 2603.06492 null
2026-03-06 Computer vision-based estimation of invertebrate biomass Mikko Impiö et.al. 2603.06362 null
2026-03-06 MLLMRec-R1: Incentivizing Reasoning Capability in Large Language Models for Multimodal Sequential Recommendation Yu Wang et.al. 2603.06243 null
2026-03-06 AnyCamVLA: Zero-Shot Camera Adaptation for Viewpoint Robust Vision-Language-Action Models Hyeongjun Heo et.al. 2603.05868 null
2026-03-09 SAIL: Similarity-Aware Guidance and Inter-Caption Augmentation-based Learning for Weakly-Supervised Dense Video Captioning Ye-Chan Kim et.al. 2603.05437 null
2026-03-05 CoIn3D: Revisiting Configuration-Invariant Multi-Camera 3D Object Detection Zhaonian Kuang et.al. 2603.05042 null
2026-03-05 Why Is RLHF Alignment Shallow? A Gradient Analysis Robin Young et.al. 2603.04851 null
2026-03-05 Revisiting Shape from Polarization in the Era of Vision Foundation Models Chenhao Li et.al. 2603.04817 null
2026-03-10 Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling Yong Liu et.al. 2603.04791 null
2026-03-05 Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement Brian Jing Hong Nge et.al. 2603.04698 null
2026-03-04 Structure-Guided Histopathology Synthesis via Dual-LoRA Diffusion Xuan Xu et.al. 2603.04565 null
2026-03-04 Balancing Fidelity, Utility, and Privacy in Synthetic Cardiac MRI Generation: A Comparative Study Madhura Edirisooriya et.al. 2603.04340 null
2026-03-04 A Multi-Fidelity Parametric Framework for Reduced-Order Modeling using Optimal Transport-based Interpolation: Applications to Diffused-Interface Two-Phase Flows Moaad Khamlich et.al. 2603.04232 null
2026-03-04 Mask-Guided Attention Regulation for Anatomically Consistent Counterfactual CXR Synthesis Zichun Zhang et.al. 2603.04130 null
2026-03-16 QD-PCQA: Quality-Aware Domain Adaptation for Point Cloud Quality Assessment Guohua Zhang et.al. 2603.03726 null
2026-03-03 An Effective Data Augmentation Method by Asking Questions about Scene Text Images Xu Yao et.al. 2603.03580 null
2026-03-05 AOI: Turning Failed Trajectories into Training Signals for Autonomous Cloud Diagnosis Pei Yang et.al. 2603.03378 null
2026-03-03 Joint Training Across Multiple Activation Sparsity Regimes Haotian Wang et.al. 2603.03131 null
2026-03-03 Single Microphone Own Voice Detection based on Simulated Transfer Functions for Hearing Aids Mathuranathan Mayuravaani et.al. 2603.02724 null
2026-03-03 Maximizing Generalization: The Effect of Different Augmentation Techniques on Lightweight Vision Transformer for Bengali Character Classification Rafi Hassan Chowdhury et.al. 2603.02591 null
2026-03-02 Symbol-Equivariant Recurrent Reasoning Models Richard Freinschlag et.al. 2603.02193 null
2026-03-02 CTForensics: A Comprehensive Dataset and Method for AI-Generated CT Image Detection Yiheng Li et.al. 2603.01878 null
2026-03-02 Investigating Group Relative Policy Optimization for Diffusion Transformer based Text-to-Audio Generation Yi Gu et.al. 2603.01565 null
2026-03-16 Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing Zijin Yin et.al. 2603.01535 null
2026-03-02 Conversational Speech Naturalness Predictor Anfeng Xu et.al. 2603.01467 null
2026-03-05 Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards Seungwook Kim et.al. 2603.00918 null
2026-02-28 Revisiting the machine-learning density functional for the one-dimensional Hubbard model with random external potential Octavio D. R. Salmon et.al. 2603.00802 null
2026-02-28 TGM-VLA: Task-Guided Mixup for Sampling-Efficient and Robust Robotic Manipulation Fanqi Pu et.al. 2603.00615 null
2026-02-28 LangGap: Diagnosing and Closing the Language Gap in Vision-Language-Action Models Yuchen Hou et.al. 2603.00592 null
2026-02-27 Synthetic Priors Nick Polson et.al. 2603.00347 null
2026-02-27 NAU-QMUL: Utilizing BERT and CLIP for Multi-modal AI-Generated Image Detection Xiaoyu Guo et.al. 2602.23863 null
2026-02-27 BRIDGE the Gap: Mitigating Bias Amplification in Automated Scoring of English Language Learners via Inter-group Data Augmentation Yun Wang et.al. 2602.23580 null
2026-02-26 Towards Better RL Training Data Utilization via Second-Order Rollout Zhe Yang et.al. 2602.22765 null
2026-02-26 TabDLM: Free-Form Tabular Data Generation via Joint Numerical-Language Diffusion Donghong Cai et.al. 2602.22586 null
2026-02-26 DrivePTS: A Progressive Learning Framework with Textual and Structural Enhancement for Driving Scene Generation Zhechao Wang et.al. 2602.22549 null
2026-02-24 Computing a Characteristic Orientation for Rotation-Independent Image Analysis Cristian Valero-Abundio et.al. 2602.20930 null
2026-02-24 Federated Learning for Cross-Modality Medical Image Segmentation via Augmentation-Driven Generalization Sachin Dudda Nagaraju et.al. 2602.20773 null
2026-02-23 Shape-informed cardiac mechanics surrogates in data-scarce regimes via geometric encoding and generative augmentation Davide Carrara et.al. 2602.20306 null
2026-02-23 The Sim-to-Real Gap in MRS Quantification: A Systematic Deep Learning Validation for GABA Zien Ma et.al. 2602.20289 null
2026-03-02 Adaptive Data Augmentation with Multi-armed Bandit: Sample-Efficient Embedding Calibration for Implicit Pattern Recognition Minxue Tang et.al. 2602.19385 null
2026-02-22 PerSoMed: A Large-Scale Balanced Dataset for Persian Social Media Text Classification Isun Chehreh et.al. 2602.19333 null
2026-02-22 RetinaVision: XAI-Driven Augmented Regulation for Precise Retinal Disease Classification using deep learning framework Mohammad Tahmid Noor et.al. 2602.19324 null
2026-02-22 Controlled Face Manipulation and Synthesis for Data Augmentation Joris Kirchner et.al. 2602.19219 null
2026-02-21 YOLOv10-Based Multi-Task Framework for Hand Localization and Laterality Classification in Surgical Videos Kedi Sun et.al. 2602.18959 null
2026-02-19 MARS: Margin-Aware Reward-Modeling with Self-Refinement Payel Bhattacharjee et.al. 2602.17658 null
2026-02-19 Reverso: Efficient Time Series Foundation Models for Zero-shot Forecasting Xinghong Fu et.al. 2602.17634 null
2026-02-19 RPDR: A Round-trip Prediction-Based Data Augmentation Framework for Long-Tail Question Answering Yiming Zhang et.al. 2602.17366 null
2026-02-18 Learning to unfold cloth: Scaling up world models to deformable object manipulation Jack Rome et.al. 2602.16675 null
2026-02-18 Label-Consistent Data Generation for Aspect-Based Sentiment Analysis Using LLM Agents Mohammad H. A. Monfared et.al. 2602.16379 null
2026-02-18 Spatial Audio Question Answering and Reasoning on Dynamic Source Movements Arvind Krishna Sridhar et.al. 2602.16334 null
2026-02-18 Peeking Ahead of the Field Study: Exploring VLM Personas as Support Tools for Embodied Studies in HCI Xinyue Gui et.al. 2602.16157 null
2026-02-03 IT-OSE: Exploring Optimal Sample Size for Industrial Data Augmentation Mingchun Sun et.al. 2602.15878 null
2026-02-17 RaCo: Ranking and Covariance for Practical Learned Keypoints Abhiram Shenoi et.al. 2602.15755 null
2026-02-22 Refine Now, Query Fast: A Decoupled Refinement Paradigm for Implicit Neural Fields Tianyu Xiong et.al. 2602.15155 null
2026-02-16 Hidden Markov Individual-level Models of Infectious Disease Transmission Dirk Douwes-Schultz et.al. 2602.15007 null
2026-02-16 Data Augmentation for Pathological Speech Enhancement Mingchi Hou et.al. 2602.14671 null
2026-02-16 Breaking Data Efficiency Dilemma: A Federated and Augmented Learning Framework For Alzheimer’s Disease Detection via Speech Xiao Wei et.al. 2602.14655 null
2026-02-23 MedVAR: Towards Scalable and Efficient Medical Image Generation via Next-scale Autoregressive Prediction Zhicheng He et.al. 2602.14512 null
2026-02-16 The geometry of invariant learning: an information-theoretic analysis of data augmentation and generalization Abdelali Bouyahia et.al. 2602.14423 null
2026-02-16 A Generative AI Approach for Reducing Skin Tone Bias in Skin Cancer Classification Areez Muhammed Shabu et.al. 2602.14356 null
2026-02-18 Bridging the Urban Divide: Adaptive Cross-City Learning for Disaster Sentiment Understanding Zihui Ma et.al. 2602.14352 null
2026-02-15 RoboAug: One Annotation to Hundreds of Scenes via Region-Contrastive Data Augmentation for Robotic Manipulation Xinhua Wang et.al. 2602.14032 null
2026-02-14 Synthetic Dataset Generation and Validation for Robotic Surgery Instrument Segmentation Giorgio Chiesa et.al. 2602.13844 null
2026-02-13 Backdooring Bias in Large Language Models Anudeep Das et.al. 2602.13427 null
2026-02-04 Deep Learning CNN for Pneumonia Detection: Advancing Digital Health in Society 5.0 Hadi Almohab et.al. 2602.13270 null
2026-02-13 Data Augmentation and Attention for massive MIMO-based Indoor Localization in Changing Environments Luisa Schuhmacher et.al. 2602.12954 null
2026-02-13 Beyond Benchmarks of IUGC: Rethinking Requirements of Deep Learning Methods for Intrapartum Ultrasound Biometry from Fetal Ultrasound Videos Jieyun Bai et.al. 2602.12922 null
2026-02-13 Robustness of Object Detection of Autonomous Vehicles in Adverse Weather Conditions Fox Pettersen et.al. 2602.12902 null
2026-02-12 GAN-based data augmentation for rare and exotic hadron searches in Pb–Pb collisions in ALICE Anisa Khatun et.al. 2602.12088 null
2026-02-12 CSEval: A Framework for Evaluating Clinical Semantics in Text-to-Image Generation Robert Cronshaw et.al. 2602.12004 null
2026-02-11 LCIP: Loss-Controlled Inverse Projection of High-Dimensional Image Data Yu Wang et.al. 2602.11141 null
2026-02-11 Healthy Harvests: A Comparative Look at Guava Disease Classification Using InceptionV3 Samanta Ghosh et.al. 2602.10967 null
2026-02-11 AugVLA-3D: Depth-Driven Feature Augmentation for Vision-Language-Action Models Zhifeng Rao et.al. 2602.10698 null
2026-02-11 Enhancing Weakly Supervised Multimodal Video Anomaly Detection through Text Guidance Shengyang Sun et.al. 2602.10549 null
2026-02-11 LakeMLB: Data Lake Machine Learning Benchmark Feiyu Pan et.al. 2602.10441 null
2026-02-10 MalMoE: Mixture-of-Experts Enhanced Encrypted Malicious Traffic Detection Under Graph Drift Yunpeng Tan et.al. 2602.10157 null
2026-02-09 MPA: Multimodal Prototype Augmentation for Few-Shot Learning Liwen Wu et.al. 2602.10143 null
2026-02-10 DexImit: Learning Bimanual Dexterous Manipulation from Monocular Human Videos Juncheng Mu et.al. 2602.10105 null
2026-02-10 Context-Aware Counterfactual Data Augmentation for Gender Bias Mitigation in Language Models Shweta Parihar et.al. 2602.09590 null
2026-02-26 HLGFA: High-Low Resolution Guided Feature Alignment for Unsupervised Anomaly Detection Han Zhou et.al. 2602.09524 null
2026-02-10 Empowering Contrastive Federated Sequential Recommendation with LLMs Thi Minh Chau Nguyen et.al. 2602.09306 null
2026-02-09 One RNG to Rule Them All: How Randomness Becomes an Attack Vector in Machine Learning Kotekar Annapoorna Prabhu et.al. 2602.09182 null
2026-02-04 The SJTU X-LANCE Lab System for MSR Challenge 2025 Jinxuan Zhu et.al. 2602.09042 null
2026-02-09 SynSacc: A Blender-to-V2E Pipeline for Synthetic Neuromorphic Eye-Movement Data and Sim-to-Real Spiking Model Training Khadija Iddrisu et.al. 2602.08726 null
2026-02-09 Chamelion: Reliable Change Detection for Long-Term LiDAR Mapping in Transient Environments Seoyeon Jang et.al. 2602.08189 null
2026-02-08 Enhancing Bandit Algorithms with LLMs for Time-varying User Preferences in Streaming Recommendations Chenglei Shen et.al. 2602.08067 null
2026-02-08 DINO-Mix: Distilling Foundational Knowledge with Cross-Domain CutMix for Semi-supervised Class-imbalanced Medical Image Segmentation Xinyu Liu et.al. 2602.07819 null
2026-02-07 ComPass: Contrastive Learning for Automated Patch Correctness Assessment in Program Repair Quanjun Zhang et.al. 2602.07561 null
2026-02-07 Fine-Grained Cat Breed Recognition with Global Context Vision Transformer Mowmita Parvin Hera et.al. 2602.07534 null
2026-02-07 Pull Requests as a Training Signal for Repo-Level Code Editing Qinglin Zhu et.al. 2602.07457 null
2026-02-07 Echoes in the Loop: Diagnosing Risks in LLM-Powered Recommender Systems under Feedback Loops Donguk Park et.al. 2602.07442 null
2026-02-06 Sequences as Nodes for Contrastive Multimodal Graph Recommendation Bucher Sahyouni et.al. 2602.07208 null
2026-02-06 Calibrating Generative AI to Produce Realistic Essays for Data Augmentation Edward W. Wolfe et.al. 2602.06772 null
2026-02-06 Diffeomorphism-Equivariant Neural Networks Josephine Elisabeth Oettinger et.al. 2602.06695 null
2026-02-06 AlertBERT: A noise-robust alert grouping framework for simultaneous cyber attacks Lukas Karner et.al. 2602.06534 null
2026-02-05 InterPrior: Scaling Generative Control for Physics-Based Human-Object Interactions Sirui Xu et.al. 2602.06035 null
2026-02-05 Mapper-GIN: Lightweight Structural Graph Abstraction for Corrupted 3D Point Cloud Classification Jeongbin You et.al. 2602.05522 null
2026-02-05 Balanced Anomaly-guided Ego-graph Diffusion Model for Inductive Graph Anomaly Detection Chunyu Wei et.al. 2602.05232 null
2026-02-04 Fast Compute via MC Boosting Sarah Polson et.al. 2602.05032 null
2026-02-04 Smart Diagnosis and Early Intervention in PCOS: A Deep Learning Approach to Women’s Reproductive Health Shayan Abrar et.al. 2602.04944 null
2026-02-04 Speaker-Aware Simulation Improves Conversational Speech Recognition Máté Gedeon et.al. 2602.04776 null
2026-02-04 Turbulence teaches equivariance to neural networks Ryley McConkey et.al. 2602.04695 null
2026-02-04 LatentTune: Efficient Tuning of High Dimensional Database Parameters via Latent Representation Learning Sein Kwon et.al. 2602.04190 null
2026-02-03 SEIS: Subspace-based Equivariance and Invariance Scores for Neural Representations Huahua Lin et.al. 2602.04054 null
2026-02-03 Quasi-multimodal-based pathophysiological feature learning for retinal disease diagnosis Lu Zhang et.al. 2602.03622 null
2026-02-03 Cut to the Mix: Simple Data Augmentation Outperforms Elaborate Ones in Limited Organ Segmentation Datasets Chang Liu et.al. 2602.03555 null
2026-02-03 Invisible Clean-Label Backdoor Attacks for Generative Data Augmentation Ting Xiang et.al. 2602.03316 null
2026-02-03 PQTNet: Pixel-wise Quantitative Thermography Neural Network for Estimating Defect Depth in Polylactic Acid Parts by Additive Manufacturing Lei Deng et.al. 2602.03314 null
2026-02-03 Convolutional Neural Networks for classifying galaxy mergers: Can faint tidal features aid in classifying mergers? Yeonkyung Lee et.al. 2602.03312 null
2026-02-03 Beyond Cropping and Rotation: Automated Evolution of Powerful Task-Specific Augmentations with Generative Models Judah Goldfeder et.al. 2602.03123 null
2026-02-03 The High Cost of Data Augmentation for Learning Equivariant Models Henri Klintebäck et.al. 2602.03118 null
2026-02-03 Structuring Value Representations via Geometric Coherence in Markov Decision Processes Zuyuan Zhang et.al. 2602.02978 null
2026-02-03 Synthetic Data Augmentation for Medical Audio Classification: A Preliminary Evaluation David McShannon et.al. 2602.02955 null
2026-02-03 3D-Learning: Diffusion-Augmented Distributionally Robust Decision-Focused Learning Jiaqi Wen et.al. 2602.02943 null
2026-02-09 Semantics-Aware Generative Latent Data Augmentation for Learning in Low-Resource Domains Jaesung Bae et.al. 2602.02841 null
2026-01-30 Auto-Augmentation Contrastive Learning for Wearable-based Human Activity Recognition Qingyu Wu et.al. 2602.02542 null
2026-02-02 HumanX: Toward Agile and Generalizable Humanoid Interaction Skills from Human Videos Yinhuai Wang et.al. 2602.02473 null
2026-02-02 Transfer Learning Through Conditional Quantile Matching Yikun Zhang et.al. 2602.02358 null
2026-02-02 Enhancing Generalization in Evolutionary Feature Construction for Symbolic Regression through Vicinal Jensen Gap Minimization Hengzhe Zhang et.al. 2602.01510 null
2026-02-01 Understanding vision transformer robustness through the lens of out-of-distribution detection Joey Kuang et.al. 2602.01459 null
2026-02-01 PedagoSense: A Pedology Grounded LLM System for Pedagogical Strategy Detection and Contextual Response Generation in Learning Dialogues Shahem Sultan et.al. 2602.01169 null
2026-02-01 Key Principles of Graph Machine Learning: Representation, Robustness, and Generalization Yassine Abbahaddou et.al. 2602.01139 null
2026-02-03 Data Augmentation for High-Fidelity Generation of CAR-T/NK Immunological Synapse Images Xiang Zhang et.al. 2602.00949 null
2026-01-31 Safety-Efficacy Trade Off: Robustness against Data-Poisoning Diego Granziol et.al. 2602.00822 null
2026-01-27 1S-DAug: One-Shot Data Augmentation for Robust Few-Shot Generalization Yunwei Bai et.al. 2602.00114 null
2026-01-30 Adaptive Edge Learning for Density-Aware Graph Generation Seyedeh Ava Razi Razavi et.al. 2601.23052 null
2026-01-30 Improving Supervised Machine Learning Performance in Optical Quality Control via Generative AI for Dataset Expansion Dennis Sprute et.al. 2601.22961 null
2026-01-30 WED-Net: A Weather-Effect Disentanglement Network with Causal Augmentation for Urban Flow Prediction Qian Hong et.al. 2601.22586 null
2026-01-30 Cross-Domain Few-Shot Learning for Hyperspectral Image Classification Based on Mixup Foundation Model Naeem Paeedeh et.al. 2601.22581 null
2026-01-30 CoDCL: Counterfactual Data Augmentation Contrastive Learning for Continuous-Time Dynamic Network Link Prediction Hantong Feng et.al. 2601.22427 null
2026-01-29 Mechanistic Data Attribution: Tracing the Training Origins of Interpretable LLM Units Jianhui Chen et.al. 2601.21996 null
2026-01-29 Localizing Speech Deepfakes Beyond Transitions via Segment-Aware Learning Yuchen Mao et.al. 2601.21925 null
2026-01-29 Generative Design of Ship Propellers using Conditional Flow Matching Patrick Kruger et.al. 2601.21637 null
2026-01-29 Note2Chat: Improving LLMs for Multi-Turn Clinical History Taking Using Medical Notes Yang Zhou et.al. 2601.21551 null
2026-02-06 inversedMixup: Data Augmentation via Inverting Mixed Embeddings Fanshuang Kong et.al. 2601.21543 null
2026-01-20 BioNIC: Biologically Inspired Neural Network for Image Classification Using Connectomics Principles Diya Prasanth et.al. 2601.20876 null
2026-01-28 Cross-Country Learning for National Infectious Disease Forecasting Using European Data Zacharias Komodromos et.al. 2601.20771 null
2026-01-28 Replicating weak-lensing summary-statistic covariances with normalizing flows Joaquin Armijo et.al. 2601.20669 null
2026-01-28 IoT Device Identification with Machine Learning: Common Pitfalls and Best Practices Kahraman Kostas et.al. 2601.20548 null
2026-01-28 PalmBridge: A Plug-and-Play Feature Alignment Framework for Open-Set Palmprint Verification Chenke Zhang et.al. 2601.20351 null
2026-01-28 Demonstration-Free Robotic Control via LLM Agents Brian Y. Tsui et.al. 2601.20334 null
2026-01-16 oculomix: Hierarchical Sampling for Retinal-Based Systemic Disease Prediction Hyunmin Kim et.al. 2601.19939 null
2026-01-29 Real-Time Pulsatile Flow Prediction for Realistic, Diverse Intracranial Aneurysm Morphologies using a Graph Transformer and Steady-Flow Data Augmentation Yiying Sheng et.al. 2601.19876 null
2026-01-27 Grasynda: Graph-based Synthetic Time Series Generation Luis Amorim et.al. 2601.19668 null
2026-01-27 Algorithmic Prompt-Augmentation for Efficient LLM-Based Heuristic Design for A* Search Thomas Bömer et.al. 2601.19622 null
2026-01-27 DSTCS: Dual-Student Teacher Framework with Segment Anything Model for Semi-Supervised Pubic Symphysis Fetal Head Segmentation Yalin Luo et.al. 2601.19446 null
2026-01-27 High-quality data augmentation for code comment classification Thomas Borsani et.al. 2601.19383 null
2026-01-27 Binary Token-Level Classification with DeBERTa for All-Type MWE Identification: A Lightweight Approach with Linguistic Enhancement Diego Rossini et.al. 2601.19360 null
2026-01-27 Implicit Non-Causal Factors are Out via Dataset Splitting for Domain Generalization Object Detection Zhilong Zhang et.al. 2601.19127 null
2026-01-27 Leveraging Sentence-oriented Augmentation and Transformer-Based Architecture for Vietnamese-Bahnaric Translation Tan Sang Nguyen et.al. 2601.19124 null
2026-01-27 Exploring Weaknesses in Function Call Models via Reinforcement Learning: An Adversarial Data Augmentation Approach Weiran Guo et.al. 2601.19122 null
2026-01-26 OATS: Online Data Augmentation for Time Series Foundation Models Junwei Deng et.al. 2601.19040 null
2026-01-26 ExoGS: A 4D Real-to-Sim-to-Real Framework for Scalable Manipulation Data Collection Yiming Wang et.al. 2601.18629 null
2026-01-26 Generative Diffusion Augmentation with Quantum-Enhanced Discrimination for Medical Image Diagnosis Jingsong Xia et.al. 2601.18556 null
2026-01-26 A Dataset for Automatic Vocal Mode Classification Reemt Hinrichs et.al. 2601.18339 null
2026-01-26 Analytic Incremental Learning For Sound Source Localization With Imbalance Rectification Zexia Fan et.al. 2601.18335 null
2026-01-26 Facial Emotion Recognition on FER-2013 using an EfficientNetB2-Based Approach Sahil Naik et.al. 2601.18228 null
2026-01-25 MarketGANs: Multivariate financial time-series data augmentation using generative adversarial networks Jeonggyu Huh et.al. 2601.17773 null
2026-01-25 Training-Free Text-to-Image Compositional Food Generation via Prompt Grafting Xinyue Pan et.al. 2601.17666 null
2026-01-24 Stylizing ViT: Anatomy-Preserving Instance Style Transfer for Domain Generalization Sebastian Doerrich et.al. 2601.17586 null
2026-01-24 SpatialMath: Spatial Comprehension-Infused Symbolic Reasoning for Mathematical Problem-Solving Ashutosh Bajpai et.al. 2601.17489 null
2026-01-23 Semi-Supervised Domain Adaptation with Latent Diffusion for Pathology Image Classification Tengyue Zhang et.al. 2601.17228 null
2026-01-23 Fully 3D Unrolled Magnetic Resonance Fingerprinting Reconstruction via Staged Pretraining and Implicit Gridding Yonatan Urman et.al. 2601.17143 null
2026-01-22 A Computer Vision Pipeline for Iterative Bullet Hole Tracking in Rifle Zeroing Robert M. Belcher et.al. 2601.17062 null
2026-01-22 Frequency-aware Adaptive Contrastive Learning for Sequential Recommendation Zhikai Wang et.al. 2601.17057 null
2026-01-20 Arabic Sign Language Recognition using Multimodal Approach Ghadeer Alanazi et.al. 2601.17041 null
2026-01-23 Latent Diffusion for Internet of Things Attack Data Generation in Intrusion Detection Estela Sánchez-Carballo et.al. 2601.16976 null
2026-01-23 A Novel Transfer Learning Approach for Mental Stability Classification from Voice Signal Rafiul Islam et.al. 2601.16793 null
2026-01-22 Synthetic Augmentation in Imbalanced Learning: When It Helps, When It Hurts, and How Much to Add Zhengchi Ma et.al. 2601.16120 null
2026-01-22 synthocr-gen: A synthetic ocr dataset generator for low-resource languages- breaking the data barrier Haq Nawaz Malik et.al. 2601.16113 null
2026-01-23 Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model Chenghao Fan et.al. 2601.15892 null
2026-01-22 Beyond Off-the-Shelf Models: A Lightweight and Accessible Machine Learning Pipeline for Ecologists Working with Image Data Clare Chemery et.al. 2601.15813 null
2026-01-22 Diffusion Model-Based Data Augmentation for Enhanced Neuron Segmentation Liuyun Jiang et.al. 2601.15779 null
2026-01-22 Materealize: a multi-agent deliberation system for end-to-end material design and synthesis Seongmin Kim et.al. 2601.15743 null
2026-01-21 AI-Based Culvert-Sewer Inspection Christina Thrainer et.al. 2601.15366 null
2026-01-21 Synthetic Data Augmentation for Multi-Task Chinese Porcelain Classification: A Stable Diffusion Approach Ziyao Ling et.al. 2601.14791 null
2026-01-21 Context Patch Fusion With Class Token Enhancement for Weakly Supervised Semantic Segmentation Yiyang Fu et.al. 2601.14718 null
2026-01-21 Counterfactual Modeling with Fine-Tuned LLMs for Health Intervention Design and Sensor Data Augmentation Shovito Barua Soumma et.al. 2601.14590 null
2026-01-20 Attention-Based Offline Reinforcement Learning and Clustering for Interpretable Sepsis Treatment Punit Kumar et.al. 2601.14228 null
2026-01-21 RL-BioAug: Label-Efficient Reinforcement Learning for Self-Supervised EEG Representation Learning Cheol-Hui Lee et.al. 2601.13964 null
2026-01-20 Towards Effective Negation Modeling in Joint Audio-Text Models for Music Yannis Vasilakis et.al. 2601.13931 null
2026-01-20 Inverting Self-Organizing Maps: A Unified Activation-Based Framework Alessandro Londei et.al. 2601.13851 null
2026-01-19 Discrete-Time Optimal Control of Species Augmentation for Predator-Prey Model Munkaila Dasumani et.al. 2601.13394 null
2026-01-30 Diffusion-Driven Synthetic Tabular Data Generation for Enhanced DoS/DDoS Attack Classification Aravind B et.al. 2601.13197 null
2026-01-19 NeuroShield: A Neuro-Symbolic Framework for Adversarial Robustness Ali Shafiee Sarvestani et.al. 2601.13162 null
2026-01-19 Adaptive Speaker Embedding Self-Augmentation for Personal Voice Activity Detection with Short Enrollment Speech Fuyuan Feng et.al. 2601.12769 null
2026-01-18 Single-index Semiparametric Transformation Cure Models with Interval-censored Data Xiaoru Huang et.al. 2601.12370 null
2026-01-18 Beyond Human Annotation: Recent Advances in Data Generation Methods for Document Intelligence Dehao Ying et.al. 2601.12318 null
2026-01-18 An Innovative Framework for Breast Cancer Detection Using Pyramid Adaptive Atrous Convolution, Transformer Integration, and Multi-Scale Feature Fusion Ehsan Sadeghi Pour et.al. 2601.12249 null
2026-01-16 Isotropy-Optimized Contrastive Learning for Semantic Course Recommendation Ali Khreis et.al. 2601.11427 null
2026-01-16 How DDAIR you? Disambiguated Data Augmentation for Intent Recognition Galo Castillo-López et.al. 2601.11234 null
2026-01-16 Tail-Aware Data Augmentation for Long-Tail Sequential Recommendation Yizhou Dang et.al. 2601.10933 null
2026-01-15 A Unified 3D Object Perception Framework for Real-Time Outside-In Multi-Camera Systems Yizhou Wang et.al. 2601.10819 null
2026-01-15 Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models Zirui Ren et.al. 2601.10679 null
2026-01-15 History Is Not Enough: An Adaptive Dataflow System for Financial Time-Series Synthesis Haochong Xia et.al. 2601.10143 null
2026-01-14 Explainable Deep Learning for Pediatric Pneumonia Detection in Chest X-Ray Images Adil O. Khadidos et.al. 2601.09814 null
2026-01-30 WiFo-M $^2$ : Empower Wireless Communications With Plug-and-Play Environment Sensing via Foundation Model Haotian Zhang et.al. 2601.09179 null
2026-01-14 From Snow to Rain: Evaluating Robustness, Calibration, and Complexity of Model-Based Robust Training Josué Martínez-Martínez et.al. 2601.09153 null
2026-01-14 Enhancing Imbalanced Electrocardiogram Classification: A Novel Approach Integrating Data Augmentation through Wavelet Transform and Interclass Fusion Haijian Shao et.al. 2601.09103 null
2026-01-09 Bias Detection and Rotation-Robustness Mitigation in Vision-Language Models and Generative Image Models Tarannum Mithila et.al. 2601.08860 null
2026-01-13 Get away with less: Need of source side data curation to build parallel corpus for low resource Machine Translation Saumitra Yadav et.al. 2601.08629 null
2026-01-13 REVNET: Rotation-Equivariant Point Cloud Completion via Vector Neuron Anchor Transformer Zhifan Ni et.al. 2601.08558 null
2026-01-13 Hybrid Distillation with CoT Guidance for Edge-Drone Control Code Generation Yizhan Feng et.al. 2601.08412 null
2026-01-13 VGG Induced Deep Hand Sign Language Detection Subham Sharma et.al. 2601.08262 null
2026-01-13 Towards Cross-Platform Generalization: Domain Adaptive 3D Detection with Augmentation and Pseudo-Labeling Xiyan Feng et.al. 2601.08174 null
2026-01-13 PathoGen: Diffusion-Based Synthesis of Realistic Lesions in Histopathology Images Mohamad Koohi-Moghadam et.al. 2601.08127 null
2026-01-12 Bayesian nonparametric models for zero-inflated count-compositional data using ensembles of regression trees André F. B. Menezes et.al. 2601.08067 null
2026-01-12 AdaField: Generalizable Surface Pressure Modeling with Physics-Informed Pre-training and Flow-Conditioned Adaptation Junhong Zou et.al. 2601.07139 null
2026-01-11 Paraphrasing Adversarial Attack on LLM-as-a-Reviewer Masahiro Kaneko et.al. 2601.06884 null
2026-01-04 AIS-CycleGen: A CycleGAN-Based Framework for High-Fidelity Synthetic AIS Data Generation and Augmentation SM Ashfaq uz Zaman et.al. 2601.06127 null
2026-01-09 Cedalion Tutorial: A Python-based framework for comprehensive analysis of multimodal fNIRS & DOT from the lab to the everyday world E. Middell et.al. 2601.05923 null
2026-01-09 Data Augmented Pipeline for Legal Information Extraction and Reasoning Nguyen Minh Phuong et.al. 2601.05609 null
2026-01-09 Learn to Evolve: Self-supervised Neural JKO Operator for Wasserstein Gradient Flow Xue Feng et.al. 2601.05583 null
2026-01-09 Generalizable and Adaptive Continual Learning Framework for AI-generated Image Detection Hanyi Wang et.al. 2601.05580 null
2026-01-09 LEAPS: An LLM-Empowered Adaptive Plugin for Taobao AI Search Lei Wang et.al. 2601.05513 null
2026-01-08 FlowLet: Conditional 3D Brain MRI Synthesis using Wavelet Flow Matching Danilo Danese et.al. 2601.05212 null
2026-01-08 SimuAgent: An LLM-Based Simulink Modeling Assistant Enhanced with Reinforcement Learning Yanchang Liang et.al. 2601.05187 null
2026-01-08 Approximate equivariance via projection-based regularisation Torben Berndt et.al. 2601.05028 null
2026-01-08 A new method for augmenting short time series, with application to pain events in sickle cell disease Kumar Utkarsh et.al. 2601.04538 null
2026-01-20 Causal Data Augmentation for Robust Fine-Tuning of Tabular Foundation Models Magnus Bühler et.al. 2601.04110 null
2026-01-07 Investigation into respiratory sound classification for an imbalanced data set using hybrid LSTM-KAN architectures Nithinkumar K. et.al. 2601.03610 null
2026-01-07 Artificial Intelligence and Skills: Evidence from Contrastive Learning in Online Job Vacancies Hangyu Chen et.al. 2601.03558 null
2026-01-07 Persona-aware and Explainable Bikeability Assessment: A Vision-Language Model Approach Yilong Dai et.al. 2601.03534 null
2026-01-10 Improving Indigenous Language Machine Translation with Synthetic Data and Language-Specific Preprocessing Aashish Dhawan et.al. 2601.03135 null
2026-01-06 ToxiGAN: Toxic Data Augmentation via LLM-Guided Directional Adversarial Generation Peiran Li et.al. 2601.03121 null
2026-01-06 Enhancing Multilingual RAG Systems with Debiased Language Preference-Guided Query Fusion Jeonghyun Park et.al. 2601.02956 null
2026-01-13 Training Language Models with homotokens Leads to Delayed Overfitting Adrian Cosma et.al. 2601.02867 null
2026-01-06 Adversarial Question Answering Robustness: A Multi-Level Error Analysis and Mitigation Study Agniv Roy Choudhury et.al. 2601.02700 null
2026-01-05 API: Empowering Generalizable Real-World Image Dehazing via Adaptive Patch Importance Learning Chen Zhu et.al. 2601.01992 null
2026-01-05 Thinking with Blueprints: Assisting Vision-Language Models in Spatial Reasoning via Structured Object Representation Weijian Ma et.al. 2601.01984 null
2026-01-05 Theoretical Convergence of SMOTE-Generated Samples Firuz Kamalov et.al. 2601.01927 null
2026-01-05 AlignDrive: Aligned Lateral-Longitudinal Planning for End-to-End Autonomous Driving Yanhao Wu et.al. 2601.01762 null
2026-01-04 FALCON: Few-Shot Adversarial Learning for Cross-Domain Medical Image Segmentation Abdur R. Fayjie et.al. 2601.01687 null
2026-01-04 DiffKD-DCIS: Predicting Upgrade of Ductal Carcinoma In Situ with Diffusion Augmentation and Knowledge Distillation Tao Li et.al. 2601.01507 null
2026-01-04 DeepInv: A Novel Self-supervised Learning Approach for Fast and Accurate Diffusion Inversion Ziyue Zhang et.al. 2601.01487 null
2026-01-04 iFlip: Iterative Feedback-driven Counterfactual Example Refinement Yilong Wang et.al. 2601.01446 null
2026-01-04 In defense of the two-stage framework for open-set domain adaptive semantic segmentation Wenqi Ren et.al. 2601.01439 null
2026-01-03 DST-Calib: A Dual-Path, Self-Supervised, Target-Free LiDAR-Camera Extrinsic Calibration Network Zhiwei Huang et.al. 2601.01188 null
2026-01-03 Comparative Evaluation of VAE, GAN, and SMOTE for Tor Detection in Encrypted Network Traffic Saravanan A et.al. 2601.01183 null
2026-01-03 600k-ks-ocr: a large-scale synthetic dataset for optical character recognition in kashmiri script Haq Nawaz Malik et.al. 2601.01088 null
2026-01-03 Enhanced Leukemic Cell Classification Using Attention-Based CNN and Data Augmentation Douglas Costa Braga et.al. 2601.01026 null
2026-01-02 A Deep Learning Approach for Automated Skin Lesion Diagnosis with Explainable AI Md. Maksudul Haque et.al. 2601.00964 null
2026-01-01 Four-Stage Alzheimer’s Disease Classification from MRI Using Topological Feature Extraction, Feature Selection, and Ensemble Learning Faisal Ahmed et.al. 2601.00918 null
2025-12-25 ShrimpXNet: A Transfer Learning Framework for Shrimp Disease Classification with Augmented Regularization, Adversarial Training, and Explainable AI Israk Hasan Jone et.al. 2601.00832 null
2026-01-08 RoboReward: General-Purpose Vision-Language Reward Models for Robotics Tony Lee et.al. 2601.00675 null
2026-01-01 Detecting Spike Wave Discharges (SWD) using 1-dimensional Residual UNet Saurav Sengupta et.al. 2601.00459 null
2026-01-01 ReMA: A Training-Free Plug-and-Play Mixing Augmentation for Video Behavior Recognition Feng-Qi Cui et.al. 2601.00311 null
2026-01-01 Towards Automated Differential Diagnosis of Skin Diseases Using Deep Learning and Imbalance-Aware Strategies Ali Anaissi et.al. 2601.00286 null
2026-01-01 Parallel Universes, Parallel Languages: A Comprehensive Study on LLM-based Multilingual Counterfactual Example Generation Qianli Wang et.al. 2601.00263 null
2026-01-01 Application Research of a Deep Learning Model Integrating CycleGAN and YOLO in PCB Infrared Defect Detection Chao Yang et.al. 2601.00237 null
2025-12-31 MUSIC: MUlti-Step Instruction Contrast for Multi-Turn Reward Models Wenzhe Li et.al. 2512.24693 null
2025-12-30 Comparing Approaches to Automatic Summarization in Less-Resourced Languages Chester Palen-Michel et.al. 2512.24410 null
2025-12-30 One-shot synthesis of rare gastrointestinal lesions improves diagnostic accuracy and clinical training Jia Yu et.al. 2512.24278 null
2025-12-30 Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes Shuyun Wang et.al. 2512.24227 null
2026-01-13 Enhanced Web Payload Classification Using WAMM: An AI-Based Framework for Dataset Refinement and Model Evaluation Heba Osama et.al. 2512.23610 null
2025-12-29 Detection Fire in Camera RGB-NIR Nguyen Truong Khai et.al. 2512.23594 null
2025-12-29 GeoTeacher: Geometry-Guided Semi-Supervised 3D Object Detection Jingyu Li et.al. 2512.23147 null
2025-12-31 A Context-Aware Temporal Modeling through Unified Multi-Scale Temporal Encoding and Hierarchical Sequence Learning for Single-Channel EEG Sleep Staging Amirali Vakili et.al. 2512.22976 null
2025-12-28 Robust LLM-based Column Type Annotation via Prompt Augmentation with LoRA Tuning Hanze Meng et.al. 2512.22742 null
2025-12-28 Data Augmentation for Classification of Negative Pregnancy Outcomes in Imbalanced Data Md Badsha Biswas et.al. 2512.22732 null
2025-12-20 SAMM2D: Scale-Aware Multi-Modal 2D Dual-Encoder for High-Sensitivity Intracrania Aneurysm Screening Antara Titikhsha et.al. 2512.22185 null
2025-12-26 High-Fidelity and Long-Duration Human Image Animation with Diffusion Transformer Shen Zheng et.al. 2512.21905 null
2025-12-26 Contextual Biasing for LLM-Based ASR with Hotword Retrieval and Reinforcement Learning YuXiang Kong et.al. 2512.21828 null
2025-12-25 AVP-Fusion: Adaptive Multi-Modal Fusion and Contrastive Learning for Two-Stage Antiviral Peptide Identification Xinru Wen et.al. 2512.21544 null
2025-12-25 Intelligent recognition of GPR road hidden defect images based on feature fusion and attention mechanism Haotian Lv et.al. 2512.21452 null
2025-12-24 Granular-ball Guided Masking: Structure-aware Data Augmentation Shuyin Xia et.al. 2512.21011 null
2025-12-23 Convergence analysis of data augmentation algorithms in Bayesian lasso models with log-concave likelihoods Jingkai Cui et.al. 2512.20041 null
2025-12-23 GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators Jiacheng Guo et.al. 2512.19682 null
2025-12-22 No Data? No Problem: Robust Vision-Tabular Learning with Missing Values Marta Hasny et.al. 2512.19602 null
2025-12-22 srvar-toolkit: A Python Implementation of Shadow-Rate Vector Autoregressions with Stochastic Volatility Charles Shaw et.al. 2512.19589 null
2025-12-22 BabyFlow: 3D modeling of realistic and expressive infant faces Antonia Alomar et.al. 2512.19560 null
2025-12-22 GANeXt: A Fully ConvNeXt-Enhanced Generative Adversarial Network for MRI- and CBCT-to-CT Synthesis Siyuan Mei et.al. 2512.19336 null
2025-12-22 IndoorUAV: Benchmarking Vision-Language UAV Navigation in Continuous Indoor Environments Xu Liu et.al. 2512.19024 null
2025-12-22 DTCCL: Disengagement-Triggered Contrastive Continual Learning for Autonomous Bus Planners Yanding Yang et.al. 2512.18988 null
2025-12-20 Generalization Gaps in Political Fake News Detection: An Empirical Study on the LIAR Dataset S Mahmudul Hasan et.al. 2512.18533 null
2025-12-10 Continual Learning for Acoustic Event Classification Yang Xiao et.al. 2512.17932 null
2026-01-09 SCOPE: Sequential Causal Optimization of Process Interventions Jakob De Moor et.al. 2512.17629 null
2025-12-19 SkinGenBench: Generative Model and Preprocessing Effects for Synthetic Dermoscopic Augmentation in Melanoma Diagnosis N. A. Adarsh Pritam et.al. 2512.17585 null
2025-12-19 SCAR: Semantic Cardiac Adversarial Representation via Spatiotemporal Manifold Optimization in ECG Shunbo Jia et.al. 2512.17423 null
2025-12-18 Data Augmentation Supporting a Conversational Agent Designed for Smoking Cessation Support Groups Salar Hashemitaheri et.al. 2512.17092 null
2025-12-18 Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation Min-Jung Kim et.al. 2512.17040 null
2025-12-23 Do Generalized-Gamma Scale Mixtures of Normals Fit Large Image Datasets? Brandon Marks et.al. 2512.17038 null
2025-12-18 Exploration of Augmentation Strategies in Multi-modal Retrieval-Augmented Generation for the Biomedical Domain: A Case Study Evaluating Question Answering in Glycobiology Primož Kocbek et.al. 2512.16802 null
2025-12-13 Two-Step Data Augmentation for Masked Face Detection and Recognition: Turning Fake Masks to Real Yan Yang et.al. 2512.15774 null
2025-12-12 Data-Chain Backdoor: Do You Trust Diffusion Models as Generative Data Supplier? Junchi Lu et.al. 2512.15769 null
2025-12-19 Stylized Synthetic Augmentation further improves Corruption Robustness Georg Siedel et.al. 2512.15675 null
2025-12-17 BEAT2AASIST model with layer fusion for ESDD 2026 Challenge Sanghyeok Chung et.al. 2512.15180 null
2025-12-16 Bayesian Latent Class Regression and Variable Selection with Applications to Sleep Patterns Data Matthew Heaney et.al. 2512.14903 null
2025-12-15 Revisiting the Reliability of Language Models in Instruction-Following Jianshuo Dong et.al. 2512.14754 null
2025-12-16 CHIP: Adaptive Compliance for Humanoid Control through Hindsight Perturbation Sirui Chen et.al. 2512.14689 null
2025-12-16 Robust Training of Singing Voice Synthesis Using Prior and Posterior Uncertainty Yiwen Zhao et.al. 2512.14653 null
2025-12-16 Attention-Based Preprocessing Framework for Improving Rare Transient Classification Xinyue Sheng et.al. 2512.14644 null
2025-12-18 Synthetic Electrogram Generation with Variational Autoencoders for ECGI Miriam Gutiérrez-Fernández et.al. 2512.14537 null
2025-12-16 Mimicking Human Visual Development for Learning Robust Image Representations Ankita Raj et.al. 2512.14360 null
2025-12-16 4D-RaDiff: Latent Diffusion for 4D Radar Point Cloud Generation Jimmie Kwok et.al. 2512.14235 null
2025-12-16 AsarRec: Adaptive Sequential Augmentation for Robust Self-supervised Sequential Recommendation Kaike Zhang et.al. 2512.14047 null
2025-12-15 Comparative Evaluation of Embedding Representations for Financial News Sentiment Analysis Joyjit Roy et.al. 2512.13749 null
2025-12-15 Advancing Machine Learning Optimization of Chiral Photonic Metasurface: Comparative Study of Neural Network and Genetic Algorithm Approaches Davide Filippozzi et.al. 2512.13656 null
2026-01-05 Test-Time Modification: Inverse Domain Transformation for Robust Perception Arpit Jadon et.al. 2512.13454 null
2025-12-15 Measurement of Material Volume Fractions in a Microwave Resonant Cavity Sensor Using Convolutional Neural Network Mojtaba Joodaki et.al. 2512.13233 null
2025-12-15 Comprehensive Deployment-Oriented Assessment for Cross-Environment Generalization in Deep Learning-Based mmWave Radar Sensing Tomoya Tanaka et.al. 2512.13018 null
2025-12-15 BLADE: A Behavior-Level Data Augmentation Framework with Dual Fusion Modeling for Multi-Behavior Sequential Recommendation Yupeng Li et.al. 2512.12964 null
2025-12-14 Adapting Multimodal Foundation Models for Few-Shot Learning: A Comprehensive Study on Contrastive Captioners N. K. B. M. P. K. B. Narasinghe et.al. 2512.12824 null
2025-12-14 Personalized QoE Prediction: A Demographic-Augmented Machine Learning Framework for 5G Video Streaming Networks Syeda Zunaira Ahmed et.al. 2512.12736 null
2025-12-14 Supervised Contrastive Frame Aggregation for Video Representation Learning Shaif Chowdhury et.al. 2512.12549 null
2025-12-14 Generative Spatiotemporal Data Augmentation Jinfan Zhou et.al. 2512.12508 null
2025-12-12 Towards Channel-Robust and Receiver-Independent Radio Frequency Fingerprint Identification Jie Ma et.al. 2512.12070 null
2025-12-07 Pseudo-Label Refinement for Robust Wheat Head Segmentation via Two-Stage Hybrid Training Jiahao Jiang et.al. 2512.11874 null
2025-12-12 Depth-Copy-Paste: Multimodal and Depth-Aware Compositing for Robust Face Detection Qiushi Guo et.al. 2512.11683 null
2025-12-12 Kinetic Mining in Context: Few-Shot Action Synthesis via Text-to-Motion Distillation Luca Cazzola et.al. 2512.11654 null
2025-12-12 DREAM-B3P: Dual-Stream Transformer Network Enhanced by Feedback Diffusion Model for Blood-Brain Barrier Penetrating Peptide Prediction Kaijie Wang et.al. 2512.11511 null
2025-12-12 Towards Logic-Aware Manipulation: A Knowledge Primitive for VLM-Based Assistants in Smart Manufacturing Suchang Chen et.al. 2512.11275 null
2025-12-15 Template-Free Retrosynthesis with Graph-Prior Augmented Transformers Youjun Zhao et.al. 2512.10770 null
2025-12-12 Textual Data Bias Detection and Mitigation – An Extensible Pipeline with Experimental Evaluation Rebekka Görge et.al. 2512.10734 null
2025-12-11 A Conditional Generative Framework for Synthetic Data Augmentation in Segmenting Thin and Elongated Structures in Biological Images Yi Liu et.al. 2512.10334 null
2025-12-11 CIEGAD: Cluster-Conditioned Interpolative and Extrapolative Framework for Geometry-Aware and Domain-Aligned Data Augmentation Keito Inoshita et.al. 2512.10178 null
2025-12-10 Knowledge Graph Enrichment and Reasoning for Nobel Laureates Thanh-Lam T. Nguyen et.al. 2512.09707 null
2025-12-10 Hands-on Evaluation of Visual Transformers for Object Recognition and Detection Dimitrios N. Vlachogiannis et.al. 2512.09579 null
2025-12-09 Protein Secondary Structure Prediction Using Transformers Manzi Kevin Maxime et.al. 2512.08613 null
2025-12-09 LLM-based Vulnerable Code Augmentation: Generate or Refactor? Dyna Soumhane Ouchebara et.al. 2512.08493 null
2025-12-09 FastBEV++: Fast by Algorithm, Deployable by Design Yuanpeng Chen et.al. 2512.08237 null
2025-12-08 Coherent Audio-Visual Editing via Conditional Audio Generation Following Video Edits Masato Ishii et.al. 2512.07209 null
2025-12-07 OXtal: An All-Atom Diffusion Model for Organic Crystal Structure Prediction Emily Jin et.al. 2512.06987 null
2025-12-07 RMAdapter: Reconstruction-based Multi-Modal Adapter for Vision-Language Models Xiang Lin et.al. 2512.06811 null
2025-12-07 Stitch and Tell: A Structured Multimodal Data Augmentation Method for Spatial Understanding Hang Yin et.al. 2512.06769 null
2025-12-07 XM-ALIGN: Unified Cross-Modal Embedding Alignment for Face-Voice Association Zhihua Fang et.al. 2512.06757 null
2025-12-20 Monotone data augmentation algorithm for longitudinal continuous, binary and ordinal outcomes: a unifying approach Yongqiang Tang et.al. 2512.06621 null
2025-12-12 Less Is More for Multi-Step Logical Reasoning of LLM Generalisation Under Rule Removal, Paraphrasing, and Compression Qiming Bao et.al. 2512.06393 null
2025-12-06 DaGRPO: Rectifying Gradient Conflict in Reasoning via Distinctiveness-Aware Group Relative Policy Optimization Xuan Xie et.al. 2512.06337 null
2025-12-10 Stronger is not better: Better Augmentations in Contrastive Learning for Medical Image Segmentation Azeez Idris et.al. 2512.05992 null
2025-12-05 LeAD-M3D: Leveraging Asymmetric Distillation for Real-time Monocular 3D Detection Johannes Meier et.al. 2512.05663 null
2025-12-05 Matching Ranks Over Probability Yields Truly Deep Safety Alignment Jason Vega et.al. 2512.05518 null
2025-12-04 The Erosion of LLM Signatures: Can We Still Distinguish Human and LLM-Generated Scientific Ideas After Iterative Paraphrasing? Sadat Shahriar et.al. 2512.05311 null
2025-12-04 Uncertainty-Aware Data-Efficient AI: An Information-Theoretic Perspective Osvaldo Simeone et.al. 2512.05267 null
2025-12-04 Multi-Loss Learning for Speech Emotion Recognition with Energy-Adaptive Mixup and Frame-Level Attention Cong Wang et.al. 2512.04551 null
2025-12-04 EvoEdit: Lifelong Free-Text Knowledge Editing through Latent Perturbation Augmentation and Knowledge-driven Parameter Fusion Pengfei Cao et.al. 2512.04545 null
2025-12-03 Text-Only Training for Image Captioning with Retrieval Augmentation and Modality Gap Correction Rui Fonseca et.al. 2512.04309 null
2025-12-03 Studying Various Activation Functions and Non-IID Data for Machine Learning Model Robustness Long Dang et.al. 2512.04264 null
2025-12-03 SimFlow: Simplified and End-to-End Training of Latent Normalizing Flows Qinyu Zhao et.al. 2512.04084 null
2025-12-03 SELF: A Robust Singular Value and Eigenvalue Approach for LLM Fingerprinting Hanxiu Zhang et.al. 2512.03620 null
2025-12-03 MAGE-ID: A Multimodal Generative Framework for Intrusion Detection Systems Mahdi Arab Loodaricheh et.al. 2512.03375 null
2025-12-02 OmniPerson: Unified Identity-Preserving Pedestrian Generation Changxiao Ma et.al. 2512.02554 null
2025-12-02 VibOmni: Towards Scalable Bone-conduction Speech Enhancement on Earables Lixing He et.al. 2512.02515 null
2025-12-02 VACoT: Rethinking Visual Data Augmentation with VLMs Zhengzhuo Xu et.al. 2512.02361 null
2025-12-02 Training Dynamics of Learning 3D-Rotational Equivariance Max W. Shen et.al. 2512.02303 null
2025-12-01 Feature Selection Empowered BERT for Detection of Hate Speech with Vocabulary Augmentation Pritish N. Desai et.al. 2512.02141 null
2025-12-01 StyleYourSmile: Cross-Domain Face Retargeting Without Paired Multi-Style Data Avirup Dey et.al. 2512.01895 null
2025-12-01 GRASP: Guided Residual Adapters with Sample-wise Partitioning Felix Nützel et.al. 2512.01675 null
2025-12-01 Neural Networks for Predicting Permeability Tensors of 2D Porous Media: Comparison of Convolution- and Transformer-based Architectures Sigurd Vargdal et.al. 2512.01517 null
2025-12-01 ChronosObserver: Taming 4D World with Hyperspace Diffusion Sampling Qisen Wang et.al. 2512.01481 null
2025-12-01 $\mathbf{M^3A}$ Policy: Mutable Material Manipulation Augmentation Policy through Photometric Re-rendering Jiayi Li et.al. 2512.01446 null
2025-12-01 MEGConformer: Conformer-Based MEG Decoder for Robust Speech and Phoneme Classification Xabier de Zuazo et.al. 2512.01443 null
2025-12-01 Teaching by Failure: Counter-Example-Driven Curricula for Transformer Self-Improvement Harshil Vejendla et.al. 2512.01187 null
2025-11-30 MS-PPO: Morphological-Symmetry-Equivariant Policy for Legged Robot Locomotion Sizhe Wei et.al. 2512.00727 null
2025-11-30 Graph Data Augmentation with Contrastive Learning on Covariate Distribution Shift Fanlong Zeng et.al. 2512.00716 null
2025-12-03 XAI-Driven Skin Disease Classification: Leveraging GANs to Augment ResNet-50 Performance Kim Gerard A. Villanueva et.al. 2512.00626 null
2025-11-29 Explainable Multi-Modal Deep Learning for Automatic Detection of Lung Diseases from Respiratory Audio Signals S M Asiful Islam Saky et.al. 2512.00563 null
2025-11-28 SD-CGAN: Conditional Sinkhorn Divergence GAN for DDoS Anomaly Detection in IoT Networks Henry Onyeka et.al. 2512.00251 null
2025-11-28 Mesh Augmentation of LoRaWAN-based IoT Networks Ram Ramanathan et.al. 2512.00161 null
2025-11-28 Local and Global Context-and-Object-part-Aware Superpixel-based Data Augmentation for Deep Visual Recognition Fadi Dornaika et.al. 2512.00130 null
2025-12-16 ASTRO: Adaptive Stitching via Dynamics-Guided Trajectory Rollouts Hang Yu et.al. 2511.23442 null
2025-11-28 Robust In-the-Wild Exercise Recognition from a Single Wearable: Data-Side Fusion, Sensor Rotation, and Feature Engineering Hoang Khang Phan et.al. 2511.23173 null
2025-11-28 A General Bayesian Nonparametric Approach for Estimating Population-Level and Conditional Causal Effects Yongseok Hur et.al. 2511.23085 null
2025-11-28 RobotSeg: A Model and Dataset for Segmenting Robots in Image and Video Haiyang Mei et.al. 2511.22950 null
2025-11-27 Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield Dongyang Liu et.al. 2511.22677 null
2025-11-27 Selecting User Histories to Generate LLM Users for Cold-Start Item Recommendation Nachiket Subbaraman et.al. 2511.21989 null
2025-11-26 Deep Learning Architectures for Code-Modulated Visual Evoked Potentials Detection Kiran Nair et.al. 2511.21940 null
2025-11-26 A Comparative Study of LLM Prompting and Fine-Tuning for Cross-genre Authorship Attribution on Chinese Lyrics Yuxin Li et.al. 2511.21930 null
2025-11-26 Advancing Marine Bioacoustics with Deep Generative Models: A Hybrid Augmentation Strategy for Southern Resident Killer Whale Detection Bruno Padovese et.al. 2511.21872 null
2025-11-26 Revolutionizing Glioma Segmentation & Grading Using 3D MRI - Guided Hybrid Deep Learning Models Pandiyaraju V et.al. 2511.21673 null
2025-11-29 Deep Learning-Based Multiclass Classification of Oral Lesions with Stratified Augmentation Joy Naoum et.al. 2511.21582 null
2025-11-26 Shift-Equivariant Complex-Valued Convolutional Neural Networks Quentin Gabot et.al. 2511.21250 null
2025-11-26 Dynamic Stratified Contrastive Learning with Upstream Augmentation for MILP Branching Tongkai Lu et.al. 2511.21107 null
2025-11-26 A Probabilistic Framework for Temporal Distribution Generalization in Industry-Scale Recommender Systems Yuxuan Zhu et.al. 2511.21032 null
2025-11-26 FANoise: Singular Value-Adaptive Noise Modulation for Robust Multimodal Representation Learning Jiaoyang Li et.al. 2511.20997 null
2025-11-25 Winning with Less for Low Resource Languages: Advantage of Cross-Lingual English_Persian Argument Mining Model over LLM Augmentation Ali Jahan et.al. 2511.20872 null
2025-11-25 Bridging the Language Gap: Synthetic Voice Diversity via Latent Mixup for Equitable Speech Recognition Wesley Bian et.al. 2511.20534 null
2025-11-25 Data Augmentation Techniques to Reverse-Engineer Neural Network Weights from Input-Output Queries Alexander Beiser et.al. 2511.20312 null
2025-11-25 Robust 3D Brain MRI Inpainting with Random Masking Augmentation Juexin Zhang et.al. 2511.20202 null
2025-11-25 SEDA: A Self-Adapted Entity-Centric Data Augmentation for Boosting Gird-based Discontinuous NER Models Wen-Fang Su et.al. 2511.20143 null
2025-11-25 BERT-APC: A Reference-free Framework for Automatic Pitch Correction via Musical Context Inference Sungjae Kim et.al. 2511.20006 null
2025-11-25 Rethinking Semi-Supervised Node Classification with Self-Supervised Graph Clustering Songbo Wang et.al. 2511.19976 null
2025-11-26 TiCT: A Synthetically Pre-Trained Foundation Model for Time Series Classification Chin-Chia Michael Yeh et.al. 2511.19694 null
2025-11-24 Blinking Beyond EAR: A Stable Eyelid Angle Metric for Driver Drowsiness Detection and Data Augmentation Mathis Wolter et.al. 2511.19519 null
2025-11-22 A Multi-Stage Deep Learning Framework with PKCP-MixUp Augmentation for Pediatric Liver Tumor Diagnosis Using Multi-Phase Contrast-Enhanced CT Wanqi Wang et.al. 2511.19478 null
2025-11-24 BackSplit: The Importance of Sub-dividing the Background in Biomedical Lesion Segmentation Rachit Saluja et.al. 2511.19394 null
2025-11-24 Tiny-TSM: Efficiently Training a Lightweight SOTA Time Series Foundation Model Felix Birkel et.al. 2511.19272 null
2025-11-24 Experimental insights into data augmentation techniques for deep learning-based multimode fiber imaging: limitations and success Jawaria Maqbool et.al. 2511.19072 null
2025-11-24 Skeletons Matter: Dynamic Data Augmentation for Text-to-Query Yuchen Ji et.al. 2511.18934 null
2025-11-24 Multidimensional Music Aesthetic Evaluation via Semantically Consistent C-Mixup Augmentation Shuyang Liu et.al. 2511.18869 null
2025-11-24 Higgs Production Classifier using Weak Supervision Kai-Feng Chen et.al. 2511.18726 null
2025-11-24 Data Augmentation Strategies for Robust Lane Marking Detection Flora Lian et.al. 2511.18668 null
2025-11-23 Re(Visiting) Time Series Foundation Models in Finance Eghbal Rahimikia et.al. 2511.18578 null
2025-11-23 Stro-VIGRU: Defining the Vision Recurrent-Based Baseline Model for Brain Stroke Classification Subhajeet Das et.al. 2511.18316 null
2025-11-23 MultiDiffNet: A Multi-Objective Diffusion Framework for Generalizable Brain Decoding Mengchun Zhang et.al. 2511.18294 null
2025-11-22 Enhancing Large Language Models for Automated Homework Assessment in Undergraduate Circuit Analysis Liangliang Chen et.al. 2511.18221 null
2025-11-22 Generating Synthetic Human Blastocyst Images for In-Vitro Fertilization Blastocyst Grading Pavan Narahari et.al. 2511.18204 null
2025-11-22 LocaGen: Low-Overhead Indoor Localization Through Spatial Augmentation Abdelrahman Abdelmotlb et.al. 2511.18158 null
2025-11-22 Diverse Instance Generation via Diffusion Models for Enhanced Few-Shot Object Detection in Remote Sensing Images Yanxing Liu et.al. 2511.18031 null
2025-11-21 Group Equivariant Convolutional Networks for Pathloss Estimation Ziyue Yang et.al. 2511.17841 null
2025-11-21 Addressing A Posteriori Performance Degradation in Neural Network Subgrid Stress Models Andy Wu et.al. 2511.17475 null
2025-11-21 ATAC: Augmentation-Based Test-Time Adversarial Correction for CLIP Linxiang Su et.al. 2511.17362 null
2025-11-20 Motion Transfer-Enhanced StyleGAN for Generating Diverse Macaque Facial Expressions Takuya Igaue et.al. 2511.16711 null
2025-11-20 Boosting Predictive Performance on Tabular Data through Data Augmentation with Latent-Space Flow-Based Diffusion Md. Tawfique Ihsan et.al. 2511.16571 null
2025-11-20 Contrastive vision-language learning with paraphrasing and negation Kwun Ho Ngan et.al. 2511.16527 null
2025-11-20 Physics-Informed Machine Learning for Efficient Sim-to-Real Data Augmentation in Micro-Object Pose Estimation Zongcai Tan et.al. 2511.16494 null
2025-11-25 Prediction of atomic H adsorption energies in metalloid doped MSSe (M = Mo/W) Janus layers: A combined DFT and machine learning study G. Tejaswini et.al. 2511.16263 null
2025-11-20 LLMs-based Augmentation for Domain Adaptation in Long-tailed Food Datasets Qing Wang et.al. 2511.16037 null
2025-11-26 KRAL: Knowledge and Reasoning Augmented Learning for LLM-assisted Clinical Antimicrobial Therapy Zhe Li et.al. 2511.15974 null
2025-11-19 Learning from Imperfect Labels: A Physics-Aware Neural Operator with Application to DAS Data Denoising Yang Cui et.al. 2511.15638 null
2025-11-19 A Hybrid CNN-ViT-GNN Framework with GAN-Based Augmentation for Intelligent Weed Detection in Precision Agriculture Pandiyaraju V et.al. 2511.15535 null
2025-11-19 Advancing Identification method of Gamma-Ray Bursts with Data and Feature Enhancement Peng Zhang et.al. 2511.15470 null
2025-11-20 Selective Mixup for Debiasing Question Selection in Computerized Adaptive Testing Mi Tian et.al. 2511.15241 null
2025-11-19 Data-driven Prediction of Species-Specific Plant Responses to Spectral-Shifting Films from Leaf Phenotypic and Photosynthetic Traits Jun Hyeun Kang et.al. 2511.15173 null
2025-11-19 Deep Learning Assisted Prediction of Electrochemical Lithiation State in Spinel Lithium Titanium Oxide Thin Films Devin Chugh et.al. 2511.15109 null
2025-11-18 Structured Contrastive Learning for Interpretable Latent Representations Zhengyang Shen et.al. 2511.14920 null
2025-11-18 Tell Me: An LLM-powered Mental Well-being Assistant with RAG, Synthetic Dialogue Generation, and Agentic Planning Trishala Jayesh Ahalpara et.al. 2511.14445 null
2025-11-18 Continuous Vision-Language-Action Co-Learning with Semantic-Physical Alignment for Behavioral Cloning Xiuxiu Qi et.al. 2511.14396 null
2025-11-18 H-LDM: Hierarchical Latent Diffusion Models for Controllable and Interpretable PCG Synthesis from Clinical Metadata Chenyang Xu et.al. 2511.14312 null
2025-11-18 A Comprehensive Study of Implicit and Explicit Biases in Large Language Models Fatima Kazi et.al. 2511.14153 null
2025-11-17 Segment Anything Across Shots: A Method and Benchmark Hengrui Hu et.al. 2511.13715 null
2025-11-17 MMWSTM-ADRAN+: A Novel Hybrid Deep Learning Architecture for Enhanced Climate Time Series Forecasting and Extreme Event Prediction Shaheen Mohammed Saleh Ahmed et.al. 2511.13419 null
2025-11-17 A Lightweight 3D Anomaly Detection Method with Rotationally Invariant Features Hanzhe Liang et.al. 2511.13115 null
2025-11-17 Learning from the Undesirable: Robust Adaptation of Language Models without Forgetting Yunhun Nam et.al. 2511.13052 null
2025-11-17 Medal S: Spatio-Textual Prompt Model for Medical Segmentation Pengcheng Shi et.al. 2511.13001 null
2025-11-17 CalibrateMix: Guided-Mixup Calibration of Image Semi-Supervised Models Mehrab Mustafy Rahman et.al. 2511.12964 null
2025-11-16 Evaluating Autoformalization Robustness via Semantically Similar Paraphrasing Hayden Moore et.al. 2511.12784 null
2025-11-16 Mitigating Length Bias in RLHF through a Causal Lens Hyeonji Kim et.al. 2511.12573 null
2025-11-16 HiGFA: Hierarchical Guidance for Fine-grained Data Augmentation with Diffusion Models Zhiguang Lu et.al. 2511.12547 null
2025-11-16 Designing-with More-than-Human Through Human Augmentation Botao ‘Amber’ Hu et.al. 2511.12533 null
2025-11-16 Task-Aware Retrieval Augmentation for Dynamic Recommendation Zhen Tao et.al. 2511.12495 null
2025-11-15 Leveraging Quantum-Based Architectures for Robust Diagnostics Shabnam Sodagari et.al. 2511.12386 null
2025-11-15 Rethinking Bias in Generative Data Augmentation for Medical AI: a Frequency Recalibration Method Chi Liu et.al. 2511.12301 null
2025-11-15 Understanding InfoNCE: Transition Probability Matrix Induced Feature Clustering Ge Cheng et.al. 2511.12180 null
2025-11-15 FIA-Edit: Frequency-Interactive Attention for Efficient and High-Fidelity Inversion-Free Text-Guided Image Editing Kaixiang Yang et.al. 2511.12151 null
2025-11-15 Breaking the Modality Wall: Time-step Mixup for Efficient Spiking Knowledge Transfer from Static to Event Domain Yuqi Xie et.al. 2511.12150 null
2025-11-15 Did Models Sufficient Learn? Attribution-Guided Training via Subset-Selected Counterfactual Augmentation Yannan Chen et.al. 2511.12100 null
2025-11-15 Treatment Stitching with Schrödinger Bridge for Enhancing Offline Reinforcement Learning in Adaptive Treatment Strategies Dong-Hee Shin et.al. 2511.12075 null
2025-11-15 Informed Bootstrap Augmentation Improves EEG Decoding Woojae Jeong et.al. 2511.12073 null
2025-11-14 Augmenting The Weather: A Hybrid Counterfactual-SMOTE Algorithm for Improving Crop Growth Prediction When Climate Changes Mohammed Temraz et.al. 2511.11945 null
2025-11-14 CLUE: Controllable Latent space of Unprompted Embeddings for Diversity Management in Text-to-Image Synthesis Keunwoo Park et.al. 2511.10993 null
2025-11-14 Automated Analysis of Learning Outcomes and Exam Questions Based on Bloom’s Taxonomy Ramya Kumar et.al. 2511.10903 null
2025-11-12 Graph Neural Field with Spatial-Correlation Augmentation for HRTF Personalization De Hu et.al. 2511.10697 null
2025-11-13 Panda: Test-Time Adaptation with Negative Data Augmentation Ruxi Deng et.al. 2511.10481 null
2025-11-13 Causal Model-Based Reinforcement Learning for Sample-Efficient IoT Channel Access Aswin Arun et.al. 2511.10291 null
2025-11-13 MTP: Exploring Multimodal Urban Traffic Profiling with Modality Augmentation and Spectrum Fusion Haolong Xiang et.al. 2511.10218 null
2025-11-14 Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL Qifeng Cai et.al. 2511.10192 null
2025-11-13 ELYADATA & LIA at NADI 2025: ASR and ADI Subtasks Haroun Elleuch et.al. 2511.10090 null
2025-11-13 Opinion: Towards Unified Expressive Policy Optimization for Robust Robot Learning Haidong Huang et.al. 2511.10087 null
2025-11-13 Learning phase diversity for solving ill-posed inverse problems in imaging Jasleen Birdi et.al. 2511.09952 null
2025-11-13 A Study on Enhancing the Generalization Ability of Visuomotor Policies via Data Augmentation Hanwen Wang et.al. 2511.09932 null
2025-11-13 Simulating Distribution Dynamics: Liquid Temporal Feature Evolution for Single-Domain Generalized Object Detection Zihao Zhang et.al. 2511.09909 null
2025-11-12 PANDA - Patch And Distribution-Aware Augmentation for Long-Tailed Exemplar-Free Continual Learning Siddeshwar Raghavan et.al. 2511.09791 null
2025-11-12 LLM-Guided Dynamic-UMAP for Personalized Federated Graph Learning Sai Puppala et.al. 2511.09438 null
2025-11-12 AuthSig: Safeguarding Scanned Signatures Against Unauthorized Reuse in Paperless Workflows RuiQiang Zhang et.al. 2511.08967 null
2025-11-11 3D-TDA – Topological feature extraction from 3D images for Alzheimer’s disease classification Faisal Ahmed et.al. 2511.08663 null
2025-11-14 Bot Meets Shortcut: How Can LLMs Aid in Handling Unknown Invariance OOD Scenarios? Shiyan Zheng et.al. 2511.08455 null
2025-11-12 SASG-DA: Sparse-Aware Semantic-Guided Diffusion Augmentation For Myoelectric Gesture Recognition Chen Liu et.al. 2511.08344 null
2025-11-11 Learning Omnidirectional Locomotion for a Salamander-Like Quadruped Robot Zhiang Liu et.al. 2511.08299 null
2025-11-11 Forgetting Alternation and Blossoms: A New Framework for Fast Matching Augmentation and Its Applications to Sequential/Distributed/Streaming Computation Taisuke Izumi et.al. 2511.08210 null
2025-11-11 I2E: Real-Time Image-to-Event Conversion for High-Performance Spiking Neural Networks Ruichen Ma et.al. 2511.08065 null
2025-11-11 From LLMs to Agents: A Comparative Evaluation of LLMs and LLM-based Agents in Security Patch Detection Junxiao Han et.al. 2511.08060 null
2025-11-11 Computational Blueprints: Generating Isomorphic Mathematics Problems with Large Language Models Jeong-Hoon Kim et.al. 2511.07932 null
2025-11-11 IBMA: An Imputation-Based Mixup Augmentation Using Self-Supervised Learning for Time Series Data Dang Nha Nguyen et.al. 2511.07930 null
2025-11-10 ACE-ICD: Acronym Expansion As Data Augmentation For Automated ICD Coding Tuan-Dung Le et.al. 2511.07311 null
2025-11-10 Improving Deepfake Detection with Reinforcement Learning-Based Adaptive Data Augmentation Yuxuan Zhou et.al. 2511.07051 null
2025-11-10 Evaluating LLMs for Anxiety, Depression, and Stress Detection Evaluating Large Language Models for Anxiety, Depression, and Stress Detection: Insights into Prompting Strategies and Synthetic Data Mihael Arcan et.al. 2511.07044 null
2025-11-10 Hybrid Autoencoders for Tabular Data: Leveraging Model-Based Augmentation in Low-Label Settings Erel Naor et.al. 2511.06961 null
2025-11-10 GNN-Enabled Robust Hybrid Beamforming with Score-Based CSI Generation and Denoising Yuhang Li et.al. 2511.06663 null
2025-11-10 On the Potential of Digital Twins for Distribution System State Estimation with Randomly Missing Data in Heterogeneous Measurements Ying Zhang et.al. 2511.06583 null
2025-11-09 Adaptive PID Control for Robotic Systems via Hierarchical Meta-Learning and Reinforcement Learning with Physics-Based Data Augmentation JiaHao Wu et.al. 2511.06500 null
2025-11-09 LLM-Driven Completeness and Consistency Evaluation for Cultural Heritage Data Augmentation in Cross-Modal Retrieval Jian Zhang et.al. 2511.06268 null
2025-11-09 Analyzing and Mitigating Negation Artifacts using Data Augmentation for Improving ELECTRA-Small Model Accuracy Mojtaba Noghabaei et.al. 2511.06234 null
2025-11-08 Reperio-rPPG: Relational Temporal Graph Neural Networks for Periodicity Learning in Remote Physiological Measurement Ba-Thinh Nguyen et.al. 2511.05946 null
2025-11-07 Persian Musical Instruments Classification Using Polyphonic Data Augmentation Diba Hadi Esfangereh et.al. 2511.05717 null
2025-11-07 Robust Neural Audio Fingerprinting using Music Foundation Models Shubhr Singh et.al. 2511.05399 null
2025-11-07 Entropy-Rank Ratio: A Novel Entropy-Based Perspective for DNA Complexity and Classification Emmanuel Pio Pastore et.al. 2511.05300 null
2025-11-07 Embedding-Space Data Augmentation to Prevent Membership Inference Attacks in Clinical Time Series Forecasting Marius Fracarolli et.al. 2511.05289 null
2025-11-07 Less Is More: Generating Time Series with LLaMA-Style Autoregression in Simple Factorized Latent Spaces Siyuan Li et.al. 2511.04973 null
2025-11-06 PromptSep: Generative Audio Separation via Multimodal Prompting Yutong Wen et.al. 2511.04623 null
2025-11-06 Comparative Study of CNN Architectures for Binary Classification of Horses and Motorcycles in the VOC 2008 Dataset Muhammad Annas Shaikh et.al. 2511.04344 null
2025-11-06 Black-Box Guardrail Reverse-engineering Attack Hongwei Yao et.al. 2511.04215 null
2025-11-06 MedDChest: A Content-Aware Multimodal Foundational Vision Model for Thoracic Imaging Mahmoud Soliman et.al. 2511.04016 null
2025-11-05 Desert Waste Detection and Classification Using Data-Based and Model-Based Enhanced YOLOv12 DL Model Abdulmumin Sa’ad et.al. 2511.03888 null
2025-11-05 A Lightweight 3D-CNN for Event-Based Human Action Recognition with Privacy-Preserving Potential Mehdi Sefidgar Dilmaghani et.al. 2511.03665 null
2025-11-05 Towards Transparent Stance Detection: A Zero-Shot Approach Using Implicit and Explicit Interpretability Apoorva Upadhyaya et.al. 2511.03635 null
2025-11-05 The Bradley-Terry Stochastic Block Model Lapo Santi et.al. 2511.03467 null
2025-11-05 Knowledge-Augmented Question Error Correction for Chinese Question Answer System with QuestionRAG Longpeng Qiu et.al. 2511.03410 null
2025-11-05 Overcoming the Generalization Limits of SLM Finetuning for Shape-Based Extraction of Datatype and Object Properties Célian Ringwald et.al. 2511.03407 null
2025-11-05 LFC-DA: Logical Formula-Controlled Data Augmentation for Enhanced Logical Reasoning Shenghao Li et.al. 2511.03372 null
2025-11-05 Decoupling Augmentation Bias in Prompt Learning for Vision-Language Models Gahyeon Kim et.al. 2511.03367 null
2025-11-05 An Augmentation Overlap Theory of Contrastive Learning Qi Zhang et.al. 2511.03114 null
2025-11-04 Generative Hints Andy Dimnaku et.al. 2511.02933 null
2025-11-04 IllumFlow: Illumination-Adaptive Low-Light Enhancement via Conditional Rectified Flow and Retinex Decomposition Wenyang Wei et.al. 2511.02411 null
2025-10-29 An Experimental Comparison of Alternative Techniques for Event-Log Augmentation Alessandro Padella et.al. 2511.01896 null
2025-11-03 DINO-MX: A Modular & Flexible Framework for Self-Supervised Learning Mahmut Selman Gokmen et.al. 2511.01610 null
2025-11-03 Driving scenario generation and evaluation using a structured layer representation and foundational models Arthur Hubert et.al. 2511.01541 null
2025-11-03 Difficulty-Controllable Cloze Question Distractor Generation Seokhoon Kang et.al. 2511.01526 null
2025-11-03 Conditional Diffusion Model-Enabled Scenario-Specific Neural Receivers for Superimposed Pilot Schemes Xingyu Zhou et.al. 2511.01173 null
2025-11-02 A Distributed Plug-and-Play MCMC Algorithm for High-Dimensional Inverse Problems Maxime Bouton et.al. 2511.00870 null
2025-11-07 Med-Banana-50K: A Cross-modality Large-Scale Dataset for Text-guided Medical Image Editing Zhihui Chen et.al. 2511.00801 null
2025-11-01 Exploring and Mitigating Gender Bias in Encoder-Based Transformer Models Ariyan Hossain et.al. 2511.00519 null
2025-11-04 Simple and Behavior-Driven Augmentation for Recommendation with Rich Collaborative Signals Doyun Choi et.al. 2511.00436 null
2025-10-31 Casing Collar Identification using AlexNet-based Neural Networks for Depth Measurement in Oil and Gas Wells Siyu Xiao et.al. 2511.00129 null
2025-10-26 Mutual Information guided Visual Contrastive Learning Hanyang Chen et.al. 2511.00028 null
2025-10-31 Effect of Domain Generalization Techniques in Low Resource Systems Mahi Aminu et.al. 2510.27512 null
2025-10-31 FedSM: Robust Semantics-Guided Feature Mixup for Bias Reduction in Federated Learning with Long-Tail Data Jingrui Zhang et.al. 2510.27240 null
2025-10-31 A Survey on Generative Recommendation: Data, Model, and Tasks Min Hou et.al. 2510.27157 null
2025-10-30 Dataset Creation and Baseline Models for Sexism Detection in Hausa Fatima Adam Muhammad et.al. 2510.27038 null
2025-10-30 SYNAPSE-Net: A Unified Framework with Lesion-Aware Hierarchical Gating for Robust Segmentation of Heterogeneous Brain Lesions Md. Mehedi Hassan et.al. 2510.26961 null
2025-10-31 Offline Clustering of Preference Learning with Active-data Augmentation Jingyuan Liu et.al. 2510.26301 null
2025-10-29 An Analysis of Causal Effect Estimation using Outcome Invariant Data Augmentation Uzair Akbar et.al. 2510.25128 null
2025-10-15 Comparative Analysis of Data Augmentation for Clinical ECG Classification with STAR Nader Nemati et.al. 2510.24740 null
2025-10-28 SPARTA: Evaluating Reasoning Segmentation Robustness through Black-Box Adversarial Paraphrasing in Text Autoencoder Latent Space Viktoriia Zinkovich et.al. 2510.24446 null
2025-10-28 UtilGen: Utility-Centric Generative Data Augmentation with Dual-Level Task Adaptation Jiyu Guo et.al. 2510.24262 null
2025-10-27 Learning Linearity in Audio Consistency Autoencoders via Implicit Regularization Bernardo Torres et.al. 2510.23530 null
2025-10-27 DPGLA: Bridging the Gap between Synthetic and Real Data for Unsupervised Domain Adaptation in 3D LiDAR Semantic Segmentation Wanmeng Li et.al. 2510.23525 null
2025-10-27 MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding Xin Jin et.al. 2510.23479 null
2025-10-27 Treble10: A high-quality dataset for far-field speech recognition, dereverberation, and enhancement Sarabeth S. Mullins et.al. 2510.23141 null
2025-10-27 Tagging-Augmented Generation: Assisting Language Models in Finding Intricate Knowledge In Long Contexts Anwesan Pal et.al. 2510.22956 null
2025-10-26 ConMatFormer: A Multi-attention and Transformer Integrated ConvNext based Deep Learning Model for Enhanced Diabetic Foot Ulcer Classification Raihan Ahamed Rifat et.al. 2510.22743 null
2025-10-26 Learning Without Augmenting: Unsupervised Time Series Representation Learning via Frame Projections Berken Utku Demirel et.al. 2510.22655 null
2025-11-01 Knowledge-guided Continual Learning for Behavioral Analytics Systems Yasas Senarath et.al. 2510.22405 null
2025-10-24 AutoSciDACT: Automated Scientific Discovery through Contrastive Embedding and Hypothesis Testing Samuel Bright-Thonney et.al. 2510.21935 null
2025-10-24 Foundation Models in Dermatopathology: Skin Tissue Classification Riya Gupta et.al. 2510.21664 null
2025-10-24 TerraGen: A Unified Multi-Task Layout Generation Framework for Remote Sensing Data Augmentation Datao Tang et.al. 2510.21391 null
2025-10-24 Generative Federated Learning for Smart Prediction and Recommendation Applications Anwesha Mukherjee et.al. 2510.21183 null
2025-10-24 SafetyPairs: Isolating Safety Critical Image Features with Counterfactual Image Generation Alec Helbling et.al. 2510.21120 null
2025-10-24 Bridging Language Gaps with Adaptive RAG: Improving Indonesian Language Question Answering William Christian et.al. 2510.21068 null
2025-10-24 Deep learning-based automated damage detection in concrete structures using images from earthquake events Abdullah Turer et.al. 2510.21063 null
2025-10-23 Information Theoretic Learning for Diffusion Models with Warm Start Yirong Shen et.al. 2510.20903 null
2025-10-23 Analyticup E-commerce Product Search Competition Technical Report from Team Tredence_AICOE Rakshith R et.al. 2510.20674 null
2025-10-23 LM-mixup: Text Data Augmentation via Language Model based Mixup Zhijie Deng et.al. 2510.20449 null
2025-10-23 Neural Networks for Censored Expectile Regression Based on Data Augmentation Wei Cao et.al. 2510.20344 null
2025-10-25 DB-FGA-Net: Dual Backbone Frequency Gated Attention Network for Multi-Class Brain Tumor Classification with Grad-CAM Interpretability Saraf Anzum Shreya et.al. 2510.20299 null
2025-10-21 Cyberattack Detection in Critical Infrastructure and Supply Chains Smita Khapre et.al. 2510.19859 null
2025-10-22 Curvilinear Structure-preserving Unpaired Cross-domain Medical Image Translation Zihao Chen et.al. 2510.19679 null
2025-10-22 Towards Single-Source Domain Generalized Object Detection via Causal Visual Prompts Chen Li et.al. 2510.19487 null
2025-10-22 KORE: Enhancing Knowledge Injection for Large Multimodal Models via Knowledge-Oriented Augmentations and Constraints Kailin Jiang et.al. 2510.19316 null
2025-10-21 SO(3)-invariant PCA with application to molecular data Michael Fraiman et.al. 2510.18827 null
2025-10-21 Finding the Sweet Spot: Optimal Data Augmentation Ratio for Imbalanced Credit Scoring Using ADASYN Luis H. Chia et.al. 2510.18252 null
2025-10-20 Learning from Generalization Patterns: An Evaluation-Driven Approach to Enhanced Data Augmentation for Fine-Tuning Small Language Models Huan Song et.al. 2510.18143 null
2025-10-24 ViBED-Net: Video Based Engagement Detection Network Using Face-Aware and Scene-Aware Spatiotemporal Cues Prateek Gothwal et.al. 2510.18016 null
2025-10-15 CMIS-Net: A Cascaded Multi-Scale Individual Standardization Network for Backchannel Agreement Estimation Yuxuan Huang et.al. 2510.17855 null
2025-10-10 MAT-Agent: Adaptive Multi-Agent Training Optimization Jusheng Zhang et.al. 2510.17845 null
2025-10-20 PANER: A Paraphrase-Augmented Framework for Low-Resource Named Entity Recognition Nanda Kumar Rengarajan et.al. 2510.17720 null
2025-10-20 Handling Extreme Class Imbalance: Using GANs in Data Augmentation for Suicide Prediction Vaishnavi Visweswaraiah et.al. 2510.17661 null
2025-10-20 ZACH-ViT: A Zero-Token Vision Transformer with ShuffleStrides Data Augmentation for Robust Lung Ultrasound Classification Athanasios Angelakis et.al. 2510.17650 null
2025-10-24 RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation Yuquan Xue et.al. 2510.17640 null
2025-10-20 Fair and Interpretable Deepfake Detection in Videos Akihito Yoshii et.al. 2510.17264 null
2025-10-19 Addressing data scarcity in structural health monitoring through generative augmentation Sasan Farhadi et.al. 2510.16889 null
2025-10-19 Connecting Domains and Contrasting Samples: A Ladder for Domain Generalization Tianxin Wei et.al. 2510.16704 null
2025-10-19 Zero- and One-Shot Data Augmentation for Sentence-Level Dysarthric Speech Recognition in Constrained Scenarios Shiyao Wang et.al. 2510.16700 null
2025-10-18 ViT-Transformer: Self-attention mechanism based constitutive modeling for nonlinear heterogeneous materials Yijing Zhou et.al. 2510.16575 null
2025-10-18 ReviewGuard: Enhancing Deficient Peer Review Detection via LLM-Driven Data Augmentation Haoxuan Zhang et.al. 2510.16549 null
2025-10-17 Data-Centric AI for Tropical Agricultural Mapping: Challenges, Strategies and Scalable Solutions Mateus Pinto da Silva et.al. 2510.16207 null
2025-11-03 Bridging Symmetry and Robustness: On the Role of Equivariance in Enhancing Adversarial Robustness Longwei Wang et.al. 2510.16171 null
2025-10-17 Learning density ratios in causal inference using Bregman-Riesz regression Oliver J. Hines et.al. 2510.16127 null
2025-10-17 Data-Driven Analysis of Intersectional Bias in Image Classification: A Framework with Bias-Weighted Augmentation Farjana Yesmin et.al. 2510.16072 null
2025-10-13 Bolster Hallucination Detection via Prompt-Guided Data Augmentation Wenyun Li et.al. 2510.15977 null
2025-10-17 ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection Haowei Zhu et.al. 2510.15783 null
2025-10-17 SAMix: Calibrated and Accurate Continual Learning via Sphere-Adaptive Mixup and Neural Collapse Trung-Anh Dang et.al. 2510.15751 null
2025-10-17 Improving Micro-Expression Recognition with Phase-Aware Temporal Augmentation Vu Tram Anh Khuong et.al. 2510.15466 null
2025-10-17 Robust Optimization in Causal Models and G-Causal Normalizing Flows Gabriele Visentin et.al. 2510.15458 null
2025-10-17 Towards In-Situ Failure Assessment: Deep Learning on DIC Results for Laminated Composites Amir Mohammad Mirzaei et.al. 2510.15424 null
2025-10-16 Salient Concept-Aware Generative Data Augmentation Tianchen Zhao et.al. 2510.15194 null
2025-10-16 Automated Snippet-Alignment Data Augmentation for Code Translation Zhiming Zhang et.al. 2510.15004 null
2025-10-16 What is missing from this picture? Persistent homology and mixup barcodes as a means of investigating negative embedding space Himanshu Yadav et.al. 2510.14327 null
2025-10-15 Do Slides Help? Multi-modal Context for Automatic Transcription of Conference Talks Supriti Sinhamahapatra et.al. 2510.13979 null
2025-10-15 OralGPT: A Two-Stage Vision-Language Model for Oral Mucosal Disease Diagnosis and Description Jia Zhang et.al. 2510.13911 null
2025-10-15 A fully automated and scalable Parallel Data Augmentation for Low Resource Languages using Image and Text Analytics Prawaal Sharma et.al. 2510.13211 null
2025-10-15 LLM-Guided Synthetic Augmentation (LGSA) for Mitigating Bias in AI Systems Sai Suhruth Reddy Karri et.al. 2510.13202 null
2025-10-15 GRACE: Globally-Seeded Representation-Aware Cluster-Specific Evolution for Compiler Auto-Tuning Haolin Pan et.al. 2510.13176 null
2025-10-13 Data-Augmented Machine Learning for Predicting Biomass-Derived Hard Carbon Anode Performance in Sodium-Ion Batteries Gang Chen et.al. 2510.12833 null
2025-10-14 A Text-Image Fusion Method with Data Augmentation Capabilities for Referring Medical Image Segmentation Shurong Chai et.al. 2510.12482 null
2025-10-14 A Function Centric Perspective On Flat and Sharp Minima Israel Mason-Williams et.al. 2510.12451 null
2025-10-14 APGNet: Adaptive Prior-Guided for Underwater Camouflaged Object Detection Xinxin Huang et.al. 2510.12056 null
2025-10-13 MammoDINO: Anatomically Aware Self-Supervision for Mammographic Images Sicheng Zhou et.al. 2510.11883 null
2025-10-13 MS-Mix: Unveiling the Power of Mixup for Multimodal Sentiment Analysis Hongyu Zhu et.al. 2510.11579 null
2025-10-13 DiffStyleTS: Diffusion Model for Style Transfer in Time Series Mayank Nagda et.al. 2510.11335 null
2025-10-13 LightPneumoNet: Lightweight Pneumonia Classifier Neilansh Chauhan et.al. 2510.11232 null
2025-10-13 Mixup Helps Understanding Multimodal Video Better Xiaoyu Ma et.al. 2510.10986 null
2025-10-12 From Detection to Mitigation: Addressing Bias in Deep Learning Models for Chest X-Ray Diagnosis Clemence Mottez et.al. 2510.10822 null
2025-10-12 Proficiency-Aware Adaptation and Data Augmentation for Robust L2 ASR Ling Sun et.al. 2510.10738 null
2025-10-11 A Survey of Inductive Reasoning for Large Language Models Kedi Chen et.al. 2510.10182 null
2025-10-11 Diversity Augmentation of Dynamic User Preference Data for Boosting Personalized Text Summarizers Parthiv Chatterjee et.al. 2510.10082 null
2025-10-11 Improving Speech Emotion Recognition with Mutual Information Regularized Generative Model Chung-Soo Ahn et.al. 2510.10078 null
2025-10-10 Closing the Data-Efficiency Gap Between Autoregressive and Masked Diffusion LLMs Xu Pan et.al. 2510.09885 null
2025-10-10 Accent-Invariant Automatic Speech Recognition via Saliency-Driven Spectrogram Masking Mohammad Hossein Sameti et.al. 2510.09528 null
2025-10-10 Cattle-CLIP: A Multimodal Framework for Cattle Behaviour Recognition Huimin Liu et.al. 2510.09203 null
2025-10-10 Augmented data and neural networks for robust epidemic forecasting: application to COVID-19 in Italy Giacomo Dimarco et.al. 2510.09192 null
2025-10-10 Generative Data Augmentation in Graph Contrastive Learning for Recommendation Yansong Wang et.al. 2510.09129 null
2025-10-14 Denoised Diffusion for Object-Focused Image Augmentation Nisha Pillai et.al. 2510.08955 null
2025-10-09 SAFER-AiD: Saccade-Assisted Foveal-peripheral vision Enhanced Reconstruction for Adversarial Defense Jiayang Liu et.al. 2510.08761 null
2025-10-08 Reproducible Evaluation of Data Augmentation and Loss Functions for Brain Tumor Segmentation Saumya B et.al. 2510.08617 null
2025-10-09 Hyperspectral data augmentation with transformer-based diffusion models Mattia Ferrari et.al. 2510.08363 null
2025-10-10 A Multimodal Depth-Aware Method For Embodied Reference Understanding Fevziye Irem Eyiokur et.al. 2510.08278 null
2025-10-09 Robust Canonicalization through Bootstrapped Data Re-Alignment Johann Schmidt et.al. 2510.08178 null
2025-10-09 Long-tailed Recognition with Model Rebalancing Jiaan Luo et.al. 2510.08177 null
2025-10-09 Self-Improving LLM Agents at Test-Time Emre Can Acikgoz et.al. 2510.07841 null
2025-10-09 Enhancing Visual Prompting through Expanded Transformation Space and Overfitting Mitigation Shohei Enomoto et.al. 2510.07823 null
2025-10-07 Enhancing Maritime Object Detection in Real-Time with RT-DETR and Data Augmentation Nader Nemati et.al. 2510.07346 null
2025-10-08 Vision-Language-Action Models for Robotics: A Review Towards Real-World Applications Kento Kawaharazuka et.al. 2510.07077 null
2025-10-08 Lung Infection Severity Prediction Using Transformers with Conditional TransMix Augmentation and Cross-Attention Bouthaina Slika et.al. 2510.06887 null
2025-10-08 PTEB: Towards Robust Text Embedding Evaluation via Stochastic Paraphrasing at Evaluation Time with LLMs Manuel Frank et.al. 2510.06730 null
2025-10-07 Data Factory with Minimal Human Effort Using VLMs Jiaojiao Ye et.al. 2510.05722 null
2025-10-07 Transfer Learning on Edge Connecting Probability Estimation under Graphon Model Yuyao Wang et.al. 2510.05527 null
2025-10-06 NASP-T: A Fuzzy Neuro-Symbolic Transformer for Logic-Constrained Aviation Safety Report Classification Fadi Al Machot et.al. 2510.05451 null
2025-09-30 CARE: Cognitive-reasoning Augmented Reinforcement for Emotional Support Conversation Jie Zhu et.al. 2510.05122 null
2025-10-06 How does the optimizer implicitly bias the model merging loss landscape? Chenxiang Zhang et.al. 2510.04686 null
2025-10-05 RAP: 3D Rasterization Augmented End-to-End Planning Lan Feng et.al. 2510.04333 null
2025-10-05 PABSA: Hybrid Framework for Persian Aspect-Based Sentiment Analysis Mehrzad Tareh et.al. 2510.04291 null
2025-10-05 Enhancing Fake News Video Detection via LLM-Driven Creative Process Simulation Yuyan Bu et.al. 2510.04024 null
2025-10-04 Exploring the Challenge and Value of Deep Learning in Automated Skin Disease Diagnosis Runhao Liu et.al. 2510.03869 null
2025-10-04 Cellular Learning: Scattered Data Regression in High Dimensions via Voronoi Cells Shankar Prasad Sastry et.al. 2510.03810 null
2025-10-09 From Moments to Models: Graphon Mixture-Aware Mixup and Contrastive Learning Ali Azizpour et.al. 2510.03690 null
2025-10-15 Diffusion-Classifier Synergy: Reward-Aligned Learning via Mutual Boosting Loop for FSCIL Ruitao Wu et.al. 2510.03608 null
2025-10-04 Exploring the Hierarchical Reasoning Model for Small Natural-Image Classification Without Augmentation Alexander V. Mantzaris et.al. 2510.03598 null
2025-10-09 How We Won BraTS-SSA 2025: Brain Tumor Segmentation in the Sub-Saharan African Population Using Segmentation-Aware Data Augmentation and Model Ensembling Claudia Takyi Ankomah et.al. 2510.03568 null
2025-10-03 InsideOut: An EfficientNetV2-S Based Deep Learning Framework for Robust Multi-Class Facial Emotion Recognition Ahsan Farabi et.al. 2510.03066 null
2025-10-03 Denoising and Augmentation: A Dual Use of Diffusion Model for Enhanced CSI Recovery Yupeng Li et.al. 2510.02744 null
2025-10-03 Hybrid-Collaborative Augmentation and Contrastive Sample Adaptive-Differential Awareness for Robust Attributed Graph Clustering Tianxiang Zhao et.al. 2510.02731 null
2025-10-03 Mind the Gap: Linguistic Divergence and Adaptation Strategies in Human-LLM Assistant vs. Human-Human Interactions Fulei Zhang et.al. 2510.02645 null
2025-10-02 Extreme value forecasting using relevance-based data augmentation with deep learning models Junru Hua et.al. 2510.02407 null
2025-10-02 Non-Asymptotic Analysis of Data Augmentation for Precision Matrix Estimation Lucas Morisset et.al. 2510.02119 null
2025-10-02 Mapping Historic Urban Footprints in France: Balancing Quality, Scalability and AI Techniques Walid Rabehi et.al. 2510.02097 null
2025-10-02 Explicit Discovery of Nonlinear Symmetries from Dynamic Data Lexiang Hu et.al. 2510.01855 null
2025-10-02 NGGAN: Noise Generation GAN Based on the Practical Measurement Dataset for Narrowband Powerline Communications Ying-Ren Chien et.al. 2510.01850 null
2025-10-01 RealClass: A Framework for Classroom Speech Simulation with Public Datasets and Game Engines Ahmed Adel Attia et.al. 2510.01462 null
2025-10-01 Diffusion Modeling of the Three-Dimensional Magnetic Field in the Sun’s Corona Daniel E. da Silva et.al. 2510.01441 null
2025-10-01 To Augment or Not to Augment? Diagnosing Distributional Symmetry Breaking Hannah Lawrence et.al. 2510.01349 null
2025-10-01 Towards Adversarial Training under Hyperspectral Images Weihua Zhang et.al. 2510.01014 null
2025-10-01 EvolProver: Advancing Automated Theorem Proving by Evolving Formalized Problems via Symmetry and Difficulty Yuchen Tian et.al. 2510.00732 null
2025-10-01 Disentangling Foreground and Background for vision-Language Navigation via Online Augmentation Yunbo Xu et.al. 2510.00604 null
2025-10-01 SAGE-LD: Towards Scalable and Generalizable End-to-End Language Diarization via Simulated Data Augmentation Sangmin Lee et.al. 2510.00582 null
2025-10-01 On-the-Fly Data Augmentation via Gradient-Guided and Sample-Aware Influence Estimation Suorong Yang et.al. 2510.00434 null
2025-09-30 Subjective quality evaluation of personalized own voice reconstruction systems Mattes Ohlenbusch et.al. 2510.00256 null
2025-09-26 Deep Learning-Based Pneumonia Detection from Chest X-ray Images: A CNN Approach with Performance Analysis and Clinical Implications P K Dutta et.al. 2510.00035 null
2025-09-25 Learning Inter-Atomic Potentials without Explicit Equivariance Ahmed A. Elhag et.al. 2510.00027 null
2025-10-08 OmniRetarget: Interaction-Preserving Data Generation for Humanoid Whole-Body Loco-Manipulation and Scene Interaction Lujie Yang et.al. 2509.26633 null
2025-09-30 Source Separation for A Cappella Music Luca A. Lanzendörfer et.al. 2509.26580 null
2025-09-30 GastroViT: A Vision Transformer Based Ensemble Learning Approach for Gastrointestinal Disease Classification with Grad CAM & SHAP Visualization Sumaiya Tabassum et.al. 2509.26502 null
2025-10-14 Limited Preference Data? Learning Better Reward Model with Latent Space Synthesis Leitian Tao et.al. 2509.26074 null
2025-09-30 Geometric Learning of Canonical Parameterizations of $2D$ -curves Ioana Ciuclea et.al. 2509.26070 null
2025-09-30 ASR Under Noise: Exploring Robustness for Sundanese and Javanese Salsabila Zahirah Pranida et.al. 2509.25878 null
2025-09-30 MIDAS: Misalignment-based Data Augmentation Strategy for Imbalanced Multimodal Learning Seong-Hyeon Hwang et.al. 2509.25831 null
2025-09-30 Less is More: Towards Simple Graph Contrastive Learning Yanan Zhao et.al. 2509.25742 null
2025-10-03 YOLO-Based Defect Detection for Metal Sheets Po-Heng Chou et.al. 2509.25659 null
2025-09-29 On-the-Fly Data Augmentation for Brain Tumor Segmentation Ishika Jain et.al. 2509.24973 null
2025-09-29 Adaptive Canonicalization with Application to Invariant Anisotropic Geometric Networks Ya-Wei Eileen Lin et.al. 2509.24886 null
2025-09-29 Intelligent Optimization of Wireless Access Point Deployment for Communication-Based Train Control Systems Using Deep Reinforcement Learning Kunyu Wu et.al. 2509.24819 null
2025-09-29 Fidelity-Aware Data Composition for Robust Robot Generalization Zizhao Tong et.al. 2509.24797 null
2025-09-29 Toward a Vision-Language Foundation Model for Medical Data: Multimodal Dataset and Benchmarks for Vietnamese PET/CT Report Generation Huu Tien Nguyen et.al. 2509.24739 null
2025-09-29 Circuit-Aware Reward Training: A Mechanistic Framework for Longtail Robustness in RLHF Jing Liu et.al. 2509.24713 null
2025-09-29 LEAF: A Robust Expert-Based Framework for Few-Shot Continual Event Detection Bao-Ngoc Dao et.al. 2509.24547 null
2025-09-29 Beyond Repetition: Text Simplification and Curriculum Learning for Data-Constrained Pretraining Matthew Theodore Roque et.al. 2509.24356 null
2025-09-29 Cycle Diffusion Model for Counterfactual Image Generation Fangrui Huang et.al. 2509.24267 null
2025-09-28 Clebsch-Gordan Transformer: Fast and Global Equivariant Attention Owen Lewis Howell et.al. 2509.24093 null
2025-09-28 DexFlyWheel: A Scalable and Self-improving Data Generation Framework for Dexterous Manipulation Kefei Zhu et.al. 2509.23829 null
2025-09-30 VioPTT: Violin Technique-Aware Transcription from Synthetic Data Augmentation Ting-Kang Wang et.al. 2509.23759 null
2025-09-28 A Hierarchical Structure-Enhanced Personalized Recommendation Model for Traditional Chinese Medicine Formulas Based on KG Diffusion Guidance ChaoBo Zhang et.al. 2509.23560 null
2025-09-26 FishAI 2.0: Marine Fish Image Classification with Multi-modal Few-shot Learning Chenghan Yang et.al. 2509.22930 null
2025-09-24 Achieving Fair Skin Lesion Detection through Skin Tone Normalization and Channel Pruning Zihan Wei et.al. 2509.22712 null
2025-09-26 FreqDebias: Towards Generalizable Deepfake Detection via Consistency-Driven Frequency Debiasing Hossein Kashiani et.al. 2509.22412 null
2025-09-26 Deep Learning-Based Cross-Anatomy CT Synthesis Using Adapted nnResU-Net with Anatomical Feature Prioritized Loss Javier Sequeiro González et.al. 2509.22394 null
2025-09-26 Cross-Dialect Bird Species Recognition with Dialect-Calibrated Augmentation Jiani Ding et.al. 2509.22317 null
2025-09-26 Debiasing Large Language Models in Thai Political Stance Detection via Counterfactual Calibration Kasidit Sermsri et.al. 2509.21946 null
2025-09-25 Expert-guided Clinical Text Augmentation via Query-Based Model Collaboration Dongkyu Cho et.al. 2509.21530 null
2025-09-25 Contrastive Mutual Information Learning: Toward Robust Representations without Positive-Pair Augmentations Micha Livne et.al. 2509.21511 null
2025-09-25 Filtering with Confidence: When Data Augmentation Meets Conformal Prediction Zixuan Wu et.al. 2509.21479 null
2025-09-25 Dense Semantic Matching with VGGT Prior Songlin Yang et.al. 2509.21263 null
2025-09-25 From Physics to Machine Learning and Back: Part II - Learning and Observational Bias in PHM Olga Fink et.al. 2509.21207 null
2025-09-25 An Improved Quantum Software Challenges Classification Approach using Transfer Learning and Explainable AI Nek Dil Khan et.al. 2509.21068 null
2025-09-25 A Real-Time On-Device Defect Detection Framework for Laser Power-Meter Sensors via Unsupervised Learning Dongqi Zheng et.al. 2509.20946 null
2025-09-25 LiLAW: Lightweight Learnable Adaptive Weighting to Meta-Learn Sample Difficulty and Improve Noisy Training Abhishek Moturu et.al. 2509.20786 null
2025-09-25 Addressing Gradient Misalignment in Data-Augmented Training for Robust Speech Deepfake Detection Duc-Tuan Truong et.al. 2509.20682 null
2025-10-05 Region-of-Interest Augmentation for Mammography Classification under Patient-Level Cross-Validation Farbod Bigdeli et.al. 2509.20585 null
2025-10-02 Feature Dynamics as Implicit Data Augmentation: A Depth-Decomposed View on Deep Neural Network Generalization Tianyu Ruan et.al. 2509.20334 null
2025-09-24 Z-Scores: A Metric for Linguistically Assessing Disfluency Removal Maria Teleki et.al. 2509.20319 null
2025-09-24 Enhancing Requirement Traceability through Data Augmentation Using Large Language Models Jianzhang Zhang et.al. 2509.20149 null
2025-09-24 A Simple Data Augmentation Strategy for Text-in-Image Scientific VQA Belal Shoer et.al. 2509.20119 null
2025-09-25 Diffusion-Augmented Contrastive Learning: A Noise-Robust Encoder for Biosignal Representations Rami Zewail et.al. 2509.20048 null
2025-09-24 Rectified Decoupled Dataset Distillation: A Closer Look for Fair and Comprehensive Evaluation Xinhao Zhong et.al. 2509.19743 null
2025-09-23 Quantum Harmonic Analysis and the Structure in Data: Augmentation Monika Doerfler et.al. 2509.19474 null
2025-09-23 ROPA: Synthetic Robot Pose Generation for RGB-D Bimanual Data Augmentation Jason Chen et.al. 2509.19454 null
2025-09-23 Improving Outdoor Multi-cell Fingerprinting-based Positioning via Mobile Data Augmentation Tony Chahoud et.al. 2509.19405 null
2025-09-16 Fine-Grained AI Model Caching and Downloading With Coordinated Multipoint Broadcasting in Multi-Cell Edge Networks Yang Fu et.al. 2509.19341 null
2025-09-23 Generative data augmentation for biliary tract detection on intraoperative images Cristina Iacono et.al. 2509.18958 null
2025-09-23 PIE: Perception and Interaction Enhanced End-to-End Motion Planning for Autonomous Driving Chengran Yuan et.al. 2509.18609 null
2025-09-24 SynSonic: Augmenting Sound Event Detection through Text-to-Audio Diffusion ControlNet and Effective Sample Filtering Jiarui Hai et.al. 2509.18603 null
2025-09-23 Efficient Breast and Ovarian Cancer Classification via ViT-Based Preprocessing and Transfer Learning Richa Rawat et.al. 2509.18553 null
2025-09-23 Reverse-Complement Consistency for DNA Language Models Mingqian Ma et.al. 2509.18529 null
2025-09-21 Automatic Classification of Magnetic Chirality of Solar Filaments from H-Alpha Observations Alexis Chalmers et.al. 2509.18214 null
2025-09-22 Intra-Cluster Mixup: An Effective Data Augmentation Technique for Complementary-Label Learning Tan-Ha Mai et.al. 2509.17971 null
2025-09-22 SeqUDA-Rec: Sequential User Behavior Enhanced Recommendation via Global Unsupervised Data Augmentation for Personalized Content Marketing Ruihan Luo et.al. 2509.17361 null
2025-09-21 Enhanced Detection of Tiny Objects in Aerial Images Kihyun Kim et.al. 2509.17078 null
2025-09-23 Penalizing Boundary Activation for Object Completeness in Diffusion Models Haoyang Xu et.al. 2509.16968 null
2025-09-20 IPF-RDA: An Information-Preserving Framework for Robust Data Augmentation Suorong Yang et.al. 2509.16678 null
2025-09-20 MedCutMix: A Data-Centric Approach to Improve Radiology Vision-Language Pre-training with Disease Awareness Sinuo Wang et.al. 2509.16673 null
2025-09-20 AISTAT lab system for DCASE2025 Task6: Language-based audio retrieval Hyun Jun Kim et.al. 2509.16649 null
2025-09-19 Intrinsic Meets Extrinsic Fairness: Assessing the Downstream Impact of Bias Mitigation in Large Language Models ‘Mina Arzaghi’ et.al. 2509.16462 null
2025-09-19 Evaluating the Effectiveness and Scalability of LLM-Based Data Augmentation for Retrieval Pranjal A. Chitale et.al. 2509.16442 null
2025-09-19 DistillMatch: Leveraging Knowledge Distillation from Vision Foundation Model for Multimodal Image Matching Meng Yang et.al. 2509.16017 null
2025-09-19 Chunk Based Speech Pre-training with High Resolution Finite Scalar Quantization Yun Tang et.al. 2509.15579 null
2025-09-19 Contrastive Learning with Spectrum Information Augmentation in Abnormal Sound Detection Xinxin Meng et.al. 2509.15570 null
2025-09-18 Generative AI Meets Wireless Sensing: Towards Wireless Foundation Model Zheng Yang et.al. 2509.15258 null
2025-09-17 GenCAD-3D: CAD Program Generation using Multimodal Latent Space Alignment and Synthetic Dataset Balancing Nomi Yu et.al. 2509.15246 null
2025-09-18 Synthetic-to-Real Object Detection using YOLOv11 and Domain Randomization Strategies Luisa Torquato Niño et.al. 2509.15045 null
2025-09-18 Data Augmentation via Latent Diffusion Models for Detecting Smell-Related Objects in Historical Artworks Ahmed Sheta et.al. 2509.14755 null
2025-09-18 SpeechMLC: Speech Multi-label Classification Miseul Kim et.al. 2509.14677 null
2025-09-18 How Does Instrumental Music Help SingFake Detection? Xuanjun Chen et.al. 2509.14675 null
2025-09-18 SWE-QA: Can Language Models Answer Repository-level Code Questions? Weihan Peng et.al. 2509.14635 null
2025-09-18 Mitigating Intra-Speaker Variability in Diarization with Style-Controllable Speech Augmentation Miseul Kim et.al. 2509.14632 null
2025-09-18 LSTC-MDA: A Unified Framework for Long-Short Term Temporal Convolution and Mixed Data Augmentation in Skeleton-Based Action Recognition Feng Ding et.al. 2509.14619 null
2025-09-18 Leveraging IndoBERT and DistilBERT for Indonesian Emotion Classification in E-Commerce Reviews William Christian et.al. 2509.14611 null
2025-09-18 VisMoDAl: Visual Analytics for Evaluating and Improving Corruption Robustness of Vision-Language Models Huanchen Wang et.al. 2509.14571 null
2025-09-18 Learning to Retrieve for Environmental Knowledge Discovery: An Augmentation-Adaptive Self-Supervised Learning Framework Shiyuan Luo et.al. 2509.14563 null
2025-09-18 Data coarse graining can improve model performance Alex Nguyen et.al. 2509.14498 null
2025-09-17 Sequential Data Augmentation for Generative Recommendation Geon Lee et.al. 2509.13648 null
2025-09-17 Multimodal signal fusion for stress detection using deep neural networks: a novel approach for converting 1D signals to unified 2D images Yasin Hasanpoor et.al. 2509.13636 null
2025-09-16 Adversarial Appearance Learning in Augmented Cityscapes for Pedestrian Recognition in Autonomous Driving Artem Savkin et.al. 2509.13507 null
2025-09-16 Contrastive timbre representations for musical instrument and synthesizer retrieval Gwendal Le Vaillant et.al. 2509.13285 null
2025-09-16 Time-step Mixup for Efficient Spiking Knowledge Transfer from Appearance to Event Domain Yuqi Xie et.al. 2509.12959 null
2025-09-16 Synthetic Protein-Ligand Complex Generation for Deep Molecular Docking Sofiene Khiari et.al. 2509.12915 null
2025-09-16 Cumulative Consensus Score: Label-Free and Model-Agnostic Evaluation of Object Detectors in Deployment Avinaash Manoharan et.al. 2509.12871 null
2025-09-20 Data Augmentation for Maltese NLP using Transliterated and Machine Translated Arabic Data Kurt Micallef et.al. 2509.12853 null
2025-09-16 Double Helix Diffusion for Cross-Domain Anomaly Image Generation Linchun Wu et.al. 2509.12787 null
2025-09-15 Robust Fetal Pose Estimation across Gestational Ages via Cross-Population Augmentation Sebastian Diaz et.al. 2509.12062 null
2025-09-15 Learning to Generate 4D LiDAR Sequences Ao Liang et.al. 2509.11959 null
2025-09-15 Automated training of neural-network interatomic potentials Davide Bidoggia et.al. 2509.11703 null
2025-09-15 DTGen: Generative Diffusion-Based Few-Shot Data Augmentation for Fine-Grained Dirty Tableware Recognition Lifei Hao et.al. 2509.11661 null
2025-09-15 Task Decoding based on Eye Movements using Synthetic Data Augmentation Shanmuka Sadhu et.al. 2509.11547 null
2025-09-14 An Entropy-Guided Curriculum Learning Strategy for Data-Efficient Acoustic Scene Classification under Domain Shift Peihong Zhang et.al. 2509.11168 null
2025-09-14 An Advanced Convolutional Neural Network for Bearing Fault Diagnosis under Limited Data Shengke Sun et.al. 2509.11053 null
2025-09-13 Point-Plane Projections for Accurate LiDAR Semantic Segmentation in Small Data Scenarios Simone Mosco et.al. 2509.10841 null
2025-09-01 MIDOG 2025 Track 2: A Deep Learning Model for Classification of Atypical and Normal Mitotic Figures under Class and Hardness Imbalances Sujatha Kotte et.al. 2509.10502 null
2025-09-12 Improving Audio Event Recognition with Consistency Regularization Shanmuka Sadhu et.al. 2509.10391 null
2025-09-12 Scaling Arabic Medical Chatbots Using Synthetic Data: Enhancing Generative AI with Synthetic Patient Records Abdulrahman Allam et.al. 2509.10108 null
2025-09-11 Combining Textual and Spectral Features for Robust Classification of Pilot Communications Abdullah All Tanvir et.al. 2509.09752 null
2025-09-24 Structure Matters: Brain Graph Augmentation via Learnable Edge Masking for Data-efficient Psychiatric Diagnosis Mujie Liu et.al. 2509.09744 null
2025-09-11 Virtual staining for 3D X-ray histology of bone implants Sarah C. Irvine et.al. 2509.09235 null
2025-09-11 Target-oriented Multimodal Sentiment Classification with Counterfactual-enhanced Debiasing Zhiyue Liu et.al. 2509.09160 null
2025-09-10 Handling Open-Vocabulary Constructs in Formalizing Specifications: Retrieval-Augmented Parsing with Expert Knowledge Mohammad Saqib Hasan et.al. 2509.08808 null
2025-09-10 ADHDeepNet From Raw EEG to Diagnosis: Improving ADHD Diagnosis through Temporal-Spatial Processing, Adaptive Attention Mechanisms, and Explainability in Raw EEG Signals Ali Amini et.al. 2509.08779 null
2025-09-10 Ensemble Distribution Distillation for Self-Supervised Human Activity Recognition Matthew Nolan et.al. 2509.08225 null
2025-09-09 Transformer-Based Approach to Optimal Sensor Placement for Structural Health Monitoring of Probe Cards Mehdi Bejani et.al. 2509.07603 null
2025-10-21 From Scarcity to Efficiency: Investigating the Effects of Data Augmentation on African Machine Translation Mardiyyah Oduwole et.al. 2509.07471 null
2025-09-08 Breast Cancer Detection in Thermographic Images via Diffusion-Based Augmentation and Nonlinear Feature Fusion Sepehr Salem et.al. 2509.07277 null
2025-09-08 Pothole Detection and Recognition based on Transfer Learning Mang Hu et.al. 2509.06750 null
2025-09-08 Contrastive Self-Supervised Network Intrusion Detection using Augmented Negative Pairs Jack Wilkie et.al. 2509.06550 null
2025-09-08 IGAff: Benchmarking Adversarial Iterative and Genetic Affine Algorithms on Deep Neural Networks Sebastian-Vasile Echim et.al. 2509.06459 null
2025-09-08 CAPMix: Robust Time Series Anomaly Detection Based on Abnormal Assumptions with Dual-Space Mixup Xudong Mou et.al. 2509.06419 null
2025-09-08 PL-CA: A Parametric Legal Case Augmentation Framework Ao Chang et.al. 2509.06356 null
2025-09-07 Exploring Light-Weight Object Recognition for Real-Time Document Detection Lucas Wojcik et.al. 2509.06246 null
2025-09-07 Learning in ImaginationLand: Omnidirectional Policies through 3D Generative Models (OP-Gen) Yifei Ren et.al. 2509.06191 null
2025-09-06 CardiacFlow: 3D+t Four-Chamber Cardiac Shape Completion and Generation via Flow Matching Qiang Ma et.al. 2509.05754 null
2025-09-05 DuoCLR: Dual-Surrogate Contrastive Learning for Skeleton-based Human Action Segmentation Haitao Tian et.al. 2509.05543 null
2025-09-05 Handling Data Gaps for the Next Generation of Gravitational-Wave Observatories Noah Pearson et.al. 2509.05479 null
2025-09-01 Handling imbalance and few-sample size in ML based Onion disease classification Abhijeet Manoj Pal et.al. 2509.05341 null
2025-08-30 A Dataset Generation Scheme Based on Video2EEG-SPGN-Diffusion for SEED-VD Yunfei Guo et.al. 2509.05321 null
2025-09-05 Uncertain but Useful: Leveraging CNN Variability into Data Augmentation Inés Gonzalez-Pepe et.al. 2509.05238 null
2025-09-05 SL-SLR: Self-Supervised Representation Learning for Sign Language Recognition Ariel Basso Madjoukeng et.al. 2509.05188 null
2025-09-05 Hybrid Matrix Factorization Based Graph Contrastive Learning for Recommendation System Hao Chen et.al. 2509.05115 null
2025-09-05 Leveraging Transfer Learning and Mobile-enabled Convolutional Neural Networks for Improved Arabic Handwritten Character Recognition Mohsine El Khayati et.al. 2509.05019 null
2025-09-05 Optimizing Small Transformer-Based Language Models for Multi-Label Sentiment Analysis in Short Texts Julius Neumann et.al. 2509.04982 null
2025-09-05 DeGuV: Depth-Guided Visual Reinforcement Learning for Generalization and Interpretability in Manipulation Tien Pham et.al. 2509.04970 null
2025-09-05 A transformer-BiGRU-based framework with data augmentation and confident learning for network intrusion detection Jiale Zhang et.al. 2509.04925 null
2025-09-05 Evaluating Multiple Instance Learning Strategies for Automated Sebocyte Droplet Counting Maryam Adelipour et.al. 2509.04895 null
2025-08-29 MOSAIC: A Multilingual, Taxonomy-Agnostic, and Computationally Efficient Approach for Radiological Report Classification Alice Schiavone et.al. 2509.04471 null
2025-09-04 TauGenNet: Plasma-Driven Tau PET Image Synthesis via Text-Guided 3D Diffusion Models Yuxin Gong et.al. 2509.04269 null
2025-09-04 How many patients could we save with LLM priors? Shota Arai et.al. 2509.04250 null
2025-09-04 Explicit and Implicit Data Augmentation for Social Event Detection Congbo Ma et.al. 2509.04202 null
2025-09-04 Chest X-ray Pneumothorax Segmentation Using EfficientNet-B4 Transfer Learning in a U-Net Architecture Alvaro Aranibar Roque et.al. 2509.03950 null
2025-09-04 A Generative Foundation Model for Chest Radiography Yuanfeng Ji et.al. 2509.03903 null
2025-09-04 Data-Augmented Quantization-Aware Knowledge Distillation Justin Kur et.al. 2509.03850 null
2025-09-03 Lightweight image segmentation for echocardiography Anders Kjelsrud et.al. 2509.03631 null
2025-09-04 Invariant Features for Global Crop Type Classification Xin-Yi Tong et.al. 2509.03497 null
2025-09-03 Joint Training of Image Generator and Detector for Road Defect Detection Kuan-Chuan Peng et.al. 2509.03465 null
2025-09-02 Enhancing Machine Learning for Imbalanced Medical Data: A Quantum-Inspired Approach to Synthetic Oversampling (QI-SMOTE) Vikas Kashtriya et.al. 2509.02863 null
2025-08-29 Foundation Model-Driven Classification of Atypical Mitotic Figures with Domain-Aware Training Strategies Piotr Giedziun et.al. 2509.02601 null
2025-09-02 PalmX 2025: The First Shared Task on Benchmarking LLMs on Arabic and Islamic Culture Fakhraddin Alwajih et.al. 2509.02550 null
2025-09-02 EmoPerso: Enhancing Personality Detection with Self-Supervised Emotion-Aware Modelling Lingzhi Shen et.al. 2509.02450 null
2025-09-02 Improving Electroencephalogram-Based Deception Detection in Concealed Information Test under Low Stimulus Heterogeneity Suhye Kim et.al. 2509.02234 null
2025-09-02 Enhancing Zero-Shot Pedestrian Attribute Recognition with Synthetic Data Generation: A Comparative Study with Image-To-Image Diffusion Models Pablo Ayuso-Albizu et.al. 2509.02161 null
2025-09-02 A Data-Centric Approach to Pedestrian Attribute Recognition: Synthetic Augmentation via Prompt-driven Diffusion Models Alejandro Alonso et.al. 2509.02099 null
2025-09-16 Abex-rat: Synergizing Abstractive Augmentation and Adversarial Training for Classification of Occupational Accident Reports Jian Chen et.al. 2509.02072 null
2025-09-01 CabinSep: IR-Augmented Mask-Based MVDR for Real-Time In-Car Speech Separation with Distributed Heterogeneous Arrays Runduo Han et.al. 2509.01399 null
2025-09-01 MARS: Modality-Aligned Retrieval for Sequence Augmented CTR Prediction Yutian Xiao et.al. 2509.01184 null
2025-08-31 A Unified Denoising and Adaptation Framework for Self-Supervised Bengali Dialectal ASR Swadhin Biswas et.al. 2509.00988 null
2025-09-05 Semi-Supervised Bayesian GANs with Log-Signatures for Uncertainty-Aware Credit Card Fraud Detection David Hirnschall et.al. 2509.00931 null
2025-08-30 NoiseCutMix: A Novel Data Augmentation Approach by Mixing Estimated Noise in Diffusion Models Shumpei Takezaki et.al. 2509.00378 null
2025-08-26 Amplifying Emotional Signals: Data-Efficient Deep Learning for Robust Speech Emotion Recognition Tai Vu et.al. 2509.00077 null
2025-08-29 A Multi-Stage Fine-Tuning and Ensembling Strategy for Pancreatic Tumor Segmentation in Diagnostic and Therapeutic MRI Omer Faruk Durugol et.al. 2508.21775 null
2025-08-29 QZhou-Embedding Technical Report Peng Yu et.al. 2508.21632 null
2025-08-29 Towards On-Device Personalization: Cloud-device Collaborative Data Augmentation for Efficient On-device Language Model Zhaofeng Zhong et.al. 2508.21313 null
2025-08-28 Reverse Imaging for Wide-spectrum Generalization of Cardiac MRI Segmentation Yidong Zhao et.al. 2508.21254 null
2025-08-26 CoBA: Counterbias Text Augmentation for Mitigating Various Spurious Correlations via Semantic Triples Kyohoon Jin et.al. 2508.21083 null
2025-08-28 Improved photometric redshift estimations through self-organising map-based data augmentation Yun-Hao Zhang et.al. 2508.20903 null
2025-08-28 Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision Ao Cheng et.al. 2508.20729 null
2025-08-28 Compositionality in Time Series: A Proof of Concept using Symbolic Dynamics and Compositional Data Augmentation Michael Hagmann et.al. 2508.20656 null
2025-08-28 Mask-Guided Multi-Channel SwinUNETR Framework for Robust MRI Classification Smriti Joshi et.al. 2508.20621 null
2025-08-28 KCS: Diversify Multi-hop Question Generation with Knowledge Composition Sampling Yangfan Wang et.al. 2508.20567 null
2025-08-28 Enhancing Health Fact-Checking with LLM-Generated Synthetic Data Jingze Zhang et.al. 2508.20525 null
2025-08-27 IELDG: Suppressing Domain-Specific Noise with Inverse Evolution Layers for Domain Generalized Semantic Segmentation Qizhe Fan et.al. 2508.19604 null
2025-08-27 Improving Recommendation Fairness via Graph Structure and Representation Augmentation Tongxin Xu et.al. 2508.19547 null
2025-08-26 Database Entity Recognition with Data Augmentation and Deep Learning Zikun Fu et.al. 2508.19372 null
2025-08-26 HuBE: Cross-Embodiment Human-like Behavior Execution for Humanoid Robots Shipeng Lyu et.al. 2508.19002 null
2025-08-26 Enhancing compact convolutional transformers with super attention Simpenzwe Honore Leandre et.al. 2508.18960 null
2025-08-26 SegReConcat: A Data Augmentation Method for Voice Anonymization Attack Ridwan Arefeen et.al. 2508.18907 null
2025-08-26 Enhancing Video-Based Robot Failure Detection Using Task Knowledge Santosh Thoduka et.al. 2508.18705 null
2025-08-26 Auditing Approximate Machine Unlearning for Differentially Private Models Yuechun Gu et.al. 2508.18671 null
2025-08-25 Analise de Desaprendizado de Maquina em Modelos de Classificacao de Imagens Medicas Andreza M. C. Falcao et.al. 2508.18509 null
2025-08-25 Data Augmentation Improves Machine Unlearning Andreza M. C. Falcao et.al. 2508.18502 null
2025-08-29 German4All – A Dataset and Model for Readability-Controlled Paraphrasing in German Miriam Anschütz et.al. 2508.17973 null
2025-08-25 Diffusion-Based Data Augmentation for Medical Image Segmentation Maham Nazir et.al. 2508.17844 null
2025-08-25 LLMulator: Generalizable Cost Modeling for Dataflow Accelerators with Input-Adaptive Control Flow Kaiyan Chang et.al. 2508.17826 null
2025-08-24 LodeStar: Long-horizon Dexterity via Synthetic Data Augmentation from Human Demonstrations Weikang Wan et.al. 2508.17547 null
2025-07-28 Data Augmentation for Spoken Grammatical Error Correction Penny Karanasou et.al. 2507.19374 null
2025-07-08 Music Boomerang: Reusing Diffusion Models for Data Augmentation and Audio Manipulation Alexander Fichtinger et.al. 2507.04864 null
2025-04-07 Mind the Prompt: Prompting Strategies in Audio Generations for Improving Sound Classification Francesca Ronchini et.al. 2504.03329 null
2025-03-25 Multimodal Large Language Models for Image, Text, and Speech Data Augmentation: A Survey Ranjan Sapkota et.al. 2501.18648 null
2025-01-24 Generative Data Augmentation Challenge: Synthesis of Room Acoustics for Speaker Distance Estimation Jackie Lin et.al. 2501.13250 null
2024-12-03 Sample adaptive data augmentation with progressive scheduling Hongxuan Lu et.al. 2412.00415 null
2024-10-15 SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs Wenxi Chen et.al. 2410.09503 null
2025-02-05 Exploring Empty Spaces: Human-in-the-Loop Data Augmentation Catherine Yeh et.al. 2410.01088 null
2024-06-28 Leveraging Synthetic Audio Data for End-to-End Low-Resource Speech Translation Yasmin Moslem et.al. 2406.17363 null
2024-06-25 Revisiting Interpolation Augmentation for Speech-to-Text Generation Chen Xu et.al. 2406.15846 null
2024-01-18 On the Effect of Data-Augmentation on Local Embedding Properties in the Contrastive Learning of Music Audio Representations Matthew C. McCallum et.al. 2401.08889 null
2024-01-17 Maximum-Entropy Adversarial Audio Augmentation for Keyword Spotting Zuzhao Ye et.al. 2401.06897 null
2023-12-27 Exploring data augmentation in bias mitigation against non-native-accented speech Yuanyuan Zhang et.al. 2312.15499 null
2023-12-15 Towards Automatic Data Augmentation for Disordered Speech Recognition Zengrui Jin et.al. 2312.08641 null
2023-12-15 PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition Chengxi Lei et.al. 2312.08571 null
2023-10-27 Dialect Adaptation and Data Augmentation for Low-Resource ASR: TalTech Systems for the MADASR 2023 Challenge Tanel Alumäe et.al. 2310.17448 null
2024-01-11 Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation Shih-Lun Wu et.al. 2309.17352 null
2024-02-21 Deepfake audio as a data augmentation technique for training automatic speech to text transcription models Alexandre R. Ferreira et.al. 2309.12802 null
2024-07-02 Reduce, Reuse, Recycle: Is Perturbed Data better than Other Language augmentation for Low Resource Self-Supervised Speech Models Asad Ullah et.al. 2309.12763 null
2024-01-10 Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation Emiru Tsunoo et.al. 2309.08876 null
2023-08-01 Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text Eric Sun et.al. 2307.16332 null
2023-06-08 Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based Augmentation Massa Baali et.al. 2306.04368 null
2023-04-26 Selective Data Augmentation for Robust Speech Translation Rajul Acharya et.al. 2304.03169 null
2024-04-01 A Comparison of Speech Data Augmentation Methods Using S3PRL Toolkit Mina Huh et.al. 2303.00510 null
2023-11-02 SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations Ioannis Tsiamas et.al. 2212.09699 null
2023-05-24 Exploring Train and Test-Time Augmentations for Audio-Language Learning Eungbeom Kim et.al. 2210.17143 null
2022-11-01 Improving Natural-Language-based Audio Retrieval with Transfer Learning and Audio & Text Augmentations Paul Primus et.al. 2208.11460 null
2022-08-11 Generative Data Augmentation Guided by Triplet Loss for Speech Emotion Recognition Shijun Wang et.al. 2208.04994 null
2022-07-21 Improving Data Driven Inverse Text Normalization using Data Augmentation Laxmi Pandey et.al. 2207.09674 null
2022-07-15 Data Augmentation for Low-Resource Quechua ASR Improvement Rodolfo Zevallos et.al. 2207.06872 null
2022-07-19 Data Augmentation for Dementia Detection in Spoken Language Anna Hlédiková et.al. 2206.12879 null
2023-06-02 Audio Data Augmentation for Acoustic-to-articulatory Speech Inversion using Bidirectional Gated RNNs Yashish M. Siriwardena et.al. 2205.13086 null
2025-11-05 Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition Zengrui Jin et.al. 2205.06445 null
2022-04-12 Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition Zehai Tu et.al. 2204.04284 null
2022-04-11 Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning Salah Zaiem et.al. 2204.04170 null
2022-07-07 SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training Strategy Shuai Guo et.al. 2203.17001 null
2023-06-12 Sample, Translate, Recombine: Leveraging Audio Alignments for Data Augmentation in End-to-end Speech Translation Tsz Kin Lam et.al. 2203.08757 null
2022-09-02 A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification Qing Wang et.al. 2203.04114 null
2022-02-22 ImportantAug: a data augmentation agent for speech Viet Anh Trinh et.al. 2112.07156 null
2022-05-20 Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection Ricardo Falcon-Perez et.al. 2110.06126 null
2021-10-14 SpliceOut: A Simple and Efficient Audio Augmentation Method Arjit Jain et.al. 2110.00046 null
2021-08-17 Data Augmentation for Scene Text Recognition Rowel Atienza et.al. 2108.06949 null
2021-08-09 SpecMix : A Mixed Sample Data Augmentation method for Training withTime-Frequency Domain Features Gwantae Kim et.al. 2108.03020 null
2021-08-03 Adversarial Data Augmentation for Disordered Speech Recognition Zengrui Jin et.al. 2108.00899 null
2021-04-27 Semantic Data Augmentation for End-to-End Mandarin Speech Recognition Jianwei Sun et.al. 2104.12521 null
2021-04-16 EnvGAN: Adversarial Synthesis of Environmental Sounds for Data Augmentation Aswathy Madhu et.al. 2104.07326 null
2023-06-13 On-the-Fly Aligned Data Augmentation for Sequence-to-Sequence ASR Tsz Kin Lam et.al. 2104.01393 null
2021-06-16 SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification Helin Wang et.al. 2103.16858 null
2021-02-26 MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition Linghui Meng et.al. 2102.12664 null
2022-11-17 Back Translation Survey for Improving Text Augmentation Matthew Ciolino et.al. 2102.09708 null
2021-02-19 Fundamental Frequency Feature Normalization and Data Augmentation for Child Speech Recognition Gary Yeung et.al. 2102.09106 null
2021-02-17 Adaptive Weighting Scheme for Automatic Time-Series Data Augmentation Elizabeth Fons et.al. 2102.08310 null
2021-04-20 Enhancing Audio Augmentation Methods with Consistency Learning Turab Iqbal et.al. 2102.05151 null
2023-03-08 A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection Qing Wang et.al. 2101.02919 null
2022-02-17 Multi-Window Data Augmentation Approach for Speech Emotion Recognition Sarala Padi et.al. 2010.09895 null
2020-09-01 Data augmentation using prosody and false starts to recognize non-native children’s speech Hemant Kathania et.al. 2008.12914 null
2020-08-18 StoRIR: Stochastic Room Impulse Response Generation for Audio Data Augmentation Piotr Masztalski et.al. 2008.07231 null
2020-09-25 Data augmentation and loss normalization for deep noise suppression Sebastian Braun et.al. 2008.06412 null
2021-03-29 Data augmentation enhanced speaker enrollment for text-dependent speaker verification Achintya Kumar Sarkar et.al. 2007.08004 null
2020-06-11 Data Augmentation for Training Dialog Models Robust to Speech Recognition Errors Longshaokan Wang et.al. 2006.05635 null
2020-09-04 On the Effectiveness of Neural Text Generation based Data Augmentation for Recognition of Morphologically Rich Speech Balázs Tarján et.al. 2006.05129 null
2020-01-16 A Multi-cascaded Model with Data Augmentation for Enhanced Paraphrase Detection in Short Texts Muhammad Haroon Shakeel et.al. 1912.12068 null
2022-01-04 Audiogmenter: a MATLAB Toolbox for Audio Data Augmentation Gianluca Maguolo et.al. 1912.05472 null
2020-02-04 Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation Thai-Son Nguyen et.al. 1910.13296 null
2019-12-04 SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition Daniel S. Park et.al. 1904.08779 null
2020-12-07 Data Augmentation of Room Classifiers using Generative Adversarial Networks Constantinos Papayiannis et.al. 1901.03257 null
2018-08-14 Sample Mixed-Based Data Augmentation for Domestic Audio Tagging Shengyun Wei et.al. 1808.03883 null
2018-09-13 CNN+LSTM Architecture for Speech Emotion Recognition with Data Augmentation Caroline Etienne et.al. 1802.05630 null
2022-07-06 Efficient data augmentation techniques for some classes of state space models Linda S. L. Tan et.al. 1712.08887 null

🎨 Synthetic Generation

📊 1506 papers

📅 Publish Date 📝 Title 👥 Authors 📄 PDF 💻 Code
2026-04-01 Bridging the Simulation-to-Experiment Gap with Generative Models using Adversarial Distribution Alignment Kai Nelson et.al. 2604.01169 null
2026-04-01 Looking into a Pixel by Nonlinear Unmixing – A Generative Approach Maofeng Tang et.al. 2604.01141 null
2026-04-01 Optimsyn: Influence-Guided Rubrics Optimization for Synthetic Data Generation Zhiting Fan et.al. 2604.00536 null
2026-03-31 SANA I2I: A Text Free Flow Matching Framework for Paired Image to Image Translation with a Case Study in Fetal MRI Artifact Reduction Italo Felix Santos et.al. 2604.00298 null
2026-03-31 SYNTHONY: A Stress-Aware, Intent-Conditioned Agent for Deep Tabular Generative Models Selection Hochan Son et.al. 2604.00293 null
2026-03-31 RawGen: Learning Camera Raw Image Generation Dongyoung Kim et.al. 2604.00093 null
2026-03-31 Reasoning-Driven Synthetic Data Generation and Evaluation Tim R. Davidson et.al. 2603.29791 null
2026-03-31 Multi-Feature Fusion Approach for Generative AI Images Detection Abderrezzaq Sendjasni et.al. 2603.29788 null
2026-03-31 Leveraging Synthetic Data for Enhancing Egocentric Hand-Object Interaction Detection Rosario Leonardi et.al. 2603.29733 null
2026-03-31 6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network Management Jiao Chen et.al. 2603.29656 null
2026-03-31 Concept frustration: Aligning human concepts and machine representations Enrico Parisini et.al. 2603.29654 null
2026-03-31 CIPHER: Counterfeit Image Pattern High-level Examination via Representation Kyeonghun Kim et.al. 2603.29356 null
2026-03-31 Differentiable Normative Guidance for Nash Bargaining Solution Recovery Moirangthem Tiken Singh et.al. 2603.29297 null
2026-03-31 Customer Analysis and Text Generation for Small Retail Stores Using LLM-Generated Marketing Presence Shiori Nakamura et.al. 2603.29273 null
2026-03-30 Generating Humanless Environment Walkthroughs from Egocentric Walking Tour Videos Yujin Ham et.al. 2603.29036 null
2026-03-30 The Ultimate Tutorial for AI-driven Scale Development in Generative Psychometrics: Releasing AIGENIE from its Bottle Lara Russell-Lasalandra et.al. 2603.28643 null
2026-03-30 Unrestrained Simplex Denoising for Discrete Data. A Non-Markovian Approach Applied to Graph Generation Yoann Boget et.al. 2603.28572 null
2026-03-30 A Probabilistic Generative Model for Spectral Speech Enhancement Marco Hidalgo-Araya et.al. 2603.28436 null
2026-03-30 From Independent to Correlated Diffusion: Generalized Generative Modeling with Probabilistic Computers Nihal Sanjay Singh et.al. 2603.27996 null
2026-03-30 Scaling Atomistic Protein Binder Design with Generative Pretraining and Test-Time Compute Kieran Didi et.al. 2603.27950 null
2026-03-29 Diversity Matters: Dataset Diversification and Dual-Branch Network for Generalized AI-Generated Image Detection Nusrat Tasnim et.al. 2603.27800 null
2026-03-29 Emergent Social Intelligence Risks in Generative Multi-Agent Systems Yue Huang et.al. 2603.27771 null
2026-03-29 Test-Time Instance-Specific Parameter Composition: A New Paradigm for Adaptive Generative Modeling Minh-Tuan Tran et.al. 2603.27665 null
2026-03-29 Understanding Semantic Perturbations on In-Processing Generative Image Watermarks Anirudh Nakra et.al. 2603.27513 null
2026-03-28 Beyond Descriptions: A Generative Scene2Audio Framework for Blind and Low-Vision Users to Experience Vista Landscapes Chitralekha Gupta et.al. 2603.27295 null
2026-03-28 Amalgam: Hybrid LLM-PGM Synthesis Algorithm for Accuracy and Realism Antheas Kapenekakis et.al. 2603.27254 null
2026-03-27 Material Identification using Multi-Modal Intrinsic Radiation and Radiography Khoa Nguyen et.al. 2603.27036 null
2026-03-27 Generative Shape Reconstruction with Geometry-Guided Langevin Dynamics Linus Härenstam-Nielsen et.al. 2603.27016 null
2026-03-27 Transparency as Architecture: Structural Compliance Gaps in EU AI Act Article 50 II Vera Schmitt et.al. 2603.26983 null
2026-03-27 Synthesizing the Counterfactual: A CTGAN-Augmented Causal Evaluation of Palliative Care on Spousal Depression Pietro Grassi et.al. 2603.26913 null
2026-03-27 Strategic Candidacy in Generative AI Arenas Chris Hays et.al. 2603.26891 null
2026-03-27 AFSS: Artifact-Focused Self-Synthesis for Mitigating Bias in Audio Deepfake Detection Hai-Son Nguyen-Le et.al. 2603.26856 null
2026-03-27 Generative Modeling in Protein Design: Neural Representations, Conditional Generation, and Evaluation Standards Senura Hansaja Wanasekara et.al. 2603.26378 null
2026-03-27 A Formal Framework for Uncertainty Analysis of Text Generation with Large Language Models Steffen Herbold et.al. 2603.26363 null
2026-03-27 Generative Score Inference for Multimodal Data Xinyu Tian et.al. 2603.26349 null
2026-03-27 Physics-Informed Neural Networks and Sequence Encoder: Application to heating and early cooling of thermo-stamping process Mouad Elaarabi et.al. 2603.26245 null
2026-03-27 Cinematic Audio Source Separation Using Visual Cues Kang Zhang et.al. 2603.26113 null
2026-03-27 JRM: Joint Reconstruction Model for Multiple Objects without Alignment Qirui Wu et.al. 2603.25985 null
2026-03-26 Seeing Through Smoke: Surgical Desmoking for Improved Visual Perception Jingpei Lu et.al. 2603.25867 null
2026-03-26 ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners? Haonan Han et.al. 2603.25823 null
2026-03-26 SoftMimicGen: A Data Generation System for Scalable Robot Learning in Deformable Object Manipulation Masoud Moghani et.al. 2603.25725 null
2026-03-26 Knowledge-Guided Retrieval-Augmented Generation for Zero-Shot Psychiatric Data: Privacy Preserving Synthetic Data Generation Adam Jakobsen et.al. 2603.25186 null
2026-03-26 AnyDoc: Enhancing Document Generation via Large-Scale HTML/CSS Data Synthesis and Height-Aware Reinforcement Optimization Jiawei Lin et.al. 2603.25118 null
2026-03-26 Discrete Causal Representation Learning Wenjin Zhang et.al. 2603.25017 null
2026-03-25 Post-selection inference in generalized linear models via parametric programming Qinyan Shen et.al. 2603.24875 null
2026-03-25 Synthetic Rewriting as a Quality Multiplier: Evidence from Portuguese Continued Pretraining Thales Sales Almeida et.al. 2603.24826 null
2026-03-25 Synthetic Cardiac MRI Image Generation using Deep Generative Models Ishan Kumarasinghe et.al. 2603.24764 null
2026-03-25 Contrastive Learning Boosts Deterministic and Generative Models for Weather Data Nathan Bailey et.al. 2603.24744 null
2026-03-25 Training LLMs for Multi-Step Tool Orchestration with Constrained Data Synthesis and Graduated Rewards Cheng Jiayang et.al. 2603.24709 null
2026-03-25 Saranga: MilliWatt Ultrasound for Navigation in Visually Degraded Environments on Palm-Sized Aerial Robots Manoj Velmurugan et.al. 2603.24699 null
2026-03-25 SpinGQE: A Generative Quantum Eigensolver for Spin Hamiltonians Alexander Holden et.al. 2603.24298 null
2026-03-25 PosterIQ: A Design Perspective Benchmark for Poster Understanding and Generation Yuheng Feng et.al. 2603.24078 null
2026-03-27 CVPD at QIAS 2026: RAG-Guided LLM Reasoning for Al-Mawarith Share Computation and Heir Allocation Wassim Swaileh et.al. 2603.24012 null
2026-03-25 GARP-EFM: Improving Foundation Models with Revealed Preference Structure Victor H. Aguiar et.al. 2603.23993 null
2026-03-25 Argument Mining as a Text-to-Text Generation Task Masayuki Kawarada et.al. 2603.23949 null
2026-03-24 CDMT-EHR: A Continuous-Time Diffusion Framework for Generating Mixed-Type Time-Series Electronic Health Records Shaonan Liu et.al. 2603.23719 null
2026-03-24 GO-Renderer: Generative Object Rendering with 3D-aware Controllable Video Diffusion Models Zekai Gu et.al. 2603.23246 null
2026-03-25 DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models Donya Jafari et.al. 2603.23140 null
2026-03-24 HUydra: Full-Range Lung CT Synthesis via Multiple HU Interval Generative Modelling António Cardoso et.al. 2603.23041 null
2026-03-24 Few-Shot Generative Model Adaption via Identity Injection and Preservation Yeqi He et.al. 2603.22965 null
2026-03-23 MIOFlow 2.0: A unified framework for inferring cellular stochastic dynamics from single cell and spatial transcriptomics data Xingzhi Sun et.al. 2603.22564 null
2026-03-23 MCLR: Improving Conditional Modeling in Visual Generative Models via Inter-Class Likelihood-Ratio Maximization and Establishing the Equivalence between Classifier-Free Guidance and Alignment Objectives Xiang Li et.al. 2603.22364 null
2026-03-23 GenOpticalFlow: A Generative Approach to Unsupervised Optical Flow Learning Yixuan Luo et.al. 2603.22270 null
2026-03-23 Gumbel Distillation for Parallel Text Generation Chi Zhang et.al. 2603.22216 null
2026-03-23 Dual-Space Knowledge Distillation with Key-Query Matching for Large Language Models with Vocabulary Mismatch Stella Eva Tsiapali et.al. 2603.22056 null
2026-03-23 DiT-Flow: Speech Enhancement Robust to Multiple Distortions based on Flow Matching in Latent Space and Diffusion Transformers Tianyu Cao et.al. 2603.21608 null
2026-03-23 SynSym: A Synthetic Data Generation Framework for Psychiatric Symptom Identification Migyeong Kang et.al. 2603.21529 null
2026-03-22 Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs Mariela M. Nina et.al. 2603.21418 null
2026-03-22 Amortized Variational Inference for Logistic Regression with Missing Covariates M. Cherifi et.al. 2603.21244 null
2026-03-22 Does Mechanistic Interpretability Transfer Across Data Modalities? A Cross-Domain Causal Circuit Analysis of Variational Autoencoders Dip Roy et.al. 2603.21236 null
2026-03-22 Incentivizing Generative Zero-Shot Learning via Outcome-Reward Reinforcement Learning with Visual Cues Wenjin Hou et.al. 2603.21138 null
2026-03-21 NextSense: A Semi-Synthetic Sensing Data generation Platform David Rico Menéndez et.al. 2603.20789 null
2026-03-21 Generative Diffusion Model for Risk-Neutral Derivative Pricing Nilay Tiwari et.al. 2603.20582 null
2026-03-20 Revenue-Sharing as Infrastructure: A Distributed Business Model for Generative AI Platforms Ghislain Dorian Tchuente Mondjo et.al. 2603.20533 null
2026-03-20 Diffutron: A Masked Diffusion Language Model for Turkish Language Şuayp Talha Kocabay et.al. 2603.20466 null
2026-03-20 MME-CoF-Pro: Evaluating Reasoning Coherence in Video Generative Models with Text and Visual Hints Yu Qi et.al. 2603.20194 null
2026-03-20 Deterministic Mode Proposals: An Efficient Alternative to Generative Sampling for Ambiguous Segmentation Sebastian Gerard et.al. 2603.20191 null
2026-03-20 Kolmogorov-Arnold causal generative models Alejandro Almodóvar et.al. 2603.20184 null
2026-03-20 Audio Avatar Fingerprinting: An Approach for Authorized Use of Voice Cloning in the Era of Synthetic Audio Candice R. Gerstner et.al. 2603.20165 null
2026-03-20 Var-JEPA: A Variational Formulation of the Joint-Embedding Predictive Architecture – Bridging Predictive and Generative Self-Supervised Learning Moritz Gögl et.al. 2603.20111 null
2026-03-20 GO-GenZip: Goal-Oriented Generative Sampling and Hybrid Compression Pietro Talli et.al. 2603.20109 null
2026-03-20 FoleyDirector: Fine-Grained Temporal Steering for Video-to-Audio Generation via Structured Scripts You Li et.al. 2603.19857 null
2026-03-20 Diminishing Returns in Expanding Generative Models and Godel-Tarski-Lob Limits Angshul Majumdar et.al. 2603.19687 null
2026-03-24 LoD-Loc v3: Generalized Aerial Localization in Dense Cities using Instance Silhouette Alignment Shuaibang Peng et.al. 2603.19609 null
2026-03-19 Warm-Start Flow Matching for Guaranteed Fast Text/Image Generation Minyoung Kim et.al. 2603.19360 null
2026-03-19 MIDST Challenge at SaTML 2025: Membership Inference over Diffusion-models-based Synthetic Tabular data Masoumeh Shafieinejad et.al. 2603.19185 null
2026-03-19 Revisiting Autoregressive Models for Generative Image Classification Ilia Sudakov et.al. 2603.19122 null
2026-03-19 Foundations of Schrödinger Bridges for Generative Modeling Sophia Tang et.al. 2603.18992 null
2026-03-19 Translating MRI to PET through Conditional Diffusion Models with Enhanced Pathology Awareness Yitong Li et.al. 2603.18896 null
2026-03-19 A Human-in/on-the-Loop Framework for Accessible Text Generation Lourdes Moreno et.al. 2603.18879 null
2026-03-19 Seasoning Generative Models for a Generalization Aftertaste Hisham Husain et.al. 2603.18817 null
2026-03-19 Scaling Sim-to-Real Reinforcement Learning for Robot VLAs with Generative 3D Worlds Andrew Choi et.al. 2603.18532 null
2026-03-19 From Snapshots to Symphonies: The Evolution of Protein Prediction from Static Structures to Generative Dynamics and Multimodal Interactions Jingzhi Chen et.al. 2603.18505 null
2026-03-18 Synthetic Data Generation for Training Diversified Commonsense Reasoning Models Tianhui Zhang et.al. 2603.18361 null
2026-03-18 Epistemic Generative Adversarial Networks Muhammad Mubashar et.al. 2603.18348 null
2026-03-20 MOSS-TTS Technical Report Yitian Gong et.al. 2603.18090 null
2026-03-18 Generative Replica-Exchange: A Flow-based Framework for Accelerating Replica Exchange Simulations Shengjie Huang et.al. 2603.18076 null
2026-03-18 Machine Learning for Network Attacks Classification and Statistical Evaluation of Machine Learning for Network Attacks Classification and Adversarial Learning Methodologies for Synthetic Data Generation Iakovos-Christos Zarkadis et.al. 2603.17717 null
2026-03-18 Cohomological Obstructions to Global Counterfactuals: A Sheaf-Theoretic Foundation for Generative Causal Models Rui Wu et.al. 2603.17384 null
2026-03-18 Neuron-Level Emotion Control in Speech-Generative Large Audio-Language Models Xiutian Zhao et.al. 2603.17231 null
2026-03-17 Early Quantization Shrinks Codebook: A Simple Fix for Diversity-Preserving Tokenization Wenhao Zhao et.al. 2603.17052 null
2026-03-17 SCE-LITE-HQ: Smooth visual counterfactual explanations with generative foundation models Ahmed Zeid et.al. 2603.17048 null
2026-03-17 Dependence Fidelity and Downstream Inference Stability in Generative Models Nazia Riasat et.al. 2603.17041 null
2026-03-19 HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning Shenzhi Wang et.al. 2603.17024 null
2026-03-17 SegviGen: Repurposing 3D Generative Model for Part Segmentation Lin Li et.al. 2603.16869 null
2026-03-17 A Semantic Timbre Dataset for the Electric Guitar Joseph Cameron et.al. 2603.16682 null
2026-03-17 VideoMatGen: PBR Materials through Joint Generative Modeling Jon Hasselgren et.al. 2603.16566 null
2026-03-17 Unlearning for One-Step Generative Models via Unbalanced Optimal Transport Hyundo Choi et.al. 2603.16489 null
2026-03-17 DermaFlux: Synthetic Skin Lesion Generation with Rectified Flows for Enhanced Image Classification Stathis Galanakis et.al. 2603.16392 null
2026-03-17 Out-of-Distribution Object Detection in Street Scenes via Synthetic Outlier Exposure and Transfer Learning Sadia Ilyas et.al. 2603.16122 null
2026-03-17 Diffusion Models for Joint Audio-Video Generation Alejandro Paredes La Torre et.al. 2603.16093 null
2026-03-16 FlatLands: Generative Floormap Completion From a Single Egocentric View Subhransu S. Bhattacharjee et.al. 2603.16016 null
2026-03-16 Time-Aware Prior Fitted Networks for Zero-Shot Forecasting with Exogenous Variables Andres Potapczynski et.al. 2603.15802 null
2026-03-16 AC-Foley: Reference-Audio-Guided Video-to-Audio Synthesis with Acoustic Transfer Pengjun Fang et.al. 2603.15597 null
2026-03-16 Clinically Aware Synthetic Image Generation for Concept Coverage in Chest X-ray Models Amy Rafferty et.al. 2603.15525 null
2026-03-18 NV-Bench: Benchmark of Nonverbal Vocalization Synthesis for Expressive Text-to-Speech Generation Qinke Ni et.al. 2603.15352 null
2026-03-16 Faster Inference of Flow-Based Generative Models via Improved Data-Noise Coupling Aram Davtyan et.al. 2603.15279 null
2026-03-16 Modeling Matches as Language: A Generative Transformer Approach for Counterfactual Player Valuation in Football Miru Hong et.al. 2603.15212 null
2026-03-16 PhonemeDF: A Synthetic Speech Dataset for Audio Deepfake Detection and Naturalness Evaluation Vamshi Nallaguntla et.al. 2603.15037 null
2026-03-18 Training-free Detection of Generated Videos via Spatial-Temporal Likelihoods Omer Ben Hayun et.al. 2603.15026 null
2026-03-16 OrgForge: A Multi-Agent Simulation Framework for Verifiable Synthetic Corporate Corpora Jeffrey Flynt et.al. 2603.14997 null
2026-03-16 Preconditioned One-Step Generative Modeling for Bayesian Inverse Problems in Function Spaces Zilan Cheng et.al. 2603.14798 null
2026-03-16 LiDAR-EVS: Enhance Extrapolated View Synthesis for 3D Gaussian Splatting with Pseudo-LiDAR Supervision Yiming Huang et.al. 2603.14763 null
2026-03-16 Chain-of-Trajectories: Unlocking the Intrinsic Generative Optimality of Diffusion Models via Graph-Theoretic Planning Ping Chen et.al. 2603.14704 null
2026-03-15 QiMeng-CodeV-SVA: Training Specialized LLMs for Hardware Assertion Generation via RTL-Grounded Bidirectional Data Synthesis Yutong Wu et.al. 2603.14239 null
2026-03-15 Fair Benchmarking of Emerging One-Step Generative Models Against Multistep Diffusion and Flow Models Advaith Ravishankar et.al. 2603.14186 null
2026-03-14 Sat-JEPA-Diff: Bridging Self-Supervised Learning and Generative Diffusion for Remote Sensing Kursat Komurcu et.al. 2603.13943 null
2026-03-14 Discriminative Flow Matching Via Local Generative Predictors Om Govind Jha et.al. 2603.13928 null
2026-03-14 Evaluating Semantic Fragility in Text-to-Audio Generation Systems Under Controlled Prompt Perturbations Jiahui Wu et.al. 2603.13824 null
2026-03-14 PhysAlign: Physics-Coherent Image-to-Video Generation through Feature and 3D Representation Alignment Zhexiao Xiong et.al. 2603.13770 null
2026-03-14 Implicit Maximum Likelihood Estimation for Real-time Generative Model Predictive Control Grayson Lee et.al. 2603.13733 null
2026-03-14 Steering Generative Models for Accessibility: EasyRead Image Generation Nicolas Dickenmann et.al. 2603.13695 null
2026-03-13 EmDT: Embedding Diffusion Transformer for Tabular Data Generation in Fraud Detection En-Ya Kuo et.al. 2603.13566 null
2026-03-13 MIRAGE: Model-agnostic Industrial Realistic Anomaly Generation and Evaluation for Visual Anomaly Detection Jinwei Hu et.al. 2603.13507 null
2026-03-13 Understanding the strengths and weaknesses of SSL models for audio deepfake model attribution Gabriel Pîrlogeanu et.al. 2603.13488 null
2026-03-13 A Generative Model of Conspicuous Consumption and Status Signaling Logan Cross et.al. 2603.13220 null
2026-03-13 V-Bridge: Bridging Video Generative Priors to Versatile Few-shot Image Restoration Shenghe Zheng et.al. 2603.13089 null
2026-03-16 DS $^2$ -Instruct: Domain-Specific Data Synthesis for Large Language Models Instruction Tuning Ruiyao Xu et.al. 2603.12932 null
2026-03-13 HaltNav: Reactive Visual Halting over Lightweight Topological Priors for Robust Vision-Language Navigation Pingcong Li et.al. 2603.12696 null
2026-03-12 RAW-Domain Degradation Models for Realistic Smartphone Super-Resolution Ali Mosleh et.al. 2603.12493 null
2026-03-12 Sinkhorn-Drifting Generative Models Ping He et.al. 2603.12366 null
2026-03-11 Synthetic Data Generation for Brain-Computer Interfaces: Overview, Benchmarking, and Future Directions Ziwei Wang et.al. 2603.12296 null
2026-03-12 DVD: Deterministic Video Depth Estimation with Generative Priors Hongfei Zhang et.al. 2603.12250 null
2026-03-15 QAQ: Bidirectional Semantic Coherence for Selecting High-Quality Synthetic Code Instructions Jiayin Lei et.al. 2603.12165 null
2026-03-16 Structure Selection for Fairness-Constrained Differentially Private Data Synthesis Naeim Ghahramanpour et.al. 2603.12112 null
2026-03-12 Efficient Generative Modeling with Unitary Matrix Product States Using Riemannian Optimization Haotong Duan et.al. 2603.12026 null
2026-03-12 AS-Bridge: A Bidirectional Generative Framework Bridging Next-Generation Astronomical Surveys Dichang Zhang et.al. 2603.11928 null
2026-03-12 Language Generation with Replay: A Learning-Theoretic View of Model Collapse Giorgio Racca et.al. 2603.11784 null
2026-03-12 Anomaly detection in time-series via inductive biases in the latent space of conditional normalizing flows David Baumgartner et.al. 2603.11756 null
2026-03-12 Gender Bias in Generative AI-assisted Recruitment Processes Martina Ullasci et.al. 2603.11736 null
2026-03-12 Resonate: Reinforcing Text-to-Audio Generation via Online Feedback from Large Audio Language Models Xiquan Li et.al. 2603.11661 null
2026-03-12 Personalized Federated Learning via Gaussian Generative Modeling Peng Hu et.al. 2603.11620 null
2026-03-12 Gen-Fab: A Variation-Aware Generative Model for Predicting Fabrication Variations in Nanophotonic Devices Rambod Azimi et.al. 2603.11505 null
2026-03-12 Reproducible Synthetic Clinical Letters for Seizure Frequency Information Extraction Yujian Gan et.al. 2603.11407 null
2026-03-11 A Standardized Framework For Evaluating Gene Expression Generative Models Andrea Rubbi et.al. 2603.11244 null
2026-03-11 Generative modeling with Gaussian Boson Sampling: classically trainable Bosonic Born Machines Zoltán Kolarovszki et.al. 2603.11195 null
2026-03-11 Interventional Time Series Priors for Causal Foundation Models Dennis Thumm et.al. 2603.11090 null
2026-03-11 V2A-DPO: Omni-Preference Optimization for Video-to-Audio Generation Nolan Chan et.al. 2603.11089 null
2026-03-11 Universality of Classically Trainable, Quantum-Deployed Boson-Sampling Generative Models Andrii Kurkin et.al. 2603.11014 null
2026-03-11 Quantifying Membership Disclosure Risk for Tabular Synthetic Data Using Kernel Density Estimators Rajdeep Pathak et.al. 2603.10937 null
2026-03-11 SNPgen: Phenotype-Supervised Genotype Representation and Synthetic Data Generation via Latent Diffusion Andrea Lampis et.al. 2603.10873 null
2026-03-11 ReTabSyn: Realistic Tabular Data Synthesis via Reinforcement Learning Xiaofeng Lin et.al. 2603.10823 null
2026-03-11 Semantic Satellite Communications for Synchronized Audiovisual Reconstruction Fangyu Liu et.al. 2603.10791 null
2026-03-12 Probabilistic Verification of Voice Anti-Spoofing Models Evgeny Kushnir et.al. 2603.10713 null
2026-03-11 AlphaFlowTSE: One-Step Generative Target Speaker Extraction via Conditional AlphaFlow Duojia Li et.al. 2603.10701 null
2026-03-11 Learning Bimanual Cloth Manipulation with Vision-based Tactile Sensing via Single Robotic Arm Dongmyoung Lee et.al. 2603.10609 null
2026-03-11 HyPER-GAN: Hybrid Patch-Based Image-to-Image Translation for Real-Time Photorealism Enhancement Stefanos Pasios et.al. 2603.10604 null
2026-03-11 Layer Consistency Matters: Elegant Latent Transition Discrepancy for Generalizable Synthetic Image Detection Yawen Yang et.al. 2603.10598 null
2026-03-11 Gradient Flow Drifting: Generative Modeling via Wasserstein Gradient Flows of KDE-Approximated Divergences Jiarui Cao et.al. 2603.10592 null
2026-03-10 Improving TabPFN’s Synthetic Data Generation by Integrating Causal Structure Davide Tugnoli et.al. 2603.10254 null
2026-03-10 Generative Drifting is Secretly Score Matching: a Spectral and Variational Perspective Erkan Turan et.al. 2603.09936 null
2026-03-10 You Didn’t Have to Say It like That: Subliminal Learning from Faithful Paraphrases Isaia Gisler et.al. 2603.09517 null
2026-03-10 Cognitively Layered Data Synthesis for Domain Adaptation of LLMs to Space Situational Awareness Ding Linghu et.al. 2603.09231 null
2026-03-10 Differentiable Stochastic Traffic Dynamics: Physics-Informed Generative Modelling in Transportation Wuping Xin et.al. 2603.09174 null
2026-03-09 Statistical Inference via Generative Models: Flow Matching and Causal Inference Shinto Eguchi et.al. 2603.09009 null
2026-03-09 VoxEmo: Benchmarking Speech Emotion Recognition with Speech LLMs Hezhao Zhang et.al. 2603.08936 null
2026-03-09 HeteroFedSyn: Differentially Private Tabular Data Synthesis for Heterogeneous Federated Settings Xiaochen Li et.al. 2603.08832 null
2026-03-09 Efficient training of photonic quantum generative models Felix Gottlieb et.al. 2603.08793 null
2026-03-09 Generative Adversarial Regression (GAR): Learning Conditional Risk Scenarios Saeed Asadi et.al. 2603.08553 null
2026-03-09 Foley-Flow: Coordinated Video-to-Audio Generation with Masked Audio-Visual Alignment and Dynamic Conditional Flows Shentong Mo et.al. 2603.08126 null
2026-03-12 Evaluating Generative Models via One-Dimensional Code Distributions Zexi Jia et.al. 2603.08064 null
2026-03-08 Uncertainty-Gated Generative Modeling Xingrui Gu et.al. 2603.07753 null
2026-03-08 Evaluating Synthetic Data for Baggage Trolley Detection in Airport Logistics Abdeldjalil Taibi et.al. 2603.07645 null
2026-03-08 Targeted Speaker Poisoning Framework in Zero-Shot Text-to-Speech Thanapat Trachu et.al. 2603.07551 null
2026-03-08 Learning-free L2-Accented Speech Generation using Phonological Rules Thanathai Lertpetchpun et.al. 2603.07550 null
2026-03-08 Contact-Guided 3D Genome Structure Generation of E. coli via Diffusion Transformers Mingxin Zhang et.al. 2603.07472 null
2026-03-07 DiffSIM: Unconditional and conditional facies simulation based on denoising diffusion generative models Minghui Xu et.al. 2603.07383 null
2026-03-07 ConfHit: Conformal Generative Design with Oracle Free Guarantees Siddhartha Laghuvarapu et.al. 2603.07371 null
2026-03-10 Latent Generative Models with Tunable Complexity for Compressed Sensing and other Inverse Problems Sean Gunn et.al. 2603.07357 null
2026-03-07 Agentic Planning with Reasoning for Image Styling via Offline RL Subhojyoti Mukherjee et.al. 2603.07148 null
2026-03-07 Resource-Adaptive Federated Text Generation with Differential Privacy Jiayi Wang et.al. 2603.07027 null
2026-03-07 Conditional Unbalanced Optimal Transport Maps: An Outlier-Robust Framework for Conditional Generative Modeling Jiwoo Yoon et.al. 2603.06972 null
2026-03-06 Stability-Guided Exploration for Diverse Motion Generation Eckart Cobo-Briesewitz et.al. 2603.06773 null
2026-03-06 Improved Constrained Generation by Bridging Pretrained Generative Models Xiaoxuan Liang et.al. 2603.06742 null
2026-03-06 From Statistical Fidelity to Clinical Consistency: Scalable Generation and Auditing of Synthetic Patient Trajectories Guanglin Zhou et.al. 2603.06720 null
2026-03-06 Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis Hila Chefer et.al. 2603.06507 null
2026-03-06 Training Flow Matching: The Role of Weighting and Parameterization Anne Gagneux et.al. 2603.06454 null
2026-03-06 Toward Generative Quantum Utility via Correlation-Complexity Map Chen-Yu Liu et.al. 2603.06440 null
2026-03-10 Making Training-Free Diffusion Segmentors Scale with the Generative Power Benyuan Meng et.al. 2603.06178 null
2026-03-06 Longitudinal NSCLC Treatment Progression via Multimodal Generative Models Massimiliano Mantegna et.al. 2603.06147 null
2026-03-06 A Hazard-Informed Data Pipeline for Robotics Physical Safety Alexei Odinokov et.al. 2603.06130 null
2026-03-06 PixARMesh: Autoregressive Mesh-Native Single-View Scene Reconstruction Xiang Zhang et.al. 2603.05888 null
2026-03-06 StreamWise: Serving Multi-Modal Generation in Real-Time at Scale Haoran Qiu et.al. 2603.05800 null
2026-03-06 CBCT-Based Synthetic CT Generation Using Conditional Flow Matching Model Junbo Peng et.al. 2603.05796 null
2026-03-05 EigenData: A Self-Evolving Multi-Agent Platform for Function-Calling Data Synthesis, Auditing, and Repair Jiaao Chen et.al. 2603.05553 null
2026-03-05 Building Enterprise Realtime Voice Agents from Scratch: A Technical Tutorial Jielin Qiu et.al. 2603.05413 null
2026-03-05 Harnessing Synthetic Data from Generative AI for Statistical Inference Ahmad Abdel-Azim et.al. 2603.05396 null
2026-03-05 WavSLM: Single-Stream Speech Language Modeling via WavLM Distillation Luca Della Libera et.al. 2603.05299 null
2026-03-05 How far have we gone in Generative Image Restoration? A study on its capability, limitations and evaluation practices Xiang Yin et.al. 2603.05010 link
2026-03-05 HiFlow: Hierarchical Feedback-Driven Optimization for Constrained Long-Form Text Generation Yifan Zhu et.al. 2603.04996 null
2026-03-05 Free Lunch for Pass@ $k$ ? Low Cost Diverse Sampling for Diffusion Language Models Sean Lamont et.al. 2603.04893 null
2026-03-04 Semi-Supervised Generative Learning via Latent Space Distribution Matching Kwong Yu Chong et.al. 2603.04223 null
2026-03-05 TumorFlow: Physics-Guided Longitudinal MRI Synthesis of Glioblastoma Growth Valentin Biller et.al. 2603.04058 null
2026-03-04 Towards Generalized Multimodal Homography Estimation Jinkun You et.al. 2603.03956 null
2026-03-04 Harnessing Selective State Space Models to Enhance Semianalytical Design of Fabrication-Ready Multilayered Huygens’ Metasurfaces: Part II - Generative Inverse Design (MetaMamba) Natanel Nissan et.al. 2603.03877 null
2026-03-04 Relational In-Context Learning via Synthetic Pre-training with Structural Prior Yanbo Wang et.al. 2603.03805 null
2026-03-04 JANUS: Structured Bidirectional Generation for Guaranteed Constraints and Analytical Uncertainty Taha Racicot et.al. 2603.03748 null
2026-03-03 PRIVATEEDIT: A Privacy-Preserving Pipeline for Face-Centric Generative Image Editing Dipesh Tamboli et.al. 2603.03412 null
2026-03-03 Infinite dimensional generative sensing Paolo Angella et.al. 2603.03196 null
2026-03-04 QFlowNet: Fast, Diverse, and Efficient Unitary Synthesis with Generative Flow Networks Inhoe Koo et.al. 2603.03045 null
2026-03-03 Integrating Homomorphic Encryption and Synthetic Data in FL for Privacy and Learning Quality Yenan Wang et.al. 2603.02969 null
2026-03-03 On Discriminative vs. Generative classifiers: Rethinking MLLMs for Action Understanding Zhanzhong Pang et.al. 2603.02546 null
2026-03-02 A Directed Graph Model and Experimental Framework for Design and Study of Time-Dependent Text Visualisation Songhai Fan et.al. 2603.02422 null
2026-03-02 RO-N3WS: Enhancing Generalization in Low-Resource ASR with Diverse Romanian Speech Benchmarks Alexandra Diaconu et.al. 2603.02368 null
2026-03-02 CausalWrap: Model-Agnostic Causal Constraint Wrappers for Tabular Synthetic Data Amir Asiaee et.al. 2603.02015 null
2026-03-02 Noise-Calibrated Inference from Differentially Private Sufficient Statistics in Exponential Families Amir Asiaee et.al. 2603.02010 null
2026-03-02 CoVAE: correlated multimodal generative modeling Federico Caretti et.al. 2603.01965 null
2026-03-02 Phase-Type Variational Autoencoders for Heavy-Tailed Data Abdelhakim Ziani et.al. 2603.01800 null
2026-03-02 A Diffusion-Driven Fine-Grained Nodule Synthesis Framework for Enhanced Lung Nodule Detection from Chest Radiographs Aryan Goyal et.al. 2603.01659 null
2026-03-02 Transform-Invariant Generative Ray Path Sampling for Efficient Radio Propagation Modeling Jérome Eertmans et.al. 2603.01655 null
2026-03-02 RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry Xinchang Wang et.al. 2603.01544 null
2026-03-02 PhysFormer: A Physics-Embedded Generative Model for Physically Self-Consistent Spectral Synthesis Siqi Wang et.al. 2603.01459 null
2026-03-02 Autoregressive Synthesis of Sparse and Semi-Structured Mixed-Type Data Thomas Rückstieß et.al. 2603.01444 null
2026-03-02 LaSER: Internalizing Explicit Reasoning into Latent Space for Dense Retrieval Jiajie Jin et.al. 2603.01425 null
2026-03-01 Velocity Model Building and Editing with Guided Denoising Diffusion Implicit Models Francesco Brandolin et.al. 2603.01231 null
2026-03-01 Generative AI & Fictionality: How Novels Power Large Language Models Edwin Roland et.al. 2603.01220 null
2026-02-28 Constitutional Black-Box Monitoring for Scheming in LLM Agents Simon Storf et.al. 2603.00829 null
2026-02-28 Designing the Haystack: Programmable Chemical Space for Generative Molecular Discovery Yuchen Zhu et.al. 2603.00614 null
2026-02-28 SesaHand: Enhancing 3D Hand Reconstruction via Controllable Generation with Semantic and Structural Alignment Zhuoran Zhao et.al. 2603.00443 null
2026-02-28 Mamba-CAD: State Space Model For 3D Computer-Aided Design Generative Modeling Xueyang Li et.al. 2603.00439 null
2026-02-27 SKeDA: A Generative Watermarking Framework for Text-to-video Diffusion Models Yang Yang et.al. 2603.00194 null
2026-02-27 TradeFM: A Generative Foundation Model for Trade-flow and Market Microstructure Maxime Kawawa-Beaudan et.al. 2602.23784 null
2026-02-27 DashengTokenizer: One layer is enough for unified audio understanding and generation Heinrich Dinkel et.al. 2602.23765 null
2026-02-27 MMKG-RDS: Reasoning Data Synthesis via Deep Mining of Multimodal Knowledge Graphs Lun Zhan et.al. 2602.23632 null
2026-02-27 Synthetic Data Powers Product Retrieval for Long-tail Knowledge-Intensive Queries in E-commerce Search Gui Ling et.al. 2602.23620 null
2026-02-27 Flowette: Flow Matching with Graphette Priors for Graph Generation Asiri Wijesinghe et.al. 2602.23566 null
2026-03-02 Synthetic Visual Genome 2: Extracting Large-scale Spatio-Temporal Scene Graphs from Videos Ziqi Gao et.al. 2602.23543 null
2026-02-26 Uncovering Physical Drivers of Dark Matter Halo Structures with Auxiliary-Variable-Guided Generative Models Arkaprabha Ganguli et.al. 2602.23518 null
2026-02-26 SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation Vaibhav Agrawal et.al. 2602.23359 null
2026-02-26 SemanticVocoder: Bridging Audio Generation and Audio Understanding via Semantic Latents Zeyu Xie et.al. 2602.23333 null
2026-02-26 Data-Efficient Generative Modeling of Non-Gaussian Global Climate Fields via Scalable Composite Transformations Johannes Brachem et.al. 2602.23311 null
2026-02-26 Efficient training of generative models from multireference simulations and its application to the design of Dy complexes with large magnetic anisotropy Zahra Khatibi et.al. 2602.23230 null
2026-02-26 Q-Tag: Watermarking Quantum Circuit Generative Models Yang Yang et.al. 2602.23085 null
2026-02-28 Deepfake Word Detection by Next-token Prediction using Fine-tuned Whisper Hoan My Tran et.al. 2602.22658 null
2026-02-26 CRAG: Can 3D Generative Models Help 3D Assembly? Zeyu Jiang et.al. 2602.22629 null
2026-02-26 BetterScene: 3D Scene Synthesis with Representation-Aligned Generative Model Yuci Han et.al. 2602.22596 null
2026-02-26 Where Relevance Emerges: A Layer-Wise Study of Internal Attention for Zero-Shot Re-Ranking Haodong Chen et.al. 2602.22591 null
2026-02-25 Bayesian Generative Adversarial Networks via Gaussian Approximation for Tabular Data Synthesis Bahrul Ilmi Nasution et.al. 2602.21948 null
2026-02-25 Joint Shadow Generation and Relighting via Light-Geometry Interaction Maps Shan Wang et.al. 2602.21820 null
2026-02-26 SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model Guibin Chen et.al. 2602.21818 null
2026-02-25 Inverse prediction of capacitor multiphysics dynamic parameters using deep generative model Kart-Leong Lim et.al. 2602.21606 null
2026-02-27 Provably Safe Generative Sampling with Constricting Barrier Functions Darshan Gadginmath et.al. 2602.21429 null
2026-02-24 Archetypal Graph Generative Models: Explainable and Identifiable Communities via Anchor-Dominant Convex Hulls Nikolaos Nakis et.al. 2602.21342 null
2026-02-24 SOM-VQ: Topology-Aware Tokenization for Interactive Generative Models Alessandro Londei et.al. 2602.21133 null
2026-02-25 Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models Christian Simon et.al. 2602.20981 null
2026-02-24 See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis Jaehyun Park et.al. 2602.20951 null
2026-02-24 BoxSplitGen: A Generative Model for 3D Part Bounding Boxes in Varying Granularity Juil Koo et.al. 2602.20666 null
2026-02-24 CAD-Prompted SAM3: Geometry-Conditioned Instance Segmentation for Industrial Objects Zhenran Tang et.al. 2602.20551 null
2026-02-23 gQIR: Generative Quanta Image Reconstruction Aryan Garg et.al. 2602.20417 null
2026-02-23 CaDrift: A Time-dependent Causal Generator of Drifting Data Streams Eduardo V. L. Barboza et.al. 2602.20329 null
2026-02-27 Discrete Diffusion with Sample-Efficient Estimators for Conditionals Karthik Elamvazhuthi et.al. 2602.20293 null
2026-02-22 OrgFlow: Generative Modeling of Organic Crystal Structures from Molecular Graphs Mohammadmahdi Vahediahmar et.al. 2602.20195 null
2026-02-22 FedAvg-Based CTMC Hazard Model for Federated Bridge Deterioration Assessment Takato Yasuno et.al. 2602.20194 null
2026-02-23 ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models Andre He et.al. 2602.20117 null
2026-02-25 Training-Free Generative Modeling via Kernelized Stochastic Interpolants Florentin Coeurdoux et.al. 2602.20070 null
2026-02-23 Schrödinger bridges with jumps for time series generation Stefano De Marco et.al. 2602.20011 null
2026-02-23 RL-RIG: A Generative Spatial Reasoner via Intrinsic Reflection Tianyu Wang et.al. 2602.19974 null
2026-02-23 Make Some Noise: Unsupervised Remote Sensing Change Detection Using Latent Space Perturbations Blaž Rolih et.al. 2602.19881 null
2026-02-27 Multimodal Dataset Distillation Made Simple by Prototype-Guided Data Synthesis Junhyeok Choi et.al. 2602.19756 null
2026-02-23 Hardware-Accelerated Geometrical Simulation of Biological and Engineered In-Air Ultrasonic Systems Wouter Jansen et.al. 2602.19652 null
2026-02-23 Manifold-Aligned Generative Transport Xinyu Tian et.al. 2602.19600 null
2026-02-26 DICArt: Advancing Category-level Articulated Object Pose Estimation in Discrete State-Spaces Li Zhang et.al. 2602.19565 null
2026-02-23 Agentic AI as a Cybersecurity Attack Surface: Threats, Exploits, and Defenses in Runtime Supply Chains Xiaochong Jiang et.al. 2602.19555 null
2026-02-23 Laplacian Multi-scale Flow Matching for Generative Modeling Zelin Zhao et.al. 2602.19461 null
2026-02-22 IDLM: Inverse-distilled Diffusion Language Models David Li et.al. 2602.19066 null
2026-02-22 A Markovian View of Iterative-Feedback Loops in Image Generative Models: Neural Resonance and Model Collapse Vibhas Kumar Vats et.al. 2602.19033 null
2026-02-21 DeepInterestGR: Mining Deep Multi-Interest Using Multi-Modal LLMs for Generative Recommendation Yangchen Zeng et.al. 2602.18907 null
2026-02-21 IDperturb: Enhancing Variation in Synthetic Face Generation via Angular Perturbation Fadi Boutros et.al. 2602.18831 null
2026-02-21 RadioGen3D: 3D Radio Map Generation via Adversarial Learning on Large-Scale Synthetic Data Junshen Chen et.al. 2602.18744 null
2026-02-21 RoboCurate: Harnessing Diversity with Action-Verified Neural Trajectory for Robot Learning Seungku Kim et.al. 2602.18742 null
2026-02-20 DP-RFT: Learning to Generate Synthetic Text via Differentially Private Reinforcement Fine-Tuning Fangyuan Xu et.al. 2602.18633 null
2026-02-20 Generative Model via Quantile Assignment Georgi Hrusanov et.al. 2602.18216 null
2026-02-19 Market Games for Generative Models: Equilibria, Welfare, and Strategic Entry Xiukun Wei et.al. 2602.17787 null
2026-02-24 A Theoretical Framework for Modular Learning of Robust Generative Models Corinna Cortes et.al. 2602.17554 null
2026-02-19 QuPAINT: Physics-Aware Instruction Tuning Approach to Quantum Material Discovery Xuan-Bac Nguyen et.al. 2602.17478 null
2026-02-19 From Labor to Collaboration: A Methodological Experiment Using AI Agents to Augment Research Perspectives in Taiwan’s Humanities and Social Sciences Yi-Chih Huang et.al. 2602.17221 null
2026-02-19 HybridPrompt: Bridging Generative Priors and Traditional Codecs for Mobile Streaming Liming Liu et.al. 2602.17120 null
2026-02-19 Epistemology of Generative AI: The Geometry of Knowing Ilya Levin et.al. 2602.17116 null
2026-02-19 Synergizing Transport-Based Generative Models and Latent Geometry for Stochastic Closure Modeling Xinghao Dong et.al. 2602.17089 null
2026-02-19 Generative modeling for the bootstrap Leon Tran et.al. 2602.17052 null
2026-02-18 Synthetic-Powered Multiple Testing with FDR Control Yonghoon Lee et.al. 2602.16690 null
2026-02-19 Style-Aware Gloss Control for Generative Non-Photorealistic Rendering Santiago Jimenez-Navarro et.al. 2602.16611 null
2026-02-18 GICDM: Mitigating Hubness for Reliable Distance-Based Generative Model Evaluation Nicolas Salvy et.al. 2602.16449 null
2026-02-17 Can Generative Artificial Intelligence Survive Data Contamination? Theoretical Guarantees under Contaminated Recursive Training Kevin Wang et.al. 2602.16065 null
2026-02-17 VideoSketcher: Video Models Prior Enable Versatile Sequential Sketch Generation Hui Ren et.al. 2602.15819 null
2026-02-17 Developing AI Agents with Simulated Data: Why, what, and how? Xiaoran Liu et.al. 2602.15816 null
2026-02-19 A Generative-First Neural Audio Autoencoder Jonah Casebeer et.al. 2602.15749 null
2026-02-17 LLM-to-Speech: A Synthetic Data Pipeline for Training Dialectal Text-to-Speech Models Ahmed Khaled Khamis et.al. 2602.15675 null
2026-02-17 Molecular Design beyond Training Data with Novel Extended Objective Functionals of Generative AI Models Driven by Quantum Annealing Computer Hayato Kunugi et.al. 2602.15451 null
2026-02-17 Efficient Generative Modeling beyond Memoryless Diffusion via Adjoint Schrödinger Bridge Matching Jeongwoo Shin et.al. 2602.15396 null
2026-02-17 Making Large Language Models Speak Tulu: Structured Prompting for an Extremely Low-Resource Language Prathamesh Devadiga et.al. 2602.15378 null
2026-02-17 GMAIL: Generative Modality Alignment for generated Image Learning Shentong Mo et.al. 2602.15368 null
2026-02-20 Non-Stationary Covariance Functions for Spatial Data on Linear Networks Alfredo Alegría et.al. 2602.15328 null
2026-02-17 Enhancing Diversity and Feasibility: Joint Population Synthesis from Multi-source Data Using Generative Models Farbod Abbasi et.al. 2602.15270 null
2026-02-16 Exposing Diversity Bias in Deep Generative Models: Statistical Origins and Correction of Diversity Error Farzan Farnia et.al. 2602.14682 null
2026-02-15 BitDance: Scaling Autoregressive Generative Models with Binary Tokens Yuang Ai et.al. 2602.14041 null
2026-02-14 GSRM: Generative Speech Reward Model for Speech RLHF Maohao Shen et.al. 2602.13891 null
2026-02-14 Generative Latent Representations of 3D Brain MRI for Multi-Task Downstream Analysis in Down Syndrome Jordi Malé et.al. 2602.13731 null
2026-02-14 A WDLoRA-Based Multimodal Generative Framework for Clinically Guided Corneal Confocal Microscopy Image Synthesis in Diabetic Neuropathy Xin Zhang et.al. 2602.13693 null
2026-02-10 Situation Graph Prediction: Structured Perspective Inference for User Modeling Jisung Shin et.al. 2602.13319 null
2026-02-13 A Calibrated Memorization Index (MI) for Detecting Training Data Leakage in Generative MRI Models Yash Deo et.al. 2602.13066 null
2026-02-13 QTabGAN: A Hybrid Quantum-Classical GAN for Tabular Data Synthesis Subhangi Kumari et.al. 2602.12704 null
2026-02-13 Formalizing the Sampling Design Space of Diffusion-Based Generative Models via Adaptive Solvers and Wasserstein-Bounded Timesteps Sangwoo Jo et.al. 2602.12624 null
2026-02-13 Generative Site-Specific Beamforming via Information-Maximizing Codebook Cheng-Jie Zhao et.al. 2602.12552 null
2026-02-12 Synthetic Interaction Data for Scalable Personalization in Large Language Models Yuchen Ma et.al. 2602.12394 null
2026-02-12 Synthetic Image Detection with CLIP: Understanding and Assessing Predictive Cues Marco Willi et.al. 2602.12381 null
2026-02-13 T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization Tunyu Zhang et.al. 2602.12262 null
2026-02-16 “Sorry, I Didn’t Catch That”: How Speech Models Miss What Matters Most Kaitlyn Zhou et.al. 2602.12249 null
2026-02-12 Pedagogically-Inspired Data Synthesis for Language Model Knowledge Distillation Bowei He et.al. 2602.12172 null
2026-02-12 Affordance-Graphed Task Worlds: Self-Evolving Task Generation for Scalable Embodied Learning Xiang Liu et.al. 2602.12065 null
2026-02-15 VLAW: Iterative Co-Improvement of Vision-Language-Action Policy and World Model Yanjiang Guo et.al. 2602.12063 null
2026-02-12 Fourier Transformers for Latent Crystallographic Diffusion and Generative Modeling Jed A. Duersch et.al. 2602.12045 null
2026-02-13 When Should LLMs Be Less Specific? Selective Abstraction for Reliable Long-Form Text Generation Shani Goren et.al. 2602.11908 null
2026-02-12 How to Sample High Quality 3D Fractals for Action Recognition Pre-Training? Marko Putak et.al. 2602.11810 null
2026-02-12 RELATE: A Reinforcement Learning-Enhanced LLM Framework for Advertising Text Generation Jinfang Wang et.al. 2602.11780 link
2026-02-13 Bizarre Love Triangle: Generative AI, Art, and Kitsch Dejan Grba et.al. 2602.11353 null
2026-02-11 TabICLv2: A better, faster, scalable, and open tabular foundation model Jingang Qu et.al. 2602.11139 null
2026-02-11 Beyond Confidence: The Rhythms of Reasoning in Generative Models Deyuan Liu et.al. 2602.10816 null
2026-02-11 A Diffusion-Based Generative Prior Approach to Sparse-view Computed Tomography Davide Evangelista et.al. 2602.10722 null
2026-02-11 Evaluation metrics for temporal preservation in synthetic longitudinal patient data Katariina Perkonoja et.al. 2602.10643 null
2026-02-11 Generative clinical time series models trained on moderate amounts of patient data are privacy preserving Rustam Zhumagambetov et.al. 2602.10631 null
2026-02-11 Flow of Spans: Generalizing Language Models to Dynamic Span-Vocabulary via GFlowNets Bo Xue et.al. 2602.10583 null
2026-02-11 From Collapse to Improvement: Statistical Perspectives on the Evolutionary Dynamics of Iterative Training on Contaminated Sources Soham Bakshi et.al. 2602.10531 null
2026-02-12 Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs Zhongzhi Li et.al. 2602.10388 null
2026-02-10 Temper-Then-Tilt: Principled Unlearning for Generative Models through Tempering and Classifier Guidance Jacob L. Block et.al. 2602.10217 null
2026-02-10 Anatomy-Preserving Latent Diffusion for Generation of Brain Segmentation Masks with Ischemic Infarct Lucia Borrego et.al. 2602.10167 null
2026-02-10 CAPID: Context-Aware PII Detection for Question-Answering Systems Mariia Ponomarenko et.al. 2602.10074 null
2026-02-10 Preventing Barren Plateaus in Continuous Quantum Generative Models Olli Hirviniemi et.al. 2602.10049 null
2026-02-11 Monocular Normal Estimation via Shading Sequence Estimation Zongrui Li et.al. 2602.09929 null
2026-02-10 AmharicIR+Instr: A Two-Dataset Resource for Neural Retrieval and Instruction Tuning Tilahun Yeshambel et.al. 2602.09914 null
2026-02-10 Efficient Unsupervised Environment Design through Hierarchical Policy Representation Learning Dexun Li et.al. 2602.09813 null
2026-02-10 Allure of Craquelure: A Variational-Generative Approach to Crack Detection in Paintings Laura Paul et.al. 2602.09730 null
2026-02-10 Why the Counterintuitive Phenomenon of Likelihood Rarely Appears in Tabular Anomaly Detection with Deep Generative Models? Donghwan Kim et.al. 2602.09593 null
2026-02-10 MieDB-100k: A Comprehensive Dataset for Medical Image Editing Yongfan Lai et.al. 2602.09587 null
2026-02-10 Smaller is Better: Generative Models Can Power Short Video Preloading Liming Liu et.al. 2602.09484 null
2026-02-10 The Wisdom of Many Queries: Complexity-Diversity Principle for Dense Retriever Training Xincan Feng et.al. 2602.09448 null
2026-02-10 AgentSkiller: Scaling Generalist Agent Intelligence through Semantically Integrated Cross-Domain Data Synthesis Zexu Sun et.al. 2602.09372 null
2026-02-10 How Far Can You Grow? Characterizing the Extrapolation Frontier of Graph Generative Models for Materials Science Can Polat et.al. 2602.09309 null
2026-02-10 Measuring Privacy Risks and Tradeoffs in Financial Synthetic Data Generation Michael Zuo et.al. 2602.09288 null
2026-02-09 RAPID: Risk of Attribute Prediction-Induced Disclosure in Synthetic Microdata Matthias Templ et.al. 2602.09235 null
2026-02-09 What do Geometric Hallucination Detection Metrics Actually Measure? Eric Yeats et.al. 2602.09158 null
2026-02-09 Distributionally Robust Optimization via Generative Ambiguity Modeling Jiaqi Wen et.al. 2602.08976 null
2026-02-09 How University Disability Services Professionals Write Image Descriptions for HCI Figures Using Generative AI Muhammad Raees et.al. 2602.08937 null
2026-02-10 MOVA: Towards Scalable and Synchronized Video-Audio Generation SII-OpenMOSS Team et.al. 2602.08794 null
2026-02-09 Equalized Generative Treatment: Matching f-divergences for Fairness in Generative Models Alexandre Verine et.al. 2602.08660 null
2026-02-09 Projected Gradient Ascent for Efficient Reward-Guided Updates with One-Step Generative Models Jisung Hwang et.al. 2602.08646 null
2026-02-09 Modeling Protein Evolution via Generative Inference From Monte Carlo Chains to Population Genetics Leonardo Di Bari et.al. 2602.08641 null
2026-02-09 Inspiration Seeds: Learning Non-Literal Visual Combinations for Generative Exploration Kfir Goldberg et.al. 2602.08615 null
2026-02-09 CoTZero: Annotation-Free Human-Like Vision Reasoning via Hierarchical Synthetic CoT Chengyi Du et.al. 2602.08339 null
2026-02-09 An Attention-over-Attention Generative Model for Joint Multiple Intent Detection and Slot Filling Wei Zhu et.al. 2602.08322 null
2026-02-09 Cyclic Adaptive Private Synthesis for Sharing Real-World Data in Education Hibiki Ito et.al. 2602.08299 null
2026-02-09 Nansde-net: A neural sde framework for generating time series with memory Hiromu Ozai et.al. 2602.08182 null
2026-02-08 Cross-Linguistic Persona-Driven Data Synthesis for Robust Multimodal Cognitive Decline Detection Rui Feng et.al. 2602.07978 null
2026-02-08 Time Series Reasoning via Process-Verifiable Thinking Data Synthesis and Scheduling for Tailored LLM Reasoning Jiahui Zhou et.al. 2602.07830 null
2026-02-07 Automated rock joint trace mapping using a supervised learning model trained on synthetic data generated by parametric modelling Jessica Ka Yi Chiu et.al. 2602.07590 null
2026-02-07 Capturing the Topological Phase Transition and Thermodynamics of the 2D XY Model via Manifold-Aware Score-Based Generative Modeling Pratyush Jha et.al. 2602.07548 null
2026-02-06 VideoNeuMat: Neural Material Extraction from Generative Video Models Bowen Xue et.al. 2602.07272 null
2026-02-06 Discrete Adjoint Matching Oswin So et.al. 2602.07132 null
2026-02-06 Finding Connections: Membership Inference Attacks for the Multi-Table Synthetic Data Setting Joshua Ward et.al. 2602.07126 null
2026-02-05 MRI Cross-Modal Synthesis: A Comparative Study of Generative Models for T1-to-T2 Reconstruction Ali Alqutayfi et.al. 2602.07068 null
2026-02-06 Learning a Generative Meta-Model of LLM Activations Grace Luo et.al. 2602.06964 null
2026-02-09 Improved Sampling Schedules for Discrete Diffusion Models Alberto Foresti et.al. 2602.06849 null
2026-02-06 RAIGen: Rare Attribute Identification in Text-to-Image Generative Models Silpa Vadakkeeveetil Sreelatha et.al. 2602.06806 null
2026-02-06 Force Generative Imitation Learning: Bridging Position Trajectory and Force Commands through Control Technique Hiroshi Sato et.al. 2602.06620 null
2026-02-06 Generating High-quality Privacy-preserving Synthetic Data David Yavo et.al. 2602.06390 null
2026-02-06 Misophonia Trigger Sound Detection on Synthetic Soundscapes Using a Hybrid Model with a Frozen Pre-Trained CNN and a Time-Series Module Kurumi Sashida et.al. 2602.06271 null
2026-02-05 From Blurry to Believable: Enhancing Low-quality Talking Heads with 3D Generative Priors Ding-Jiun Huang et.al. 2602.06122 null
2026-02-05 Discrete diffusion samplers and bridges: Off-policy algorithms and applications in latent spaces Arran Carter et.al. 2602.05961 null
2026-02-05 Verification of the Implicit World Model in a Generative Model via Adversarial Sequences András Balogh et.al. 2602.05903 null
2026-02-05 FHAIM: Fully Homomorphic AIM For Private Synthetic Data Generation Mayank Kumar et.al. 2602.05838 null
2026-02-05 Synthesizing Realistic Test Data without Breaking Privacy Laura Plein et.al. 2602.05833 null
2026-02-05 Wave-Trainer-Fit: Neural Vocoder with Trainable Prior and Fixed-Point Iteration towards High-Quality Speech Generation from SSL features Hien Ohnaka et.al. 2602.05443 null
2026-02-05 Synthetic Defect Geometries of Cast Metal Objects Modeled via 2d Voronoi Tessellations Natascha Jeziorski et.al. 2602.05440 null
2026-02-05 GAS: Enhancing Reward-Cost Balance of Generative Model-assisted Offline Safe RL Zifan Liu et.al. 2602.05323 null
2026-02-05 GT-SVJ: Generative-Transformer-Based Self-Supervised Video Judge For Efficient Video Reward Modeling Shivanshu Shekhar et.al. 2602.05202 null
2026-02-04 Data Kernel Perspective Space Performance Guarantees for Synthetic Data from Transformer Models Michael Browder et.al. 2602.05106 null
2026-02-04 Private PoEtry: Private In-Context Learning via Product of Experts Rob Romijnders et.al. 2602.05012 null
2026-02-03 Privacy Amplification Persists under Unlimited Synthetic Data Release Clément Pierquin et.al. 2602.04895 null
2026-02-06 Generative Modeling via Drifting Mingyang Deng et.al. 2602.04770 null
2026-02-04 Audio ControlNet for Fine-Grained Audio Generation and Editing Haina Zhu et.al. 2602.04680 null
2026-02-04 PFluxTTS: Hybrid Flow-Matching TTS with Robust Cross-Lingual Voice Cloning and Inference-Time Model Fusion Vikentii Pankov et.al. 2602.04160 null
2026-02-06 PromptSplit: Revealing Prompt-Level Disagreement in Generative Models Mehdi Lotfian et.al. 2602.04009 null
2026-02-03 pop-cosmos: Forward modeling KiDS-1000 redshift distributions using realistic galaxy populations Boris Leistedt et.al. 2602.03935 null
2026-02-03 HY3D-Bench: Generation of 3D Assets Team Hunyuan3D et.al. 2602.03907 null
2026-02-03 DiffLOB: Diffusion Models for Counterfactual Generation in Limit Order Books Zhuohan Wang et.al. 2602.03776 null
2026-02-03 Efficient Variance-reduced Estimation from Generative EHR Models: The SCOPE and REACH Estimators Luke Solo et.al. 2602.03730 null
2026-02-03 CTTVAE: Latent Space Structuring for Conditional Tabular Data Generation on Imbalanced Datasets Milosh Devic et.al. 2602.03641 null
2026-02-03 Generator-based Graph Generation via Heat Diffusion Anthony Stephenson et.al. 2602.03612 null
2026-02-03 Riemannian Neural Optimal Transport Alessandro Micheli et.al. 2602.03566 null
2026-02-03 R1-SyntheticVL: Is Synthetic Data from Generative Models Ready for Multimodal Large Language Model? Jingyi Zhang et.al. 2602.03300 null
2026-02-03 Beyond Quantity: Trajectory Diversity Scaling for Code Agents Guhong Chen et.al. 2602.03219 null
2026-02-03 Consensus Group Relative Policy Optimization for Text Generation Yuki Ichihara et.al. 2602.03102 null
2026-02-03 Distance Marching for Generative Modeling Zimo Wang et.al. 2602.02928 null
2026-02-02 Beyond Content: Behavioral Policies Reveal Actors in Information Operations Philipp J. Schneider et.al. 2602.02838 null
2026-02-04 daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently Mohan Jiang et.al. 2602.02619 null
2026-02-01 VividVoice: A Unified Framework for Scene-Aware Visually-Driven Speech Synthesis Chengyuan Ma et.al. 2602.02591 null
2026-02-02 MIRROR: Manifold Ideal Reference ReconstructOR for Generalizable AI-Generated Image Detection Ruiqi Liu et.al. 2602.02222 null
2026-02-02 The Verification Crisis: Expert Perceptions of GenAI Disinformation and the Case for Reproducible Provenance Alexander Loth et.al. 2602.02100 null
2026-02-02 Logic-Guided Vector Fields for Constrained Generative Modeling Ali Baheri et.al. 2602.02009 null
2026-02-02 Synesthesia of Vehicles: Tactile Data Synthesis from Visual Inputs Rui Wang et.al. 2602.01832 null
2026-02-02 Reconstruction of instantaneous flow fields from transient velocity snapshots using physics-informed neural networks: Applications to pulsatile blood flow behind a stenosis Kakeru Ueda et.al. 2602.01542 null
2026-02-01 Addressing Explainability of Generative AI using SMILE (Statistical Model-agnostic Interpretability with Local Explanations) Zeinab Dehghani et.al. 2602.01206 null
2026-01-31 Factuality on Demand: Controlling the Factuality-Informativeness Trade-off in Text Generation Ziwei Gong et.al. 2602.00848 null
2026-01-31 Scalable Generative Game Engine: Breaking the Resolution Wall via Hardware-Algorithm Co-Design Wei Zeng et.al. 2602.00608 null
2026-01-31 RVCBench: Benchmarking the Robustness of Voice Cloning Across Modern Audio Generation Models Xinting Liao et.al. 2602.00443 null
2026-01-31 Toward Autonomous Laboratory Safety Monitoring with Vision Language Models: Learning to See Hazards Through Scene Structure Trishna Chakraborty et.al. 2602.00414 null
2026-01-30 Planning with Language and Generative Models: Toward General Reward-Guided Wireless Network Design Chenyang Yuan et.al. 2602.00357 null
2026-01-30 Reducing Memorisation in Generative Models via Riemannian Bayesian Inference Johanna Marie Gegenfurtner et.al. 2602.00199 null
2026-01-30 How well do generative models solve inverse problems? A benchmark study Patrick Krüger et.al. 2601.23238 null
2026-01-30 JobResQA: A Benchmark for LLM Machine Reading Comprehension on Multilingual Résumés and JDs Casimiro Pio Carrino et.al. 2601.23183 null
2026-01-30 Behemoth: Benchmarking Unlearning in LLMs Using Fully Synthetic Data Eugenia Iofinova et.al. 2601.23153 null
2026-01-30 Manifold-Aware Perturbations for Constrained Generative Modeling Katherine Keegan et.al. 2601.23151 null
2026-01-30 ExplainerPFN: Towards tabular foundation models for model-free zero-shot feature importance estimations Joao Fonseca et.al. 2601.23068 null
2026-01-30 MoVE: Mixture of Value Embeddings – A New Axis for Scaling Parametric Memory in Autoregressive Models Yangyan Li et.al. 2601.22887 null
2026-01-30 Generative and Nonparametric Approaches for Conditional Distribution Estimation: Methods, Perspectives, and Comparative Evaluations Yen-Shiu Chin et.al. 2601.22650 null
2026-01-30 Beyond Medical Chatbots: Meddollina and the Rise of Continuous Clinical Intelligence Vaibhav Ram S. V. N. S et.al. 2601.22645 null
2026-01-30 VocBulwark: Towards Practical Generative Speech Watermarking via Additional-Parameter Injection Weizhi Liu et.al. 2601.22556 null
2026-01-30 Towards the Holographic Characteristic of LLMs for Efficient Short-text Generation Shun Qian et.al. 2601.22546 null
2026-01-30 DNA: Uncovering Universal Latent Forgery Knowledge Jingtong Dou et.al. 2601.22515 null
2026-01-30 ScribbleSense: Generative Scribble-Based Texture Editing with Intent Prediction Yudi Zhang et.al. 2601.22455 null
2026-01-30 Rethinking Anonymity Claims in Synthetic Data Generation: A Model-Centric Privacy Attack Perspective Georgi Ganev et.al. 2601.22434 null
2026-01-29 Conformal Prediction for Generative Models via Adaptive Cluster-Based Density Estimation Qidong Yang et.al. 2601.22298 null
2026-01-29 Investigating Associational Biases in Inter-Model Communication of Large Generative Models Fethiye Irmak Dogan et.al. 2601.22093 null
2026-01-29 Holographic generative flows with AdS/CFT Ehsan Mirafzali et.al. 2601.22033 null
2026-01-29 The Ensemble Inverse Problem: Applications and Methods Zhengyan Huan et.al. 2601.22029 null
2026-01-30 From Tokens to Blocks: A Block-Diffusion Perspective on Molecular Generation Qianwei Yang et.al. 2601.21964 null
2026-01-29 From Generative Modeling to Clinical Classification: A GPT-Based Architecture for EHR Notes Fariba Afrin Irany et.al. 2601.21955 null
2026-01-29 On Forgetting and Stability of Score-based Generative models Stanislas Strasman et.al. 2601.21868 null
2026-01-29 Generative Modeling of Discrete Data Using Geometric Latent Subspaces Daniel Gonzalez-Alvarado et.al. 2601.21831 null
2026-01-29 DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning Mingshuang Luo et.al. 2601.21716 null
2026-01-30 SmartMeterFM: Unifying Smart Meter Data Generative Tasks Using Flow Matching Models Nan Lin et.al. 2601.21706 null
2026-01-30 Bi-Anchor Interpolation Solver for Accelerating Generative Modeling Hongxu Chen et.al. 2601.21542 null
2026-01-29 HERS: Hidden-Pattern Expert Learning for Risk-Specific Vehicle Damage Adaptation in Diffusion Models Teerapong Panboonyuen et.al. 2601.21517 null
2026-01-29 Nimbus: A Unified Embodied Synthetic Data Generation Framework Zeyu He et.al. 2601.21449 null
2026-01-29 SemanticAudio: Audio Generation and Editing in Semantic Space Zheqi Dai et.al. 2601.21402 null
2026-01-29 Understanding Frechet Speech Distance for Synthetic Speech Quality Evaluation June-Woo Kim et.al. 2601.21386 null
2026-01-29 Conditional Generative Framework with Peak-Aware Attention for Robust Chemical Detection under Interferences Namkyung Yoon et.al. 2601.21246 null
2026-01-29 Rethinking Refinement: Correcting Generative Bias without Noise Injection Xin Peng et.al. 2601.21182 null
2026-01-29 WheelArm-Sim: A Manipulation and Navigation Combined Multimodal Synthetic Data Generation Simulator for Unified Control in Assistive Robotics Guangping Liu et.al. 2601.21129 null
2026-01-28 MapPFN: Learning Causal Perturbation Maps in Context Marvin Sextro et.al. 2601.21092 null
2026-01-28 Accelerated Inorganic Electrides Discovery by Generative Models and Hierarchical Screening Shuo Tao et.al. 2601.21077 null
2026-01-28 Signal from Structure: Exploiting Submodular Upper Bounds in Generative Flow Networks Alexandre Larouche et.al. 2601.21061 null
2026-01-28 Privatization of Synthetic Gaze: Attenuating State Signatures in Diffusion-Generated Eye Movements Kamrul Hasan et.al. 2601.21057 null
2026-01-28 A Diffusive Classification Loss for Learning Energy-based Generative Models Louis Grenioux et.al. 2601.21025 null
2026-01-28 Low performing pixel correction in computed tomography with unrolled network and synthetic data training Hongxu Yang et.al. 2601.20995 null
2026-01-28 Gen-SER: When the generative model meets speech emotion recognition Taihui Wang et.al. 2601.20573 null
2026-01-28 Audio Deepfake Detection in the Age of Advanced Text-to-Speech models Robin Singh et.al. 2601.20510 null
2026-01-28 StormDiT: A generative AI model bridges the 2-6 hour ‘gray zone’ in precipitation nowcasting Haofei Sun et.al. 2601.20342 null
2026-01-28 BLENDER: Blended Text Embeddings and Diffusion Residuals for Intra-Class Image Synthesis in Deep Metric Learning Jan Niklas Kolf et.al. 2601.20246 null
2026-01-28 Quantum statistics from classical simulations via generative Gibbs sampling Weizhou Wang et.al. 2601.20228 null
2026-01-28 Parametric and Generative Forecasts of Day-Ahead Market Curves for Storage Optimization Julian Gutierrez et.al. 2601.20226 null
2026-01-27 GenCP: Towards Generative Modeling Paradigm of Coupled Physics Tianrun Gao et.al. 2601.19541 null
2026-01-27 Cortex-Grounded Diffusion Models for Brain Image Generation Fabian Bongratz et.al. 2601.19498 null
2026-01-27 Cross-Examination Framework: A Task-Agnostic Diagnostic for Information Fidelity in Text-to-Text Generation Tathagata Raha et.al. 2601.19350 null
2026-01-27 Handcrafted Feature Fusion for Reliable Detection of AI-Generated Images Syed Mehedi Hasan Nirob et.al. 2601.19262 null
2026-01-27 E-QRGMM: Efficient Generative Metamodeling for Covariate-Dependent Uncertainty Quantification Zhiyang Liang et.al. 2601.19256 null
2026-01-27 EnzyPGM: Pocket-conditioned Generative Model for Substrate-specific Enzyme Design Zefeng Lin et.al. 2601.19205 null
2026-01-27 A Hybrid Discriminative and Generative System for Universal Speech Enhancement Yinghao Liu et.al. 2601.19113 null
2026-01-27 Proactive Hardening of LLM Defenses with HASTE Henry Chen et.al. 2601.19051 null
2026-01-26 Advances in Diffusion-Based Generative Compression Yibo Yang et.al. 2601.18932 null
2026-01-26 OptiGAN for Crystal Arrays: Physics-Informed Generative Modeling of Optical Photon Transport in PET Detector Arrays Stephan Naunheim et.al. 2601.18780 null
2026-01-26 Riemannian AmbientFlow: Towards Simultaneous Manifold Learning and Generative Modeling from Corrupted Data Willem Diepeveen et.al. 2601.18728 null
2026-01-26 Conditioned Generative Modeling of Molecular Glues: A Realistic AI Approach for Synthesizable Drug-like Molecules Naeyma N. Islam et.al. 2601.18716 null
2026-01-26 Neural Multi-Speaker Voice Cloning for Nepali in Low-Resource Settings Aayush M. Shrestha et.al. 2601.18694 null
2026-01-26 Quasi Monte Carlo methods enable extremely low-dimensional deep generative models Miles Martinez et.al. 2601.18676 null
2026-01-26 GCFX: Generative Counterfactual Explanations for Deep Graph Models at the Model Level Jinlong Hu et.al. 2601.18447 null
2026-01-26 GenCI: Generative Modeling of User Interest Shift via Cohort-based Intent Learning for CTR Prediction Kesha Ou et.al. 2601.18251 null
2026-01-25 Feature-Space Generative Models for One-Shot Class-Incremental Learning Jack Foster et.al. 2601.17905 null
2026-01-25 Controlling Reading Ease with Gaze-Guided Text Generation Andreas Säuberli et.al. 2601.17781 null
2026-01-24 Correct-by-Construction Vision-based Pose Estimation using Geometric Generative Models Ulices Santa Cruz et.al. 2601.17556 null
2026-01-24 Error Analysis of Bayesian Inverse Problems with Generative Priors Bamdad Hosseini et.al. 2601.17374 null
2026-01-24 TheoremForge: Scaling up Formal Data Synthesis with Low-Budget Agentic Workflow Yicheng Tao et.al. 2601.17332 null
2026-01-23 HapticMatch: An Exploration for Generative Material Haptic Simulation and Interaction Mingxin Zhang et.al. 2601.16639 null
2026-01-23 SCHIGAND: A Synthetic Facial Generation Mode Pipeline Ananya Kadali et.al. 2601.16627 null
2026-01-23 MultiLexNorm++: A Unified Benchmark and a Generative Model for Lexical Normalization for Asian Languages Weerayut Buaphet et.al. 2601.16623 null
2026-01-23 LLM-based Semantic Search for Conversational Queries in E-commerce Emad Siddiqui et.al. 2601.16492 null
2026-01-23 Beyond the Training Domain: Robust Generative Transition State Models for Unseen Chemistry Samir Darouich et.al. 2601.16469 null
2026-01-22 Better as Generators Than Classifiers: Leveraging LLMs and Synthetic Data for Low-Resource Multilingual Classification Branislav Pecher et.al. 2601.16278 null
2026-01-24 Point Bridge: 3D Representations for Cross Domain Policy Learning Siddhant Haldar et.al. 2601.16212 null
2026-01-24 PAL*M: Property Attestation for Large Generative Models Prach Chantasantitam et.al. 2601.16199 null
2026-01-22 Learning to Watermark in the Latent Space of Generative Models Sylvestre-Alvise Rebuffi et.al. 2601.16140 null
2026-01-23 Recursive Flow: A Generative Framework for MIMO Channel Estimation Zehua Jiang et.al. 2601.15767 null
2026-01-22 Communication-efficient Federated Graph Classification via Generative Diffusion Modeling Xiuling Wang et.al. 2601.15722 null
2026-01-22 Explainable Deepfake Detection with RL Enhanced Self-Blended Images Ning Jiang et.al. 2601.15624 null
2026-01-22 DeepASMR: LLM-Based Zero-Shot ASMR Speech Generation for Anyone of Any Voice Leying Zhang et.al. 2601.15596 null
2026-01-21 Reliability by design: quantifying and eliminating fabrication risk in LLMs. From generative to consultative AI: a comparative analysis in the legal domain and lessons for high-stakes knowledge bases Alex Dantart et.al. 2601.15476 null
2026-01-21 Ambient Dataloops: Generative Models for Dataset Refinement Adrián Rodríguez-Muñoz et.al. 2601.15417 null
2026-01-21 GeMM-GAN: A Multimodal Generative Model Conditioned on Histopathology Images and Clinical Descriptions for Gene Expression Profile Generation Francesca Pia Panaccione et.al. 2601.15392 null
2026-01-21 SpooFL: Spoofing Federated Learning Isaac Baglin et.al. 2601.15055 null
2026-01-21 SpatialV2A: Visual-Guided High-fidelity Spatial Audio Generation Yanan Wang et.al. 2601.15017 null
2026-01-21 AQAScore: Evaluating Semantic Alignment in Text-to-Audio Generation via Audio Question Answering Chun-Yi Kuan et.al. 2601.14728 null
2026-01-20 Business Logic-Driven Text-to-SQL Data Synthesis for Business Intelligence Jinhui Liu et.al. 2601.14518 null
2026-01-20 Self-Supervised Score-Based Despeckling for SAR Imagery via Log-Domain Transformation Junhyuk Heo et.al. 2601.14334 null
2026-01-18 Guided by the Plan: Enhancing Faithful Autoregressive Text-to-Audio Generation with Guided Decoding Juncheng Wang et.al. 2601.14304 null
2026-01-16 Guardrails for trust, safety, and ethical development and deployment of Large Language Models (LLM) Anjanava Biswas et.al. 2601.14298 null
2026-01-20 Domain-Adaptation through Synthetic Data: Fine-Tuning Large Language Models for German Law Ali Hamza Bashir et.al. 2601.14160 null
2026-01-20 Style Transfer as Bias Mitigation: Diffusion Models for Synthetic Mental Health Text for Arabic Saad Mankarious et.al. 2601.14124 null
2026-01-20 GOMPSNR: Reflourish the Signal-to-Noise Ratio Metric for Audio Generation Tasks Lingling Dai et.al. 2601.13758 null
2026-01-20 Beyond Known Facts: Generating Unseen Temporal Knowledge to Address Data Contamination in LLM Evaluation Arthur Amalvy et.al. 2601.13658 null
2026-01-19 BladeSDF : Unconditional and Conditional Generative Modeling of Representative Blade Geometries Using Signed Distance Functions Ashish S. Nair et.al. 2601.13445 null
2026-01-19 CausationEntropy: Pythonic Optimal Causation Entropy Kevin Slote et.al. 2601.13365 null
2026-01-19 OFA-MAS: One-for-All Multi-Agent System Topology Design based on Mixture-of-Experts Graph Generative Models Shiyuan Li et.al. 2601.12996 null
2026-01-19 Beyond Visual Realism: Toward Reliable Financial Time Series Generation Fan Zhang et.al. 2601.12990 null
2026-01-19 ImmersiveFlow: Stereo-to-7.1.4 spatial audio generation with flow matching Zining Liang et.al. 2601.12950 null
2026-01-21 AI-generated data contamination erodes pathological variability and diagnostic reliability Hongyu He et.al. 2601.12946 null
2026-01-19 SciCoQA: Quality Assurance for Scientific Paper–Code Alignment Tim Baumgärtner et.al. 2601.12910 null
2026-01-19 Text2Structure3D: Graph-Based Generative Modeling of Equilibrium Structures with Diffusion Transformers Lazlo Bleker et.al. 2601.12870 null
2026-01-18 A Unified Neural Codec Language Model for Selective Editable Text to Speech Generation Hanchen Pei et.al. 2601.12480 null
2026-01-18 S^2F-Net:A Robust Spatial-Spectral Fusion Framework for Cross-Model AIGC Detection Xiangyu Hu et.al. 2601.12313 null
2026-01-18 ParaMETA: Towards Learning Disentangled Paralinguistic Speaking Styles Representations from Speech Haowei Lou et.al. 2601.12289 null
2026-01-17 SynQP: A Framework and Metrics for Evaluating the Quality and Privacy Risk of Synthetic Data Bing Hu et.al. 2601.12124 null
2026-01-16 Cleansing the Artificial Mind: A Self-Reflective Detoxification Framework for Large Language Models Kaituo Zhang et.al. 2601.11776 null
2026-01-16 Generative Scenario Rollouts for End-to-End Autonomous Driving Rajeev Yasarla et.al. 2601.11475 null
2026-01-16 FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning Tanyu Chen et.al. 2601.11141 null
2026-01-16 PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models Qiyuan Zhang et.al. 2601.11087 null
2026-01-16 Your One-Stop Solution for AI-Generated Video Detection Long Ma et.al. 2601.11035 null
2026-01-15 BYOL: Bring Your Own Language Into LLMs Syed Waqas Zamir et.al. 2601.10804 null
2026-01-15 Inference-time Physics Alignment of Video Generative Models with Latent World Models Jianhao Yuan et.al. 2601.10553 null
2026-01-15 TF3-RO-50M: Training Compact Romanian Language Models from Scratch on Synthetic Moral Microfiction Mihai Dan Nadas et.al. 2601.10410 null
2026-01-15 Joint Bayesian inference of Earth’s magnetic field and core surface flow on millennial timescales Andreas Nilsson et.al. 2601.10344 null
2026-01-15 Boundary-Aware NL2SQL: Integrating Reliability through Hybrid Reward and Data Synthesis Songsong Tian et.al. 2601.10318 null
2026-01-15 ADVOSYNTH: A Synthetic Multi-Advocate Dataset for Speaker Identification in Courtroom Scenarios Aniket Deroy et.al. 2601.10315 null
2026-01-15 In-Context Operator Learning on the Space of Probability Measures Frank Cole et.al. 2601.09979 null
2026-01-14 Terminally constrained flow-based generative models from an optimal control perspective Weiguo Gao et.al. 2601.09474 null
2026-01-14 Long-term Task-oriented Agent: Proactive Long-term Intent Maintenance in Dynamic Environments Qinglong Shi et.al. 2601.09382 null
2026-01-14 Honesty-Aware Multi-Agent Framework for High-Fidelity Synthetic Data Generation in Digital Psychiatric Intake Doctor-Patient Interactions Xinyuan Zhang et.al. 2601.09216 null
2026-01-14 Seeking Human Security Consensus: A Unified Value Scale for Generative AI Value Safety Ying He et.al. 2601.09112 null
2026-01-14 Mi:dm 2.0 Korea-centric Bilingual Language Models Donghoon Shin et.al. 2601.09066 null
2026-01-13 Spectral Generative Flow Models: A Physics-Inspired Replacement for Vectorized Large Language Models Andrew Kiruluta et.al. 2601.08893 null
2026-01-13 RAGShaper: Eliciting Sophisticated Agentic RAG Skills via Automated Data Synthesis Zhengwei Tao et.al. 2601.08699 null
2026-01-13 Creativity in AI as Emergence from Domain-Limited Generative Models Corina Chutaux et.al. 2601.08388 null
2026-01-13 Training-Free Distribution Adaptation for Diffusion Models via Maximum Mean Discrepancy Guidance Matina Mahdizadeh Sani et.al. 2601.08379 null
2026-01-13 Intra-tree Column Subsampling Hinders XGBoost Learning of Ratio-like Interactions Mykola Pinchuk et.al. 2601.08121 null
2026-01-12 Studying the Role of Synthetic Data for Machine Learning-based Wireless Networks Traffic Forecasting José Pulido et.al. 2601.07646 null
2026-01-12 Puzzle it Out: Local-to-Global World Model for Offline Multi-Agent Reinforcement Learning Sijia li et.al. 2601.07463 null
2026-01-12 SceneNAT: Masked Generative Modeling for Language-Guided Indoor Scene Synthesis Jeongjun Choi et.al. 2601.07218 null
2026-01-12 Agents of Diffusion: Enhancing Diffusion Language Models with Multi-Agent Reinforcement Learning for Structured Data Generation (Extended Version) Aja Khanal et.al. 2601.07152 null
2026-01-11 Bridging Attribution and Open-Set Detection using Graph-Augmented Instance Learning in Synthetic Speech Mohd Mujtaba Akhtar et.al. 2601.07064 null
2026-01-11 Codified Foreshadowing-Payoff Text Generation Longfei Yun et.al. 2601.07033 null
2026-01-11 Continuous Energy Landscape Model for Analyzing Brain State Transitions Triet M. Tran et.al. 2601.06991 null
2026-01-11 X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests Jie Wu et.al. 2601.06953 null
2026-01-11 Generative Modeling of Human-Computer Interfaces with Diffusion Processes and Conditional Control Rui Liu et.al. 2601.06823 null
2026-01-11 Cross-Modal Computational Model of Brain-Heart Interactions via HRV and EEG Feature Malavika Pradeep et.al. 2601.06792 null
2026-01-11 CyberLLM-FINDS 2025: Instruction-Tuned Fine-tuning of Domain-Specific LLMs with Retrieval-Augmented Generation and Graph Integration for MITRE Evaluation Vasanth Iyer et.al. 2601.06779 null
2026-01-11 When Humans Judge Irises: Pupil Size Normalization as an Aid and Synthetic Irises as a Challenge Mahsa Mitcheff et.al. 2601.06725 null
2026-01-10 Characterising Toxicity in Generative Large Language Models Zhiyao Zhang et.al. 2601.06700 null
2026-01-10 From Easy to Hard++: Promoting Differentially Private Image Synthesis Through Spatial-Frequency Curriculum Chen Gong et.al. 2601.06368 null
2026-01-09 CARD: Cluster-level Adaptation with Reward-guided Decoding for Personalized Text Generation Yutong Song et.al. 2601.06352 null
2026-01-09 Multi-Agent Framework for Controllable and Protected Generative Content Creation: Addressing Copyright and Provenance in AI-Generated Media Haris Khan et.al. 2601.06232 null
2026-01-09 Discriminative-Generative Target Speaker Extraction with Decoder-Only Language Models Bang Zeng et.al. 2601.06006 null
2026-01-09 GenCtrl – A Formal Controllability Toolkit for Generative Models Emily Cheng et.al. 2601.05637 null
2026-01-12 Continual Pretraining on Encrypted Synthetic Data for Privacy-Preserving LLMs Honghao Liu et.al. 2601.05635 null
2026-01-09 Research Integrity and Academic Authority in the Age of Artificial Intelligence: From Discovery to Curation? Simon Chesterman et.al. 2601.05574 null
2026-01-08 A Bayesian Generative Modeling Approach for Arbitrary Conditional Inference Qiao Liu et.al. 2601.05355 null
2026-01-08 Generate, Transfer, Adapt: Learning Functional Dexterous Grasping from a Single Human Demonstration Xingyi He et.al. 2601.05243 null
2026-01-08 DocDancer: Towards Agentic Document-Grounded Information Seeking Qintong Zhang et.al. 2601.05163 null
2026-01-08 EvolSQL: Structure-Aware Evolution for Scalable Text-to-SQL Data Synthesis Xuanguang Pan et.al. 2601.04875 null
2026-01-08 PROMISE: Process Reward Models Unlock Test-Time Scaling Laws in Generative Recommendations Chengcheng Guo et.al. 2601.04674 null
2026-01-08 Know Thy Enemy: Securing LLMs Against Prompt Injection via Diverse Data Synthesis and Instruction-Level Chain-of-Thought Learning Zhiyuan Chang et.al. 2601.04666 null
2026-01-08 LLMs-Integrated Automatic Hate Speech Recognition Using Controllable Text Generation Models Ryutaro Oshima et.al. 2601.04654 null
2026-01-08 3D Conditional Image Synthesis of Left Atrial LGE MRI from Composite Semantic Masks Yusri Al-Sanaani et.al. 2601.04588 null
2026-01-08 BanglaLorica: Design and Evaluation of a Robust Watermarking Algorithm for Large Language Models in Bangla Text Generation Amit Bin Tariqul et.al. 2601.04534 null
2026-01-07 From Domains to Instances: Dual-Granularity Data Synthesis for LLM Unlearning Xiaoyu Xu et.al. 2601.04278 null
2026-01-04 LEMAS: Large A 150K-Hour Large-scale Extensible Multilingual Audio Suite with Generative Speech Models Zhiyuan Zhao et.al. 2601.04233 null
2026-01-07 SpeakerSleuth: Evaluating Large Audio-Language Models as Judges for Multi-turn Speaker Consistency Jonggeun Lee et.al. 2601.04029 null
2026-01-12 Muse: Towards Reproducible Long-Form Song Generation with Fine-Grained Style Control Changhao Jiang et.al. 2601.03973 null
2026-01-07 Local Interpolation via Low-Rank Tensor Trains Siddhartha E. Guzman et.al. 2601.03885 null
2026-01-07 Logic Tensor Network-Enhanced Generative Adversarial Network Nijesh Upreti et.al. 2601.03839 null
2026-01-07 Prompt Tuning without Labeled Samples for Zero-Shot Node Classification in Text-Attributed Graphs Sethupathy Parameswaran et.al. 2601.03793 null
2026-01-07 VietMed-MCQ: A Consistency-Filtered Data Synthesis Framework for Vietnamese Traditional Medicine Evaluation Huynh Trung Kiet et.al. 2601.03792 null
2026-01-07 A Comparative Study of 3D Model Acquisition Methods for Synthetic Data Generation of Agricultural Products Steven Moonen et.al. 2601.03784 null
2026-01-07 Evaluation of Multilingual LLMs Personalized Text Generation Capabilities Targeting Groups and Social-Media Platforms Dominik Macko et.al. 2601.03752 null
2026-01-07 Domain Adaptation of the Pyannote Diarization Pipeline for Conversational Indonesian Audio Muhammad Daffa’i Rafi Prasetyo et.al. 2601.03684 null
2026-01-07 Towards Compositional Generalization of LLMs via Skill Taxonomy Guided Data Synthesis Yifan Wei et.al. 2601.03676 null
2026-01-07 eTracer: Towards Traceable Text Generation via Claim-Level Grounding Bohao Chu et.al. 2601.03669 null
2026-01-06 Fine-tuning Small Language Models as Efficient Enterprise Search Relevance Labelers Yue Kang et.al. 2601.03211 null
2026-01-06 UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward Yile Liu et.al. 2601.03205 null
2026-01-06 Quality Degradation Attack in Synthetic Data Qinyi Liu et.al. 2601.02947 null
2026-01-06 Vulnerabilities of Audio-Based Biometric Authentication Systems Against Deepfake Speech Synthesis Mengze Hong et.al. 2601.02914 null
2026-01-06 Q-Regularized Generative Auto-Bidding: From Suboptimal Trajectories to Optimal Policies Mingming Zhang et.al. 2601.02754 null
2026-01-06 Omni2Sound: Towards Unified Video-Text-to-Audio Generation Yusheng Dai et.al. 2601.02731 null
2026-01-06 GRRE: Leveraging G-Channel Removed Reconstruction Error for Robust Detection of AI-Generated Images Shuman He et.al. 2601.02709 null
2026-01-05 Generative Site-Specific Beamforming for Next-Generation Spatial Intelligence Zhaolin Wang et.al. 2601.02301 null
2026-01-05 HeadLighter: Disentangling Illumination in Generative 3D Gaussian Heads via Lightstage Captures Yating Wang et.al. 2601.02103 null
2026-01-05 SerpentFlow: Generative Unpaired Domain Alignment via Shared-Structure Decomposition Julie Keisler et.al. 2601.01979 null
2026-01-05 Forget Less by Learning from Parents Through Hierarchical Relationships Arjun Ramesh Kaushik et.al. 2601.01892 null
2026-01-04 Deep Linear Discriminant Analysis Revisited Maxat Tezekbayev et.al. 2601.01619 null
2026-01-08 MM-Sonate: Multimodal Controllable Audio-Video Generation with Zero-Shot Voice Cloning Chunyu Qiang et.al. 2601.01568 null
2026-01-08 Logics-STEM: Empowering LLM Reasoning via Failure-Driven Post-Training and Document Knowledge Enhancement Mingyu Xu et.al. 2601.01562 null
2026-01-04 DrivingGen: A Comprehensive Benchmark for Generative Video World Models in Autonomous Driving Yang Zhou et.al. 2601.01528 null
2026-01-03 GenCAMO: Scene-Graph Contextual Decoupling for Environment-aware and Mask-free Camouflage Image-Dense Annotation Generation Chenglizhao Chen et.al. 2601.01181 null
2026-01-03 Histogram Assisted Quality Aware Generative Model for Resolution Invariant NIR Image Colorization Abhinav Attri et.al. 2601.01103 null
2026-01-03 Luminark: Training-free, Probabilistically-Certified Watermarking for General Vision Generative Models Jiayi Xu et.al. 2601.01085 null
2026-01-06 Coarse-Grained Kullback–Leibler Control of Diffusion-Based Generative AI Tatsuaki Tsuruyama et.al. 2601.01045 null
2025-12-31 A Chemically Grounded Evaluation Framework for Generative Models in Materials Discovery Elohan Veillon et.al. 2601.00886 null
2025-12-30 Path Integral Solution for Dissipative Generative Dynamics Xidi Wang et.al. 2601.00860 null
2026-01-02 FedHypeVAE: Federated Learning with Hypernetwork Generated Conditional VAEs for Differentially Private Embedding Sharing Sunny Gupta et.al. 2601.00785 null
2026-01-02 Gradient-free ensemble transform methods for generalized Bayesian inference in generative models Diksha Bhandari et.al. 2601.00760 null
2026-01-02 Peak-Nadir Encoding for Efficient CGM Data Compression and High-Fidelity Reconstruction Clara Bender et.al. 2601.00608 null
2026-01-01 Unknown Aware AI-Generated Content Attribution Ellie Thieu et.al. 2601.00218 null
2025-12-31 Generative Classifiers Avoid Shortcut Solutions Alexander C. Li et.al. 2512.25034 null
2025-12-31 ShowUI- $π$ : Flow-based Generative Models as GUI Dexterous Hands Siyuan Hu et.al. 2512.24965 null
2025-12-31 Limits of quantum generative models with classical sampling hardness Sabrina Herbst et.al. 2512.24801 null
2025-12-31 HiGR: Efficient Generative Slate Recommendation via Hierarchical Planning and Multi-Objective Preference Alignment Yunsheng Pang et.al. 2512.24787 null
2025-12-30 Generative forecasting with joint probability models Patrick Wyrod et.al. 2512.24446 null
2025-12-30 GPT-like transformer model for silicon tracking detector simulation Tadej Novak et.al. 2512.24254 null
2025-12-30 Assured Autonomy: How Operations Research Powers and Orchestrates Generative AI Systems Tinglong Dai et.al. 2512.23978 null
2025-12-30 Assessing generative modeling approaches for free energy estimates in condensed matter Maximilian Schebek et.al. 2512.23930 null
2025-12-29 Flow Matching Neural Processes Hussen Abu Hamad et.al. 2512.23853 null
2025-12-29 Exploiting the Prior of Generative Time Series Imputation YuYang Miao et.al. 2512.23832 null
2025-12-26 State-of-the-art Small Language Coder Model: Mify-Coder Abhinav Parmar et.al. 2512.23747 null
2025-12-29 Diffusion priors enhanced velocity model building from time-lag images using a neural operator Xiao Ma et.al. 2512.23375 null
2025-12-29 AGRO-SQL: Agentic Group-Relative Optimization with High-Fidelity Data Synthesis Cehua Yang et.al. 2512.23366 null
2025-12-29 Flow2GAN: Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-step High-Fidelity Audio Generation Zengwei Yao et.al. 2512.23278 null
2025-12-29 Anomaly Detection by Effectively Leveraging Synthetic Images Sungho Kang et.al. 2512.23227 null
2025-12-29 PathoSyn: Imaging-Pathology MRI Synthesis via Disentangled Deviation Diffusion Jian Wang et.al. 2512.23130 null
2025-12-27 Quantum Generative Models for Computational Fluid Dynamics: A First Exploration of Latent Space Learning in Lattice Boltzmann Simulations Achraf Hsain et.al. 2512.22672 null
2025-12-27 Visual Autoregressive Modelling for Monocular Depth Estimation Amir El-Ghoussani et.al. 2512.22653 null
2025-12-26 LLA: Enhancing Security and Privacy for Generative Models with Logic-Locked Accelerators You Li et.al. 2512.22307 null
2025-12-25 Human-Aligned Generative Perception: Bridging Psychophysics and Generative Models Antara Titikhsha et.al. 2512.22272 null
2025-12-26 From In Silico to In Vitro: Evaluating Molecule Generative Models for Hit Generation Nagham Osman et.al. 2512.22031 null
2025-12-29 Deep Generative Models for Synthetic Financial Data: Applications to Portfolio and Risk Modeling Christophe D. Hounwanou et.al. 2512.21798 null
2025-12-25 Synthetic Financial Data Generation for Enhanced Financial Modelling Christophe D. Hounwanou et.al. 2512.21791 null
2025-12-25 BeHGAN: Bengali Handwritten Word Generation from Plain Text Using Generative Adversarial Networks Md. Rakibul Islam et.al. 2512.21694 null
2025-12-25 Dictionary-Transform Generative Adversarial Networks Angshul Majumdar et.al. 2512.21677 null
2025-12-25 Residual Prior Diffusion: A Probabilistic Framework Integrating Coarse Latent Priors with Diffusion Models Takuro Kutsuna et.al. 2512.21593 null
2025-12-25 Generative Actor Critic Aoyang Qin et.al. 2512.21527 null
2025-12-24 A Reinforcement Learning Approach to Synthetic Data Generation Natalia Espinosa-Dice et.al. 2512.21395 null
2025-12-24 A Turn Toward Better Alignment: Few-Shot Generative Adaptation with Equivariant Feature Rotation Chenghao Xu et.al. 2512.21174 null
2025-12-24 Active inference and artificial reasoning Karl Friston et.al. 2512.21129 null
2025-12-24 PUFM++: Point Cloud Upsampling via Enhanced Flow Matching Zhi-Song Liu et.al. 2512.20988 null
2025-12-24 X-ray Insights Unleashed: Pioneering the Enhancement of Multi-Label Long-Tail Data Xinquan Yang et.al. 2512.20980 null
2025-12-24 GenTSE: Enhancing Target Speaker Extraction via a Coarse-to-Fine Generative Language Model Haoyang Li et.al. 2512.20978 null
2025-12-24 Beyond Artifacts: Real-Centric Envelope Modeling for Reliable AI-Generated Image Detection Ruiqi Liu et.al. 2512.20937 null
2025-12-23 Improving Matrix Exponential for Generative AI Flows: A Taylor-Based Approach Beyond Paterson–Stockmeyer Jorge Sastre et.al. 2512.20777 null
2025-12-23 UTDesign: A Unified Framework for Stylized Text Editing and Generation in Graphic Design Images Yiming Zhao et.al. 2512.20479 null
2025-12-23 Enriching Earth Observation labeled data with Quantum Conditioned Diffusion Models Francesco Mauro et.al. 2512.20448 null
2025-12-23 Structured Visualization Design Knowledge for Grounding Generative Reasoning and Situated Feedback Péter Ferenc Gyarmati et.al. 2512.20306 null
2025-12-23 HGAN-SDEs: Learning Neural Stochastic Differential Equations with Hermite-Guided Adversarial Training Yuanjian Xu et.al. 2512.20272 null
2025-12-23 Automated Training of Learned Database Components with Generative AI Angjela Davitkova et.al. 2512.20271 null
2025-12-23 Aliasing-Free Neural Audio Synthesis Yicheng Gu et.al. 2512.20211 null
2025-12-23 QuarkAudio Technical Report Chengwei Liu et.al. 2512.20151 null
2025-12-22 Modeling Non-Ergodic Path Effects Using Conditional Generative Model for Fourier Amplitude Spectra Maxime Lacour et.al. 2512.19909 null
2025-12-22 Generative diffusion models for agricultural AI: plant image generation, indoor-to-outdoor translation, and expert preference alignment Da Tan et.al. 2512.19632 null
2025-12-22 MapTrace: Scalable Data Generation for Route Tracing on Maps Artemis Panagopoulou et.al. 2512.19609 null
2025-12-22 GLUE: Generative Latent Unification of Expertise-Informed Engineering Models Tim Aebersold et.al. 2512.19469 null
2025-12-23 SiamGPT: Quality-First Fine-Tuning for Stable Thai Text Generation Thittipat Pairatsuppawat et.al. 2512.19455 null
2025-12-22 A Rate-Distortion Perspective on the Emergence of Number Sense in Unsupervised Generative Models Leo D’Amato et.al. 2512.19450 null
2025-12-26 Real-Time Streamable Generative Speech Restoration with Flow Matching Simon Welker et.al. 2512.19442 null
2025-12-22 Generative Krylov Subspace Representations for Scalable Quantum Eigensolvers Changwon Lee et.al. 2512.19420 null
2025-12-22 Interpretable Hybrid Deep Q-Learning Framework for IoT-Based Food Spoilage Prediction with Synthetic Data Generation and Hardware Validation Isshaan Singh et.al. 2512.19361 null
2025-12-22 VisionDirector: Vision-Language Guided Closed-Loop Refinement for Generative Image Synthesis Meng Chu et.al. 2512.19243 null
2025-12-22 JoyVoice: Long-Context Conditioning for Anthropomorphic Multi-Speaker Conversational Synthesis Fan Yu et.al. 2512.19090 null
2025-12-22 Efficient Personalization of Generative Models via Optimal Experimental Design Guy Schacht et.al. 2512.19057 null
2025-12-22 Decoupled Generative Modeling for Human-Object Interaction Synthesis Hwanhee Jung et.al. 2512.19049 null
2025-12-22 On Conditional Stochastic Interpolation for Generative Nonlinear Sufficient Dimension Reduction Shuntuo Xu et.al. 2512.18971 null
2025-12-22 Symmetrization of 3D Generative Models Nicolas Caytuiro et.al. 2512.18953 null
2025-12-21 Generative Modeling through Spectral Analysis of Koopman Operator Yuanchao Xu et.al. 2512.18837 null
2025-12-23 Social Comparison without Explicit Inference of Others’ Reward Values: A Constructive Approach Using a Probabilistic Generative Model Yosuke Taniuchi et.al. 2512.18687 null
2025-12-20 Feature-Enhanced Graph Neural Networks for Classification of Synthetic Graph Generative Models: A Benchmarking Study Janek Dyer et.al. 2512.18524 null
2025-12-20 Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models Chao Wen et.al. 2512.18388 null
2025-12-19 Generative Multi-Objective Bayesian Optimization with Scalable Batch Evaluations for Sample-Efficient De Novo Molecular Design Madhav R. Muthyala et.al. 2512.17659 null
2025-12-19 Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs Zhaolin Cai et.al. 2512.17640 null
2025-12-19 InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion Hoiyeong Jin et.al. 2512.17504 null
2025-12-19 3D-RE-GEN: 3D Reconstruction of Indoor Scenes with a Generative Framework Tobias Sautter et.al. 2512.17459 null
2025-12-22 Generative modeling of conditional probability distributions on the level-sets of collective variables Fatima-Zahrae Akhyar et.al. 2512.17374 null
2025-12-18 Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection Kaixin Ding et.al. 2512.16905 null
2025-12-18 Sceniris: A Fast Procedural Scene Generation Framework Jinghuan Shang et.al. 2512.16896 null
2025-12-18 Task-Oriented Data Synthesis and Control-Rectify Sampling for Remote Sensing Semantic Segmentation Yunkai Yang et.al. 2512.16740 null
2025-12-18 Empirical Evaluation of Structured Synthetic Data Privacy Metrics: Novel experimental framework Milton Nicolás Plasencia Palacios et.al. 2512.16284 null
2025-12-18 Scaling Spatial Reasoning in MLLMs through Programmatic Data Synthesis Zhi Helu et.al. 2512.16237 null
2025-12-18 ToolForge: A Data Synthesis Pipeline for Multi-Hop Search without Real-World APIs Hao Chen et.al. 2512.16149 null
2025-12-18 Evaluation of Generative Models for Emotional 3D Animation Generation in VR Kiran Chhatre et.al. 2512.16081 null
2025-12-17 SoFlow: Solution Flow Models for One-Step Generative Modeling Tianze Luo et.al. 2512.15657 null
2025-12-17 On Assessing the Relevance of Code Reviews Authored by Generative Models Robert Heumüller et.al. 2512.15466 null
2025-12-17 Expand and Prune: Maximizing Trajectory Diversity for Effective GRPO in Generative Models Shiran Ge et.al. 2512.15347 null
2025-12-17 SynthSeg-Agents: Multi-Agent Synthetic Data Generation for Zero-Shot Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2512.15310 null
2025-12-16 Polypersona: Persona-Grounded LLM for Synthetic Survey Responses Tejaswani Dash et.al. 2512.14562 null
2025-12-16 C-ing Clearly: Enhanced Binary Code Explanations using C code Teodor Poncu et.al. 2512.14500 null
2025-12-16 A data-physics hybrid generative model for patient-specific post-stroke motor rehabilitation using wearable sensor data Yanning Dai et.al. 2512.14329 null
2025-12-16 SS4D: Native 4D Generative Model via Structured Spacetime Latents Zhibing Li et.al. 2512.14284 null
2025-12-16 Beyond MMD: Evaluating Graph Generative Models with Geometric Deep Learning Salvatore Romano et.al. 2512.14241 null
2025-12-16 Estimating problem difficulty without ground truth using Large Language Model comparisons Marthe Ballon et.al. 2512.14220 null
2025-12-16 Random-Bridges as Stochastic Transports for Generative Models Stefano Goria et.al. 2512.14190 null
2025-12-16 Quantum-Inspired Approach to Analyzing Complex System Dynamics Parsa Kafashi et.al. 2512.14169 null
2025-12-16 An intercomparison of generative machine learning methods for downscaling precipitation at fine spatial scales Bryn Ward-Leikis et.al. 2512.13987 null
2025-12-15 An evaluation of SVBRDF Prediction from Generative Image Models for Appearance Modeling of 3D Scenes Alban Gauthier et.al. 2512.13950 null
2025-12-15 Deepfakes in the 2025 Canadian Election: Prevalence, Partisanship, and Platform Dynamics Victor Livernoche et.al. 2512.13915 null
2025-12-15 SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning Jitesh Jain et.al. 2512.13874 null
2025-12-15 Improving the Plausibility of Pressure Distributions Synthesized from Depth through Generative Modeling Neevkumar Manavar et.al. 2512.13757 null
2025-12-16 Lyra: A Hardware-Accelerated RISC-V Verification Framework with Generative Model-Based Processor Fuzzing Juncheng Huo et.al. 2512.13686 null
2025-12-15 JoVA: Unified Multimodal Learning for Joint Video-Audio Generation Xiaohu Huang et.al. 2512.13677 null
2025-12-15 PrahokBART: A Pre-trained Sequence-to-Sequence Model for Khmer Natural Language Generation Hour Kaing et.al. 2512.13552 null
2025-12-19 Non-Resolution Reasoning (NRR): A Computational Framework for Contextual Identity and Ambiguity Preservation Kei Saito et.al. 2512.13478 null
2025-12-15 ALIGN-FL: Architecture-independent Learning through Invariant Generative component sharing in Federated Learning Mayank Gulati et.al. 2512.13316 null
2025-12-18 DisCo-Speech: Controllable Zero-Shot Speech Generation with A Disentangled Speech Codec Tao Li et.al. 2512.13251 null
2025-12-16 POLAR: A Portrait OLAT Dataset and Generative Framework for Illumination-Aware Face Modeling Zhuo Chen et.al. 2512.13192 null
2025-12-16 A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis Xianchao Guan et.al. 2512.13164 null
2025-12-14 NagaNLP: Bootstrapping NLP for Low-Resource Nagamese Creole with Human-in-the-Loop Synthetic Data Agniva Maiti et.al. 2512.12537 null
2025-12-13 Bayesian Full-waveform Monitoring of CO2 Storage with Fluid-flow Priors via Generative Modeling Haipeng Li et.al. 2512.12482 null
2025-12-13 ArtGen: Conditional Generative Modeling of Articulated Objects in Arbitrary Part-Level States Haowen Wang et.al. 2512.12395 null
2025-12-13 Quantum-Aware Generative AI for Materials Discovery: A Framework for Robust Exploration Beyond DFT Biases Mahule Roy et.al. 2512.12288 null
2025-12-13 Rethinking Label Consistency of In-Context Learning: An Implicit Transductive Label Propagation Perspective Haoyang Chen et.al. 2512.12175 null
2025-12-13 A comparative study of generative models for child voice conversion Protima Nomo Sudro et.al. 2512.12129 null
2025-12-12 AnchorDream: Repurposing Video Diffusion for Embodiment-Aware Robot Data Synthesis Junjie Ye et.al. 2512.11797 null
2025-12-12 Super Suffixes: Bypassing Text Generation Alignment and Guard Models Simultaneously Andrew Adiletta et.al. 2512.11783 null
2025-12-12 Referring Change Detection in Remote Sensing Imagery Yilmaz Korkmaz et.al. 2512.11719 null
2025-12-12 Emergence of Nonequilibrium Latent Cycles in Unsupervised Generative Modeling Marco Baiesi et.al. 2512.11415 null
2025-12-12 Iterative Compositional Data Generation for Robot Control Anh-Quan Pham et.al. 2512.10891 null
2025-12-11 Generative Modeling from Black-box Corruptions via Self-Consistent Stochastic Interpolants Chirag Modi et.al. 2512.10857 null
2025-12-11 Beyond the Black Box: Identifiable Interpretation and Control in Generative Models via Causal Minimality Lingjing Kong et.al. 2512.10720 null
2025-12-11 TriDF: Evaluating Perception, Detection, and Hallucination for Interpretable DeepFake Detection Jian-Yu Jiang-Lin et.al. 2512.10652 null
2025-12-11 AgriGPT-Omni: A Unified Speech-Vision-Text Framework for Multilingual Agricultural Intelligence Bo Yang et.al. 2512.10624 null
2025-12-11 Breaking the Vicious Cycle: Coherent 3D Gaussian Splatting from Sparse and Motion-Blurred Views Zhankuo Xu et.al. 2512.10369 null
2025-12-10 Generative Modeling of Entangled Polymers with a Distance-Based Variational Autoencoder Pietro Chiarantoni et.al. 2512.10131 null
2025-12-10 Workflow is All You Need: Escaping the “Statistical Smoothing Trap” via High-Entropy Information Foraging and Adversarial Pacing Zhongjie Jiang et.al. 2512.10121 null
2025-12-10 PathCo-LatticE: Pathology-Constrained Lattice-Of Experts Framework for Fully-supervised Few-Shot Cardiac MRI Segmentation Mohamed Elbayumi et.al. 2512.09779 null
2025-12-10 Membership and Dataset Inference Attacks on Large Audio Generative Models Jakub Proboszcz et.al. 2512.09654 null
2025-12-10 ImageTalk: Designing a Multimodal AAC Text Generation System Driven by Image Recognition and Natural Language Generation Boyin Yang et.al. 2512.09610 null
2025-12-10 Lazy Diffusion: Mitigating spectral collapse in generative diffusion-based stable autoregressive emulation of turbulent flows Anish Sambamurthy et.al. 2512.09572 null
2025-12-10 Toward Closed-loop Molecular Discovery via Language Model, Property Alignment and Strategic Search Junkai Ji et.al. 2512.09566 null
2025-12-10 Transport Novelty Distance: A Distributional Metric for Evaluating Material Generative Models Paul Hagemann et.al. 2512.09514 null
2025-12-10 Color encoding in Latent Space of Stable Diffusion Models Guillem Arias et.al. 2512.09477 null
2025-12-10 Generative Point Cloud Registration Haobo Jiang et.al. 2512.09407 null
2025-12-10 ASSIST-3D: Adapted Scene Synthesis for Class-Agnostic 3D Instance Segmentation Shengchao Zhou et.al. 2512.09364 null
2025-12-12 SDialog: A Python Toolkit for End-to-End Agent Building, User Simulation, Dialog Generation, and Evaluation Sergio Burdisso et.al. 2512.09142 null
2025-12-09 Contrast transfer functions help quantify neural network out-of-distribution generalization in HRTEM Luis Rangel DaCosta et.al. 2512.09067 null
2025-12-09 A Survey of Body and Face Motion: Datasets, Performance Evaluation Metrics and Generative Techniques Lownish Rai Sookha et.al. 2512.09005 null
2025-12-08 Demo: Generative AI helps Radiotherapy Planning with User Preference Riqiang Gao et.al. 2512.08996 null
2025-12-09 When Tables Leak: Attacking String Memorization in LLM-Based Tabular Data Generation Joshua Ward et.al. 2512.08875 null
2025-12-09 Differentially Private Synthetic Data Generation Using Context-Aware GANs Anantaa Kotal et.al. 2512.08869 null
2025-12-09 Democratizing ML for Enterprise Security: A Self-Sustained Attack Detection Framework Sadegh Momeni et.al. 2512.08802 null
2025-12-09 LoFA: Learning to Predict Personalized Priors for Fast Adaptation of Visual Generative Models Yiming Hao et.al. 2512.08785 null
2025-12-09 Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank Shaofeng Zhang et.al. 2512.08648 null
2025-12-09 HealthcareNLP: where are we and what is next? Lifeng Han et.al. 2512.08617 null
2025-12-09 A Novel Wasserstein Quaternion Generative Adversarial Network for Color Image Generation Zhigang Jia et.al. 2512.08542 null
2025-12-09 PAVAS: Physics-Aware Video-to-Audio Synthesis Oh Hyun-Bin et.al. 2512.08282 null
2025-12-09 Worst-case generation via minimax optimization in Wasserstein space Xiuyuan Cheng et.al. 2512.08176 null
2025-12-08 SwissGov-RSD: A Human-annotated, Cross-lingual Benchmark for Token-level Recognition of Semantic Differences Between Related Documents Michelle Wastl et.al. 2512.07538 null
2025-12-07 Progress Ratio Embeddings: An Impatience Signal for Robust Length Control in Neural Text Generation Ivanhoé Botcazou et.al. 2512.06938 null
2025-12-07 RunawayEvil: Jailbreaking the Image-to-Video Generative Models Songping Wang et.al. 2512.06674 null
2025-12-06 Generic visuality of war? How image-generative AI models (mis)represent Russia’s war against Ukraine Mykola Makhortykh et.al. 2512.06570 null
2025-12-06 SUGAR: A Sweeter Spot for Generative Unlearning of Many Identities Dung Thuy Nguyen et.al. 2512.06562 null
2025-12-06 LOCUS: A System and Method for Low-Cost Customization for Universal Specialization Dhanasekar Sundararaman et.al. 2512.06239 null
2025-12-05 When Privacy Isn’t Synthetic: Hidden Data Leakage in Generative AI Models S. M. Mustaqim et.al. 2512.06062 null
2025-12-04 DreamFoley: Scalable VLMs for High-Fidelity Video-to-Audio Generation Fu Li et.al. 2512.06022 null
2025-12-05 MaxShapley: Towards Incentive-compatible Generative Search with Fair Context Attribution Sara Patel et.al. 2512.05958 null
2025-12-05 Impugan: Learning Conditional Generative Models for Robust Data Imputation Zalish Mahmud et.al. 2512.05950 null
2025-12-05 Measuring the Effect of Background on Classification and Feature Importance in Deep Learning for AV Perception Anne Sielemann et.al. 2512.05937 null
2025-12-05 Synset Signset Germany: a Synthetic Dataset for German Traffic Sign Recognition Anne Sielemann et.al. 2512.05936 null
2025-12-05 A Comparative Study on Synthetic Facial Data Generation Techniques for Face Recognition Pedro Vidal et.al. 2512.05928 null
2025-12-05 3D Path Planning for Robot-assisted Vertebroplasty from Arbitrary Bi-plane X-ray via Differentiable Rendering Blanca Inigo et.al. 2512.05803 null
2025-12-08 General and Domain-Specific Zero-shot Detection of Generated Images via Conditional Likelihood Roy Betser et.al. 2512.05590 null
2025-12-05 SSDLabeler: Realistic semi-synthetic data generation for multi-label artifact classification in EEG Taketo Akama et.al. 2512.05500 null
2025-12-05 SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling Elisabetta Fedele et.al. 2512.05343 null
2025-12-04 Light-X: Generative 4D Video Rendering with Camera and Illumination Control Tianqi Liu et.al. 2512.05115 null
2025-12-04 ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning Shengyuan Ding et.al. 2512.05111 null
2025-12-04 OMTRA: A Multi-Task Generative Model for Structure-Based Drug Design Ian Dunn et.al. 2512.05080 null
2025-12-04 Object Reconstruction under Occlusion with Generative Priors and Contact-induced Constraints Minghan Zhu et.al. 2512.05079 null
2025-12-04 HTR-ConvText: Leveraging Convolution and Textual Information for Handwritten Text Recognition Pham Thach Thanh Truc et.al. 2512.05021 null
2025-12-04 Generative Neural Video Compression via Video Diffusion Prior Qi Mao et.al. 2512.05016 null
2025-12-04 Reflection Removal through Efficient Adaptation of Diffusion Transformers Daniyar Zakarin et.al. 2512.05000 null
2025-12-04 Efficient Generative Transformer Operators For Million-Point PDEs Armand Kassaï Koupaï et.al. 2512.04974 null
2025-12-04 LatentFM: A Latent Flow Matching Approach for Generative Medical Image Segmentation Huynh Trinh Ngoc et.al. 2512.04821 null
2025-12-04 LaFiTe: A Generative Latent Field for 3D Native Texturing Chia-Hao Chen et.al. 2512.04786 null
2025-12-04 Complementary Characterization of Agent-Based Models via Computational Mechanics and Diffusion Models Roberto Garrone et.al. 2512.04771 null
2025-12-04 LeMat-GenBench: A Unified Evaluation Framework for Crystal Generative Models Siddharth Betala et.al. 2512.04562 null
2025-12-04 One-Step Generative Channel Estimation via Average Velocity Field Zehua Jiang et.al. 2512.04501 null
2025-12-04 UniTS: Unified Time Series Generative Model for Remote Sensing Yuxiang Zhang et.al. 2512.04461 null
2025-12-03 ActVAE: Modelling human activity schedules with a deep conditional generative approach Fred Shone et.al. 2512.04223 null
2025-12-03 ReasonX: MLLM-Guided Intrinsic Image Decomposition Alara Dirik et.al. 2512.04222 null
2025-12-03 Stable Signer: Hierarchical Sign Language Generative Model Sen Fang et.al. 2512.04048 null
2025-12-03 Fast & Efficient Normalizing Flows and Applications of Image Generative Models Sandeep Nagar et.al. 2512.04039 null
2025-12-03 Towards Privacy-Preserving Range Queries with Secure Learned Spatial Index over Encrypted Data Zuan Wang et.al. 2512.03669 null
2025-12-03 AdaPower: Specializing World Foundation Models for Predictive Manipulation Yuhang Huang et.al. 2512.03538 null
2025-12-02 SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning Salman Rahman et.al. 2512.03244 null
2025-12-02 InvertiTune: High-Quality Data Synthesis for Cost-Effective Single-Shot Text-to-Knowledge Graph Generation Faezeh Faez et.al. 2512.03197 null
2025-12-02 ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation Mengchen Zhang et.al. 2512.03036 null
2025-12-04 LORE: A Large Generative Model for Search Relevance Chenji Lu et.al. 2512.03025 null
2025-12-02 In Silico Development of Psychometric Scales: Feasibility of Representative Population Data Simulation with LLMs Enrico Cipriani et.al. 2512.02910 null
2025-12-02 Leveraging generative adversarial networks with spatially adaptive denormalization for multivariate stochastic seismic data inversion Roberto Miele et.al. 2512.02863 null
2025-12-02 Making Dialogue Grounding Data Rich: A Three-Tier Data Synthesis Framework for Generalized Referring Expression Comprehension Juexi Shao et.al. 2512.02791 null
2025-12-02 Self-Improving AI Agents through Self-Play Przemyslaw Chojecki et.al. 2512.02731 null
2025-12-02 Generative modeling using evolved quantum Boltzmann machines Mark M. Wilde et.al. 2512.02721 null
2025-12-02 ClimaOoD: Improving Anomaly Segmentation via Physically Realistic Synthetic Data Yuxing Liu et.al. 2512.02686 null
2025-12-02 Generative Multi-modal Feedback for Singing Voice Synthesis Evaluation Xueyan Li et.al. 2512.02523 null
2025-12-02 Data Curation Through the Lens of Spectral Dynamics: Static Limits, Dynamic Acceleration, and Practical Oracles Yizhou Zhang et.al. 2512.02409 null
2025-12-01 InstructLR: A Scalable Approach to Create Instruction Dataset for Under-Resourced Languages Mamadou K. Keita et.al. 2512.02213 null
2025-12-01 Generative Video Motion Editing with 3D Point Tracks Yao-Chih Lee et.al. 2512.02015 null
2025-12-01 Improved Mean Flows: On the Challenges of Fastforward Generative Models Zhengyang Geng et.al. 2512.02012 null
2025-12-02 Consistent Synthetic Sequences Unlock Structural Diversity in Fully Atomistic De Novo Protein Design Danny Reidenbach et.al. 2512.01976 null
2025-12-01 From Atomic to Composite: Reinforcement Learning Enables Generalization in Complementary Reasoning Sitao Cheng et.al. 2512.01970 null
2025-12-01 Exploring Human Perceptions of AI Responses: Insights from a Mixed-Methods Study on Risk Mitigation in Generative Models Heloisa Candello et.al. 2512.01892 null
2025-12-01 Tahr: The Generative Attribute Grammar Framework Matteo Ciccaglione et.al. 2512.01872 null
2025-12-01 Deconstructing Generative Diversity: An Information Bottleneck Analysis of Discrete Latent Generative Models Yudi Wu et.al. 2512.01831 null
2025-12-01 Dimension-free error estimate for diffusion model and optimal scheduling Valentin de Bortoli et.al. 2512.01820 null
2025-12-01 Much Ado About Noising: Dispelling the Myths of Generative Robotic Control Chaoyi Pan et.al. 2512.01809 null
2025-12-01 Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos Xavier Thomas et.al. 2512.01803 null
2025-11-28 Object-Centric Data Synthesis for Category-level Object Detection Vikhyat Agarwal et.al. 2511.23450 null
2025-11-28 MegaChat: A Synthetic Persian Q&A Dataset for High-Quality Sales Chatbot Evaluation Mahdi Rahmani et.al. 2511.23397 null
2025-11-28 Identifying bars in galaxies using machine learning Rajit Shrivastava et.al. 2511.23383 null
2025-11-28 Flow Straighter and Faster: Efficient One-Step Generative Modeling via MeanFlow on Rectified Trajectories Xinxi Zhang et.al. 2511.23342 null
2025-11-28 Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach Shuqi Liu et.al. 2511.23335 null
2025-11-28 Synthetic Industrial Object Detection: GenAI vs. Feature-Based Methods Jose Moises Araya-Martinez et.al. 2511.23241 null
2025-11-28 db-SP: Accelerating Sparse Attention for Visual Generative Models with Dual-Balanced Sequence Parallelism Siqi Chen et.al. 2511.23113 null
2025-11-28 Evaluating the Clinical Impact of Generative Inpainting on Bone Age Estimation Felipe Akio Matsuoka et.al. 2511.23066 null
2025-11-26 Matrix: Peer-to-Peer Multi-Agent Synthetic Data Generation Framework Dong Wang et.al. 2511.21686 null
2025-11-26 TAB-DRW: A DFT-based Robust Watermark for Generative Tabular Data Yizhou Zhao et.al. 2511.21600 null
2025-11-26 Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy Teng Hu et.al. 2511.21579 null
2025-11-26 Guiding Generative Models for Protein Design: Prompting, Steering and Aligning Filippo Stocco et.al. 2511.21476 null
2025-11-26 Ensemble Performance Through the Lens of Linear Independence of Classifier Votes in Data Streams Enes Bektas et.al. 2511.21465 null
2025-11-26 TSGM: Regular and Irregular Time-series Generation using Score-based Generative Models Haksoo Lim et.al. 2511.21335 null
2025-11-25 MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models Chieh-Yun Chen et.al. 2511.20629 null
2025-11-25 Copyright Detection in Large Language Models: An Ethical Approach to Generative AI Development David Szczecina et.al. 2511.20623 null
2025-11-25 Anatomica: Localized Control over Geometric and Topological Properties for Anatomical Diffusion Models Karim Kadry et.al. 2511.20587 null
2025-11-25 Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward Yuwei Niu et.al. 2511.20561 null
2025-11-25 AlignBench: Benchmarking Fine-Grained Image-Text Alignment with Synthetic Image-Caption Pairs Kuniaki Saito et.al. 2511.20515 null
2025-11-25 FRAGMENTA: End-to-end Fragmentation-based Generative Model with Agentic Tuning for Drug Lead Optimization Yuto Suzuki et.al. 2511.20510 null
2025-11-25 Generative Modeling with Manifold Percolation Rui Tong et.al. 2511.20503 null
2025-11-25 Quantifying the Privacy Implications of High-Fidelity Synthetic Network Traffic Van Tran et.al. 2511.20497 null
2025-11-25 Efficient and Fast Generative-Based Singing Voice Separation using a Latent Diffusion Model Genís Plaja-Roglans et.al. 2511.20470 null
2025-11-25 STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flow Jiatao Gu et.al. 2511.20462 null
2025-11-25 Diffusion for Fusion: Designing Stellarators with Generative AI Misha Padidar et.al. 2511.20445 null
2025-11-24 In-Video Instructions: Visual Signals as Generative Control Gongfan Fang et.al. 2511.19401 null
2025-11-24 Historical Reconstruction of Solar Surface Magnetism from Cycle 1-24 Using the Synthetic Active Region Generator (SARG) and the Advective Flux Transport (AFT) Model Bibhuti Kumar Jha et.al. 2511.19371 null
2025-11-24 Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoning Qihan Huang et.al. 2511.19343 null
2025-11-24 Targeted Manipulation: Slope-Based Attacks on Financial Time-Series Data Dominik Luszczynski et.al. 2511.19330 null
2025-11-24 Automated RF Phase Adjustment for Beam Stabilization in the Fermilab Linac R. R. Chichili et.al. 2511.19141 null
2025-11-21 Designing and Generating Diverse, Equitable Face Image Datasets for Face Verification Tasks Georgia Baltsou et.al. 2511.17393 null
2025-11-21 A Little More Like This: Text-to-Image Retrieval with Vision-Language Models Using Relevance Feedback Bulat Khaertdinov et.al. 2511.17255 null
2025-11-21 Dual-domain Adaptation Networks for Realistic Image Super-resolution Chaowei Fang et.al. 2511.17217 null
2025-11-21 PostCam: Camera-Controllable Novel-View Video Generation with Query-Shared Cross-Attention Yipeng Chen et.al. 2511.17185 null
2025-11-21 Toward Sustainable Generative AI: A Scoping Review of Carbon Footprint and Environmental Impacts Across Training and Inference Stages Min-Kyu Kim et.al. 2511.17179 null
2025-11-21 Towards Generative Design Using Optimal Transport for Shape Exploration and Solution Field Interpolation Sergio Torregrosa et.al. 2511.17111 null
2025-11-21 Modeling memory in time-respecting paths on temporal networks Silvia Guerrini et.al. 2511.17108 null
2025-11-20 Dataset Distillation for Pre-Trained Self-Supervised Vision Models George Cazenavette et.al. 2511.16674 null
2025-11-20 V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models Yang Luo et.al. 2511.16668 null
2025-11-20 InternData-A1: Pioneering High-Fidelity Synthetic Data for Pre-training Generalist Policy Yang Tian et.al. 2511.16651 null
2025-11-20 SAM 3D: 3Dfy Anything in Images SAM 3D Team et.al. 2511.16624 null
2025-11-20 gfnx: Fast and Scalable Library for Generative Flow Networks in JAX Daniil Tiapkin et.al. 2511.16592 null
2025-11-20 The Oracle and The Prism: A Decoupled and Efficient Framework for Generative Recommendation Explanation Jiaheng Zhang et.al. 2511.16543 null
2025-11-20 Supervised Contrastive Learning for Few-Shot AI-Generated Image Detection and Attribution Jaime Álvarez Urueña et.al. 2511.16541 null
2025-11-20 From generative AI to the brain: five takeaways Claudius Gros et.al. 2511.16432 null
2025-11-20 Generative Modeling of Clinical Time Series via Latent Stochastic Differential Equations Muhammad Aslanimoghanloo et.al. 2511.16427 null
2025-11-20 Denoising weak lensing mass maps with diffusion model and generative adversarial network Shohei D. Aoyama et.al. 2511.16415 null
2025-11-20 Reducing Instability in Synthetic Data Evaluation with a Super-Metric in MalDataGen Anna Luiza Gomes da Silva et.al. 2511.16373 null
2025-11-20 Beyond Generative AI: World Models for Clinical Prediction, Counterfactuals, and Planning Mohammad Areeb Qazi et.al. 2511.16333 null
2025-11-19 Computer-Use Agents as Judges for Generative User Interface Kevin Qinghong Lin et.al. 2511.15567 null
2025-11-19 FunnyNodules: A Customizable Medical Dataset Tailored for Evaluating Explainable AI Luisa Gallée et.al. 2511.15481 null
2025-11-19 Taming Generative Synthetic Data for X-ray Prohibited Item Detection Jialong Sun et.al. 2511.15299 null
2025-11-19 Insert In Style: A Zero-Shot Generative Framework for Harmonious Cross-Domain Object Composition Raghu Vamsi Chittersu et.al. 2511.15197 null
2025-11-18 Zero-shot Synthetic Video Realism Enhancement via Structure-aware Denoising Yifan Wang et.al. 2511.14719 null
2025-11-18 Ground Truth Generation for Multilingual Historical NLP using LLMs Clovis Gladstone et.al. 2511.14688 null
2025-11-18 Streamlining Industrial Contract Management with Retrieval-Augmented LLMs Kristi Topollai et.al. 2511.14671 null
2025-11-18 A Controllable Perceptual Feature Generative Model for Melody Harmonization via Conditional Variational Autoencoder Dengyun Huang et.al. 2511.14600 null
2025-11-18 LiveRAG: A diverse Q&A dataset with varying difficulty level for RAG evaluation David Carmel et.al. 2511.14531 null
2025-11-18 A Generative Data Framework with Authentic Supervision for Underwater Image Restoration and Enhancement Yufeng Tian et.al. 2511.14521 null
2025-11-18 Nonparametric estimation of conditional probability distributions using a generative approach based on conditional push-forward neural networks Nicola Rares Franco et.al. 2511.14455 null
2025-11-18 Infer As You Train: A Symmetric Paradigm of Masked Generative for Click-Through Rate Prediction Moyu Zhang et.al. 2511.14403 null
2025-11-17 Back to Basics: Let Denoising Generative Models Denoise Tianhong Li et.al. 2511.13720 null
2025-11-17 TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models Harold Haodong Chen et.al. 2511.13704 null
2025-11-17 Data Value in the Age of Scaling: Understanding LLM Scaling Dynamics Under Real-Synthetic Data Mixtures Haohui Wang et.al. 2511.13640 null
2025-11-17 Statistically Accurate and Robust Generative Prediction of Rock Discontinuities with A Tabular Foundation Model Han Meng et.al. 2511.13339 null
2025-11-17 AutoMalDesc: Large-Scale Script Analysis for Cyber Threat Research Alexandru-Mihai Apostu et.al. 2511.13333 null
2025-11-17 TacEleven: generative tactic discovery for football open play Siyao Zhao et.al. 2511.13326 null
2025-11-17 PASE: Leveraging the Phonological Prior of WavLM for Low-Hallucination Generative Speech Enhancement Xiaobin Rong et.al. 2511.13300 null
2025-11-17 Grounded by Experience: Generative Healthcare Prediction Augmented with Hierarchical Agentic Retrieval Chuang Zhao et.al. 2511.13293 null
2025-11-17 Examining the Usage of Generative AI Models in Student Learning Activities for Software Programming Rufeng Chen et.al. 2511.13271 null
2025-11-14 Terrain Costmap Generation via Scaled Preference Conditioning Luisa Mao et.al. 2511.11529 null
2025-11-14 SynthSoM-Twin: A Multi-Modal Sensing-Communication Digital-Twin Dataset for Sim2Real Transfer via Synesthesia of Machines Junlong Chen et.al. 2511.11503 null
2025-11-14 From Synthetic Scenes to Real Performance: Enhancing Spatial Reasoning in VLMs Massimo Rizzoli et.al. 2511.11440 null
2025-11-14 YCB-Ev SD: Synthetic event-vision dataset for 6DoF object pose estimation Pavel Rojtberg et.al. 2511.11344 null
2025-11-14 How Physics Professors Use and Frame Generative AI Tools Vidar Skogvoll et.al. 2511.11317 null
2025-11-14 6D Strawberry Pose Estimation: Real-time and Edge AI Solutions Using Purely Synthetic Training Data Saptarshi Neil Sinha et.al. 2511.11307 null
2025-11-14 Improving conditional generative adversarial networks for inverse design of plasmonic structures Petter Persson et.al. 2511.11279 null
2025-11-14 Prompt Engineering vs. Fine-Tuning for LLM-Based Vulnerability Detection in Solana and Algorand Smart Contracts Biagio Boi et.al. 2511.11250 null
2025-11-14 Arcee: Differentiable Recurrent State Chain for Generative Vision Modeling with Mamba SSMs Jitesh Chavan et.al. 2511.11243 null
2025-11-13 Pretrained Joint Predictions for Scalable Batch Bayesian Optimization of Molecular Designs Miles Wang-Henderson et.al. 2511.10590 null
2025-11-13 Don’t Waste It: Guiding Generative Recommenders with Structured Human Priors via Multi-head Decoding Yunkai Zhang et.al. 2511.10492 null
2025-11-13 BhashaKritika: Building Synthetic Pretraining Data at Scale for Indic Languages Guduru Manoj et.al. 2511.10338 null
2025-11-13 Bridging Synthetic and Real Routing Problems via LLM-Guided Instance Generation and Progressive Adaptation Jianghan Zhu et.al. 2511.10233 null
2025-11-10 AdaRec: Adaptive Recommendation with LLMs via Narrative Profiling and Dual-Channel Reasoning Meiyun Wang et.al. 2511.07166 null
2025-11-10 Guiding Generative Models to Uncover Diverse and Novel Crystals via Reinforcement Learning Hyunsoo Park et.al. 2511.07158 null
2025-11-10 Conditional Diffusion as Latent Constraints for Controllable Symbolic Music Generation Matteo Pettenó et.al. 2511.07156 null
2025-11-10 On the Joint Minimization of Regularization Loss Functions in Deep Variational Bayesian Methods for Attribute-Controlled Symbolic Music Generation Matteo Pettenó et.al. 2511.07118 null
2025-11-10 Llama-Embed-Nemotron-8B: A Universal Text Embedding Model for Multilingual and Cross-Lingual Tasks Yauhen Babakhin et.al. 2511.07025 null
2025-11-10 Image Restoration via Primal Dual Hybrid Gradient and Flow Generative Model Ji Li et.al. 2511.06748 null
2025-11-10 Relative Energy Learning for LiDAR Out-of-Distribution Detection Zizhao Li et.al. 2511.06720 null
2025-11-10 F2GAN: A Feature-Feedback Generative Framework for Reliable AI-Based Fault Diagnosis in Inverter-Dominated Microgrids Swetha Rani Kasimalla et.al. 2511.06677 null
2025-11-10 Non-Rival Data as Rival Products: An Encapsulation-Forging Approach for Data Synthesis Kaidong Wang et.al. 2511.06610 null
2025-11-09 Decomate: Leveraging Generative Models for Co-Creative SVG Animation Jihyeon Park et.al. 2511.06297 null
2025-11-09 Synthetic Data-Driven Prompt Tuning for Financial QA over Tables and Documents Yaoning Yu et.al. 2511.06292 null
2025-11-09 Breaking the Modality Barrier: Generative Modeling for Accurate Molecule Retrieval from Mass Spectra Yiwen Zhang et.al. 2511.06259 null
2025-11-09 Gait Recognition via Collaborating Discriminative and Generative Diffusion Models Haijun Xiong et.al. 2511.06245 null
2025-11-08 Adapting Web Agents with Synthetic Supervision Zhaoyang Wang et.al. 2511.06101 null
2025-11-08 Identity Card Presentation Attack Detection: A Systematic Review Esteban M. Ruiz et.al. 2511.06056 null
2025-11-08 CGCE: Classifier-Guided Concept Erasure in Generative Models Viet Nguyen et.al. 2511.05865 null
2025-11-07 Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at Scale David Acuna et.al. 2511.05705 null
2025-11-07 Associative Poisoning to Generative Machine Learning Mathias Lundteigen Mohus et.al. 2511.05177 null
2025-11-07 Role-SynthCLIP: A Role Play Driven Diverse Synthetic Data Approach Yuanxiang Huangfu et.al. 2511.05057 null
2025-11-07 Challenges in 3D Data Synthesis for Training Neural Networks on Topological Features Dylan Peek et.al. 2511.04972 null
2025-11-06 Generate, Evaluate, Iterate: Synthetic Data for Human-in-the-Loop Refinement of LLM Judges Hyo Jin Do et.al. 2511.04478 null
2025-11-06 Towards Causal Market Simulators Dennis Thumm et.al. 2511.04469 null
2025-11-06 Deep learning-based object detection of offshore platforms on Sentinel-1 Imagery and the impact of synthetic training data Robin Spanier et.al. 2511.04304 null
2025-11-06 Proto-LeakNet: Towards Signal-Leak Aware Attribution in Synthetic Human Face Imagery Claudio Giusti et.al. 2511.04260 null
2025-11-06 Learning to Land Anywhere: Transferable Generative Models for Aircraft Trajectories Olav Finne Praesteng Larsen et.al. 2511.04155 null
2025-11-06 Room Envelopes: A Synthetic Dataset for Indoor Layout Reconstruction from Images Sam Bahrami et.al. 2511.03970 null
2025-11-05 Integrating Score-Based Generative Modeling and Neural ODEs for Accurate Representation of Multiscale Chaotic Dynamics Giulio Del Felice et.al. 2511.03862 null
2025-11-05 Human Mesh Modeling for Anny Body Romain Brégier et.al. 2511.03589 null
2025-11-05 Generative Artificial Intelligence in Bioinformatics: A Systematic Review of Models, Applications, and Methodological Advances Riasad Alvi et.al. 2511.03354 null
2025-11-05 From Insight to Exploit: Leveraging LLM Collaboration for Adaptive Adversarial Text Generation Najrin Sultana et.al. 2511.03128 null
2025-11-04 Discrete Bayesian Sample Inference for Graph Generation Ole Petersen et.al. 2511.03015 null
2025-11-04 A Non-Adversarial Approach to Idempotent Generative Modelling Mohammed Al-Jaff et.al. 2511.02614 null
2025-11-04 DetectiumFire: A Comprehensive Multi-modal Dataset Bridging Vision and Language for Fire Understanding Zixuan Liu et.al. 2511.02495 null
2025-11-04 A New Perspective on Precision and Recall for Generative Models Benjamin Sykes et.al. 2511.02414 null
2025-11-04 From Models to Operators: Rethinking Autoscaling Granularity for Large Generative Models Xingqi Cui et.al. 2511.02248 null
2025-11-04 Language-Enhanced Generative Modeling for PET Synthesis from MRI and Blood Biomarkers Zhengjie Zhang et.al. 2511.02206 null
2025-11-04 DoFlow: Causal Generative Flows for Interventional and Counterfactual Time-Series Prediction Dongze Wu et.al. 2511.02137 null
2025-11-03 Quantum-Enhanced Generative Models for Rare Event Prediction M. Z. Haider et.al. 2511.02042 null
2025-11-03 The Born Ultimatum: Conditions for Classical Surrogation of Quantum Generative Models with Correlators Mario Herrero-Gonzalez et.al. 2511.01845 null
2025-11-03 GenDexHand: Generative Simulation for Dexterous Hands Feng Chen et.al. 2511.01791 null
2025-11-03 Game-theoretic distributed learning of generative models for heterogeneous data collections Dmitrij Schlesinger et.al. 2511.01740 null
2025-11-03 Generative Adversarial Synthesis and Deep Feature Discrimination of Brain Tumor MRI Images Md Sumon Ali et.al. 2511.01574 null
2025-11-03 NSYNC: Negative Synthetic Image Generation for Contrastive Training to Improve Stylized Text-To-Image Translation Serkan Ozturk et.al. 2511.01517 null
2025-11-03 UniREditBench: A Unified Reasoning-based Image Editing Benchmark Feng Han et.al. 2511.01295 null
2025-11-03 Speech-DRAME: A Framework for Human-Aligned Benchmarks in Speech Role-Play Jiatong Shi et.al. 2511.01261 null
2025-11-02 Feedback-driven Retrieval-augmented Audio Generation with Large Audio Language Models Junqi Zhao et.al. 2511.01091 null
2025-11-02 SliceVision-F2I: A Synthetic Feature-to-Image Dataset for Visual Pattern Representation on Network Slices Md. Abid Hasan Rafi et.al. 2511.01087 null
2025-11-02 Using Synthetic Data to estimate the True Error is theoretically and practically doable Hai Hoang Thanh et.al. 2511.00964 null
2025-11-04 Deep Generative Models for Enhanced Vitreous OCT Imaging Simone Sarrocco et.al. 2511.00881 null
2025-11-01 Sensitivity Analysis for Climate Science with Generative Flow Models Alex Dobra et.al. 2511.00663 null
2025-10-31 A Technical Exploration of Causal Inference with Hybrid LLM Synthetic Data Dana Kim et.al. 2511.00318 null
2025-10-31 Generative Modeling Enables Molecular Structure Retrieval from Coulomb Explosion Imaging Xiang Li et.al. 2511.00179 null
2025-10-31 GeoFM: Enhancing Geometric Reasoning of MLLMs via Synthetic Data Generation through Formal Language Yuhao Zhang et.al. 2510.27448 null
2025-10-31 Generative Semantic Coding for Ultra-Low Bitrate Visual Communication and Analysis Weiming Chen et.al. 2510.27324 null
2025-10-31 Disrupting Networks: Amplifying Social Dissensus via Opinion Perturbation and Large Language Models Erica Coppolillo et.al. 2510.27152 null
2025-11-03 Generative diffusion modeling protocols for improving the Kikuchi pattern indexing in electron back-scatter diffraction Meghraj Prajapat et.al. 2510.26907 null
2025-10-30 BI-DCGAN: A Theoretically Grounded Bayesian Framework for Efficient and Diverse GANs Mahsa Valizadeh et.al. 2510.26892 null
2025-10-30 Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench Fenfen Lin et.al. 2510.26865 null
2025-10-30 OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes Yukun Huang et.al. 2510.26800 null
2025-10-30 FlowQ-Net: A Generative Framework for Automated Quantum Circuit Design Jun Dai et.al. 2510.26688 null
2025-10-30 Generative sampling with physics-informed kernels Friederike Ihssen et.al. 2510.26678 null
2025-10-30 Metacognition and Confidence Dynamics in Advice Taking from Generative AI Clara Colombatto et.al. 2510.26508 null
2025-10-30 UniTok-Audio: A Unified Audio Generation Framework via Generative Modeling on Discrete Codec Tokens Chengwei Liu et.al. 2510.26372 null
2025-10-30 MisSynth: Improving MISSCI Logical Fallacies Classification with Synthetic Data Mykhailo Poliakov et.al. 2510.26345 null
2025-10-30 Likely Interpolants of Generative Models Frederik Möbius Rygaard et.al. 2510.26266 null
2025-10-30 New Money: A Systematic Review of Synthetic Data Generation for Finance James Meldrum et.al. 2510.26076 null
2025-10-30 Bias-Corrected Data Synthesis for Imbalanced Learning Pengfei Lyu et.al. 2510.26046 null
2025-10-29 Generative Image Restoration and Super-Resolution using Physics-Informed Synthetic Data for Scanning Tunneling Microscopy Nikola L. Kolev et.al. 2510.25921 null
2025-10-29 A Survey on Efficient Large Language Model Training: From Data-centric Perspectives Junyu Luo et.al. 2510.25817 null
2025-10-28 SHA-256 Infused Embedding-Driven Generative Modeling of High-Energy Molecules in Low-Data Regimes Siddharth Verma et.al. 2510.25788 null
2025-10-29 E-Scores for (In)Correctness Assessment of Generative Model Outputs Guneet S. Dhillon et.al. 2510.25770 null
2025-10-29 Distributional Evaluation of Generative Models via Relative Density Ratio Yuliang Xu et.al. 2510.25507 null
2025-10-31 TempoPFN: Synthetic Pre-training of Linear RNNs for Zero-shot Time Series Forecasting Vladyslav Moroshan et.al. 2510.25502 null
2025-10-29 Generative Bayesian Optimization: Generative Models as Acquisition Functions Rafael Oliveira et.al. 2510.25240 null
2025-10-29 Scaling Cultural Resources for Improving Generative Models Hayk Stepanyan et.al. 2510.25167 null
2025-10-29 Target-Guided Bayesian Flow Networks for Quantitatively Constrained CAD Generation Wenhao Zheng et.al. 2510.25163 null
2025-10-28 VividCam: Learning Unconventional Camera Motions from Virtual Synthetic Videos Qiucheng Wu et.al. 2510.24904 null
2025-10-28 A Parameter-Efficient Multi-Scale Convolutional Adapter for Synthetic Speech Detection Yassine El Kheir et.al. 2510.24852 null
2025-10-28 AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis Xuanzhong Chen et.al. 2510.24695 null
2025-10-28 OrchDAG: Complex Tool Orchestration in Multi-Turn Interactions with Plan DAGs Yifu Lu et.al. 2510.24663 null
2025-10-28 A Comprehensive Evaluation Framework for Synthetic Trip Data Generation in Public Transport Yuanyuan Wu et.al. 2510.24375 null
2025-10-28 Bayesian Speech synthesizers Can Learn from Multiple Teachers Ziyang Zhang et.al. 2510.24372 null
2025-10-28 PRIVET: Privacy Metric Based on Extreme Value Theory Antoine Szatkownik et.al. 2510.24233 null
2025-10-28 Model-Guided Dual-Role Alignment for High-Fidelity Open-Domain Video-to-Audio Generation Kang Zhang et.al. 2510.24103 null
2025-10-28 Beyond Objects: Contextual Synthetic Data Generation for Fine-Grained Classification William Yang et.al. 2510.24078 null
2025-10-28 Score-based constrained generative modeling via Langevin diffusions with boundary conditions Adam Nordenhög et.al. 2510.23985 null
2025-10-27 Learning Interpretable Features in Audio Latent Spaces via Sparse Autoencoders Nathan Paek et.al. 2510.23802 null
2025-10-27 Robust and Generalizable Background Subtraction on Images of Calorimeter Jets using Unsupervised Generative Learning Yeonju Go et.al. 2510.23717 null
2025-10-27 Variational Masked Diffusion Models Yichi Zhang et.al. 2510.23606 null
2025-10-27 RobotArena $\infty$ : Scalable Robot Benchmarking via Real-to-Sim Translation Yash Jangir et.al. 2510.23571 null
2025-10-27 An Efficient Remote Sensing Super Resolution Method Exploring Diffusion Priors and Multi-Modal Constraints for Crop Type Mapping Songxi Yang et.al. 2510.23382 null
2025-10-27 Model-Behavior Alignment under Flexible Evaluation: When the Best-Fitting Model Isn’t the Right One Itamar Avitan et.al. 2510.23321 null
2025-10-27 Increasing LLM Coding Capabilities through Diverse Synthetic Coding Tasks Amal Abed et.al. 2510.23208 null
2025-10-27 On the Anisotropy of Score-Based Generative Models Andreas Floros et.al. 2510.22899 null
2025-10-26 A Comprehensive Dataset for Human vs. AI Generated Text Detection Rajarshi Roy et.al. 2510.22874 null
2025-10-26 SAO-Instruct: Free-form Audio Editing using Natural Language Instructions Michael Ungersböck et.al. 2510.22795 null
2025-10-26 Semi-Supervised Learning under General Causal Models Archer Moore et.al. 2510.22567 null
2025-10-25 GigaEmbeddings: Efficient Russian Language Embedding Model Egor Kolodin et.al. 2510.22369 null
2025-10-24 Linearized Optimal Transport for Analysis of High-Dimensional Point-Cloud and Single-Cell Data Tianxiang Wang et.al. 2510.22033 null
2025-10-24 Embedding Trust: Semantic Isotropy Predicts Nonfactuality in Long-Form Text Generation Dhrupad Bhardwaj et.al. 2510.21891 null
2025-10-23 Generative AI in Depth: A Survey of Recent Advances, Model Variants, and Real-World Applications Shamim Yazdani et.al. 2510.21887 null
2025-10-24 Generative Correlation Manifolds: Generating Synthetic Data with Preserved Higher-Order Correlations Jens E. d’Hondt et.al. 2510.21610 null
2025-10-24 Generalised Flow Maps for Few-Step Generative Modelling on Riemannian Manifolds Oscar Davis et.al. 2510.21608 null
2025-10-24 S3OD: Towards Generalizable Salient Object Detection with Synthetic Data Orest Kupyn et.al. 2510.21605 null
2025-10-24 Are These Even Words? Quantifying the Gibberishness of Generative Speech Models Danilo de Oliveira et.al. 2510.21317 null
2025-10-24 Text-Guided Diffusion Model-based Generative Communication for Wireless Image Transmission Shengkang Chen et.al. 2510.21299 null
2025-10-24 Robust Distortion-Free Watermark for Autoregressive Audio Generation Models Yihan Wu et.al. 2510.21115 null
2025-10-23 Amortized Active Generation of Pareto Sets Daniel M. Steinberg et.al. 2510.21052 null
2025-10-23 Can Current Detectors Catch Face-to-Voice Deepfake Attacks? Nguyen Linh Bao Nguyen et.al. 2510.21004 null
2025-10-23 CUPID: Pose-Grounded Generative 3D Reconstruction from a Single Image Binbin Huang et.al. 2510.20776 null
2025-10-24 EditInfinity: Image Editing with Binary-Quantized Generative Models Jiahuan Wang et.al. 2510.20217 null
2025-10-22 Learning and Simulating Building Evacuation Patterns for Enhanced Safety Design Using Generative Models Jin Han et.al. 2510.19623 null
2025-10-22 The Intricate Dance of Prompt Complexity, Quality, Diversity, and Consistency in T2I Models Xiaofeng Zhang et.al. 2510.19557 link
2025-10-22 Predicting before Reconstruction: A generative prior framework for MRI acceleration Juhyung Park et.al. 2510.19472 null
2025-10-22 Unified Reinforcement and Imitation Learning for Vision-Language Models Byung-Kwan Lee et.al. 2510.19307 null
2025-10-22 Loopholing Discrete Diffusion: Deterministic Bypass of the Sampling Wall Mingyu Jo et.al. 2510.19304 null
2025-10-24 Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks Kai Zeng et.al. 2510.19195 null
2025-10-21 Improving the Generation and Evaluation of Synthetic Data for Downstream Medical Causal Inference Harry Amad et.al. 2510.18768 null
2025-10-21 ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization Yuanhe Guo et.al. 2510.18433 null
2025-10-21 GPTFace: Generative Pre-training of Facial-Linguistic Transformer by Span Masking and Weakly Correlated Text-image Data Yudong Li et.al. 2510.18345 null
2025-10-21 Towards Identifiability of Hierarchical Temporal Causal Representation Learning Zijian Li et.al. 2510.18310 null
2025-10-21 ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive Text-to-Speech Generation Haowei Lou et.al. 2510.18308 null
2025-10-21 Efficient Few-shot Identity Preserving Attribute Editing for 3D-aware Deep Generative Models Vishal Vinod et.al. 2510.18287 null
2025-10-21 ACTG-ARL: Differentially Private Conditional Text Generation with RL-Boosted Control Yuzheng Hu et.al. 2510.18232 null
2025-10-20 Gradient Variance Reveals Failure Modes in Flow-Based Generative Models Teodora Reu et.al. 2510.18118 null
2025-10-20 Fine-tuning Flow Matching Generative Models with Intermediate Feedback Jiajun Fan et.al. 2510.18072 null
2025-10-20 Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models Jiajun Fan et.al. 2510.18053 null
2025-10-20 EvoSyn: Generalizable Evolutionary Data Synthesis for Verifiable Learning He Du et.al. 2510.17928 null
2025-10-20 QueST: Incentivizing LLMs to Generate Difficult Problems Hanxu Hu et.al. 2510.17715 null
2025-10-20 Quantum Synthetic Data Generation for Industrial Bioprocess Monitoring Shawn M. Gibford et.al. 2510.17688 null
2025-10-20 Integrating BIM and UAV-based photogrammetry for Automated 3D Structure Model Segmentation Siqi Chen et.al. 2510.17609 null
2025-10-20 Towards geological inference with process-based and deep generative modeling, part 2: inversion of fluvial deposits and latent-space disentanglement Guillaume Rongier et.al. 2510.17478 null
2025-10-20 Optimal transport by a Lagrangian dynamics of population distribution Babak Benam et.al. 2510.17193 null
2025-10-19 Hephaestus: Mixture Generative Modeling with Energy Guidance for Large-scale QoS Degradation Nguyen Do et.al. 2510.17036 null
2025-10-19 Conditional Synthetic Live and Spoof Fingerprint Generation Syed Konain Abbas et.al. 2510.17035 null
2025-10-19 Towards Real-Time Generative Speech Restoration with Flow-Matching Tsun-An Hsieh et.al. 2510.16997 null
2025-10-19 Differentially Private Linear Regression and Synthetic Data Generation with Statistical Guarantees Shurong Lin et.al. 2510.16974 null
2025-10-19 Class-N-Diff: Classification-Induced Diffusion Model Can Make Fair Skin Cancer Diagnosis Nusrat Munia et.al. 2510.16887 null
2025-10-19 U-Codec: Ultra Low Frame-rate Neural Speech Codec for Fast High-fidelity Speech Generation Xusheng Yang et.al. 2510.16718 null
2025-10-18 Escaping Model Collapse via Synthetic Data Verification: Near-term Improvements and Long-term Convergence Bingji Yi et.al. 2510.16657 null
2025-10-18 Accelerated Learning on Large Scale Screens using Generative Library Models Eli N. Weinstein et.al. 2510.16612 null
2025-10-17 AtomBench: A Benchmark for Generative Atomic Structure Models using GPT, Diffusion, and Flow Architectures Charles Rhys Campbell et.al. 2510.16165 null
2025-10-16 Membership Inference over Diffusion-models-based Synthetic Tabular Data Peini Cheng et.al. 2510.16037 null
2025-10-17 GENESIS: A Generative Model of Episodic-Semantic Interaction Marco D’Alessandro et.al. 2510.15828 null
2025-10-16 Deep generative priors for 3D brain analysis Ana Lawry Aguila et.al. 2510.15119 null
2025-10-16 AlignFlow: Improving Flow-based Generative Models with Semi-Discrete Optimal Transport Lingkai Kong et.al. 2510.15038 null
2025-10-16 Harmonizing Diverse Models: A Layer-wise Merging Strategy for Consistent Generation Xujun Peng et.al. 2510.14915 null
2025-10-16 FraQAT: Quantization Aware Training with Fractional bits Luca Morreale et.al. 2510.14823 null
2025-10-16 SpeechLLM-as-Judges: Towards General and Interpretable Speech Quality Evaluation Hui Wang et.al. 2510.14664 null
2025-10-16 Generative Models From and For Sampling-Based MPC: A Bootstrapped Approach For Adaptive Contact-Rich Manipulation Lara Brudermüller et.al. 2510.14643 null
2025-10-16 Unsupervised Deep Generative Models for Anomaly Detection in Neuroimaging: A Systematic Scoping Review Youwan Mahé et.al. 2510.14462 null
2025-10-16 Towards geological inference with process-based and deep generative modeling, part 1: training on fluvial deposits Guillaume Rongier et.al. 2510.14445 null
2025-10-16 Qwen3Guard Technical Report Haiquan Zhao et.al. 2510.14276 null
2025-10-15 Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms Shrey Pandit et.al. 2510.13913 null
2025-10-13 Joint Discriminative-Generative Modeling via Dual Adversarial Training Xuwang Yin et.al. 2510.13872 null
2025-10-15 Assessing the Geographic Generalization and Physical Consistency of Generative Models for Climate Downscaling Carlo Saccardi et.al. 2510.13722 null
2025-10-15 Manifold Decoders: A Framework for Generative Modeling from Nonlinear Embeddings Riddhish Thakare et.al. 2510.13622 null
2025-10-15 FreshTab: Sourcing Fresh Data for Table-to-Text Generation Evaluation Kristýna Onderková et.al. 2510.13598 null
2025-10-15 Reinforcement Learning Meets Masked Generative Models: Mask-GRPO for Text-to-Image Generation Yifu Luo et.al. 2510.13418 null
2025-10-15 UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE Zhenyu Liu et.al. 2510.13344 null
2025-10-15 Federated Conditional Conformal Prediction via Generative Models Rui Xu et.al. 2510.13297 null
2025-10-15 Generative model for information metamaterial design Jun Ming Hou et.al. 2510.13264 null
2025-10-15 NeuroRVQ: Multi-Scale EEG Tokenization for Generative Large Brainwave Models Konstantinos Barmpas et.al. 2510.13068 null
2025-10-16 Adapting Noise to Data: Generative Flows from 1D Processes Jannis Chemseddine et.al. 2510.12636 null
2025-10-14 Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training Jiachen Lei et.al. 2510.12586 null
2025-10-14 Mitigating the Noise Shift for Denoising Generative Models via Noise Awareness Guidance Jincheng Zhong et.al. 2510.12497 null
2025-10-14 Continuous Uniqueness and Novelty Metrics for Generative Modeling of Inorganic Crystals Masahiro Negishi et.al. 2510.12405 null
2025-10-14 Generative Diffusion Model DiffCrysGen Discovers Rare Earth-Free Magnetic Materials Sourav Mal et.al. 2510.12329 null
2025-10-14 Beating Harmful Stereotypes Through Facts: RAG-based Counter-speech Generation Greta Damo et.al. 2510.12316 null
2025-10-14 GOAT: A Training Framework for Goal-Oriented Agent with Tools Hyunji Min et.al. 2510.12218 null
2025-10-15 DiSTAR: Diffusion over a Scalable Token Autoregressive Representation for Speech Generation Yakun Song et.al. 2510.12210 null
2025-10-14 The Impact of Synthetic Data on Object Detection Model Performance: A Comparative Analysis with Real-World Data Muammer Bay et.al. 2510.12208 null
2025-10-14 Audio Palette: A Diffusion Transformer with Multi-Signal Conditioning for Controllable Foley Synthesis Junnuo Wang et.al. 2510.12175 null
2025-10-14 Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing Rongzhi Zhang et.al. 2510.12121 null
2025-10-14 G4Splat: Geometry-Guided Gaussian Splatting with Generative Prior Junfeng Ni et.al. 2510.12099 null
2025-10-14 Your VAR Model is Secretly an Efficient and Explainable Generative Classifier Yi-Chung Chen et.al. 2510.12060 null
2025-10-13 UALM: Unified Audio Language Model for Understanding, Generation and Reasoning Jinchuan Tian et.al. 2510.12000 null
2025-10-15 Y-shaped Generative Flows Arip Asadulaev et.al. 2510.11955 null
2025-10-13 GRAVITY: A Framework for Personalized Text Generation via Profile-Grounded Synthetic Preferences Priyanka Dey et.al. 2510.11952 null
2025-10-13 LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens Armel Zebaze et.al. 2510.11919 null
2025-10-13 Balancing Synthetic Data and Replay for Enhancing Task-Specific Capabilities Urs Spiegelhalter et.al. 2510.11842 null
2025-10-13 Schrödinger bridge for generative AI: Soft-constrained formulation and convergence analysis Jin Ma et.al. 2510.11829 null
2025-10-13 OneRec-Think: In-Text Reasoning for Generative Recommendation Zhanyu Liu et.al. 2510.11639 null
2025-10-13 A Framework for Low-Effort Training Data Generation for Urban Semantic Segmentation Denis Zavadski et.al. 2510.11567 null
2025-10-13 Offline Reinforcement Learning with Generative Trajectory Policies Xinsong Feng et.al. 2510.11499 null
2025-10-13 Into the Unknown: Towards using Generative Models for Sampling Priors of Environment Uncertainty for Planning in Configuration Spaces Subhransu S. Bhattacharjee et.al. 2510.11014 null
2025-10-13 Secret-Protected Evolution for Differentially Private Synthetic Text Generation Tianze Wang et.al. 2510.10990 null
2025-10-13 IUT-Plug: A Plug-in tool for Interleaved Image-Text Generation Zeteng Lin et.al. 2510.10969 null
2025-10-13 Comparative Evaluation of Neural Network Architectures for Generalizable Human Spatial Preference Prediction in Unseen Built Environments Maral Doctorarastoo et.al. 2510.10954 null
2025-10-13 Towards Distribution-Shift Uncertainty Estimation for Inverse Problems with Generative Priors Namhoon Kim et.al. 2510.10947 null
2025-10-13 Find Your Optimal Teacher: Personalized Data Synthesis via Router-Guided Multi-Teacher Distillation Hengyuan Zhang et.al. 2510.10925 null
2025-10-12 DISC-GAN: Disentangling Style and Content for Cluster-Specific Synthetic Underwater Image Generation Sneha Varur et.al. 2510.10782 null
2025-10-12 Controllable Generative Trajectory Prediction via Weak Preference Alignment Yongxi Cao et.al. 2510.10731 null
2025-10-12 Designing ReLU Generative Networks to Enumerate Trees with a Given Tree Edit Distance Mamoona Ghafoor et.al. 2510.10706 null
2025-10-15 Trustworthy Retrosynthesis: Eliminating Hallucinations with a Diverse Ensemble of Reaction Scorers Michal Sadowski et.al. 2510.10645 null
2025-10-12 GraphTracer: Graph-Guided Failure Tracing in LLM Agents for Robust Multi-Turn Deep Search Heng Zhang et.al. 2510.10581 null
2025-10-12 Reverse Supervision at Scale: Exponential Search Meets the Economics of Annotation Masoud Makrehchi et.al. 2510.10446 null
2025-10-11 Generative Modeling of Aerosol State Representations Ehsan Saleh et.al. 2510.10361 null
2025-10-11 LLM-Friendly Knowledge Representation for Customer Support Hanchen Su et.al. 2510.10331 null
2025-10-11 Calibrating Generative Models Henry D. Smith et.al. 2510.10020 link
2025-10-11 Generative Latent Video Compression Zongyu Guo et.al. 2510.09987 null
2025-10-10 Augmenting generative models with biomedical knowledge graphs improves targeted drug discovery Aditya Malusare et.al. 2510.09914 null
2025-10-10 Domain Knowledge Infused Generative Models for Drug Discovery Synthetic Data Bing Hu et.al. 2510.09837 null
2025-10-10 BaNEL: Exploration Posteriors for Generative Modeling Using Only Negative Rewards Sangyun Lee et.al. 2510.09596 null
2025-10-10 Efficient Autoregressive Inference for Transformer Probabilistic Models Conor Hassan et.al. 2510.09477 null
2025-10-13 Failure Prediction at Runtime for Generative Robot Policies Ralf Römer et.al. 2510.09459 null
2025-10-10 A Biophysically-Conditioned Generative Framework for 3D Brain Tumor MRI Synthesis Valentin Biller et.al. 2510.09365 null
2025-10-10 SOS: Synthetic Object Segments Improve Detection, Segmentation, and Grounding Weikai Huang et.al. 2510.09110 null
2025-10-10 MCMC: Bridging Rendering, Optimization and Generative AI Gurprit Singh et.al. 2510.09078 null
2025-10-10 MMAudioSep: Taming Video-to-Audio Generative Model Towards Video/Text-Queried Sound Separation Akira Takahashi et.al. 2510.09065 null
2025-10-10 O_O-VC: Synthetic Data-Driven One-to-One Alignment for Any-to-Any Voice Conversion Huu Tuong Tu et.al. 2510.09061 null
2025-10-10 Mirror Flow Matching with Heavy-Tailed Priors for Generative Modeling on Convex Domains Yunrui Guan et.al. 2510.08929 null
2025-10-10 ControlAudio: Tackling Text-Guided, Timing-Indicated and Intelligible Audio Generation via Progressive Diffusion Modeling Yuxuan Jiang et.al. 2510.08878 null
2025-10-08 Next Semantic Scale Prediction via Hierarchical Diffusion Language Models Cai Zhou et.al. 2510.08632 null
2025-10-11 MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization Xiangyu Zhao et.al. 2510.08540 null
2025-10-09 Universality and kernel-adaptive training for classically trained, quantum-deployed generative models Andrii Kurkin et.al. 2510.08476 null
2025-10-09 SummDiff: Generative Modeling of Video Summarization with Diffusion Kwanseok Kim et.al. 2510.08458 null
2025-10-09 ReasonEmbed: Enhanced Text Embeddings for Reasoning-Intensive Document Retrieval Jianlyu Chen et.al. 2510.08252 null
2025-10-09 Contrastive Decoding for Synthetic Data Generation in Low-Resource Language Modeling Jannek Ulm et.al. 2510.08245 null
2025-10-09 Detecting and Mitigating Insertion Hallucination in Video-to-Audio Generation Liyang Chen et.al. 2510.08078 null
2025-10-09 IntMeanFlow: Few-step Speech Generation with Integral Velocity Distillation Wei Wang et.al. 2510.07979 null
2025-10-09 Comprehensiveness Metrics for Automatic Evaluation of Factual Recall in Text Generation Adam Dejl et.al. 2510.07926 null
2025-10-09 GeoGen: A Two-stage Coarse-to-Fine Framework for Fine-grained Synthetic Location-based Social Network Trajectory Generation Rongchao Xu et.al. 2510.07735 null
2025-10-09 SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction Wenyue Chen et.al. 2510.07723 null
2025-10-08 Transferable Generative Models Bridge Femtosecond to Nanosecond Time-Step Molecular Dynamics Juan Viguera Diez et.al. 2510.07589 null
2025-10-07 Mitigating Surgical Data Imbalance with Dual-Prediction Video Diffusion Model Danush Kumar Venkatesh et.al. 2510.07345 null
2025-10-08 A Digital Twin Framework for Metamorphic Testing of Autonomous Driving Systems Using Generative Model Tony Zhang et.al. 2510.07133 null
2025-10-08 Sharpness-Aware Data Generation for Zero-shot Quantization Dung Hoang-Anh et.al. 2510.07018 null
2025-10-08 Differentially Private Synthetic Text Generation for Retrieval-Augmented Generation (RAG) Junki Mori et.al. 2510.06719 null
2025-10-08 XLSR-Kanformer: A KAN-Intergrated model for Synthetic Speech Detection Phuong Tuan Dat et.al. 2510.06706 null
2025-10-08 Three Forms of Stochastic Injection for Improved Distribution-to-Distribution Generative Modeling Shiye Su et.al. 2510.06634 null
2025-10-08 AIM 2025 Challenge on Real-World RAW Image Denoising Feiran Li et.al. 2510.06601 null
2025-10-08 SDQM: Synthetic Data Quality Metric for Object Detection Dataset Evaluation Ayush Zenith et.al. 2510.06596 null
2025-10-07 Deep Generative Model for Human Mobility Behavior Ye Hong et.al. 2510.06473 null
2025-10-07 FinLFQA: Evaluating Attributed Text Generation of LLMs in Financial Long-Form Question Answering Yitao Long et.al. 2510.06426 null
2025-10-07 Controllable Stylistic Text Generation with Train-Time Attribute-Regularized Diffusion Fan Zhou et.al. 2510.06386 null
2025-10-07 Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models Jiahao Wang et.al. 2510.06209 null
2025-10-07 Thermodynamic Performance Limits for Score-Based Diffusion Models Nathan X. Kodama et.al. 2510.06174 null
2025-10-07 Towards Data-Efficient Medical Imaging: A Generative and Semi-Supervised Framework Mosong Ma et.al. 2510.06123 null
2025-10-07 Carré du champ flow matching: better quality-generalisation tradeoff in generative models Jacob Bamberger et.al. 2510.05930 null
2025-10-07 FoleyGRAM: Video-to-Audio Generation with GRAM-Aligned Multimodal Encoders Riccardo Fosco Gramaccioni et.al. 2510.05829 null
2025-10-07 StereoSync: Spatially-Aware Stereo Audio Generation from Video Christian Marinoni et.al. 2510.05828 null
2025-10-07 Physicochemically Informed Dual-Conditioned Generative Model of T-Cell Receptor Variable Regions for Cellular Therapy Jiahao Ma et.al. 2510.05747 null
2025-10-07 Redefining Generalization in Visual Domains: A Two-Axis Framework for Fake Image Detection with FusionDetect Amirtaha Amanzadi et.al. 2510.05740 null
2025-10-08 Scalable In-context Ranking with Generative Models Nilesh Gupta et.al. 2510.05396 null
2025-10-06 Watch and Learn: Learning to Use Computers from Online Videos Chan Hee Song et.al. 2510.04673 null
2025-10-06 Forecasting-Based Biomedical Time-series Data Synthesis for Open Data and Robust AI Youngjoon Lee et.al. 2510.04622 null
2025-10-06 Language Model Based Text-to-Audio Generation: Anti-Causally Aligned Collaborative Residual Transformers Juncheng Wang et.al. 2510.04577 null
2025-10-06 Quantum generative model on bicycle-sharing system and an application Fumio Nemoto et.al. 2510.04512 null
2025-10-05 Score-based generative emulation of impact-relevant Earth system model outputs Shahine Bouabid et.al. 2510.04358 null
2025-10-05 Pitch-Conditioned Instrument Sound Synthesis From an Interactive Timbre Latent Space Christian Limberg et.al. 2510.04339 null
2025-10-05 Scaling Sequence-to-Sequence Generative Neural Rendering Shikun Liu et.al. 2510.04236 null
2025-10-05 BLADE: Bias-Linked Adaptive DEbiasing Piyush Arora et.al. 2510.04174 null
2025-10-05 A Multilingual Framework for Dysarthria: Detection, Severity Classification, Speech-to-Text, and Clean Speech Generation Ananya Raghu et.al. 2510.03986 null
2025-10-04 Mirage: Unveiling Hidden Artifacts in Synthetic Images with Large Vision-Language Models Pranav Sharma et.al. 2510.03840 null
2025-10-07 Neon: Negative Extrapolation From Self-Training Improves Image Generation Sina Alemohammad et.al. 2510.03597 null
2025-10-07 Longitudinal Flow Matching for Trajectory Modeling Mohammad Mohaiminul Islam et.al. 2510.03569 null
2025-10-07 Synthetic Audio Forensics Evaluation (SAFE) Challenge Kirill Trapeznikov et.al. 2510.03387 null
2025-10-03 Predicting cell-specific gene expression profile and knockout impact through deep learning Yongjian He et.al. 2510.03359 null
2025-10-03 Wave-GMS: Lightweight Multi-Scale Generative Model for Medical Image Segmentation Talha Ahmed et.al. 2510.03216 null
2025-10-06 What Drives Compositional Generalization in Visual Generative Models? Karim Farid et.al. 2510.03075 null
2025-10-03 SALSA-V: Shortcut-Augmented Long-form Synchronized Audio from Videos Amir Dellali et.al. 2510.02916 null
2025-10-03 Neural Jump ODEs as Generative Models Robert A. Crowell et.al. 2510.02757 null
2025-10-03 Automated Constraint Specification for Job Scheduling by Regulating Generative Model with Domain-Specific Representation Yu-Zhe Shi et.al. 2510.02679 null
2025-10-03 Deep Generative Continual Learning using Functional LoRA: FunLoRA Victor Enescu et.al. 2510.02631 null
2025-10-02 Beyond Linear Diffusions: Improved Representations for Rare Conditional Generative Modeling Kulunu Dharmakeerthi et.al. 2510.02499 null
2025-10-02 Orthogonal Procrustes problem preserves correlations in synthetic data Oussama Ounissi et.al. 2510.02405 null
2025-10-02 Equilibrium Matching: Generative Modeling with Implicit Energy-Based Models Runqian Wang et.al. 2510.02300 null
2025-10-02 Study on LLMs for Promptagator-Style Dense Retriever Training Daniel Gwon et.al. 2510.02241 null
2025-10-02 FlexDoc: Parameterized Sampling for Diverse Multilingual Synthetic Documents for Training Document Understanding Models Karan Dua et.al. 2510.02133 null
2025-10-02 SoundReactor: Frame-level Online Video-to-Audio Generation Koichi Saito et.al. 2510.02110 null
2025-10-04 NGGAN: Noise Generation GAN Based on the Practical Measurement Dataset for Narrowband Powerline Communications Ying-Ren Chien et.al. 2510.01850 null
2025-10-02 Sensitivity, Specificity, and Consistency: A Tripartite Evaluation of Privacy Filters for Synthetic Data Generation Adil Koeken et.al. 2510.01793 null
2025-10-02 A Locally Executable AI System for Improving Preoperative Patient Communication: A Multi-Domain Clinical Evaluation Motoki Sato et.al. 2510.01671 null
2025-10-02 Demystifying Synthetic Data in LLM Pre-training: A Systematic Study of Scaling Laws, Benefits, and Pitfalls Feiyang Kang et.al. 2510.01631 null
2025-10-02 Posterior Collapse as a Phase Transition in Variational Autoencoders Zhen Li et.al. 2510.01621 null
2025-10-02 TimeGazer: Temporal Modeling of Predictive Gaze Stabilization for AR Interaction Yaozheng Xia et.al. 2510.01561 null
2025-10-01 Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling Huangjie Zheng et.al. 2510.01329 null
2025-10-01 MorphGen: Controllable and Morphologically Plausible Generative Cell-Imaging Berker Demirel et.al. 2510.01298 null
2025-10-01 Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity Jiayi Zhang et.al. 2510.01171 null
2025-10-01 Fiaingen: A financial time series generative method matching real-world data quality Jože M. Rožanec et.al. 2510.01169 null
2025-10-01 GRAD: Generative Retrieval-Aligned Demonstration Sampler for Efficient Few-Shot Reasoning Oussama Gabouj et.al. 2510.01165 null
2025-10-01 Apriel-1.5-15b-Thinker Shruthan Radhakrishna et.al. 2510.01141 null
2025-10-01 Authentic Discrete Diffusion Model Xiao Li et.al. 2510.01047 null
2025-10-01 Making, not Taking, the Best of N Ammar Khairi et.al. 2510.00931 null
2025-10-01 Population Synthesis using Incomplete Information Tanay Rastogi et.al. 2510.00859 null
2025-10-01 From Scores to Preferences: Redefining MOS Benchmarking for Speech Quality Reward Modeling Yifei Cao et.al. 2510.00743 null
2025-10-01 Inclusive Easy-to-Read Generation for Individuals with Cognitive Impairments François Ledoyen et.al. 2510.00691 null
2025-10-01 A Geometric Unification of Generative AI with Manifold-Probabilistic Projection Models Leah Bar et.al. 2510.00666 null
2025-10-01 Facilitating Cognitive Accessibility with LLMs: A Multi-Task Approach to Easy-to-Read Text Generation François Ledoyen et.al. 2510.00662 null
2025-10-01 MCM-DPO: Multifaceted Cross-Modal Direct Preference Optimization for Alt-text Generation Jinlan Fu et.al. 2510.00647 null
2025-10-01 PodEval: A Multimodal Evaluation Framework for Podcast Audio Generation Yujia Xiao et.al. 2510.00485 null
2025-09-30 Nonparametric Identification of Latent Concepts Yujia Zheng et.al. 2510.00136 null
2025-09-30 Video Object Segmentation-Aware Audio Generation Ilpo Viertola et.al. 2509.26604 null
2025-09-30 Learning from Hallucinating Critical Points for Navigation in Dynamic Environments Saad Abdul Ghani et.al. 2509.26513 null
2025-09-30 Data-to-Energy Stochastic Dynamics Kirill Tamogashev et.al. 2509.26364 null
2025-09-30 Reframing Generative Models for Physical Systems using Stochastic Interpolants Anthony Zhou et.al. 2509.26282 null
2025-09-30 EnScale: Temporally-consistent multivariate generative downscaling via proper scoring rules Maybritt Schillinger et.al. 2509.26258 null
2025-09-30 MARS: Audio Generation via Multi-Channel Autoregression on Spectrograms Eleonora Ristori et.al. 2509.26007 null
2025-09-30 Think Less, Label Better: Multi-Stage Domain-Grounded Synthetic Data Generation for Fine-Tuning Large Language Models in Telecommunications Chenhua Shi et.al. 2509.25736 null
2025-09-30 CATCH: A Novel Data Synthesis Framework for High Therapy Fidelity and Memory-Driven Planning Chain of Thought in AI Counseling Mingyu Chen et.al. 2509.25733 null
2025-09-30 Controlled Generation for Private Synthetic Text Zihao Zhao et.al. 2509.25729 null
2025-09-30 OmniDFA: A Unified Framework for Open Set Synthesis Image Detection and Few-Shot Attribution Shiyu Wu et.al. 2509.25682 null
2025-09-30 SING-SQL: A Synthetic Data Generation Framework for In-Domain Text-to-SQL Translation Hasan Alp Caferoğlu et.al. 2509.25672 null
2025-09-29 Coupling Generative Modeling and an Autoencoder with the Causal Bridge Ruolin Meng et.al. 2509.25599 null
2025-09-29 Understanding Generative Recommendation with Semantic IDs from a Model-scaling View Jingzhe Liu et.al. 2509.25522 null
2025-09-29 Uncertainty-Aware Generative Oversampling Using an Entropy-Guided Conditional Variational Autoencoder Amirhossein Zare et.al. 2509.25334 null
2025-09-29 Paired by the Teacher: Turning Unpaired Data into High-Fidelity Pairs for Low-Resource Text Generation Yen-Ju Lu et.al. 2509.25144 null
2025-09-29 MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech Chengyao Wang et.al. 2509.25131 null
2025-09-29 Meta-Learning Theory-Informed Inductive Biases using Deep Kernel Gaussian Processes Bahti Zakirov et.al. 2509.24919 null
2025-09-29 VAGUEGAN: Stealthy Poisoning and Backdoor Attacks on Image Generative Pipelines Mostafa Mohaimen Akand Faisal et.al. 2509.24891 null
2025-09-29 ThermalGen: Style-Disentangled Flow-Based Generative Models for RGB-to-Thermal Image Translation Jiuhong Xiao et.al. 2509.24878 null
2025-09-29 Cell2Text: Multimodal LLM for Generating Single-Cell Descriptions from RNA-Seq Data Oussama Kharouiche et.al. 2509.24840 null
2025-09-30 MarS-FM: Generative Modeling of Molecular Dynamics via Markov State Models Kacper Kapuśniak et.al. 2509.24779 null
2025-09-30 VSSFlow: Unifying Video-conditioned Sound and Speech Generation via Joint Learning Xin Cheng et.al. 2509.24773 null
2025-09-29 Socratic-Zero : Bootstrapping Reasoning via Data-Free Agent Co-evolution Shaobo Wang et.al. 2509.24726 null
2025-09-29 VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning Yixuan Zhou et.al. 2509.24650 null
2025-09-29 When Audio Generators Become Good Listeners: Generative Features for Understanding Tasks Zeyu Xie et.al. 2509.24635 null
2025-09-29 Training-Free Multimodal Guidance for Video to Audio Generation Eleonora Grassucci et.al. 2509.24550 null
2025-09-29 Alternatives To Next Token Prediction In Text Generation – A Survey Charlie Wyatt et.al. 2509.24435 null
2025-09-29 RapidMV: Leveraging Spatio-Angular Representations for Efficient and Consistent Text-to-Multi-View Synthesis Seungwook Kim et.al. 2509.24410 null
2025-09-29 Unsupervised Single-Channel Speech Separation with a Diffusion Prior under Speaker-Embedding Guidance Runwu Shi et.al. 2509.24395 null
2025-09-29 UniFlow-Audio: Unified Flow Matching for Audio Generation from Omni-Modalities Xuenan Xu et.al. 2509.24391 null
2025-09-29 Towards Foundation Models for Cryo-ET Subtomogram Analysis Runmin Jiang et.al. 2509.24311 null
2025-09-28 Define latent spaces by example: optimisation over the outputs of generative models Samuel Willis et.al. 2509.23800 null
2025-09-28 AudioMoG: Guiding Audio Generation with Mixture-of-Guidance Junyou Wang et.al. 2509.23727 null
2025-09-28 ReWatch-R1: Boosting Complex Video Reasoning in Large Vision-Language Models through Agentic Data Synthesis Congzhi Zhang et.al. 2509.23652 null
2025-09-28 From Past To Path: Masked History Learning for Next-Item Prediction in Generative Recommendation KaiWen Wei et.al. 2509.23649 null
2025-09-28 Disentanglement of Variations with Multimodal Generative Modeling Yijie Zhang et.al. 2509.23548 null
2025-09-27 Generative Modeling of Shape-Dependent Self-Contact Human Poses Takehiko Ohkawa et.al. 2509.23393 null
2025-09-27 SynDoc: A Hybrid Discriminative-Generative Framework for Enhancing Synthetic Domain-Adaptive Document Key Information Extraction Yihao Ding et.al. 2509.23273 null
2025-09-27 OracleGS: Grounding Generative Priors for Sparse-View Gaussian Splatting Atakan Topaloglu et.al. 2509.23258 null
2025-09-27 A Generative Model for Controllable Feature Heterophily in Graphs Haoyu Wang et.al. 2509.23230 null
2025-09-27 Sparse2Dense: A Keypoint-driven Generative Framework for Human Video Compression and Vertex Prediction Bolin Chen et.al. 2509.23169 null
2025-09-27 Dense associative memory on the Bures-Wasserstein space Chandan Tankala et.al. 2509.23162 null
2025-09-26 GDR-learners: Orthogonal Learning of Generative Models for Potential Outcomes Valentyn Melnychuk et.al. 2509.22953 null
2025-09-26 Extract-0: A Specialized Language Model for Document Information Extraction Henrique Godoy et.al. 2509.22906 null
2025-09-26 ArFake: A Multi-Dialect Benchmark and Baselines for Arabic Spoof-Speech Detection Mohamed Maged et.al. 2509.22808 null
2025-09-26 Generative Modeling and Decision Fusion for Unknown Event Detection and Classification Using Synchrophasor Data Yi Hu et.al. 2509.22795 null
2025-09-26 Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance Luc Boudier et.al. 2509.22635 null
2025-09-26 A Theoretical Analysis of Discrete Flow Matching Generative Models Maojiang Su et.al. 2509.22623 null
2025-09-26 Transport Based Mean Flows for Generative Modeling Elaheh Akbari et.al. 2509.22592 null
2025-09-26 ConQuER: Modular Architectures for Control and Bias Mitigation in IQP Quantum Generative Models Xiaocheng Zou et.al. 2509.22551 null
2025-09-26 Overclocking Electrostatic Generative Models Daniil Shlenskii et.al. 2509.22454 null
2025-09-26 SurvDiff: A Diffusion Model for Generating Synthetic Data in Survival Analysis Marie Brockschmidt et.al. 2509.22352 null
2025-09-26 Preventing Model Collapse Under Overparametrization: Optimal Mixing Ratios for Interpolation Learning and Ridge Regression Anvit Garg et.al. 2509.22341 null
2025-09-26 Accuracy-First Rényi Differential Privacy and Post-Processing Immunity Ossi Räisä et.al. 2509.22213 null
2025-09-26 High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling Chao Huang et.al. 2509.22063 null
2025-09-26 Comparative Analysis of GAN and Diffusion for MRI-to-CT translation Emily Honey et.al. 2509.22049 null
2025-09-26 Text2Move: Text-to-moving sound generation via trajectory prediction and temporal alignment Yunyi Liu et.al. 2509.21919 null
2025-09-26 UISim: An Interactive Image-Based UI Simulator for Dynamic Mobile Environments Jiannan Xiang et.al. 2509.21733 null
2025-09-25 HuLA: Prosody-Aware Anti-Spoofing with Multi-Task Learning for Expressive and Emotional Synthetic Speech Aurosweta Mahapatra et.al. 2509.21676 null
2025-09-25 Guiding Audio Editing with Audio Language Model Zitong Lan et.al. 2509.21625 null
2025-09-25 QMill: Representative Quantum Data Generation for Quantum Machine Learning Utility Jason Ludmir et.al. 2509.21622 null
2025-09-25 Federated Flow Matching Zifan Wang et.al. 2509.21250 null
2025-09-25 MeanSE: Efficient Generative Speech Enhancement with Mean Flows Jiahe Wang et.al. 2509.21214 null
2025-09-25 Super-resolution of 4D flow MRI through inverse problem explicit solving Aurélien de Turenne et.al. 2509.21071 null
2025-09-25 Conditionally Whitened Generative Models for Probabilistic Time Series Forecasting Yanfeng Yang et.al. 2509.20928 null
2025-09-25 FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies Shuqiao Liang et.al. 2509.20890 null
2025-09-25 Verification Limits Code LLM Training Srishti Gureja et.al. 2509.20837 null
2025-09-25 Measuring LLM Sensitivity in Transformer-based Tabular Data Synthesis Maria F. Davila R et.al. 2509.20768 null
2025-09-26 Neptune-X: Active X-to-Maritime Generation for Universal Maritime Object Detection Yu Guo et.al. 2509.20745 null
2025-09-24 FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models Amin Karimi Monsefi et.al. 2509.20624 null
2025-09-24 pop-cosmos: Star formation over 12 Gyr from generative modelling of a deep infrared-selected galaxy catalogue Sinan Deger et.al. 2509.20430 null
2025-09-24 Quasi-Synthetic Riemannian Data Generation for Writer-Independent Offline Signature Verification Elias N. Zois et.al. 2509.20420 null
2025-09-24 PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation Chen Wang et.al. 2509.20358 null
2025-09-24 Generative Model Inversion Through the Lens of the Manifold Hypothesis Xiong Peng et.al. 2509.20177 null
2025-09-24 Discrete Diffusion for Generative Modeling of Text-Aligned Speech Tokens Pin-Jui Ku et.al. 2509.20060 null
2025-09-24 MultiSoundGen: Video-to-Audio Generation for Multi-Event Scenarios via SlowFast Contrastive Audio-Visual Pretraining and Direct Preference Optimization Jianxuan Yang et.al. 2509.19999 null
2025-09-24 Learnable Sampler Distillation for Discrete Diffusion Models Feiyang Fu et.al. 2509.19962 null
2025-09-24 When Words Can’t Capture It All: Towards Video-Based User Complaint Text Generation with Multimodal Video Complaint Dataset Sarmistha Das et.al. 2509.19952 null
2025-09-24 TABFAIRGDT: A Fast Fair Tabular Data Generator using Autoregressive Decision Trees Emmanouil Panagiotou et.al. 2509.19927 null
2025-09-25 MAGE: A Coarse-to-Fine Speech Enhancer with Masked Generative Model The Hieu Pham et.al. 2509.19881 null
2025-09-24 SCORE: Scaling audio generation using Standardized COmposite REwards Jaemin Jung et.al. 2509.19831 null
2025-09-24 Efficient Speech Watermarking for Speech Synthesis via Progressive Knowledge Distillation Yang Cui et.al. 2509.19812 null
2025-09-25 StrCGAN: A Generative Framework for Stellar Image Restoration Shantanusinh Parmar et.al. 2509.19805 null
2025-09-24 EnAnchored-X2X: English-Anchored Optimization for Many-to-Many Translation Sen Yang et.al. 2509.19770 null
2025-09-24 SMILES-Inspired Transfer Learning for Quantum Operators in Generative Quantum Eigensolver Zhi Yin et.al. 2509.19715 null
2025-09-24 Towards Robust In-Context Learning for Medical Image Segmentation via Data Synthesis Jiesi Hu et.al. 2509.19711 null
2025-09-24 Long-Range Dependence in Financial Markets: Empirical Evidence and Generative Modeling Challenges Yifan He et.al. 2509.19663 null
2025-09-24 Statistical Parameter Calibration with the Generalized Fluctuation Dissipation Theorem and Generative Modeling Ludovico T. Giorgini et.al. 2509.19660 null
2025-09-23 TIMED: Adversarial and Autoregressive Refinement of Diffusion-Based Time Series Generation MohammadReza EskandariNasab et.al. 2509.19638 null
2025-09-23 Frame-Stacked Local Transformers For Efficient Multi-Codebook Speech Generation Roy Fejgin et.al. 2509.19592 null
2025-09-23 Synthesizing Artifact Dataset for Pixel-level Detection Dennis Menn et.al. 2509.19589 null
2025-09-23 CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching Chen Chen et.al. 2509.19300 null
2025-09-23 Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation Sherwin Bahmani et.al. 2509.19296 null
2025-09-23 Enabling Plant Phenotyping in Weedy Environments using Multi-Modal Imagery via Synthetic and Generated Training Data Earl Ranario et.al. 2509.19208 null
2025-09-23 GSTM-HMU: Generative Spatio-Temporal Modeling for Human Mobility Understanding Wenying Luo et.al. 2509.19135 null
2025-09-23 Extractive Fact Decomposition for Interpretable Natural Language Inference in one Forward Pass Nicholas Popovič et.al. 2509.18901 null
2025-09-24 Scalable Evaluation for Audio Identification via Synthetic Latent Fingerprint Generation Aditya Bhattacharjee et.al. 2509.18620 null
2025-09-22 Hierarchical Semi-Markov Models with Duration-Aware Dynamics for Activity Sequences Rohit Dube et.al. 2509.18414 null
2025-09-22 Evaluating the Creativity of LLMs in Persian Literary Text Generation Armin Tourajmehr et.al. 2509.18401 null
2025-10-07 StereoFoley: Object-Aware Stereo Audio Generation from Video Tornike Karchkhadze et.al. 2509.18272 null
2025-09-22 Synth-MIA: A Testbed for Auditing Privacy Leakage in Tabular Data Synthesis Joshua Ward et.al. 2509.18014 null
2025-09-22 Autoregressive-Gaussian Mixture Models: Efficient Generative Modeling of WSS Signals Kathrin Klein et.al. 2509.17953 null
2025-09-22 Unsupervised Learning and Representation of Mandarin Tonal Categories by a Generative CNN Kai Schenck et.al. 2509.17859 null
2025-09-22 Semantic and Visual Crop-Guided Diffusion Models for Heterogeneous Tissue Synthesis in Histopathology Saghir Alfasly et.al. 2509.17847 null
2025-09-22 GEM-T: Generative Tabular Data via Fitting Moments Miao Li et.al. 2509.17752 null
2025-09-23 A Generative Framework for Personalized Sticker Retrieval Changjiang Zhou et.al. 2509.17749 null
2025-09-22 PG-CE: A Progressive Generation Dataset with Constraint Enhancement for Controllable Text Generation Yan Zhuang et.al. 2509.17669 null
2025-09-22 Is It Certainly a Deepfake? Reliability Analysis in Detection & Generation Ecosystem Neslihan Kose et.al. 2509.17550 null
2025-09-22 Audiobook-CC: Controllable Long-context Speech Generation for Multicast Audiobook Min Liu et.al. 2509.17516 null
2025-09-21 Echo-Path: Pathology-Conditioned Echo Video Generation Kabir Hamzah Muhammad et.al. 2509.17190 null
2025-09-23 STAR: Speech-to-Audio Generation via Representation Learning Zeyu Xie et.al. 2509.17164 null
2025-09-21 ScenGAN: Attention-Intensive Generative Model for Uncertainty-Aware Renewable Scenario Forecasting Yifei Wu et.al. 2509.17119 null
2025-09-21 Deep Synthetic Cross-Project Approaches for Software Reliability Growth Modeling Taehyoun Kim et.al. 2509.16939 null
2025-09-21 PRISM: Precision-Recall Informed Data-Free Knowledge Distillation via Generative Diffusion Xuewan He et.al. 2509.16897 null
2025-09-20 DoubleGen: Debiased Generative Modeling of Counterfactuals Alex Luedtke et.al. 2509.16842 null
2025-09-23 Pain in 3D: Generating Controllable Synthetic Faces for Automated Pain Assessment Xin Lei Lin et.al. 2509.16727 null
2025-09-20 Semi-Supervised Synthetic Data Generation with Fine-Grained Relevance Control for Short Video Search Relevance Modeling Haoran Li et.al. 2509.16717 null
2025-09-20 An Octave-based Multi-Resolution CQT Architecture for Diffusion-based Audio Generation Maurício do V. M. da Costa et.al. 2509.16603 null
2025-09-20 A Novel Metric for Detecting Memorization in Generative Models for Brain MRI Synthesis Antonio Scardace et.al. 2509.16582 null
2025-09-20 SCAN: Self-Denoising Monte Carlo Annotation for Robust Process Reward Learning Yuyang Ding et.al. 2509.16548 null
2025-09-20 ChemOrch: Empowering LLMs with Chemical Intelligence via Synthetic Instructions Yue Huang et.al. 2509.16543 link
2025-09-20 mmExpert: Integrating Large Language Models for Comprehensive mmWave Data Synthesis and Understanding Yifan Yan et.al. 2509.16521 null
2025-09-20 RLGF: Reinforcement Learning with Geometric Feedback for Autonomous Driving Video Generation Tianyi Yan et.al. 2509.16500 null
2025-09-19 SynthIPD: assumption-lean synthetic individual patient data generation Zixuan Zhao et.al. 2509.16466 null
2025-09-19 Entropic Causal Inference: Graph Identifiability Spencer Compton et.al. 2509.16463 null
2025-09-19 Introducing Resizable Region Packing Problem in Image Generation, with a Heuristic Solution Hrishikesh Sharma et.al. 2509.16363 null
2025-09-19 Guided Sequence-Structure Generative Modeling for Iterative Antibody Optimization Aniruddh Raghu et.al. 2509.16357 null
2025-09-19 Rethinking Molecule Synthesizability with Chain-of-Reaction Seul Lee et.al. 2509.16084 null
2025-09-19 Sampling String Vacua Using Generative Models Moritz Walden et.al. 2509.16029 null
2025-09-19 Fed-PISA: Federated Voice Cloning via Personalized Identity-Style Adaptation Qi Wang et.al. 2509.16010 null
2025-09-19 On Optimal Steering to Achieve Exact Fairness Mohit Sharma et.al. 2509.15759 null
2025-09-19 TrueMoE: Dual-Routing Mixture of Discriminative Experts for Synthetic Image Detection Laixin Zhang et.al. 2509.15741 null
2025-09-19 Toward Medical Deepfake Detection: A Comprehensive Dataset and Novel Method Shuaibo Li et.al. 2509.15711 null
2025-09-19 Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification Zinan Lin et.al. 2509.15591 null
2025-09-19 LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs Junlong Jia et.al. 2509.15568 null
2025-09-19 Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech Xinlei Niu et.al. 2509.15492 null
2025-09-18 Discrete Flow-Based Generative Models for Measurement Optimization in Quantum Computing Isaac L. Huidobro-Meezs et.al. 2509.15486 null
2025-09-18 Efficient Multimodal Dataset Distillation via Generative Models Zhenghao Zhao et.al. 2509.15472 null
2025-09-18 PILOT: Steering Synthetic Data Generation with Psychological & Linguistic Output Targeting Caitlin Cisar et.al. 2509.15447 null
2025-09-18 Causal Fingerprints of AI Generative Models Hui Xu et.al. 2509.15406 null
2025-09-18 Autoguided Online Data Curation for Diffusion Model Training Valeria Pais et.al. 2509.15267 null
2025-09-18 Emotion-Aware Speech Generation with Character-Specific Voices for Comics Zhiwen Qian et.al. 2509.15253 null
2025-09-18 Fair-GPTQ: Bias-Aware Quantization for Large Language Models Irina Proskurina et.al. 2509.15206 null
2025-09-18 Learning Mechanistic Subtypes of Neurodegeneration with a Physics-Informed Variational Autoencoder Mixture Model Sanduni Pinnawala et.al. 2509.15124 null
2025-09-19 Sea-ing Through Scattered Rays: Revisiting the Image Formation Model for Realistic Underwater Image Generation Vasiliki Ismiroglou et.al. 2509.15011 null
2025-09-20 SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding Bingsong Bai et.al. 2509.14946 null
2025-09-18 Mitigating data replication in text-to-audio generative diffusion models through anti-memorization guidance Francisco Messina et.al. 2509.14934 null
2025-09-19 MeanFlowSE: one-step generative speech enhancement via conditional mean flow Duojia Li et.al. 2509.14858 null
2025-09-18 SynBench: A Benchmark for Differentially Private Text Generation Yidan Sun et.al. 2509.14594 null
2025-09-18 Cross-Lingual F5-TTS: Towards Language-Agnostic Voice Cloning and Speech Synthesis Qingyu Liu et.al. 2509.14579 null
2025-09-17 A generative model of function growth explains hidden self-similarities across biological and social systems James Holehouse et.al. 2509.14468 null
2025-10-03 SpeechWeave: Diverse Multilingual Synthetic Text & Audio Data Generation Pipeline for Training Text to Speech Models Karan Dua et.al. 2509.14270 null
2025-09-17 Quantum Reinforcement Learning-Guided Diffusion Model for Image Synthesis via Hybrid Quantum-Classical Generative Model Architectures Chi-Sheng Chen et.al. 2509.14163 null
2025-09-19 FlightDiffusion: Revolutionising Autonomous Drone Training with Diffusion Models Generating FPV Video Valerii Serpiva et.al. 2509.14082 null
2025-09-17 Lightweight Implicit Neural Network for Binaural Audio Synthesis Xikun Lu et.al. 2509.14069 null
2025-09-17 Enhancing Time Awareness in Generative Recommendation Sunkyung Lee et.al. 2509.13957 null
2025-09-17 Synthetic Data Generation for Screen Time and App Usage Gustavo Kruger et.al. 2509.13892 null
2025-09-17 EDITS: Enhancing Dataset Distillation with Implicit Textual Semantics Qianxin Xia et.al. 2509.13858 null
2025-09-17 CraftMesh: High-Fidelity Generative Mesh Manipulation via Poisson Seamless Fusion James Jincheng et.al. 2509.13688 null
2025-09-17 AgentCTG: Harnessing Multi-Agent Collaboration for Fine-Grained Precise Control in Text Generation Xinxu Zhou et.al. 2509.13677 null
2025-09-17 LLM-I: LLMs are Naturally Interleaved Multimodal Creators Zirun Guo et.al. 2509.13642 null
2025-09-17 Privacy-Aware In-Context Learning for Large Language Models Bishnu Bhusal et.al. 2509.13625 null
2025-09-14 Synthetic Data and the Shifting Ground of Truth Dietmar Offenhuber et.al. 2509.13355 null
2025-09-16 SURGIN: SURrogate-guided Generative INversion for subsurface multiphase flow with quantified uncertainty Zhao Feng et.al. 2509.13189 null
2025-09-17 TeraSim-World: Worldwide Safety-Critical Data Synthesis for End-to-End Autonomous Driving Jiawei Wang et.al. 2509.13164 null
2025-09-16 A Synthetic Data Pipeline for Supporting Manufacturing SMEs in Visual Assembly Control Jonas Werheid et.al. 2509.13089 null
2025-09-16 MSR-Codec: A Low-Bitrate Multi-Stream Residual Codec for High-Fidelity Speech Generation with Information Disentanglement Jingyu Li et.al. 2509.13068 null
2025-09-16 MIA-EPT: Membership Inference Attack via Error Prediction for Tabular Data Eyal German et.al. 2509.13046 null
2025-09-16 A Lightweight Pipeline for Noisy Speech Voice Cloning and Accurate Lip Sync Synthesis Javeria Amir et.al. 2509.12831 null
2025-09-16 ConvergeWriter: Data-Driven Bottom-Up Article Construction Binquan Ji et.al. 2509.12811 null
2025-09-16 Toward Ownership Understanding of Objects: Active Question Generation with Large Language Model and Probabilistic Generative Model Saki Hashimoto et.al. 2509.12754 null
2025-09-16 Chat-Driven Text Generation and Interaction for Person Retrieval Zequn Xie et.al. 2509.12662 null
2025-09-15 MTEB-NL and E5-NL: Embedding Benchmark and Models for Dutch Nikolay Banar et.al. 2509.12340 null
2025-09-15 VADER: A Variational Autoencoder to Infer Planetary Masses and Gas-Dust Disk Properties Around Young Stars Sayed Shafaat Mahmud et.al. 2509.12324 null
2025-09-14 Prediction of Stocks Index Price using Quantum GANs Sangram Deshpande et.al. 2509.12286 null
2025-09-15 OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling Yang Zhou et.al. 2509.12201 null
2025-09-15 Learning Majority-to-Minority Transformations with MMD and Triplet Loss for Imbalanced Classification Suman Cha et.al. 2509.11511 null
2025-09-14 Scaling Up Forest Vision with Synthetic Data Yihang She et.al. 2509.11201 null
2025-09-14 Differentially-private text generation degrades output language quality Erion Çano et.al. 2509.11176 null
2025-09-14 STASE: A spatialized text-to-audio synthesis engine for music generation Tutti Chi et.al. 2509.11124 null
2025-09-14 Filling the Gaps: A Multitask Hybrid Multiscale Generative Framework for Missing Modality in Remote Sensing Semantic Segmentation Nhi Kieu et.al. 2509.11102 null
2025-09-14 Patient-Zero: A Unified Framework for Real-Record-Free Patient Agent Generation Yunghwei Lai et.al. 2509.11078 null
2025-09-13 Term2Note: Synthesising Differentially Private Clinical Notes from Medical Terms Yuping Wu et.al. 2509.10882 null
2025-09-13 CogGNN: Cognitive Graph Neural Networks in Generative Connectomics Mayssa Soussia et.al. 2509.10864 null
2025-09-12 Struct-Bench: A Benchmark for Differentially Private Structured Text Generation Shuaiqi Wang et.al. 2509.10696 null
2025-09-12 Humanizing Automated Programming Feedback: Fine-Tuning Generative Models with Student-Written Feedback Victor-Alexandru Pădurean et.al. 2509.10647 null
2025-09-11 The Coding Limits of Robust Watermarking for Generative Models Danilo Francati et.al. 2509.10577 null
2025-09-12 Differentially Private Decentralized Dataset Synthesis Through Randomized Mixing with Correlated Noise Utsab Saha et.al. 2509.10385 null
2025-09-12 Merging Physics-Based Synthetic Data and Machine Learning for Thermal Monitoring of Lithium-ion Batteries: The Role of Data Fidelity Yusheng Zheng et.al. 2509.10380 null
2025-09-12 Arabic Large Language Models for Medical Text Generation Abdulrahman Allam et.al. 2509.10095 null
2025-09-11 A Modular and Multimodal Generative AI Framework for Urban Building Energy Data: Generating Synthetic Homes Jackson Eshbaugh et.al. 2509.09794 null
2025-09-11 OpenFake: An Open Dataset and Platform Toward Large-Scale Deepfake Detection Victor Livernoche et.al. 2509.09495 null
2025-09-11 Diabatic quantum annealing for training energy-based generative models Gilhan Kim et.al. 2509.09374 null
2025-09-11 HISPASpoof: A New Dataset For Spanish Speech Forensics Maria Risques et.al. 2509.09155 null
2025-09-10 Generative quantum advantage for classical and quantum problems Hsin-Yuan Huang et.al. 2509.09033 null
2025-09-12 ForTIFAI: Fending Off Recursive Training Induced Failure for AI Models Soheil Zibakhsh Shabgahi et.al. 2509.08972 null
2025-09-10 PromptGuard: An Orchestrated Prompting Framework for Principled Synthetic Text Generation for Vulnerable Populations using LLMs with Enhanced Safety, Fairness, and Controllability Tung Vu et.al. 2509.08910 null
2025-09-10 GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts Jenna Kang et.al. 2509.08818 null
2025-09-10 Learning Turbulent Flows with Generative Models: Super-resolution, Forecasting, and Sparse Flow Reconstruction Vivek Oommen et.al. 2509.08752 null
2025-09-10 Design-GenNO: A Physics-Informed Generative Model with Neural Operators for Inverse Microstructure Design Yaohua Zang et.al. 2509.08749 null
2025-09-11 Generative Data Refinement: Just Ask for Better Data Minqi Jiang et.al. 2509.08653 null
2025-09-10 Variational Rank Reduction Autoencoders for Generative Thermal Design Alicia Tierz et.al. 2509.08515 null
2025-09-10 A Structured Review of Underwater Object Detection Challenges and Solutions: From Traditional to Large Vision Language Models Edwine Nabahirwa et.al. 2509.08490 null
2025-09-10 Joint Learning using Mixture-of-Expert-Based Representation for Enhanced Speech Generation and Robust Emotion Recognition Jing-Tong Tzeng et.al. 2509.08470 null
2025-09-10 LLM-Guided Ansätze Design for Quantum Circuit Born Machines in Financial Generative Modeling Yaswitha Gujju et.al. 2509.08385 null
2025-09-10 Persistent-DPO: A novel loss function and hybrid learning for generative quantum eigensolver Junya Nakamura et.al. 2509.08351 null
2025-09-09 Performance Assessment Strategies for Generative AI Applications in Healthcare Victor Garcia et.al. 2509.08087 null
2025-09-09 One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation Zheng Geng et.al. 2509.07978 null
2025-09-09 Enhancements in Score-based Channel Estimation for Real-Time Wireless Systems Florian Strasser et.al. 2509.07839 null
2025-09-09 A Generalisable Generative Model for Multi-Detector Calorimeter Simulation Piyush Raikwar et.al. 2509.07700 null
2025-09-09 Spectral Masking and Interpolation Attack (SMIA): A Black-box Adversarial Attack against Voice Authentication and Anti-Spoofing Systems Kamel Kamel et.al. 2509.07677 null
2025-09-09 Target matching based generative model for speech enhancement Taihui Wang et.al. 2509.07521 null
2025-09-09 Synthetic Data Generation with Lorenzetti for Time Series Anomaly Detection in High-Energy Physics Calorimeters Laura Boggia et.al. 2509.07451 null
2025-09-09 When Fine-Tuning is Not Enough: Lessons from HSAD on Hybrid and Adversarial Audio Spoof Detection Bin Hu et.al. 2509.07323 null
2025-09-08 A transformer-based generative model for planetary systems Yann Alibert et.al. 2509.07226 null
2025-09-08 Neurocognitive Modeling for Text Generation: Deep Learning Architecture for EEG Data Khushiyant et.al. 2509.07202 null
2025-09-04 K-Syn: K-space Data Synthesis in Ultra Low-data Regimes Guan Yu et.al. 2509.06997 null
2025-09-08 SynthDrive: Scalable Real2Sim2Real Sensor Simulation Pipeline for High-Fidelity Asset Generation and Driving Data Synthesis Zhengqing Chen et.al. 2509.06798 null
2025-09-15 A Statistical 3D Stomach Shape Model for Anatomical Analysis Erez Posner et.al. 2509.06464 null
2025-09-08 MeanFlow-Accelerated Multimodal Video-to-Audio Synthesis via One-Step Generation Xiaoran Yang et.al. 2509.06389 null
2025-09-08 Text4Seg++: Advancing Image Segmentation via Generative Language Modeling Mengcheng Lan et.al. 2509.06321 null
2025-09-07 If generative AI is the answer, what is the question? Ambuj Tewari et.al. 2509.06120 null
2025-09-07 DreamAudio: Customized Text-to-Audio Generation with Diffusion Models Yi Yuan et.al. 2509.06027 null
2025-09-06 GUIDe: Generative and Uncertainty-Informed Inverse Design for On-Demand Nonlinear Functional Responses Haoxuan Dylan Mu et.al. 2509.05641 null
2025-09-04 SasAgent: Multi-Agent AI System for Small-Angle Scattering Data Analysis Lijie Ding et.al. 2509.05363 null
2025-09-02 Ensembling Membership Inference Attacks Against Tabular Generative Models Joshua Ward et.al. 2509.05350 null
2025-09-04 Improved 3D Scene Stylization via Text-Guided Generative Image Editing with Region-Based Control Haruo Fujiwara et.al. 2509.05285 null
2025-09-05 Recomposer: Event-roll-guided generative audio editing Daniel P. W. Ellis et.al. 2509.05256 null
2025-09-08 Probabilistic operator learning: generative modeling and uncertainty quantification for foundation models of differential equations Benjamin J. Zhang et.al. 2509.05186 null
2025-09-05 Painting the market: generative diffusion models for financial limit order book simulation and forecasting Alfred Backhouse et.al. 2509.05107 null
2025-09-05 QCA-MolGAN: Quantum Circuit Associative Molecular GAN with Multi-Agent Reinforcement Learning Aaron Mark Thomas et.al. 2509.05051 null
2025-09-05 Efficient Video-to-Audio Generation via Multiple Foundation Models Mapper Gehui Chen et.al. 2509.04957 null
2025-09-05 SynGen-Vision: Synthetic Data Generation for training industrial vision models Alpana Dubey et.al. 2509.04894 null
2025-09-04 Transition Models: Rethinking the Generative Learning Objective Zidong Wang et.al. 2509.04394 null
2025-09-04 AUDETER: A Large-scale Dataset for Deepfake Audio Detection in Open Worlds Qizhou Wang et.al. 2509.04345 null
2025-09-04 Synthetic Survival Data Generation for Heart Failure Prognosis Using Deep Generative Models Chanon Puttanawarut et.al. 2509.04245 null
2025-09-04 Synthesizing Sheet Music Problems for Evaluation and Reinforcement Learning Zhilin Wang et.al. 2509.04059 null
2025-09-04 An invertible generative model for forward and inverse problems Tristan van Leeuwen et.al. 2509.03910 null
2025-09-04 Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series Zhengyi Guo et.al. 2509.03898 null
2025-09-03 LuxDiT: Lighting Estimation with Video Diffusion Transformer Ruofan Liang et.al. 2509.03680 null
2025-09-05 CEHR-XGPT: A Scalable Multi-Task Foundation Model for Electronic Health Records Chao Pang et.al. 2509.03643 null
2025-09-03 Multi-level SSL Feature Gating for Audio Deepfake Detection Hoan My Tran et.al. 2509.03409 null
2025-09-03 Generative Auto-Bidding in Large-Scale Competitive Auctions via Diffusion Completer-Aligner Yewen Li et.al. 2509.03348 null
2025-09-03 A Comprehensive Guide to Differential Privacy: From Theory to User Expectations Napsu Karmitsa et.al. 2509.03294 null
2025-09-03 Improving Perceptual Audio Aesthetic Assessment via Triplet Loss and Self-Supervised Embeddings Dyah A. M. G. Wisnu et.al. 2509.03292 null
2025-09-03 RTGMFF: Enhanced fMRI-based Brain Disorder Diagnosis via ROI-driven Text Generation and Multimodal Feature Fusion Junhao Jia et.al. 2509.03214 null
2025-09-03 Eigendecompositions of temporal networks Lucas Lacasa et.al. 2509.03135 null
2025-09-03 Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers Xingyue Huang et.al. 2509.03059 null
2025-09-03 Scale-Adaptive Generative Flows for Multiscale Scientific Data Yifan Chen et.al. 2509.02971 null
2025-09-02 Generative AI for Crystal Structures: A Review Pierre-Paul De Breuck et.al. 2509.02723 null
2025-09-02 Top-H Decoding: Adapting the Creativity and Coherence with Bounded Entropy in Text Generation Erfan Baghaei Potraghloo et.al. 2509.02510 null
2025-09-02 Exploring Variational Graph Autoencoders for Distribution Grid Data Generation Syed Zain Abbas et.al. 2509.02469 null
2025-09-02 Exploring Diffusion Models for Generative Forecasting of Financial Charts Taegyeong Lee et.al. 2509.02308 null
2025-09-01 Towards Improved Speech Recognition through Optimized Synthetic Data Generation Yanis Perrin et.al. 2508.21631 null
2025-08-11 Large Language Model Data Generation for Enhanced Intent Recognition in German Speech Theresa Pekarek Rosin et.al. 2508.06277 null
2025-07-25 Synthetic Data Generation for Phrase Break Prediction with Large Language Model Hoyeon Lee et.al. 2507.18044 null
2025-07-15 DualDub: Video-to-Soundtrack Generation via Joint Speech and Background Audio Synthesis Wenjie Tian et.al. 2507.10109 null
2025-06-13 Multimodal Cinematic Video Synthesis Using Text-to-Image and Audio Generation Models Sridhar S et.al. 2506.10005 null
2025-06-11 A Review on Score-based Generative Models for Audio Applications Ge Zhu et.al. 2506.08457 link
2025-06-24 Analysis and Evaluation of Synthetic Data Generation in Speech Dysfluency Detection Jinming Zhang et.al. 2505.22029 null
2025-07-01 From Alignment to Advancement: Bootstrapping Audio-Language Alignment with Synthetic Data Chun-Yi Kuan et.al. 2505.20166 null
2025-05-15 DPN-GAN: Inducing Periodic Activations in Generative Adversarial Networks for High-Fidelity Audio Synthesis Zeeshan Ahmad et.al. 2505.09091 null
2025-03-04 Voice Cloning for Dysarthric Speech Synthesis: Addressing Data Scarcity in Speech-Language Pathology Birger Moell et.al. 2503.01266 null
2025-06-09 DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model Lei Zhao et.al. 2502.18952 null
2025-05-23 ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts Ashi Garg et.al. 2502.05674 null
2025-07-24 Koel-TTS: Enhancing LLM based Speech Generation with Preference Alignment and Classifier Free Guidance Shehzeen Hussain et.al. 2502.05236 null
2025-01-29 CosyAudio: Improving Audio Generation with Confidence Scores and Synthetic Captions Xinfa Zhu et.al. 2501.16761 null
2025-09-05 Exposing Synthetic Speech: Model Attribution and Detection of AI-generated Speech via Audio Fingerprints Matías Pizarro et.al. 2411.14013 null
2024-12-20 Exploring the Landscape for Generative Sequence Models for Specialized Data Synthesis Mohammad Zbeeb et.al. 2411.01929 link
2024-10-24 Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation Junwon Lee et.al. 2410.17589 null
2025-03-25 Where are we in audio deepfake detection? A systematic analysis over generative and detection models Xiang Li et.al. 2410.04324 null
2025-07-08 A Framework for Synthetic Audio Conversations Generation using Large Language Models Kaung Myat Kyaw et.al. 2409.00946 null
2024-08-20 Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition Samuele Cornell et.al. 2408.09215 null
2024-08-01 On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition Nick Rossenbach et.al. 2407.21476 null
2024-06-27 SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond Marco Comunità et.al. 2406.17672 null
2024-06-21 Instruction Data Generation and Unsupervised Adaptation for Speech Language Models Vahid Noroozi et.al. 2406.12946 null
2024-06-13 LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation Wenhao Guan et.al. 2406.08203 null
2024-07-10 AudioLCM: Text-to-Audio Generation with Latent Consistency Models Huadai Liu et.al. 2406.00356 null
2024-06-04 Creative Text-to-Audio Generation via Synthesizer Programming Manuel Cherep et.al. 2406.00294 null
2024-05-01 Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis Shivam Mehta et.al. 2404.19622 null
2024-02-09 Listening Between the Lines: Synthetic Speech Detection Disregarding Verbal Content Davide Salvi et.al. 2402.05567 null
2024-02-19 Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis Vinotha R et.al. 2401.11771 null
2024-01-08 Pheme: Efficient and Conversational Speech Generation Paweł Budzianowski et.al. 2401.02839 null
2023-11-21 EDMSound: Spectrogram Based Diffusion Models for Efficient and High-Quality Audio Synthesis Ge Zhu et.al. 2311.08667 null
2024-03-27 Generative Pre-training for Speech with Flow Matching Alexander H. Liu et.al. 2310.16338 null
2024-01-24 Low-latency Speech Enhancement via Speech Token Generation Huaying Xue et.al. 2310.08981 null
2024-05-14 AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Haohe Liu et.al. 2308.05734 null
2023-07-04 FFPDG: Fast, Fair and Private Data Generation Weijie Xu et.al. 2307.00161 null
2023-05-31 Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation Jiawei Huang et.al. 2305.18474 null
2023-03-28 Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis Karren Yang et.al. 2303.14885 null
2023-04-04 A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI Chenshuang Zhang et.al. 2303.13336 null
2024-07-18 Configurable EBEN: Extreme Bandwidth Extension Network to enhance body-conducted speech capture Julien Hauret et.al. 2303.10008 null
2023-01-31 Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models Rongjie Huang et.al. 2301.12661 null
2023-05-26 Evaluating and reducing the distance between synthetic and real speech distributions Christoph Minixhofer et.al. 2211.16049 null
2023-07-27 AudioLM: a Language Modeling Approach to Audio Generation Zalán Borsos et.al. 2209.03143 null
2022-07-05 Computer-assisted Pronunciation Training – Speech synthesis is almost all you need Daniel Korzekwa et.al. 2207.00774 null
2022-06-22 Adversarial Audio Synthesis with Complex-valued Polynomial Networks Yongtao Wu et.al. 2206.06811 null
2024-06-06 Parallel Synthesis for Autoregressive Speech Generation Po-chun Hsu et.al. 2204.11806 null
2022-03-30 Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise Tuomo Raitio et.al. 2203.10637 null
2022-03-16 Attributable-Watermarking of Speech Generative Models Yongbaek Cho et.al. 2202.08900 null
2021-06-15 CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis Simon Rouard et.al. 2106.07431 null
2022-02-01 ItôTTS and ItôWave: Linear Stochastic Differential Equation Is All You Need For Audio Generation Shoule Wu et.al. 2105.07583 null
2021-08-02 VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding Javier Nistal et.al. 2105.01531 null
2022-02-25 Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis Erica Cooper et.al. 2104.12292 null
2021-04-01 DiffWave: A Versatile Diffusion Model for Audio Synthesis Zhifeng Kong et.al. 2009.09761 null
2022-06-29 DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial Networks J. Nistal et.al. 2008.12073 null
2019-06-05 MelNet: A Generative Model for Audio in the Frequency Domain Sean Vasquez et.al. 1906.01083 null
2019-05-22 Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems Ohsung Kwon et.al. 1905.08486 null
2019-03-15 Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis Bajibabu Bollepalli et.al. 1903.05955 null
2019-02-20 Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks Hafiz Malik et.al. 1902.06782 null
2018-10-31 Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks Lauri Juvela et.al. 1810.12598 null
2017-09-26 Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks Yuki Saito et.al. 1709.08041 null

🤝 Contributing

Contributions are welcome! Please feel free to submit issues or pull requests.

⭐ Star History

If you find this repository useful, please consider giving it a star!