Pattern Recognition

27th International Conference, ICPR 2024, Kolkata, India, December 1–5, 2024, Proceedings, Part VI

Apostolos Antonacopoulos, Subhasis Chaudhuri, Rama Chellappa, Cheng-Lin Liu, Saumik Bhattacharya, Umapada Pal (Herausgeber)

Buch | Softcover

XXXVII, 480 Seiten

2024
Springer International Publishing (Verlag)
9783031781711 (ISBN)

Artikel merken

The multi-volume set of LNCS books with volume numbers 15301-15333 constitutes the refereed proceedings of the 27th International Conference on Pattern Recognition, ICPR 2024, held in Kolkata, India, during December 1-5, 2024. The 963 papers presented in these proceedings were carefully reviewed and selected from a total of 2106 submissions. They deal with topics such as Pattern Recognition; Artificial Intelligence; Machine Learning; Computer Vision; Robot Vision; Machine Vision; Image Processing; Speech Processing; Signal Processing; Video Processing; Biometrics; Human-Computer Interaction (HCI); Document Analysis; Document Recognition; Biomedical Imaging; Bioinformatics.

TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back) using Taylor-Softmax.- Balancing Accuracy and Efficiency in Budget-Aware Early-Exiting Neural Networks.- An Evolutionary Search-Based Operator Fusion Method with Binary Representation for Deep Learning Inference Acceleration.- SemFaceEdit: Semantic Face Editing on Generative Radiance Manifolds.- (D^2)Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods.- Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt.- Freestyle 3D-Aware Portrait Synthesis Based on Compositional Generative Priors.- FUGAN: A GAN Based Facial Reconstructor For Accurate Unveiling Of Hidden Faces.- Text2Street: Controllable Text-to-image Generation for Street Views.- Make An Image Move: Few-shot based Video Generation Guided by CLIP.- A Framework For Image Synthesis Using Supervised Contrastive Learning.- TMCSPEECH: A CHINESE TV AND MOVIE SPEECH DATASET WITH CHARACTER DESCRIPTIONS AND A CHARACTER-BASED VOICE GENERATION MODEL.- Deterministic Synthesis of Defect Images using Null Optimization.- Adaptive Refiner based Few-Shot Font Generation.- Controllable 3D object Generation with Single Image Prompt.- Beyond Labels: Aligning Large Language Models with Human-like Reasoning.- HindiLLM: Large Language Model for Hindi.- StableTalk: Advancing Audio-to-Talking Face Generation with Stable Diffusion And Vision Transformer.- Can LLMs perform structured graph reasoning tasks?.- Improved Zero-Shot Image Editing via Null-Toon and Directed Delta Denoising Score.- Texture Spectral Decorrelation Criteria.- A Low Rank Gaussian Mixture Latent Model for Face Generation.- Domain Adaptation for Machinery Fault Diagnosis Based on Critic Classifier GAN.- Data Augmentation Pipeline for Enhanced UAV Surveillance.- Generative Adversarial Networks for Imputing Sparse Learning Performance.- SWave: Improving Vocoder Efficiency by Straightening the Waveform Generation Path.- Outdoor Scene Relighting with Diffusion Models.- Matching aggregate posteriors in the variational autoencoder.- Efficient Nonlinear DAG Learning under Projection Framework.- GCompletor: A Graph-based Deep Learning Method for Traffic State Imputation on Urban Road Networks.

Erscheinungsdatum	03.12.2024
Reihe/Serie	Lecture Notes in Computer Science
Zusatzinfo	XXXVII, 480 p. 176 illus., 165 illus. in color.
Verlagsort	Cham
Sprache	englisch
Maße	155 x 235 mm
Themenwelt	Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik
Schlagworte	Artificial Intelligence • Bioinformatics • Biomedical Imaging • biometrics • computer vision • Document Analysis • document recognition • Human-Computer Interaction (HCI) • Image Processing • machine learning • Machine vision • pattern recognition • Robot vision • Signal Processing • Speech processing • Video Processing
ISBN-13	9783031781711 / 9783031781711
Zustand	Neuware