Zum Hauptinhalt springen
Nicht aus der Schweiz? Besuchen Sie lehmanns.de
Document Analysis and Recognition – ICDAR 2025 Workshops -

Document Analysis and Recognition – ICDAR 2025 Workshops

Wuhan, China, September 20–21, 2025, Proceedings, Part II
Buch | Softcover
XV, 396 Seiten
2025
Springer International Publishing (Verlag)
978-3-032-09370-7 (ISBN)
CHF 194,70 inkl. MwSt
  • Noch nicht erschienen - erscheint am 31.12.2025
  • Versandkostenfrei
  • Auch auf Rechnung
  • Artikel merken

The two-volume set LNCS 16225 + 16226 constitutes the proceedings of International Workshops co-located with the 19th International Conference on Document Analysis and Recognition, ICDAR 2025, held in Wuhan, China, during September 2025. 

The 46 full papers included in these proceedings were carefully reviewed and selected from a total of 74 submissions. The contributions stem from the following workshops:

Part I: The Fifth ICDAR International Workshop on Machine Learning (WML 2025); ICDAR 2025 Workshop on Multi-Modal Mathematical Reasoning in Documents (M3RD 2025);

Part II: The 16th IAPR International Workshop on Graphics Recognition (GREC 2025); ICDAR 2025 Workshop on Visual Text Generation and Text Image Processing VT-TIP 2025); ICDAR 2025 Workshop on Documents Analysis of Low-resource Languages (DALL 2025)

 

.- The 16th IAPR International Workshop on Graphics Recognition (GREC 2025)
.- Hierarchical structure understanding in complex tables with VLLMs: a benchmark and experiments.
.- Archival Faces: Detection of Faces in Digitized Historical Documents.
.- AnonED: Complex Region Anonymisation in Electrical Diagrams using Hybrid Density-Based Spatial Clustering.
.- AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization.
.- GAN-based Content-Conditioned Generation of Handwritten Musical Symbols.
.- ICDAR 2025 Workshop on Visual Text Generation and Text Image Processing (VT-TIP 2025)
.- SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table Extraction.
.- BengaliDiff: Diffusion Model for Few-Shot Bengali Font Generation.
.- DAA-Net: Dynamic Adaptive Aggregation Network for Document Image Rectification.
.- Visual Text Generation in Khmer Language: Challenges and Trends with Diffusion Models.
.- EroPT: Benchmarking Robustness of OCR Methods on Eroded Printed Text.
.- BiNet: A Deep Encoder-Decoder Network for Binarizing Degraded Ancient Manuscripts.
.- Modular OCR Using Web Scraping Data.
.- Semi-Supervised Writing Style Classification in Medieval Hebrew Manuscripts.
.- ICDAR 2025 Workshop on Documents Analysis of Low-resource Languages (DALL 2025)
.- Enhancing Khmer-English Machine Translation via Document Analysis Techniques.
.- PALM-LAY: A Multi-Script Cross-Regional Dataset for Layout Analysis of Palm Leaf Manuscripts.
.- Open Set Oracle Character Recognition via Adaptive Decision Boundary.
.- TMAWS: A Manchu Archives Word Spotting Method Supporting Both Image and String Query Modes.
.- The Research on End-to-End Tibetan Text Detection and Recognition in Natural Scenes.
.- Multi-Type Tibetan Ancient Book Text Line Recognition Based on Adapter Fine-Tuning.
.- ClapperText: A Benchmark for Text Recognition in Low-Resource Archival Documents.
.- Cross-Lingual Learning for Low-Resource Khmer Scene Text Detection and Recognition.
.- Text Enhancement of Degraded

Erscheinungsdatum
Reihe/Serie Lecture Notes in Computer Science
Zusatzinfo XV, 396 p. 187 illus., 134 illus. in color.
Verlagsort Cham
Sprache englisch
Maße 155 x 235 mm
Themenwelt Informatik Grafik / Design Digitale Bildverarbeitung
Schlagworte Artificial Intelligence • benchmark dataset • Deep learning • Document Analysis and Recognition • Document Analysis Systems • Document Image Processing • document understanding • Graphics, Diagram, and Math Recognition • Graphics Recognition • handwriting analysis and recognition • Handwriting Recognition • Historical document • historical document analysis • low-resource language processing • machine learning • multimedia document analysis • multi-modal mathematical reasoning • Neural networks • NLP for Document Understanding • Optical Character Recognition • Scene Text Detection and Recognition • visual text generation
ISBN-10 3-032-09370-8 / 3032093708
ISBN-13 978-3-032-09370-7 / 9783032093707
Zustand Neuware
Informationen gemäß Produktsicherheitsverordnung (GPSR)
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Das umfassende Handbuch

von Jürgen Wolf

Buch | Hardcover (2025)
Rheinwerk (Verlag)
CHF 69,85