Abstract: Large-scale text-to-image (T2I) diffusion models have showcased incredible capabilities in generating coherent images based on textual descriptions, enabling vast applications in content ...
Scene text image super-resolution (STISR) aims to improve the visual clarity of the text in low-resolution scene images. Due to the intrinsic lack of detailed text appearance information in the ...