Test environment running 7.6.3
 

Handwriting Transformers

dc.contributor.authorBhunia, Ankan Kumaren
dc.contributor.authorKhan, Salmanen
dc.contributor.authorCholakkal, Hishamen
dc.contributor.authorAnwer, Rao Muhammaden
dc.contributor.authorKhan, Fahad Shahbazen
dc.contributor.authorShah, Mubaraken
dc.date.accessioned2025-03-17T07:06:17Z
dc.date.available2025-03-17T07:06:17Z
dc.date.issued2021en
dc.description.abstractWe propose a novel transformer-based styled handwritten text image generation approach, HWT, that strives to learn both style-content entanglement as well as global and local style patterns. The proposed HWT captures the long and short range relationships within the style examples through a self-attention mechanism, thereby encoding both global and local style patterns. Further, the proposed transformer-based HWT comprises an encoder-decoder attention that enables style-content entanglement by gathering the style features of each query character. To the best of our knowledge, we are the first to introduce a transformer-based network for styled handwritten text generation. Our proposed HWT generates realistic styled handwritten text images and outperforms the state-of-the-art demonstrated through extensive qualitative, quantitative and human-based evaluations. The proposed HWT can handle arbitrary length of text and any desired writing style in a few-shot setting. Further, our HWT generalizes well to the challenging scenario where both words and writing style are unseen during training, generating realistic styled handwritten text images. Code is available at: https://github.com/ankanbhunia/Handwriting-Transformers.en
dc.description.statustrueen
dc.format.extent9en
dc.identifier.isbn9781665428125en
dc.identifier.issn1550-5499en
dc.identifier.otherScopus:85127184225en
dc.identifier.otherARIES:a383154xPUB29547en
dc.identifier.urihttps://dspace-test.anu.edu.au/handle/1885/733721001
dc.identifier.urlhttp://www.scopus.com/inward/record.url?scp=85127184225&partnerID=8YFLogxKen
dc.language.isoEnglishen
dc.relation.ispartofseriesProceedings of the IEEE International Conference on Computer Visionen
dc.rightsPublisher Copyright: © 2021 IEEEen
dc.titleHandwriting Transformersen
dc.typeConference contributionen
local.bibliographicCitation.lastpage1074en
local.bibliographicCitation.startpage1066en
local.contributor.affiliationBhunia, Ankan Kumar; Mohamed Bin Zayed University of Artificial Intelligenceen
local.contributor.affiliationKhan, Salman; School of Computing, ANU College of Systems and Society, The Australian National Universityen
local.contributor.affiliationCholakkal, Hisham; Mohamed Bin Zayed University of Artificial Intelligenceen
local.contributor.affiliationAnwer, Rao Muhammad; Mohamed Bin Zayed University of Artificial Intelligenceen
local.contributor.affiliationKhan, Fahad Shahbaz; Mohamed Bin Zayed University of Artificial Intelligenceen
local.contributor.affiliationShah, Mubarak; University of Central Floridaen
local.identifier.doi10.1109/ICCV48922.2021.00112en
local.identifier.purecec902eb-2fa2-4c57-81ff-a032856c764aen
local.type.statusPublisheden

Downloads