Multimodal Pre-Training Based on Graph Attention Network for Document Understanding.

AllImages Videos Shopping Maps News Books

[2203.13530] Multimodal Pre-training Based on Graph Attention Network ...

Mar 25, 2022 · In this paper, we present the GraphDoc, a multimodal graph attention-based model for various document understanding tasks.

Scholarly articles for Multimodal Pre-Training Based on Graph Attention Network for Document Understanding.

scholar.google.com › citations

… graph attention network for document understanding
Zhang · Cited by 41

Multimodal Pre-Training Based on Graph Attention Network for ...

ieeexplore.ieee.org › iel7

We do the multimodal feature fusion of each node by the gate fusion layer. The contextualization between each node is modeled by the graph attention layer.

Multimodal Pre-Training Based on Graph Attention Network for ...

dl.acm.org › doi › tmm.2022.3214102

In this paper, we present the GraphDoc, a multimodal graph attention-based model for various document understanding tasks. GraphDoc is pre-trained in a ...

Multimodal Pre-training Based on Graph Attention Network for ...

ieeexplore.ieee.org › iel7

In our work, we inject the graph structure in a document into the attention mechanism to form the graph attention layer instead of the original Transformer ...

ZZR8066/GraphDoc - GitHub

github.com › ZZR8066 › GraphDoc

The source code for Multimodal Pre-training Based on Graph Attention Network for Document Understanding.

[PDF] Multimodal Pre-Training Based on Graph Attention Network for ...

www.semanticscholar.org › paper

The GraphDoc is a multimodal graph attention-based model for various document understanding tasks that learns a generic representation from only 320k ...

Multimodal Pre-training Based on Graph Attention Network ... - YouTube

m.youtube.com › watch

Apr 13, 2023 · Have you ever wondered how computers can read and understand documents? It's a tough task because documents come in all sorts of formats and ...

Multimodal Pre-training Based on Graph Attention ... - arxiv-sanity

arxiv-sanity-lite.com › ...

In this paper, we present the GraphDoc, a multimodal graph attention-based model for various document understanding tasks. GraphDoc is pre-trained in a ...

Multimodal Pre-Training Based on Graph Attention Network for ...

www.researchgate.net › ... › Graphs

In this paper, we present the GraphDoc, a multimodal graph attention-based model for various document understanding tasks. GraphDoc is pre-trained in a ...

(PDF) Multimodal Pre-training Based on Graph Attention Network for ...

www.researchgate.net › publication › 35...

Mar 25, 2022 · In this paper, we present the GraphDoc, a multimodal graph attention-based model for various document understanding tasks. GraphDoc is pre- ...