Feature Enhancement with Text-specific Region Contrast for Scene Text Detection.- Learning Efficient Representations for Patent Drawing Retrieval.- HelixNet: Dual Helix Cooperative Decoders for Scene Text Removal.- Semantic-information Space Sharing Interaction Network for Arbitrary Shape Text Detection.- AIE-KB: Information Extraction Technology with Knowledge Base for Chinese Archival Scenario.- Deep Hough Transform For Gaussian Semantic Box-Lines Alignment.- Chinese-Vietnamese Cross-lingual Event Causality Identification Based on Syntactic Graph Convolution.- MCKIE: Multi-Class Key Information Extraction from Complex Documents based on Graph Convolutional Network.- A Pre-trained Model For Chinese Medical Record Punctuation Restoration.- "English and Spanish Bilinguals’ Language Processing: An ALE-based Meta-analysis of Neuroimaging Studies".- Robust Subspace Learning with Double Graph Embedding Unsupervised Feature Selection via Nonlinear Representation and Adaptive Structure Preservation.- Text Causal Discovery Based on Sequence Structure Information.- MetaSelection: A Learnable Masked AutoEncoder for Multimodal Sentiment Feature Selection.- Image Manipulation Localization based on Multiscale Convolutional Attention.- Bi-Stream Multiscale Hamhead Networks with Contrastive Learning for Image forgery Localization.- Fuse Tune: Hierarchical Decoder Towards Efficient Transfer Learning.- Industrial-SAM with Interactive Adapter.- Mining Temporal Inconsistency with 3D Face Model for Deepfake Video Detection.- DT-TransUNet: A Dual-task Model for Deepfake Detection and Segmentation.- Camouflaged Object Detection via Global-edge Context and Mixed-scale Refinement.- Enhancing CLIP-Based Text-Person Retrieval by Leveraging Negative Samples.- Global Selection and Local Attention Network for Referring Image Segmentation.- MTQ-Caps: A Multi-Task Capsule Network for Blind Image Quality Assessment.- VCD: Visual Causality Discovery for Cross-Modal Question Reasoning.- Multimodal Topic and Sentiment Recognition for Chinese Data Based on Pre-trained Encoders.- Multi-Feature Fusion-Based Central Similarity Deep Supervised Hashing.- VVA: Video Values Analysis.- Dynamic Multi-modal Prompting for Efficient Visual Grounding.- A Graph-involved Lightweight Semantic Segmentation Network.- User-Aware Prefix-Tuning is a Good Learner for Personalized Image Captioning.- An End-to-End Transformer with Progressive Tri-modal Attention for Multi-modal Emotion Recognition.- Target-oriented Multi-criteria Band Selection for Hyperspectral Image.- Pairwise Negative Sample Mining for Human-Object Interaction Detection.- An Evolutionary Multiobjective Optimization Algorithm based on Manifold Learning.- Path Planning of Automatic Parking System by A Point-based Genetic Algorithm
Penalty-Aware Memory Loss for Deep Metric Learning.- Central and Directional Muti-Neck Knowledge Distillation.- Online Class-incremental Learning in Image Classification based on Attention
Online airline baggage packing based on hierarchical tree A2C-reinforcement learning framework