Portrait
Wei Shi
Staff AI Researcher, Meituan M17
About Me

Wei Shi is currently a staff AI researcher at Meituan M17. Prior to that, he received his Ph.D. Magna Cum Laude in December 2020 from Saarland University in Germany, where he worked on Natural Language Processing under the supervision of Prof. Dr. Vera Demberg at the Department of Language Science and Technology and Collaborative Research Center SFB-1102.

After graduation, he joined DAMO Academy, Alibaba Group as a Senior Algorithm Engineer in 2021, and later MiniMax Inc. in 2022, focusing on Large Language Models and Multimodal AI.

His research interests include Large Language Models, Multimodal AI, Discourse Relation Parsing, Sentiment Analysis, Text Generation, and Natural Language Understanding.

Education
  • Saarland University
    Ph.D. in Computational Linguistics
    Advisor: Prof. Dr. Vera Demberg
    2016 - 2020
  • Chinese Academy of Sciences
    Institute of Automation
    M.Sc. in Computer Science
    2013 - 2016
  • Wuhan University
    B.Eng. in Automation
    2008 - 2012
Experience
  • Meituan M17
    Staff AI Researcher
    Feb. 2024 - present
  • MiniMax Inc.
    LLM Algorithm Engineer
    Jun. 2022 - Jan. 2024
  • DAMO Academy, Alibaba Group
    Senior Algorithm Engineer
    Jan. 2021 - Jun. 2022
News
2024
Wei Shi has left MiniMax Inc., but is still working on LLMs.
Jan 14
2022
Wei Shi has left Alibaba and joined MiniMax Inc., a startup company focusing on LLM and Multimodality models.
May 31
2021
Wei Shi joined DAMO Academy, Alibaba Group as a Senior Algorithm Engineer.
Feb 28
2020
Wei Shi received his Ph.D. Magna Cum Laude from Saarland University.
Dec 14
Selected Publications (view all )
Longcat-Flash-Thinking-2601 Technical Report

MLC Team

arXiv preprint 2026

Technical report of Longcat-Flash-Thinking-2601, an advanced reasoning model by the MLC Team.

Longcat-Flash-Thinking-2601 Technical Report

MLC Team

arXiv preprint 2026

Technical report of Longcat-Flash-Thinking-2601, an advanced reasoning model by the MLC Team.

LongCat-Flash Technical Report

MLC Team

arXiv preprint 2025

Technical report of LongCat-Flash, a large language model developed by the MLC Team.

LongCat-Flash Technical Report

MLC Team

arXiv preprint 2025

Technical report of LongCat-Flash, a large language model developed by the MLC Team.

Entity Enhancement for Implicit Discourse Relation Classification in the Biomedical Domain

Wei Shi, Vera Demberg

Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP) 2021

We propose an entity-enhanced model for implicit discourse relation classification in the biomedical domain, leveraging entity information to improve relation detection.

Entity Enhancement for Implicit Discourse Relation Classification in the Biomedical Domain

Wei Shi, Vera Demberg

Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP) 2021

We propose an entity-enhanced model for implicit discourse relation classification in the biomedical domain, leveraging entity information to improve relation detection.

Next Sentence Prediction helps Implicit Discourse Relation Classification within and across Domains

Wei Shi, Vera Demberg

Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) 2019

We show that next sentence prediction, as used in BERT pre-training, can effectively help implicit discourse relation classification both within and across domains.

Next Sentence Prediction helps Implicit Discourse Relation Classification within and across Domains

Wei Shi, Vera Demberg

Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) 2019

We show that next sentence prediction, as used in BERT pre-training, can effectively help implicit discourse relation classification both within and across domains.

Attention-based Bidirectional Long Short-term Memory Networks for Relation Classification

Peng Zhou, Wei Shi, Jun Tian, Zhenyu Qi, Bingchen Li, Hongwei Hao, Bo Xu

54th Annual Meeting of the Association for Computational Linguistics (ACL) 2016

We propose an attention-based bidirectional LSTM model for relation classification, which can capture the most important semantic information in a sentence.

Attention-based Bidirectional Long Short-term Memory Networks for Relation Classification

Peng Zhou, Wei Shi, Jun Tian, Zhenyu Qi, Bingchen Li, Hongwei Hao, Bo Xu

54th Annual Meeting of the Association for Computational Linguistics (ACL) 2016

We propose an attention-based bidirectional LSTM model for relation classification, which can capture the most important semantic information in a sentence.

All publications