Yunfan Shao
Yunfan Shao
Verified email at
Cited by
Cited by
Pre-trained models for natural language processing: A survey
X Qiu, T Sun, Y Xu, Y Shao, N Dai, X Huang
Science China technological sciences 63 (10), 1872-1897, 2020
Q Guo, X Qiu, P Liu, Y Shao, X Xue, Z Zhang
arXiv preprint arXiv:1902.09113, 2019
Black-box tuning for language-model-as-a-service
T Sun, Y Shao, H Qian, X Huang, X Qiu
International Conference on Machine Learning, 20841-20855, 2022
Colake: Contextualized language and knowledge embedding
T Sun, Y Shao, X Qiu, Q Guo, Y Hu, X Huang, Z Zhang
arXiv preprint arXiv:2010.00309, 2020
Internlm: A multilingual language model with progressively enhanced capabilities
ILM Team
2023-01-06)[2023-09-27]. https://github. com/InternLM/InternLM, 2023
Learning sparse sharing architectures for multiple tasks
T Sun, Y Shao, X Li, P Liu, H Yan, X Qiu, X Huang
Proceedings of the AAAI conference on artificial intelligence 34 (05), 8936-8943, 2020
Cpt: A pre-trained unbalanced transformer for both chinese language understanding and generation
Y Shao, Z Geng, Y Liu, J Dai, H Yan, F Yang, Z Li, H Bao, X Qiu
Science China Information Sciences 67 (5), 152102, 2024
Character-llm: A trainable agent for role-playing
Y Shao, L Li, J Dai, X Qiu
arXiv preprint arXiv:2310.10158, 2023
Internlm2 technical report
Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ...
arXiv preprint arXiv:2403.17297, 2024
Moss: Training conversational language models from synthetic data
T Sun, X Zhang, Z He, P Li, Q Cheng, H Yan, X Liu, Y Shao, Q Tang, ...
arXiv preprint arXiv:2307.15020 7, 3, 2023
Accelerating bert inference for sequence labeling via early-exit
X Li, Y Shao, T Sun, H Yan, X Qiu, X Huang
arXiv preprint arXiv:2105.13878, 2021
Generating adversarial examples in chinese texts using sentence-pieces
L Li, Y Shao, D Song, X Qiu, X Huang
arXiv preprint arXiv:2012.14769, 2020
Internlm-math: Open math large language models toward verifiable reasoning
H Ying, S Zhang, L Li, Z Zhou, Y Shao, Z Fei, Y Ma, J Hong, K Liu, Z Wang, ...
arXiv preprint arXiv:2402.06332, 2024
PerturbScore: Connecting Discrete and Continuous Perturbations in NLP
L Li, K Ren, Y Shao, P Wang, X Qiu
arXiv preprint arXiv:2310.08889, 2023
MOSS: An Open Conversational Large Language Model
T Sun, X Zhang, Z He, P Li, Q Cheng, X Liu, H Yan, Y Shao, Q Tang, ...
Machine Intelligence Research, 1-18, 2024
Balanced Data Sampling for Language Model Training with Clustering
Y Shao, L Li, Z Fei, H Yan, D Lin, X Qiu
arXiv preprint arXiv:2402.14526, 2024
Query of CC: Unearthing Large Scale Domain-Specific Knowledge from Public Corpora
Z Fei, Y Shao, L Li, Z Zeng, H Yan, X Qiu, D Lin
arXiv preprint arXiv:2401.14624, 2024
Unified Active Retrieval for Retrieval Augmented Generation
Q Cheng, X Li, S Li, Q Zhu, Z Yin, Y Shao, L Li, T Sun, H Yan, X Qiu
arXiv preprint arXiv:2406.12534, 2024
The system can't perform the operation now. Try again later.
Articles 1–18