chinese-corpus
Here are 16 public repositories matching this topic...
Pre-trained Wikipedia corpus by MITIE
-
Updated
Sep 9, 2018
搜狗细胞词库到普通文本的转换提取工具。提取词汇表,用于深度学习做数据生成和字典特征
-
Updated
Dec 3, 2018 - Python
Pretrained model for Chinese Scientific Text
-
Updated
May 26, 2020
Predicting Audience’s Response from Sketch Comedy and Crosstalk Scripts (A Corpus Supporting Comedy Writers)
-
Updated
Nov 16, 2020
PTT 八卦版問答中文語料
-
Updated
Jan 18, 2021 - Jupyter Notebook
Corpus creator for Chinese Wikipedia
-
Updated
Jun 30, 2021 - Python
20201124到20220710期间的微博热搜中出现过的姓名 (主要为明星、政客、名人、网红、企业家等)
-
Updated
Jul 10, 2022
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
-
Updated
Nov 21, 2022 - Python
基于4-tag标注好的2019中文维基语料库,使用hanlp进行标注
-
Updated
Jan 17, 2023 - Python
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
-
Updated
Feb 18, 2023 - Python
汉语现代诗歌语料库整理,3489诗人,81.7K诗歌,15.43M字。持续扩充...
-
Updated
Aug 1, 2023 - Python
An Implementation of 'Attention is all you need' with Chinese Corpus
-
Updated
May 14, 2024 - Python
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
-
Updated
May 23, 2024
Improve this page
Add a description, image, and links to the chinese-corpus topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the chinese-corpus topic, visit your repo's landing page and select "manage topics."