pretrain

Here are 20 public repositories matching this topic...

brightmart / nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

nlp news wiki text-classification word2vec corpus dataset question-answering chinese chinese-nlp language-model bert chinese-corpus pretrain chinese-dataset

Updated May 23, 2024

keyu-tian / SparK

Star

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Updated Jan 23, 2024
Python

CLUEbenchmark / CLUECorpus2020

Star

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

nlp corpus chinese datasets albert bert chinese-corpus roberta pretrain

Updated Oct 17, 2022

microsoft / UniVL

Star

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

video localization caption alignment segmentation coin multimodality joint multimodal-sentiment-analysis pretrain pretraining msrvtt video-text-retrieval video-text video-language youcookii retrieval-task caption-task

Updated Nov 28, 2022
Python

yangjianxin1 / Firefly-LLaMA2-Chinese

Star

Firefly中文LLaMA-2大模型，支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

bloom falcon firefly llama lora pretrain baichuan llm chatglm qlora internlm baichuan-13b llama2 llama-2 qwen xverse baichaun2

Updated Oct 21, 2023
Python

thunlp / RE-Context-or-Names

Star

Bert-based models(BERT, MTB, CP) for relation extraction.

pytorch bert relation-extraction pretrain contrastive-learning

Updated Jun 1, 2022
Python

THUNLP-AIPoet / BERT-CCPoem

Star

BERT-CCPoem is an BERT-based pre-trained model particularly for Chinese classical poetry

poetry bert pretrain

Updated Mar 1, 2022
Python

xcfcode / What-I-Have-Read

Star

Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers

Updated Jun 12, 2022

huzongxiang / MatDGL

Star

MatDGL is a neural network package that allows researchers to train custom models for crystal modeling tasks. It aims to accelerate the research and application of material science.

machine-learning deep-learning materials graph tensorflow transformer neural-networks pretrain massagepassing

Updated Oct 10, 2022
Python

SalesforceAIResearch / pretrain-time-series-cloudops

Star

Official code repository for the paper "Pushing the Limits of Pre-training for Time Series Forecasting in the CloudOps Domain"

time-series forecasting cloudops pretrain

Updated Jun 7, 2024
Python

bayartsogt-ya / albert-mongolian

Star

ALBERT trained on Mongolian text corpus

transformers language-model albert mongolian pretrain pretrained-model masked-autoencoder

Updated Jan 10, 2021
Jupyter Notebook

CoinCheung / MFM

Star

code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)

ssl mfm frequency fft self-supervised-learning pretrain

Updated Feb 3, 2023
Python

yongzhuo / MacroGPT-Pretrain

Star

macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor

micro gpt macro pretrain deepspeed llm 1b3

Updated Nov 30, 2023
Python

mrzjy / hoyo_public_wiki_parser

Star

Parsing Hoyoverse game text corpus from public wikipedia

game nlp wiki dialogue corpus conversation pretrain mihoyo genshin-impact hoyoverse llm honkai-star-rail

Updated Feb 18, 2024
Python

arrrrrmin / albert-guide

Star

Understanding "A Lite BERT". An Transformer approach for learning self-supervised Language Models.

nlp guide language-modeling pretrain pretraining albert-models albert-guide

Updated Jan 28, 2023
Python

pskliff / vtb-data-fusion

Star

This repository provides code solution for Data Fusion Contest task 1

nlp classification retail bert receipts fine-tuning pretrain huggingface rubert distilbert

Updated Mar 31, 2021
Jupyter Notebook

afogarty85 / applied_nlp_demos

Star

nlp natural-language-processing chatbot transformers pytorch accelerate lora bert pretrain deepspeed t5-model

Updated Oct 17, 2023
Python

nancheng58 / SSL4SR

Star

[CCIR 2023] Self-supervised learning for Sequential Recommender Systems

baseline recommender-system recommendation self-supervised-learning pretrain sequential-recommendation

Updated Nov 7, 2023
Python

stoneyang / cv-arxiv-daily

Star

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 24th hours)

pretrained pretrain pretraining

Updated Jul 17, 2024
Python

tianhao-ai / Detecting-Machine-Generated-Text-COMP90051-2023S1-Project-1

Star

This project is about to detecting the text generated by different LLM given prompt. The instance is labeled by Human and Machine, and this project utilised both traditional machine learning method and deep learning method to classify the instance.

pytorch bidirectional-gru domain-adaptation fine-tuning pretrain pretraining attention-gru lgbmclassifier comp90051

Updated Jul 12, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the pretrain topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pretrain topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pretrain

Here are 20 public repositories matching this topic...

brightmart / nlp_chinese_corpus

keyu-tian / SparK

CLUEbenchmark / CLUECorpus2020

microsoft / UniVL

yangjianxin1 / Firefly-LLaMA2-Chinese

thunlp / RE-Context-or-Names

THUNLP-AIPoet / BERT-CCPoem

xcfcode / What-I-Have-Read

huzongxiang / MatDGL

SalesforceAIResearch / pretrain-time-series-cloudops

bayartsogt-ya / albert-mongolian

CoinCheung / MFM

yongzhuo / MacroGPT-Pretrain

mrzjy / hoyo_public_wiki_parser

arrrrrmin / albert-guide

pskliff / vtb-data-fusion

afogarty85 / applied_nlp_demos

nancheng58 / SSL4SR

stoneyang / cv-arxiv-daily

tianhao-ai / Detecting-Machine-Generated-Text-COMP90051-2023S1-Project-1

Improve this page

Add this topic to your repo