Skip to content
View beyondguo's full-sized avatar
🎨
🎨
Block or Report

Block or report beyondguo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
beyondguo/README.md

Hi there 👋

This is Biyang Guo!

Homepage: Beyond@SimpleAI

I'm a PhD student at SUFE. My research interests lie in NLP (Natural Language Processing) and DCAI (Data-Centric AI).

Here are some research projects I'm in charge of:

Project Paper Description Code
LLM-Tuning -- Tuning LLMs without tears.
ChatGPT-Comparison-Detection LLM@IJCAI-23 The first Human-ChatGPT comparison corpus and detection tools.
GENIUS: Generating text using sketches! Arxiv 2023 A novel pre-training model for sketch-based text generation and data augmentation
Selective Text Augmentation Arxiv 2022 A simple but effective data augmentation method based on Word Roles
Label Confusion Learning AAAI 2021 Label confusion learning for more robust model training

Pinned Loading

  1. Hello-SimpleAI/chatgpt-comparison-detection Hello-SimpleAI/chatgpt-comparison-detection Public

    Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥

    Python 1.2k 117

  2. LLM-Tuning LLM-Tuning Public

    Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.

    HTML 942 98

  3. genius genius Public

    💡GENIUS – generating text using sketches! A strong text generation & data augmentation tool.

    Python 175 17

  4. label_confusion_learning label_confusion_learning Public

    Official implementation of AAAI-21 paper "Label Confusion Learning to Enhance Text Classification Models"

    Python 111 23

  5. TrainingDynamics TrainingDynamics Public

    Compute training dynamics, plot data cartography, analysing data quality...

    Jupyter Notebook 38 7

  6. STA STA Public

    Selective Text Augmentation with Word Roles

    Python 9 1