🌟 Scott Miner's GitHub portfolio showcasing personal projects, coding skills, and expertise in Software Development/Data Analytics/AI/ML. Get in touch for collaboration!
-
Updated
Jul 16, 2024
🌟 Scott Miner's GitHub portfolio showcasing personal projects, coding skills, and expertise in Software Development/Data Analytics/AI/ML. Get in touch for collaboration!
Fast Augmentation library for NLP
A PyPI package for augmenting text data using NLP techniques directly in your pandas dataframe.
Augmenty is an augmentation library based on spaCy for augmenting texts.
[WIP] Fast text augmentation for small text corvus
MSc Thesis Code
Bangla Text Augmentation
Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.
Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
This repo offers a Python script using NLPAug library & RTT to augment text datasets. It processes TXT files in "data/" folder, translating text and creating augmented versions. Augmented data enhances NLP tasks like chatbot training & text classification. Includes overview of techniques, applications & implementation.
💡GENIUS – generating text using sketches! A strong text generation & data augmentation tool.
This repository contains the data and code for the paper "Self-training with Two-phase Self-augmentation for Few-shot Dialogue Generation" (EMNLP2022-Findings).
AAAI Knowledge NLP Submission
Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing
Chinese Characters Visualization & Chinese Text Augmentation.
Source Code, data, and results for my paper titled Linguistic Knowledge in Data Augmentation for Natural Language Processing: An Example on Chinese Question Matching.
ANSI and Unicode are encoding standards used across the world by writers and common users. ANSI is an older encoding version and is used in operating systems like Windows 95/ 98 and much older systems. Unicode is a newer version of encoding used in the current day operating systems
Dritributed Text Augmentation Techniques (Appeared AAAI 2023)
Feature space Augmentation
Text augmentation, deep learning, and aspect-based sentiment analysis.
Add a description, image, and links to the text-augmentation topic page so that developers can more easily learn about it.
To associate your repository with the text-augmentation topic, visit your repo's landing page and select "manage topics."