#

transformer-architecture

Here are 226 public repositories matching this topic...

songqiang321 / Awesome-AI-Papers

This repository is used to collect papers and code in the field of AI.

Updated Jul 16, 2024

LongRoPE

jshuadvd / LongRoPE

Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper

nlp machine-learning natural-language-processing ai deep-learning transformers artificial-intelligence gpt language-model natural-language-inference natural tokenization natural-language-understanding attention-is-all-you-need attention-mechanisms transformer-architecture natural-language-procressing tokenizers llm

Updated Jul 16, 2024
Python

cmhungsteve / Awesome-Transformer-Attention

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

computer-vision deep-learning transformers transformer awesome-list vit papers attention-mechanism attention-mechanisms self-attention transformer-architecture transformer-models detr vision-transformer transformer-cv transformer-with-cv transformer-awesome visual-transformer

Updated Jul 11, 2024

Ctrl408 / ViT-implementations

Implementation of Vision Transformers (ViT) with a token merging mechanism

transformer token transformer-architecture vision-transformer token-merging tome-vit

Updated Jul 10, 2024
Python

LinkOrgs-software

cjerzak / LinkOrgs-software

LinkOrgs: An R package for linking linking records on organizations using half a billion open-collaborated records from LinkedIn

machine-learning record-linkage community-detection equinox text-as-data jax transformer-architecture organizational-units

Updated Jul 10, 2024
R

Awni00 / abstract_transformer

This is the project repo associated with the paper "Disentangling and Integrating Relational and Sensory Information in Transformer Architectures" by Awni Altabaa, John Lafferty

machine-learning attention relational-learning relational-reasoning transformer-architecture machine-learning-research

Updated Jul 9, 2024
Jupyter Notebook

M-Taghizadeh / Persian_Question_Answering_Voice2Voice_AI

This repository hosts BonyadAI, a Persian question answering AI Model. We developed an initial web crawler and scraper to gather the dataset. The second phase involved building a machine learning model based on word embeddings and NLP techniques. This AI model operates end-to-end, receiving user voice input and providing responses in Persian voice.

python crawler machine-learning natural-language-processing text-to-speech deep-learning word2vec artificial-intelligence question-answering persian speech-to-text corpus-linguistics farsi scraping-python transformer-architecture farsi-datasets large-language-models

Updated Jul 7, 2024
Jupyter Notebook

ArikReuter / TNTM

This repository contains the code for the Transformer-Representation Neural Topic Model (TNTM) based on the paper "Probabilistic Topic Modelling with Transformer Representations" by Arik Reuter, Anton Thielmann, Christoph Weisser, Benjamin Säfken and Thomas Kneib

word-embeddings transformer variational-autoencoder topicmodeling document-embeddings transformer-architecture

Updated Jul 6, 2024
Jupyter Notebook

joseph-nagel / attention-mechanism

An introduction to attention mechanisms and the vision transformer

deep-neural-networks transformer attention-mechanism transformer-architecture vision-transformer

Updated Jul 2, 2024
Python

reymond-group / MultiStepRetrosynthesisTTL

Multi-Step Retrosynthesis Tool based on Augmented Disconnection Aware Triple Transformer Loop Predictions

chemistry synthesis transformer-architecture biocatalysis retrosynthesis retrosynthesis-reaction-pathway

Updated Jul 2, 2024
Python

kyegomez / GPT3

An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"

artificial-intelligence attention-mechanism transformer-architecture transformer-models gpt3

Updated Jun 29, 2024
Python

Flavio-Mangione / Nvidia-Analysis

This project involves analyzing the relationship between Nvidia and AI technologies. The notebook covers data pre-processing steps, including importing necessary libraries and loading data. Further analysis and insights into Nvidia's impact on AI and related trends are provided.

Updated Jun 24, 2024
Jupyter Notebook

fritzprix / toyGPT

toyGPT - Hands-on project for learning Transformer and GPT model

transformer gpt language-model transformer-architecture

Updated Jun 23, 2024
Jupyter Notebook

vilari-mickopf / mmwave-gesture-recognition

Basic Gesture Recognition Using mmWave Sensor - TI AWR1642

python machine-learning ai neural-network tensorflow keras lstm resnet texas-instruments gesture-recognition mmwave conv2d awr1642 transformer-architecture conv1d mmwave-sensor

Updated Jun 22, 2024
Python

AkiRusProd / numpy-transformer

A numpy implementation of the Transformer model in "Attention is All You Need"

machine-learning translation deep-learning neural-network transformers transformer seq2seq deeplearning seq2seq-model transformer-architecture

Updated Jun 22, 2024
Python

jhagnberger / vcnef

Official PyTorch implementation of the Vectorized Conditional Neural Field.

machine-learning deep-learning partial-differential-equations transformer-architecture

Updated Jun 21, 2024
Python

kyegomez / LongVit

A simplistic pytorch implementation of LongVit using my previous implementation of LongNet as a foundation.

ai ml artificial-intelligence attention attention-mechanism attention-is-all-you-need transformer-architecture transformer-models gpt3 gpt4

Updated Jun 17, 2024
Shell

zouguojian / Personal-Accepted-Research

Welcome to quote our published papers, and the codes have been uploaded.

machine-learning deep-learning lstm generative-model rnn convolutional-neural-networks residual-networks intelligent-transportation-systems traffic-prediction transformer-architecture graph-neural-network air-pollution-prediction spatio-temporal-attention

Updated Jun 17, 2024

RistoAle97 / yati

Yet Another Transformer Implementation

deep-learning transformer neural-networks transformer-architecture transformer-pytorch

Updated Jun 17, 2024
Python

IParraMartin / attention-mechanisms

Educational code for understanding attention mechanisms. You will build a good intuition to K, Q, and V, key in modern Transformer architectures.

nlp machine-learning natural-language-processing tutorial deep-learning transformer easy-to-use head begginers attention-is-all-you-need transformer-architecture

Updated Jun 14, 2024
Python

Improve this page

Add a description, image, and links to the transformer-architecture topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the transformer-architecture topic, visit your repo's landing page and select "manage topics."