Short and Sweet: Summarize App Reviews with NLP

About

Short and Sweet is a web application that utilizes machine learning techniques to summarize recurring themes in reviews from the Google Play Store. It presents these themes in bar plots and sentiments of the reviews in pie charts.

Usage

Start the Application: In the /src directory, run docker-compose up.
Access the Web Interface: Visit http://0.0.0.0:3000 in your browser.
Enter App URL: Paste the URL of a Google Play Store app into the form field.
Set Scraping Parameters: Choose review count and star filter options.
View Results: Interact with the plots for detailed insights.

Setup

The application runs in a Docker container, simplifying dependency installation. The machine learning models run on CPUs, requiring powerful hardware (e.g., AMD Ryzen 5 6-Core). For less powerful systems, see the cloud deployment instructions.

Local Setup

Clone the Repository:

git clone [email protected]:bjpietrzak/short_and_sweet.git

Navigate to the Project Directory:

cd short_and_sweet/src/

Build the Docker Images:

docker-compose build

Cloud Deployment

Lacking a powerful CPU? Deploy the machine learning models on Google Cloud Platform (GCP).

Create a GCP Account: Sign up for GCP.
Set Up a GCP Project: Create a new project and note the project ID.
Enable Services: Activate Container Registry, Cloud Build, and Cloud Run.
Install gcloud CLI: Download here.
Configure gcloud:

gcloud config set project <project-ID>

Build and Deploy Models: In src/ai/bertopic or src/ai/distillbert, run:

gcloud builds submit --tag gcr.io/<project-ID>/inference

and

gcloud run deploy inference --image=gcr.io/<project-ID>/inference:latest --execution-environment=gen2 --region=<region> --project=<project-ID> && gcloud run services update-traffic inference --to-latest

Update Endpoints: Insert the URL from Cloud Run into src/backend/app/configs/endpoints.json.
Comment Out Local AI Services in src/docker-compose.yaml.
Launch the Application:

docker-compose up

Common Errors and Solutions

Here are common errors you might encounter and how to resolve them.

1. Time Out During Cloud Run Deployment

Problem: Deploying the Docker image to the Cloud Run service times out, possibly due to insufficient memory allocation.

Solution:

Increase Memory Allocation: Navigate to the Container Registry, select the Docker image you wish to deploy. In the deployment settings, allocate more memory to the container. Setting it to 8GB RAM can be effective.

Steps:
1. Go to the Container Registry and select your image.
2. During deployment, adjust the memory settings to 8GB.

2. Time Out During Inference

Problem: Experiencing a timeout during the inference phase of the project.

Solution:

Increase Backend Timeout Threshold: Edit the timeout value in the src/backend/app/configs/backend.json file. Increasing this value allows more time for the inference process to complete without timing out.

Modification Example:
```
"timeout": 420
```
Increase Cloud Run Endpoint Timeout: In the deployment settings of the Cloud Run service, increase the timeout value to allow more time for inference operations.

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
images		images
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Short and Sweet: Summarize App Reviews with NLP

About

Usage

Setup

Local Setup

Cloud Deployment

Common Errors and Solutions

1. Time Out During Cloud Run Deployment

2. Time Out During Inference

About

Languages

License

bjpietrzak/short_and_sweet

Folders and files

Latest commit

History

Repository files navigation

Short and Sweet: Summarize App Reviews with NLP

About

Usage

Setup

Local Setup

Cloud Deployment

Common Errors and Solutions

1. Time Out During Cloud Run Deployment

2. Time Out During Inference

About

Topics

Resources

License

Stars

Watchers

Forks

Languages