GPT-2 Output Detector

What Is GPT

GPT stands for “Generative Pre-trained Transformer.” It is a type of language model developed by OpenAI that is trained to generate human-like text. The model is pre-trained on a large dataset & can then be fine-tuned for specific tasks such as translation, summarization, or question answering. GPT has been successful in many natural language processing tasks & has set several benchmarks in the field.

You need super computer with extreme configuration if you want to train the data, I am not going into details, but here are the steps

What Is GPT Output Detector

GPT output detectors are computer programs that analyze the output of a GPT (generative pre-trained transformer) model & detect when the output is not realistic. These programs use various techniques such as natural language processing, machine learning, & deep learning to identify patterns in the output that indicate when the text is not natural. They can also detect when the output is generated from a malicious source.

Huggingface GPT2 tutorial

I am providing steps with and without train data

I was able to setup my own GPT-2 Output Detector, on Debian 11, before you do anything make sure you run apt-get update && apt-get upgrade -y

apt install git wget docker.io -y
git clone https://github.com/openai/gpt-2-output-dataset && cd gpt-2-output-dataset
cd gpt-2-output-dataset
nano Dockerfile

++Insdie Dockerfile paste the following

FROM python:3.7

RUN pip3 install torch==1.1.0 torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cpu \
    pip install transformers==2.9.1 \
    pip install fire==0.2.1 \
    pip install tensorboard>=1.14.0 \
    pip install future

COPY . /code

WORKDIR /code

RUN wget https://openaipublic.azureedge.net/gpt-2/detector-models/v1/detector-base.pt && wget https://openaipublic.azureedge.net/gpt-2/detector-models/v1/detector-large.pt

RUN python -m detector.train

CMD python -m detector.server detector-base.pt

If you don’t have high end machine then I advise you to edit your Dockerfile with the following

FROM python:3.7

RUN pip3 install torch==1.1.0 torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cpu \
    pip install transformers==2.9.1 \
    pip install fire==0.2.1 \
    pip install tensorboard>=1.14.0 \
    pip install future

COPY . /code

WORKDIR /code

RUN wget https://openaipublic.azureedge.net/gpt-2/detector-models/v1/detector-base.pt && wget https://openaipublic.azureedge.net/gpt-2/detector-models/v1/detector-large.pt

CMD python -m detector.server detector-base.pt

Next step

curl -sSl https://get.docker.com | bash

docker build -t gpt .

Final Step

docker run --name gpt --restart always -d -p 80:8080 gpt

Once you ran all the above command, it will take may be 2 minutes and you can access your GPT Output detector, by visiting http://YOURIP:YOURPORT

Enjoy your very own GPT Output Detector

1 thought on “GPT-2 Output Detector”

Verified this detector works with GPT-2 generated content, such as:

https://us.ukessays.com/essays/chatgpt/corporate-wellness-progams-exploration-1963.php

Verified this detector does not work with GPT-3 generated content such as:

https://www.theguardian.com/commentisfree/2020/sep/08/robot-wrote-this-article-gpt-3

Did not train data locally, used the second method provided in this tutorial.
Did not test datasets other than the one provided in the above example.

Posts

What Is GPT

What Is GPT Output Detector

++Insdie Dockerfile paste the following

1 thought on “GPT-2 Output Detector”

Leave a Reply Cancel reply