Token Cutter 5000same model, two paths
grok-4-fast-non-reasoning

Token Cutter 5000

Beta testing for a new prompt-compression technology created by Justin Gund and Grok.

Backend-driven prompt metrics
Pure user prompt mode
Side-by-side worker answers
Temp 0 • Max 512

Prompt input

Type or copy/paste your prompt in here. 220 word limit. This site is a beta testing site for a new compression technology created by Justin Gund and Grok. Send all inquiries to justingund@level3id.com.

0 / 220 wordsTrim -220 wordsLive token preview

Normal worker response

Full prompt sent directly to grok-4-fast-non-reasoning as a single user message.

Input
--
Output
--
Total
--
Latency
--
Prompt --Completion --Finish after runFingerprint after run

The normal worker answer will appear here after the prompt is sent through the raw path.

TC5 worker response

Compressed prompt sent to the same model as a single user message.

Input
--
Output
--
Total
--
Latency
--
Prompt --Completion --Finish after runFingerprint after run

The TC5 worker answer will appear here after the compressed path finishes.

Normal Tokens In--

Preview estimate for the full prompt until you send it.

TC5 Tokens In--

Preview estimate for the compressed prompt until you send it.

Normal Tokens Out--

Appears after the live model call.

TC5 Tokens Out--

Appears after the live model call.

Pure user prompt mode — both sides use identical generation settings.
Words removed: --
Preview savings: -- tokens (--)
Live latency appears after you send the prompt.
Any difference in answer quality or length now comes purely from prompt compression.

Normal worker prompt tokens

Backend preview count for the untouched user prompt before it is sent to the model.

---- words

TC5 worker prompt tokens

Backend preview count for the compressed prompt that goes through the TC5 path.

---- words
Copyright Level3ID All rights reserved.