T

Text Machine

Powerful text tools, in your browser

Cipher Identifier

Not sure what cipher you are looking at? Paste the mystery text and this tool analyses its character set, letter frequencies, and Index of Coincidence to rank the most likely ciphers — then links you straight to the matching decoder. Everything runs in your browser.

Try a sample:

Ciphertext to identify

Paste some text above and the most likely ciphers will appear here, ranked by confidence.

How to use Cipher Identifier

  1. 1

    Paste your mystery text

    Copy the unknown code or ciphertext and paste it into the box. It can be letters, Morse dots and dashes, Base64, numbers, or almost anything.

  2. 2

    Read the analysis summary

    See the character count, how many letters it contains, the detected character set, and the Index of Coincidence, which hints at whether the cipher used one alphabet or several.

  3. 3

    Review the ranked guesses

    The most likely ciphers appear as cards, ordered by confidence from Very likely down to Unlikely, each with a short reason explaining the signal that fired.

  4. 4

    Open the matching decoder

    Click through to the dedicated decoder for the top candidate to finish decoding. If several are suggested, try them from the top of the list down.

  5. 5

    Share or clear

    Copy a shareable link that reopens the tool with your exact text, or clear the box to start again. Everything stays in your browser.

How to identify an unknown cipher

What is a cipher identifier?

A cipher identifier is a tool that takes a piece of mystery text and tells you which cipher or encoding most likely produced it. Instead of guessing blindly or trying every decoder in turn, you paste the ciphertext once and get a ranked list of candidates — Caesar, Vigenère, Base64, Morse, and more — each with a confidence rating and a direct link to the right decoder.

This is the natural starting point whenever you find encoded text without being told how it was made: a capture-the-flag challenge, an escape room clue, a geocaching puzzle, an alternate-reality game, or a coded note. Knowing the family of cipher narrows dozens of possibilities down to one or two, so you can stop searching and start decoding.

How the cipher identifier works

The identifier uses classical cryptanalysis, not machine learning, so every verdict is explainable. It runs your text through a series of tests, from the most obvious to the most subtle. First it looks at the character set: text made only of dots and dashes is Morse, only zeros and ones is binary, only hexadecimal digits is hex, and the Base64 alphabet with a length that is a multiple of four is Base64. Strings of small numbers point to Polybius squares, A1Z26 letter numbering, or ASCII codes.

When the text is ordinary letters, the tool switches to statistics. It measures the Index of Coincidence to decide whether one alphabet was used (monoalphabetic) or several (polyalphabetic), then runs a Caesar shift test and an Atbash test using chi-squared scoring against English letter frequencies. The combination of character set, frequency profile, and structural clues such as length and repeated patterns produces the ranked list you see.

The Index of Coincidence, explained

The Index of Coincidence, or IoC, measures how likely it is that two letters picked at random from the text are the same. Ordinary English text has an IoC of about 0.067 because some letters, like E and T, are far more common than others. Completely random text sits near 0.038, where every letter is equally likely.

This single number is the most useful clue for letter-based ciphers. Caesar, Atbash, and keyword substitution ciphers only swap one letter for another, so the lumpy English frequency profile survives and the IoC stays high, near 0.066. Polyalphabetic ciphers like Vigenère use several alphabets at once, which flattens the frequencies and drags the IoC down toward 0.04. So a high IoC says monoalphabetic, a low IoC says polyalphabetic, and a value in between is a hint to try both.

Reading the character-set clues

Many encodings give themselves away by their alphabet alone. Morse code uses only dots, dashes, and separators. Binary uses only the digits 0 and 1, usually in groups of eight. Hexadecimal uses the digits 0 to 9 and the letters A to F, with an even number of characters. Base64 uses upper and lower case letters, digits, plus and slash, often ending in one or two equals signs as padding, with a total length that is a multiple of four.

Numbers carry their own meaning. Pairs of digits between 1 and 5 are the coordinates of a Polybius square. Numbers that all fall between 1 and 26 are most likely A1Z26, where 1 is A and 26 is Z. Larger numbers in the 32 to 126 range are decimal ASCII character codes. A text built only from the letters A, D, F, G, V, and X is the unmistakable signature of the ADFGX or ADFGVX field cipher used in the First World War.

Monoalphabetic versus polyalphabetic ciphers

If the text is letters and the IoC is high, you are almost certainly looking at a monoalphabetic cipher, where each plaintext letter always maps to the same ciphertext letter. The simplest is the Caesar cipher, which shifts every letter by the same amount; the identifier confirms it when exactly one shift turns the text into English. Atbash is the special case that reverses the alphabet so A becomes Z. If neither a shift nor a reversal works but the IoC is still high, it is a general keyword or affine substitution that needs a substitution solver.

If the IoC is low, the cipher uses more than one alphabet. The classic example is Vigenère, which applies a repeating keyword so each position can use a different shift; its relatives include Beaufort, Gronsfeld, and Porta. A separate clue, an even length with no doubled letters inside pairs, points instead to a polygraphic cipher like Playfair, which encrypts two letters at a time.

What to do after you identify the cipher

Identification is only the first step. Each candidate in the results links to the dedicated decoder for that cipher, where you can finish the job. For a Caesar cipher, the decoder can brute-force all 25 shifts and pick the best with frequency analysis. For Base64, Morse, binary, or hex, the matching converter turns the code straight back into text. For Vigenère and substitution ciphers you will usually need the key or a solver, but knowing the family tells you exactly which technique to use.

When several candidates appear, work down the list from the most confident. The identifier deliberately surfaces more than one possibility when the evidence is ambiguous, because a short message rarely gives a definitive answer. Trying the top decoder almost always confirms or rules out the guess in seconds.

Limitations to keep in mind

No automatic identifier is perfect. Very short messages do not contain enough letters for the statistics to be reliable, so a five-letter sample may be ambiguous even when a paragraph would be obvious. Messages that have been encoded in several layers, such as a Caesar cipher then converted to Base64, will only reveal the outermost layer at first; decode it and run the identifier again on the result.

The tool focuses on classical ciphers and common encodings, the families that turn up in puzzles, games, and history. It does not attempt to recognise modern cryptography such as AES or RSA, whose output is designed to look completely random and carry no identifying structure at all. Treat the results as expert hints that point you to the right decoder, not as a guaranteed verdict.

Frequently asked questions

What is a cipher identifier?
A cipher identifier analyses a piece of unknown text and tells you which cipher or encoding most likely created it. Instead of trying every decoder by hand, you paste the ciphertext once and get a ranked list of candidates, each with a confidence rating and a link to the right decoder.
How does it know which cipher it is?
It uses classical cryptanalysis. First it checks the character set, since Morse, binary, hexadecimal, Base64, and numeric codes each use a distinctive alphabet. For letter-based text it measures letter frequencies and the Index of Coincidence, then runs Caesar and Atbash tests, combining all the evidence into a ranked list. It is fully rule-based, so every result comes with an explanation.
What is the Index of Coincidence?
The Index of Coincidence measures the chance that two letters picked at random from the text are the same. English is about 0.067 and random text about 0.038. Monoalphabetic ciphers keep the value high, near 0.066, while polyalphabetic ciphers like Vigenère flatten it toward 0.04, which is the single most useful clue for telling the two families apart.
Can it identify every cipher?
No. It covers the classical ciphers and common encodings found in puzzles, games, and history, more than fifteen families in all. It does not recognise modern cryptography such as AES or RSA, whose output is designed to look random and carry no structure. Treat the results as strong hints that point you to the right decoder rather than a guaranteed answer.
Why does it suggest more than one cipher?
When the evidence is ambiguous, the tool deliberately lists several possibilities ordered by confidence so you do not miss the answer. Short messages in particular rarely give a definitive verdict, so it is normal to see two or three candidates. Start with the most confident one and work down the list.
It says my text is plain text. What does that mean?
It means the letter distribution already matches ordinary English, so the text may not be encrypted at all, or it uses a cipher that preserves normal frequencies. If you expected a cipher, check whether the message is simply readable, or whether it has been encoded in a way that keeps English-like statistics.
Does the tool decode the message too?
The identifier focuses on telling you which cipher you have. To decode, click the link on the top candidate, which opens the dedicated decoder for that cipher. Some, like Caesar, Base64, Morse, binary, and hex, decode instantly; others, like Vigenère, usually need the key or a solver, but you will know exactly which tool to use.
Is my text uploaded to a server?
No. All analysis happens entirely in your browser, so your text is never uploaded, logged, or stored. Even a share link keeps your text in the part of the URL after the hash, which browsers never send to a server, so it stays private until you choose to share it.
What ciphers and encodings can it detect?
Among others it recognises Morse code, binary, hexadecimal, Base64, Baconian, Polybius squares, A1Z26 and ASCII number codes, ADFGX and ADFGVX, the Caesar and ROT shift ciphers, Atbash, keyword and affine substitution, Vigenère and other polyalphabetic ciphers, and Playfair-style digraph ciphers.
Why can it not identify very short messages?
Statistical tests like frequency analysis and the Index of Coincidence need enough letters to be reliable. A handful of characters does not contain enough information to separate one cipher from another, so short samples often come back as ambiguous or unrecognised. The longer the message, the more confident the identification.
How can I identify a cipher by hand?
Look at the alphabet first: dots and dashes mean Morse, zeros and ones mean binary, and an alphabet of A, D, F, G, V, X means ADFGX. For letters, check the frequencies; if one cipher letter dominates the way E does in English, it is probably a simple shift or substitution, and if the frequencies look flat, it is likely polyalphabetic. This tool automates exactly those checks.
What is the difference between a cipher and an encoding?
An encoding such as Base64, hexadecimal, or Morse simply represents text in another form and needs no secret key to reverse. A cipher such as Caesar or Vigenère deliberately hides a message and needs a key to decode. The identifier handles both, because when you face mystery text you first need to know which of the two you are dealing with.

Related tools

Keep going with these handy tools

Caesar Cipher

Vigenère Cipher

Atbash Cipher

Rail Fence Cipher

Playfair Cipher

Affine Cipher