Cipher Identifier
Not sure what cipher you are looking at? Paste the mystery text and this tool analyses its character set, letter frequencies, and Index of Coincidence to rank the most likely ciphers — then links you straight to the matching decoder. Everything runs in your browser.
Try a sample:
Paste some text above and the most likely ciphers will appear here, ranked by confidence.
How to use Cipher Identifier
- 1
Paste your mystery text
Copy the unknown code or ciphertext and paste it into the box. It can be letters, Morse dots and dashes, Base64, numbers, or almost anything.
- 2
Read the analysis summary
See the character count, how many letters it contains, the detected character set, and the Index of Coincidence, which hints at whether the cipher used one alphabet or several.
- 3
Review the ranked guesses
The most likely ciphers appear as cards, ordered by confidence from Very likely down to Unlikely, each with a short reason explaining the signal that fired.
- 4
Open the matching decoder
Click through to the dedicated decoder for the top candidate to finish decoding. If several are suggested, try them from the top of the list down.
- 5
Share or clear
Copy a shareable link that reopens the tool with your exact text, or clear the box to start again. Everything stays in your browser.
How to identify an unknown cipher
What is a cipher identifier?
A cipher identifier is a tool that takes a piece of mystery text and tells you which cipher or encoding most likely produced it. Instead of guessing blindly or trying every decoder in turn, you paste the ciphertext once and get a ranked list of candidates — Caesar, Vigenère, Base64, Morse, and more — each with a confidence rating and a direct link to the right decoder.
This is the natural starting point whenever you find encoded text without being told how it was made: a capture-the-flag challenge, an escape room clue, a geocaching puzzle, an alternate-reality game, or a coded note. Knowing the family of cipher narrows dozens of possibilities down to one or two, so you can stop searching and start decoding.
How the cipher identifier works
The identifier uses classical cryptanalysis, not machine learning, so every verdict is explainable. It runs your text through a series of tests, from the most obvious to the most subtle. First it looks at the character set: text made only of dots and dashes is Morse, only zeros and ones is binary, only hexadecimal digits is hex, and the Base64 alphabet with a length that is a multiple of four is Base64. Strings of small numbers point to Polybius squares, A1Z26 letter numbering, or ASCII codes.
When the text is ordinary letters, the tool switches to statistics. It measures the Index of Coincidence to decide whether one alphabet was used (monoalphabetic) or several (polyalphabetic), then runs a Caesar shift test and an Atbash test using chi-squared scoring against English letter frequencies. The combination of character set, frequency profile, and structural clues such as length and repeated patterns produces the ranked list you see.
The Index of Coincidence, explained
The Index of Coincidence, or IoC, measures how likely it is that two letters picked at random from the text are the same. Ordinary English text has an IoC of about 0.067 because some letters, like E and T, are far more common than others. Completely random text sits near 0.038, where every letter is equally likely.
This single number is the most useful clue for letter-based ciphers. Caesar, Atbash, and keyword substitution ciphers only swap one letter for another, so the lumpy English frequency profile survives and the IoC stays high, near 0.066. Polyalphabetic ciphers like Vigenère use several alphabets at once, which flattens the frequencies and drags the IoC down toward 0.04. So a high IoC says monoalphabetic, a low IoC says polyalphabetic, and a value in between is a hint to try both.
Reading the character-set clues
Many encodings give themselves away by their alphabet alone. Morse code uses only dots, dashes, and separators. Binary uses only the digits 0 and 1, usually in groups of eight. Hexadecimal uses the digits 0 to 9 and the letters A to F, with an even number of characters. Base64 uses upper and lower case letters, digits, plus and slash, often ending in one or two equals signs as padding, with a total length that is a multiple of four.
Numbers carry their own meaning. Pairs of digits between 1 and 5 are the coordinates of a Polybius square. Numbers that all fall between 1 and 26 are most likely A1Z26, where 1 is A and 26 is Z. Larger numbers in the 32 to 126 range are decimal ASCII character codes. A text built only from the letters A, D, F, G, V, and X is the unmistakable signature of the ADFGX or ADFGVX field cipher used in the First World War.
Monoalphabetic versus polyalphabetic ciphers
If the text is letters and the IoC is high, you are almost certainly looking at a monoalphabetic cipher, where each plaintext letter always maps to the same ciphertext letter. The simplest is the Caesar cipher, which shifts every letter by the same amount; the identifier confirms it when exactly one shift turns the text into English. Atbash is the special case that reverses the alphabet so A becomes Z. If neither a shift nor a reversal works but the IoC is still high, it is a general keyword or affine substitution that needs a substitution solver.
If the IoC is low, the cipher uses more than one alphabet. The classic example is Vigenère, which applies a repeating keyword so each position can use a different shift; its relatives include Beaufort, Gronsfeld, and Porta. A separate clue, an even length with no doubled letters inside pairs, points instead to a polygraphic cipher like Playfair, which encrypts two letters at a time.
What to do after you identify the cipher
Identification is only the first step. Each candidate in the results links to the dedicated decoder for that cipher, where you can finish the job. For a Caesar cipher, the decoder can brute-force all 25 shifts and pick the best with frequency analysis. For Base64, Morse, binary, or hex, the matching converter turns the code straight back into text. For Vigenère and substitution ciphers you will usually need the key or a solver, but knowing the family tells you exactly which technique to use.
When several candidates appear, work down the list from the most confident. The identifier deliberately surfaces more than one possibility when the evidence is ambiguous, because a short message rarely gives a definitive answer. Trying the top decoder almost always confirms or rules out the guess in seconds.
Limitations to keep in mind
No automatic identifier is perfect. Very short messages do not contain enough letters for the statistics to be reliable, so a five-letter sample may be ambiguous even when a paragraph would be obvious. Messages that have been encoded in several layers, such as a Caesar cipher then converted to Base64, will only reveal the outermost layer at first; decode it and run the identifier again on the result.
The tool focuses on classical ciphers and common encodings, the families that turn up in puzzles, games, and history. It does not attempt to recognise modern cryptography such as AES or RSA, whose output is designed to look completely random and carry no identifying structure at all. Treat the results as expert hints that point you to the right decoder, not as a guaranteed verdict.
Frequently asked questions
What is a cipher identifier?
How does it know which cipher it is?
What is the Index of Coincidence?
Can it identify every cipher?
Why does it suggest more than one cipher?
It says my text is plain text. What does that mean?
Does the tool decode the message too?
Is my text uploaded to a server?
What ciphers and encodings can it detect?
Why can it not identify very short messages?
How can I identify a cipher by hand?
What is the difference between a cipher and an encoding?
Related tools
Keep going with these handy tools