Katakana

Katakana カタカナ

Type	Syllabary
Languages	Japanese, Okinawan, Ainu, Palauan^[1]
Time period	~800 AD to the present
Parent systems	Oracle Bone Script Seal Script Clerical Script Regular script (Kanji) Man'yōgana Katakana カタカナ
Sister systems	Hiragana, Hentaigana
Direction	Left-to-right
ISO 15924	`Kana, 411`
Unicode alias	Katakana
Unicode range	Katakana: U+30A0–U+30FF Katakana Phonetic Extensions: U+31F0–U+31FF Enclosed CJK Letters and Months: U+3200–U+32FF Halfwidth and Fullwidth Forms: U+FF00–U+FFEF Kana Supplement: U+1B000–U+1B0FF

Japanese writing

Components
Kanji Stroke order Radicals Kyōiku kanji Jōyō kanji Jinmeiyō kanji Hyōgai kanji List of kanji by stroke count List of kanji by concept
Kana Hiragana Katakana Hentaigana Man'yōgana Sogana Gojūon
Typographic symbols Japanese punctuation Iteration mark
Uses
Syllabograms Furigana Okurigana Braille
Romanization
Rōmaji Hepburn (colloquial) Kunrei (ISO) Nihon (ISO transliteration) JSL (transliteration) Wāpuro (keyboard)

Katakana (片仮名, カタカナ) is a Japanese syllabary, one component of the Japanese writing system along with hiragana,^[2] kanji, and in some cases the Latin script (known as romaji). The word katakana means "fragmentary kana", as the katakana characters are derived from components of more complex kanji. Katakana and hiragana are both kana systems. With one or two minor exceptions, each syllable (strictly mora) in the Japanese language is represented by one character, or kana, in each system. Each kana is either a vowel such as "a" (katakana ア); a consonant followed by a vowel such as "ka" (katakana カ); or "n" (katakana ン), a nasal sonorant which, depending on the context, sounds either like English m, n, or ng ([ŋ]), or like the nasal vowels of Portuguese.

In contrast to the hiragana syllabary, which is used for those Japanese language words and grammatical inflections which kanji does not cover, the katakana syllabary usage is quite similar to italics in English; specifically, it is used for transcription of foreign language words into Japanese and the writing of loan words (collectively gairaigo); for emphasis; to represent onomatopoeia; for technical and scientific terms; and for names of plants, animals, minerals, and often Japanese companies.

Katakana are characterized by short, straight strokes and sharp corners, and are the simplest of the Japanese scripts.^[3] There are two main systems of ordering katakana: the old-fashioned iroha ordering, and the more prevalent gojūon ordering.

Writing system

Script

Gojūon – Katakana characters with nucleus
	a	i	u	e	o
∅	ア	イ	ウ	エ	オ
K	カ	キ	ク	ケ	コ
S	サ	シ	ス	セ	ソ
T	タ	チ	ツ	テ	ト
N	ナ	ニ	ヌ	ネ	ノ
H	ハ	ヒ	フ	ヘ	ホ
M	マ	ミ	ム	メ	モ
Y	ヤ		ユ		ヨ
R	ラ	リ	ル	レ	ロ
W	ワ	ヰ		ヱ	ヲ

Katakana coda character
n	ン

Katakana diacritics
dakuten	゛
handakuten	゜

The complete katakana script consists of 48 characters, not counting functional and diacritic marks:

5 nucleus vowels
42 core or body (onset-nucleus) syllabograms, consisting of 9 consonants in combination with each of the 5 vowels, of which 3 possible combinations (yi, ye, wu) are not canonical
1 coda consonant

These are conceived as a 5×10 grid (gojūon, 五十音, literally "fifty sounds"), as shown in the adjacent table, read ア (a), イ (i), ウ (u), エ (e), オ (o), カ (ka), キ (ki), ク (ku), ケ (ke), コ (ko) and so on. The gojūon inherits its vowel and consonant order from Sanskrit practice. In vertical text contexts, which used to be the default case, the grid is usually presented as 10 columns by 5 rows, with vowels on the right hand side and ア (a) on top. Katakana glyphs in the same row or column do not share common graphic characteristics. Three of the syllabograms to be expected, yi, ye and wu, may have been used idiosyncratically with varying glyphs, but never became conventional in any language and are not present at all in modern Japanese.

The 50-sound table is often amended with an extra character, the nasal stop ン (n). This can appear in several positions, most often next to the N signs or, because it developed from one of many mu hentaigana, below the u column. It may also be appended to the vowel row or the a column. Here, it is shown in a table of its own.

The script includes two diacritic marks that change the initial sound of a syllabogram. Both appear mutually exclusive at the upper right of the base character. A double dot, called dakuten, indicates a primary alteration; most often it voices the consonant: k→g, s→z, t→d and h→b; for example, カ (ka) becomes ガ (ga). Secondary alteration, where possible, is shown by a circular handakuten: h→p; For example; ハ (ha) becomes パ (pa). Diacritics, though used for over a thousand years, only became mandatory in the Japanese writing system in the second half of the 20th century. Their application is strictly limited in proper writing systems, but may be more extensive in academic transcriptions.

Furthermore, some characters may have special semantics when used in smaller size after a normal one (see below), but this does not make the script truly bicameral.

The layout of the gojūon table promotes a systematic view of kana syllabograms as being always pronounced with the same single consonant followed by a vowel, but this is not exactly the case (and never has been). Existing schemes for the romanization of Japanese either are based on the systematic nature of the script, e.g. nihon-siki チ ti, or they apply some Western graphotactics, usually the English one, to the common Japanese pronunciation of the kana signs, e.g. Hepburn-shiki チ chi. Both approaches conceal the fact, though, that many consonant-based katakana signs, especially those canonically ending in u, can be used in coda position, too, where the vowel is unvoiced and therefore barely perceptible.

Japanese

Syllabary and orthography

Katakana used in Japanese orthography
	a	i	u	e	o
∅	ア	イ	ウ	エ	オ
K	カ	キ	ク	ケ	コ
G	ガ	ギ	グ	ゲ	ゴ
S	サ	シ	ス	セ	ソ
Z	ザ	ジ	ズ	ゼ	ゾ
T	タ	チ	ツ	テ	ト
D	ダ	ヂ	ヅ	デ	ド
N	ナ	ニ	ヌ	ネ	ノ
H	ハ	ヒ	フ	ヘ	ホ
B	バ	ビ	ブ	ベ	ボ
P	パ	ピ	プ	ペ	ポ
M	マ	ミ	ム	メ	モ
Y	ヤ		ユ		ヨ
R	ラ	リ	ル	レ	ロ
W	ワ	ヰ		ヱ	ヲ
n			ン

unused/obsolete

Katakana functional characters
sokuon	ッ
chōonpu	ー
iteration mark	ヽ

Of the 48 katakana syllabograms described above, only 46 are used in modern Japanese, and one of these is preserved for only a single use:

wi and we are pronounced as vowels in modern Japanese and are therefore obsolete, being supplanted by i and e respectively.
wo is now used only as a particle, and is normally pronounced the same as vowel オ o. As a particle, it is usually written in hiragana (を) and the katakana form, ヲ, is uncommon.

A small version of the katakana for ya, yu or yo (ャ, ュ or ョ respectively) may be added to katakana ending in i. This changes the i vowel sound to a glide (palatalization) to a, u or o, e.g. キャ (ki + ya) /kja/. Addition of the small y kana is called yōon.

Small versions of the five vowel kana are sometimes used to represent trailing off sounds (ハァ haa, ネェ nee), but in katakana they are more often used in yōon-like extended digraphs designed to represent phonemes not present in Japanese; examples include チェ (che) in チェンジ chenji ("change"), and ウィ (wi) and ディ (di) in ウィキペディア Wikipedia.

A character called a sokuon, which is visually identical to a small tsu ッ, indicates that the following consonant is geminated (doubled); this is represented in rōmaji by doubling the consonant that follows the sokuon. In Japanese this is an important distinction in pronunciation; for example, compare サカ saka "hill" with サッカ sakka "author". Geminated consonants are common in transliterations of foreign loanwords; for example English "bed" is represented as ベッド (beddo). The sokuon also sometimes appears at the end of utterances, where it denotes a glottal stop. However, it cannot be used to double the na, ni, nu, ne, no syllables' consonants – to double these, the singular n (ン) is added in front of the syllable. The sokuon may also be used to approximate a non-native sound; Bach is written バッハ (Bahha); Mach as マッハ (Mahha).

Both katakana and hiragana usually spell native long vowels with the addition of a second vowel kana, but katakana uses a vowel extender mark, called a chōonpu ("long vowel mark"), in foreign loanwords. This is a short line (ー) following the direction of the text, horizontal for yokogaki (horizontal text), and vertical for tategaki (vertical text). For example, メール mēru is the gairaigo for e-mail taken from the English word "mail"; the ー lengthens the e. There are some exceptions, such as ローソク (rōsoku (蝋燭^?, "candle") ) or ケータイ(kētai (携帯^?, "mobile phone") ), where Japanese words written in katakana use the elongation mark, too.

Standard and voiced iteration marks are written in katakana as ヽ and ヾ respectively.

Usage

Main article: Japanese writing system

All Katakana writing (in 1940)

In modern Japanese, katakana is most often used for transcription of words from foreign languages (other than words historically imported from Chinese), called gairaigo.^[4] For example, "television" is written テレビ (terebi). Similarly, katakana is usually used for country names, foreign places, and foreign personal names. For example, the United States is usually referred to as アメリカ Amerika, rather than in its ateji kanji spelling of 亜米利加 Amerika.

Katakana are also used for onomatopoeia,^[4] words used to represent sounds – for example, ピンポン (pinpon), the "ding-dong" sound of a doorbell.

Technical and scientific terms, such as the names of animal and plant species and minerals, are also commonly written in katakana.^[5] Homo sapiens (ホモ・サピエンス, Homo sapiensu), as a species, is written ヒト (hito), rather than its kanji 人.

Katakana are also often (but not always) used for transcription of Japanese company names. For example, Suzuki is written スズキ, and Toyota is written トヨタ. Katakana are also used for emphasis, especially on signs, advertisements, and hoardings (i.e., billboards). For example, it is common to see ココ koko ("here"), ゴミ gomi ("trash"), or メガネ megane ("glasses"). Words the writer wishes to emphasize in a sentence are also sometimes written in katakana, mirroring the European usage of italics.^[4]

Pre-World War II official documents mix katakana and kanji in the same way that hiragana and kanji are mixed in modern Japanese texts, that is, katakana were used for okurigana and particles such as wa or o.

Katakana were also used for telegrams in Japan before 1988, and for computer systems – before the introduction of multibyte characters – in the 1980s. Most computers in that era used katakana instead of kanji or hiragana for output.

Although words borrowed from ancient Chinese are usually written in kanji, loanwords from modern Chinese dialects which are borrowed directly use katakana instead.

Examples of modern Chinese loanwords in Japanese
Japanese	Rōmaji	Meaning	Chinese	Romanization	Source language
マージャン	mājan	mahjong	麻將	májiàng	Mandarin
ウーロン茶	ūroncha	Oolong tea	烏龍茶	wūlóngchá
チャーハン	chāhan	fried rice	炒飯	chǎofàn
チャーシュー	chāshū	barbecued pork	叉焼	cha siu	Cantonese
シューマイ	shūmai	shumai	焼賣	siu maai	Cantonese

The very common Chinese loanword rāmen, written in katakana as ラーメン , is rarely written with its kanji (拉麺).

There are rare instances where the opposite has occurred, with kanji forms created from words originally written in katakana. An example of this is コーヒー kōhī, ("coffee"), which can be alternatively written as 珈琲. This kanji usage is occasionally employed by coffee manufacturers or coffee shops for novelty.

Katakana are used to indicate the on'yomi (Chinese-derived readings) of a kanji in a kanji dictionary. For instance, the kanji 人 has a Japanese pronunciation, written in hiragana as ひと hito (person), as well as a Chinese derived pronunciation, written in katakana as ジン jin (used to denote groups of people). Katakana are sometimes used instead of hiragana as furigana to give the pronunciation of a word written in Roman characters, or for a foreign word, which is written as kanji for the meaning, but intended to be pronounced as the original.

In this travel warning, the kanji for "fog" (霧) has been written in katakana (キリ) to make it more immediately readable

Katakana are also sometimes used to indicate words being spoken in a foreign or otherwise unusual accent. For example, in a manga, the speech of a foreign character or a robot may be represented by コンニチワ konnichiwa ("hello") instead of the more typical hiragana こんにちは. Some Japanese personal names are written in katakana. This was more common in the past, hence elderly women often have katakana names. This was particularly common among women in the Meiji and Taishō periods, when many poor, illiterate parents were unwilling to pay a scholar to give their daughters names in kanji.^[6]

Words with difficult-to-read kanji are sometimes instead written in katakana (hiragana is also used for this purpose). This phenomenon is often seen with medical terminology. For example, in the word 皮膚科 hifuka ("dermatology"), the second kanji, 膚, is considered difficult to read, and thus the word hifuka is commonly written 皮フ科 or ヒフ科, mixing kanji and katakana. Similarly, the difficult-to-read kanji such as 癌 gan ("cancer") are often written in katakana or hiragana.

Katakana is also used for traditional musical notations, as in the Tozan-ryū of shakuhachi, and in sankyoku ensembles with koto, shamisen and shakuhachi.

Some instructors for Japanese as a foreign language "introduce katakana after the students have learned to read and write sentences in hiragana without difficulty and know the rules."^[7] Most students who have learned hiragana "do not have great difficulty in memorizing" katakana as well.^[8] Other instructors introduce the katakana first, because these are used with loanwords. This gives students a chance to practice reading and writing kana with meaningful words. This was the approach taken by the influential American linguistics scholar Eleanor Harz Jorden in Japanese: The Written Language (parallel to Japanese: The Spoken Language).^[9]

Ainu

Main article: Ainu language § Writing

Katakana is commonly used to write the Ainu language by Japanese linguists. In Ainu language katakana usage, the consonant that comes at the end of a syllable is represented by a small version of a katakana that corresponds to that final consonant and with an arbitrary vowel. For instance "up" is represented by ウㇷ゚ (ウプ [u followed by small pu]). Ainu also uses three handakuten modified katakana, セ゚ ([tse]), and ツ゚ or ト゚ ([tu̜]). In Unicode, the Katakana Phonetic Extensions block (U+31F0–U+31FF) exists for Ainu language support. These characters are used for the Ainu language only.

Taiwanese

Main article: Taiwanese kana

Taiwanese kana (タイヲァヌギイカアビェン) is a katakana-based writing system once used to write Holo Taiwanese, when Taiwan was under Japanese control. It functioned as a phonetic guide for Chinese characters, much like furigana in Japanese or Zhuyin fuhao in Chinese. There were similar systems for other languages in Taiwan as well, including Hakka and Formosan languages.

Unlike Japanese or Ainu, Taiwanese kana are used similarly to the Zhùyīn fúhào characters, with kana serving as initials, vowel medials and consonant finals, marked with tonal marks. A dot below the initial kana represented aspirated consonants, and チ, ツ, サ, セ, ソ, ウ and オ with a superpositional bar represented sounds found only in Taiwanese.

Okinawan

Main article: Okinawan scripts

Katakana is used as a phonetic guide for the Okinawan language, unlike the various other systems to represent Okinawan, which use hiragana with extensions. The system was devised by the Okinawa Center of Language Study of the University of the Ryukyus. It uses many extensions and yōon to show the many non-Japanese sounds of Okinawan.

Table of katakana

For modern digraph additions that are used mainly to transcribe other languages, see Transcription into Japanese.

This is a table of katakana together with their Hepburn romanization and rough IPA transcription for their use in Japanese. Katakana with dakuten or handakuten follow the gojūon kana without them.

Characters shi シ and tsu ツ, and so ソ and n(g) ン, look very similar in print except for the slant and stroke shape. These differences in slant and shape are more prominent when written with an ink brush.

Grey background indicates obsolete characters.

Katakana syllabograms
	Monographs (gojūon)					Digraphs (yōon)
	a	i	u	e	o	ya	yu	yo
∅	ア a [a]	イ i [i]	ウ u [u͍]	エ e [e]	オ o [o]
K	カ ka [ka]	キ ki [ki]	ク ku [ku͍]	ケ ke [ke]	コ ko [ko]	キャ kya [kʲa]	キュ kyu [kʲu͍]	キョ kyo [kʲo]
S	サ sa [sa]	シ shi [ɕi]	ス su [su͍]	セ se [se]	ソ so [so]	シャ sha [ɕa]	シュ shu [ɕu͍]	ショ sho [ɕo]
T	タ ta [ta]	チ chi [t͡ɕi]	ツ tsu [t͡su͍]	テ te [te]	ト to [to]	チャ cha [t͡ɕa]	チュ chu [t͡ɕu͍]	チョ cho [t͡ɕo]
N	ナ na [na]	ニ ni [nʲi]	ヌ nu [nu͍]	ネ ne [ne]	ノ no [no]	ニャ nya [ɲa]	ニュ nyu [ɲu͍]	ニョ nyo [ɲo]
H	ハ ha [ha]	ヒ hi [çi]	フ fu [ɸu͍]	ヘ he [he]	ホ ho [ho]	ヒャ hya [ça]	ヒュ hyu [çu͍]	ヒョ hyo [ço]
M	マ ma [ma]	ミ mi [mi]	ム mu [mu͍]	メ me [me]	モ mo [mo]	ミャ mya [mʲa]	ミュ myu [mʲu͍]	ミョ myo [mʲo]
Y	ヤ ya [ja]	^{[n 1]}	ユ yu [ju͍]	^{[n 1]}	ヨ yo [jo]
R	ラ ra [ɽa]	リ ri [ɽi]	ル ru [ɽu͍]	レ re [ɽe]	ロ ro [ɽo]	リャ rya [ɽʲa]	リュ ryu [ɽʲu͍]	リョ ryo [ɽʲo]
W	ワ wa [wa]	ヰ wi [i]^{[n 2]}	^{[n 1]}	ヱ we [e]^{[n 2]}	ヲ wo [o]^{[n 2]}

	Final nasal monograph			Functional graphemes
	ン n [n] [m] [ŋ] before stop consonants; n[ɴ] [ũ͍][ĩ] elsewhere			ッ (before geminate consonant)		ー (after long vowel)	ヽ (reduplicates and unvoices syllable)	ヾ (reduplicates and voices syllable)

	Monographs with diacritics: gojūon with (han)dakuten					Digraphs with diacritics: yōon with (han)dakuten
	a	i	u	e	o	ya	yu	yo
G	ガ ga [ɡa]	ギ gi [ɡi]	グ gu [ɡu͍]	ゲ ge [ɡe]	ゴ go [ɡo]	ギャ gya [ɡʲa]	ギュ gyu [ɡʲu͍]	ギョ gyo [ɡʲo]
Z	ザ za [za]	ジ ji [d͡ʑi]	ズ zu [zu͍]	ゼ ze [ze]	ゾ zo [zo]	ジャ ja [d͡ʑa]	ジュ ju [d͡ʑu͍]	ジョ jo [d͡ʑo]
D	ダ da [da]	ヂ ji [d͡ʑi]^{[n 3]}	ヅ zu [zu͍]^{[n 3]}	デ de [de]	ド do [do]	ヂャ ja [d͡ʑa]^{[n 3]}	ヂュ ju [d͡ʑu͍]^{[n 3]}	ヂョ jo [d͡ʑo]^{[n 3]}
B	バ ba [ba]	ビ bi [bi]	ブ bu [bu͍]	ベ be [be]	ボ bo [bo]	ビャ bya [bʲa]	ビュ byu [bʲu͍]	ビョ byo [bʲo]
P	パ pa [pa]	ピ pi [pi]	プ pu [pu͍]	ペ pe [pe]	ポ po [po]	ピャ pya [pʲa]	ピュ pyu [pʲu͍]	ピョ pyo [pʲo]

Notes

1 2 3
1 2 3
1 2 3 4 5

History

Katakana was developed in the 9th century (during the early Heian period) by Buddhist monks by taking parts of man'yōgana characters as a form of shorthand, hence this kana is so-called kata (片^?, ‘partial, fragmented’) .

For example, ka (カ) comes from the left side of ka (加^?, literally ‘increase’, but the original meaning is no longer applicable to kana) . The adjacent table shows the origins of each katakana: the red markings of the original Chinese character (used as man'yōgana) eventually became each corresponding symbol.^[10]

Early on, katakana was almost exclusively used by men for official text and text imported from China.^[11]

Recent findings by Yoshinori Kobayashi, professor of Japanese at Tokushima Bunri University, suggest the possibility that the katakana-like annotations used in reading guide marks (乎古止点 / ヲコト点, okototen) may have originated in 8th-century Korea – possibly Silla – and then introduced to Japan through Buddhist texts.^[12]^[13]

Stroke order

The following table shows the method for writing each katakana character. It is arranged in the traditional way, beginning top right and reading columns down. The numbers and arrows indicate the stroke order and direction respectively.

Computer encoding

In addition to fonts intended for Japanese text and Unicode catch-all fonts (like Arial Unicode MS), many fonts intended for Chinese (such as MS Song) and Korean (such as Batang) also include katakana.

Half-width kana

Main article: Half-width kana

In addition to the usual full-width (全角, zenkaku) display forms of characters, katakana has a second form, half-width (半角, hankaku) (there are no half-width hiragana or kanji). The half-width forms were originally associated with the JIS X 0201 encoding. Although their display form is not specified in the standard, in practice they were designed to fit into the same rectangle of pixels as Roman letters to enable easy implementation on the computer equipment of the day. This space is narrower than the square space traditionally occupied by Japanese characters, hence the name "half-width". In this scheme, diacritics (dakuten and handakuten) are separate characters. When originally devised, the half-width katakana were represented by a single byte each, as in JIS X 0201, again in line with the capabilities of contemporary computer technology.

In the late 1970s, two-byte character sets such as JIS X 0208 were introduced to support the full range of Japanese characters, including katakana, hiragana and kanji. Their display forms were designed to fit into an approximately square array of pixels, hence the name "full-width". For backwards compatibility, separate support for half-width katakana has continued to be available in modern multi-byte encoding schemes such as Unicode, by having two separate blocks of characters – one displayed as usual (full-width) katakana, the other displayed as half-width katakana.

Although often said to be obsolete, in fact the half-width katakana are still used in many systems and encodings. For example, the titles of mini discs can only be entered in ASCII or half-width katakana, and half-width katakana are commonly used in computerized cash register displays, on shop receipts, and Japanese digital television and DVD subtitles. Several popular Japanese encodings such as EUC-JP, Unicode and Shift JIS have half-width katakana code as well as full-width. By contrast, ISO-2022-JP has no half-width katakana, and is mainly used over SMTP and NNTP.

Unicode

Main articles: Katakana (Unicode block), Halfwidth and Fullwidth Forms (Unicode block), Enclosed CJK Letters and Months (Unicode block), Katakana Phonetic Extensions (Unicode block), and Kana Supplement (Unicode block)

Katakana was added to the Unicode Standard in October, 1991 with the release of version 1.0.

The Unicode block for (full-width) katakana is U+30A0–U+30FF.

Encoded in this block along with the katakana are the nakaguro word-separation middle dot, the chōon vowel extender, the katakana iteration marks, and a ligature of コト sometimes used in vertical writing.

Katakana^[1] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+30Ax	゠	ァ	ア	ィ	イ	ゥ	ウ	ェ	エ	ォ	オ	カ	ガ	キ	ギ	ク
U+30Bx	グ	ケ	ゲ	コ	ゴ	サ	ザ	シ	ジ	ス	ズ	セ	ゼ	ソ	ゾ	タ
U+30Cx	ダ	チ	ヂ	ッ	ツ	ヅ	テ	デ	ト	ド	ナ	ニ	ヌ	ネ	ノ	ハ
U+30Dx	バ	パ	ヒ	ビ	ピ	フ	ブ	プ	ヘ	ベ	ペ	ホ	ボ	ポ	マ	ミ
U+30Ex	ム	メ	モ	ャ	ヤ	ュ	ユ	ョ	ヨ	ラ	リ	ル	レ	ロ	ヮ	ワ
U+30Fx	ヰ	ヱ	ヲ	ン	ヴ	ヵ	ヶ	ヷ	ヸ	ヹ	ヺ	・	ー	ヽ	ヾ	ヿ
Notes 1.^ As of Unicode version 9.0

Half-width equivalents to the usual full-width katakana also exist in Unicode. These are encoded within the Halfwidth and Fullwidth Forms block (U+FF00–U+FFEF) (which also includes full-width forms of Latin characters, for instance), starting at U+FF65 and ending at U+FF9F (characters U+FF61–U+FF64 are half-width punctuation marks). This block also includes the half-width dakuten and handakuten. The full-width versions of these characters are found in the Hiragana block.

Katakana subset of Halfwidth and Fullwidth Forms^[1] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
...	(U+FF00–U+FF64 omitted)
U+FF6x						･	ｦ	ｧ	ｨ	ｩ	ｪ	ｫ	ｬ	ｭ	ｮ	ｯ
U+FF7x	ｰ	ｱ	ｲ	ｳ	ｴ	ｵ	ｶ	ｷ	ｸ	ｹ	ｺ	ｻ	ｼ	ｽ	ｾ	ｿ
U+FF8x	ﾀ	ﾁ	ﾂ	ﾃ	ﾄ	ﾅ	ﾆ	ﾇ	ﾈ	ﾉ	ﾊ	ﾋ	ﾌ	ﾍ	ﾎ	ﾏ
U+FF9x	ﾐ	ﾑ	ﾒ	ﾓ	ﾔ	ﾕ	ﾖ	ﾗ	ﾘ	ﾙ	ﾚ	ﾛ	ﾜ	ﾝ	ﾞ	ﾟ
...	(U+FFA0–U+FFEF omitted)
Notes 1.^ As of Unicode version 9.0

Circled katakana are code points U+32D0–U+32FE in the Enclosed CJK Letters and Months block (U+3200–U+32FF). A circled ン (n) is not included.

Katakana subset of Enclosed CJK Letters and Months^[1]^[2] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
...	(U+3200–U+32CF omitted)
U+32Dx	㋐	㋑	㋒	㋓	㋔	㋕	㋖	㋗	㋘	㋙	㋚	㋛	㋜	㋝	㋞	㋟
U+32Ex	㋠	㋡	㋢	㋣	㋤	㋥	㋦	㋧	㋨	㋩	㋪	㋫	㋬	㋭	㋮	㋯
U+32Fx	㋰	㋱	㋲	㋳	㋴	㋵	㋶	㋷	㋸	㋹	㋺	㋻	㋼	㋽	㋾
Notes 1.^ As of Unicode version 9.0 2.^ Grey areas indicate non-assigned code points

Extensions to Katakana for phonetic transcription of Ainu and other languages were added to the Unicode standard in March 2002 with the release of version 3.2.

The Unicode block for Katakana Phonetic Extensions is U+31F0–U+31FF:

Katakana Phonetic Extensions^[1] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+31Fx	ㇰ	ㇱ	ㇲ	ㇳ	ㇴ	ㇵ	ㇶ	ㇷ	ㇸ	ㇹ	ㇺ	ㇻ	ㇼ	ㇽ	ㇾ	ㇿ
Notes 1.^ As of Unicode version 9.0

Historic and variant forms of Japanese kana characters were added to the Unicode standard in October 2010 with the release of version 6.0.

The Unicode block for Kana Supplement is U+1B000–U+1B0FF:

Kana Supplement^[1]^[2] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+1B00x	𛀀	𛀁
U+1B01x
...	(omitted; not used yet)
U+1B0Fx
Notes 1.^ As of Unicode version 9.0 2.^ Grey areas indicate non-assigned code points

Katakana in other Unicode blocks:

Dakuten and handakuten diacritics are located in the Hiragana block:
- U+3099 COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK (non-spacing dakuten): ゙
- U+309A COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK (non-spacing handakuten): ゚
- U+309B KATAKANA-HIRAGANA VOICED SOUND MARK (spacing dakuten): ゛
- U+309C KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK (spacing handakuten): ゜
Two katakana-based emoji are in the Enclosed Ideographic Supplement block:
- U+1F201 SQUARED KATAKANA KOKO ('here' sign): 🈁
- U+1F202 SQUARED KATAKANA SA ('service' sign): 🈂
A katakana-based Japanese TV symbol from the ARIB STD-B24 standard is in the Enclosed Ideographic Supplement block:
- U+1F213 SQUARED KATAKANA DE ('data broadcasting service linked with a main program' symbol): 🈓

Furthermore, as of Unicode 9.0, the following combinatory sequences have been explicitly named, despite having no precomposed symbols in the katakana block. Font designers may want to optimize the display of these composed glyphs. Some of them are mostly used for writing the Ainu language, the others are called bidakuon in Japanese. Other, arbitrary combinations with U+309A handakuten are also possible, of course.

Katakana named sequences Unicode Named Character Sequences Database
Sequence name	Codepoints		Glyph
KATAKANA LETTER BIDAKUON NGA	U+30AB	U+309A	カ゚
KATAKANA LETTER BIDAKUON NGI	U+30AD	U+309A	キ゚
KATAKANA LETTER BIDAKUON NGU	U+30AF	U+309A	ク゚
KATAKANA LETTER BIDAKUON NGE	U+30B1	U+309A	ケ゚
KATAKANA LETTER BIDAKUON NGO	U+30B3	U+309A	コ゚
KATAKANA LETTER AINU CE	U+30BB	U+309A	セ゚
KATAKANA LETTER AINU TU	U+30C4	U+309A	ツ゚
KATAKANA LETTER AINU TO	U+30C8	U+309A	ト゚
KATAKANA LETTER AINU P	U+31F7	U+309A	ㇷ゚

References

↑ Thomas E. McAuley (2001) Language change in East Asia. Routledge. ISBN 0700713778. p. 90
↑ Roy Andrew Miller (1966) A Japanese Reader: Graded Lessons in the Modern Language, Rutland, Vermont: Charles E. Tuttle Company, Tokyo, Japan, p. 28, Lesson 7 : Katakana : a—no. "Side by side with hiragana, modern Japanese writing makes use of another complete set of similar symbols called the katakana."
↑ Miller, p. 28. "The katakana symbols, rather simpler, more angular and abrupt in their line than the hiragana..."
1 2 3 "The Japanese Writing System (2) Katakana", p. 29 in Yookoso! An Invitation to Contemporary Japanese. McGraw-Hill, 1993, ISBN 0070722935
↑ "Hiragana, Katakana & Kanji". Japanese Word Characters. Retrieved 15 October 2011.
↑ Tackett, Rachel. "Why old Japanese women have names in katakana". RocketNews24. Retrieved 19 September 2015.
↑ Mutsuko Endo Simon (1984) Section 3.3 "Katakana", p. 36 in A Practical Guide for Teachers of Elementary Japanese, Center for Japanese Studies, the University of Michigan. ISBN 0939512165
↑ Simon, p. 36
↑ Reading Japanese, Lesson 1. joyo96.org
↑ Japanese katakana. Omniglot.com
↑ Taku Sugimoto; James A. Levin (2000). "Global Literacies and the World-Wide Web". London: Routledge. p. 137. Missing or empty |url= (help); |access-date= requires |url= (help)
↑ Japan Times, "Katakana system may be Korean, professor says"
↑ Yoshinori Kobayashi, 日本のヲコト点の起源と古代韓国語の点吐との関係 ("Relationship between tento in Ancient Korean and the origin of Japan's okoto point)

External links

Wikimedia Commons has media related to Katakana.

Look up katakana in Wiktionary, the free dictionary.

Real Kana Practice katakana using different typefaces.
Katakana Unicode chart
Japanese, including "practice kana" links, at DMOZ
Learn Katakana with Audio Slideshow
KanaTeacher - Practice and learn Katakana online.
Japanese dictionary with Katakana, Hiragana and Kanji on-screen keyboards
Animated Katakana stroke orders with audio

Japanese language

Earlier forms

Dialects

Hokkaidō
Tōhoku
- Tsugaru
- Kesen
- Yamagata
Kantō
- Ibaraki
- Tokyo
Tōkai–Tōsan
- Nagaoka
- Nagoya
- Mikawa
- Mino
- Hida
Hokuriku
Kansai
Chūgoku
Umpaku
Shikoku
- Iyo
- Tosa
- Sanuki
Hōnichi
- Ōita
Hichiku
- Hakata
- Saga
- Tsushima
Satsugū
Okinawan Japanese

Japonic languages

Hachijō
Ryukyuan
- Amami Ōshima
- Kikai
- Kunigami
- Miyako
- Okinawan
- Okinoerabu
- Tokunoshima
- Yaeyama
- Yonaguni
- Yoron

Writing system

Logograms	Kanbun Kanji by concept by stroke count Kanji radicals by frequency by stroke count

Kana	Hiragana Katakana Furigana Okurigana Gojūon Man'yōgana Hentaigana Sogana

Orthography	Braille Kanji Punctuation Orthographic issues Kanazukai Historical kana Modern kana Jōdai Tokushu Kanazukai Yotsugana Transcription into Japanese

Grammar and
vocabulary

Phonology

Transliteration

Literature

Types of writing systems

Overview	History of writing Grapheme

Lists	Writing systems undeciphered inventors constructed Languages by writing system / by first written accounts

Types

Abjads

Numerals Aramaic Hatran Arabic Pitman shorthand Hebrew Ashuri Cursive Rashi Solitreo Libyco-Berber Manichaean Nabataean Old North Arabian Pahlavi Paleo-Hebrew Pegon Phoenician Proto-Sinaitic Psalter Punic Samaritan South Arabian Zabur Musnad Sogdian Syriac ʾEsṭrangēlā Serṭā Maḏnḥāyā Teeline Shorthand Ugaritic

Abugidas

Brahmic

Northern	Assamese-Bengali Bhaikshuki Bhujinmol Brāhmī Devanāgarī Dogra Gujarati Gupta Gurmukhī Kaithi Kalinga Khojki Khotanese Khudawadi Laṇḍā Lepcha Limbu Mahajani Marchen Marchung Meitei Mayek Modi Multani Nāgarī Nandinagari Odia 'Phags-pa Newar Pungs-chen Pungs-chung Ranjana Sharada Saurashtra Siddhaṃ Soyombo Sylheti Nagari Takri Tibetan Uchen Umê Tirhuta Tocharian Zanabazar Square

Southern	Ahom Balinese Batak Baybayin Bhattiprolu Buhid Burmese Chakma Cham Grantha Goykanadi Hanunó'o Javanese Kadamba Kannada Kawi Khmer Kulitan Lanna Lao Leke Lontara Malayalam Maldivian Dhives Akuru Eveyla Akuru Mon Old Sundanese Pallava Pyu Rejang Rencong Sinhala Sundanese Tagbanwa Tai Le Tai Tham Tai Viet Tamil Telugu Thai Tigalari Vatteluttu Kolezhuthu Malayanma Visayan

Others

Alphabets

Linear	Abkhaz Adlam Armenian Avestan Avoiuli Bassa Vah Borama Carian Caucasian Albanian Coorgi–Cox alphabet Coptic Cyrillic Deseret Duployan shorthand Chinook writing Early Cyrillic Eclectic shorthand Elbasan Etruscan Evenki Fox II Fraser Gabelsberger shorthand Garay Georgian Asomtavruli Nuskhuri Mkhedruli Glagolitic Gothic Gregg shorthand Greek Greco-Iberian alphabet Hangul IPA Kaddare Latin Beneventan Blackletter Carolingian minuscule Fraktur Gaelic Insular Kurrent Merovingian Sigla Sütterlin Tironian notes Visigothic Luo Lycian Lydian Manchu Mandaic Molodtsov Mongolian Mru Neo-Tifinagh New Tai Lue N'Ko Ogham Oirat Ol Chiki Old Hungarian Old Italic Old Permic Orkhon Old Uyghur Osage Osmanya Pau Cin Hau Rohingya Hanifi Runic Anglo-Saxon Cipher Dalecarlian Elder Futhark Younger Futhark Gothic Marcomannic Medieval Staveless Sidetic Shavian Somali Stokoe notation Tifinagh Vagindra Visible Speech Vithkuqi Zaghawa

Non-linear	Braille Maritime flags Morse code New York Point Semaphore line Flag semaphore Moon type

Ideograms/Pictograms

Adinkra Aztec Blissymbol Dongba Ersu Shaba Emoji IConji Isotype Kaidā Míkmaq Mixtec New Epoch Notation Painting Nsibidi Ojibwe Hieroglyphs Siglas poveiras SignWriting Testerian Yerkish Zapotec

Logograms

Chinese family of scripts

Chinese Characters	Simplified Traditional Oracle bone script Bronze Script Seal Script large small bird-worm Hanja Idu Kanji Chữ nôm Zhuang

Chinese-influenced	Jurchen Khitan large script Sui Tangut

Cuneiform

Other logo-syllabic

Logo-consonantal

Numerals

Semi-syllabaries

Full	Celtiberian Northeastern Iberian Southeastern Iberian Khom

Redundant	Espanca Pahawh Hmong Khitan small script Southwest Paleohispanic Zhùyīn fúhào

Syllabaries

Afaka Bamum Bété Byblos Cherokee Cypriot Cypro-Minoan Eskayan Geba Great Lakes Algonquian syllabics Iban Japanese Hiragana Katakana Man'yōgana Hentaigana Sogana Jindai moji Kikakui Kpelle Linear B Linear Elamite Lisu Loma Nüshu Nwagu Aneke script Old Persian Cuneiform Vai Woleai Yi (Modern) Yugtun

Braille ⠃⠗⠁⠊⠇⠇⠑

Braille cell

Braille scripts

French-ordered scripts (see for more)	Albanian Amharic Arabic Armenian Azerbaijani Belarusian Bharati Devanagari (Hindi / Marathi / Nepali) Bengali Punjabi Sinhalese Tamil Urdu etc. Bulgarian Burmese Cambodian Cantonese Catalan Chinese (Mandarin, mainland) Czech Dutch Dzongkha (Bhutanese) English (Unified English) Esperanto Estonian Faroese French Georgian German Ghanaian Greek Guarani Hawaiian Hebrew Hungarian Icelandic Inuktitut (reassigned vowels) Iñupiaq IPA Irish Italian Kazakh Kyrgyz Latvian Lithuanian Maltese Mongolian Māori Nigerian Northern Sami Persian Philippine Polish Portuguese Romanian Russian Samoan Scandinavian Slovak South African Spanish Tatar Taiwanese Mandarin (largely reassigned) Thai & Lao (Japanese vowels) Tibetan Turkish Ukrainian Vietnamese Welsh Yugoslav

Reordered scripts	Algerian Braille (obsolete)

Frequency-based scripts	American Braille (obsolete)

Independent scripts	Japanese Korean Two-Cell Chinese

Eight-dot scripts	Luxembourgish Kanji Gardner–Salinas braille codes (GS8)

Symbols in braille

Braille technology

Persons

Organisations

Other tactile alphabets

Katakana

Writing system

Script

Japanese

Syllabary and orthography

Usage

Ainu

Taiwanese

Okinawan

Table of katakana

History

Stroke order

Computer encoding

Half-width kana

Unicode

See also

References

External links