Unicode Tutorials - Herong's Tutorial Examples - v5.32, by Herong Yang
Character.getNumericValue() - Numeric Value of Code Point
This section provides tutorial example on how to test 'Character' class getNumericValue() static methods to obtain the numeric value associated with a given Unicode character.
One interesting static method offered in the "Character" class is the "getNumericValue(int codePoint)" method, which returns a numeric value represented by the given Unicode character.
Here is a tutorial example on how to use "getNumericValue()" and other related methods:
/* UnicodeCharacterNumeric.java
* Copyright (c) 2019 HerongYang.com. All Rights Reserved.
*/
import java.io.*;
import java.nio.*;
import java.nio.charset.*;
class UnicodeCharacterNumeric {
static int[] unicodeList = {0x37, 0x0667, 0x2166, 0x3286, 0x4E03,
0x1F108};
public static void main(String[] arg) {
try {
for (int i=0; i<unicodeList.length; i++) {
// Starting with the code point value
int codePoint = unicodeList[i];
// Dumping data in HEX numbers
System.out.print("\n");
System.out.print("\n Code point: "
+Integer.toHexString(codePoint).toUpperCase());
// Getting Unicode character numeric values
System.out.print("\n isDefined(): "
+Character.isDefined(codePoint));
System.out.print("\n getName(): "
+Character.getName(codePoint));
System.out.print("\n getNumericValue(): "
+Character.getNumericValue(codePoint));
// Getting Unicode character type
int intType = Character.getType(codePoint);
System.out.print("\n getType(): "
+intType);
System.out.print("\n is DECIMAL_DIGIT_NUMBER: "
+(intType==Character.DECIMAL_DIGIT_NUMBER));
System.out.print("\n isDigit(): "
+Character.isDigit(codePoint));
}
} catch (Exception e) {
System.out.print("\n"+e.toString());
}
}
}
Compile and run it with Java 11:
C:\herong>javac UnicodeCharacterNumeric.java
C:\herong>java UnicodeCharacterNumeric
Code point: 37
getName(): DIGIT SEVEN
getNumericValue(): 7
getType(): 9
is DECIMAL_DIGIT_NUMBER: true
isDigit(): true
Code point: 667
getName(): ARABIC-INDIC DIGIT SEVEN
getNumericValue(): 7
getType(): 9
is DECIMAL_DIGIT_NUMBER: true
isDigit(): true
Code point: 2166
getName(): ROMAN NUMERAL SEVEN
getNumericValue(): 7
getType(): 10
is DECIMAL_DIGIT_NUMBER: false
isDigit(): false
Code point: 3286
getName(): CIRCLED IDEOGRAPH SEVEN
getNumericValue(): 7
getType(): 11
is DECIMAL_DIGIT_NUMBER: false
isDigit(): false
Code point: 4E03
getName(): CJK UNIFIED IDEOGRAPHS 4E03
getNumericValue(): -1
getType(): 5
is DECIMAL_DIGIT_NUMBER: false
isDigit(): false
Code point: 1F108
getName(): DIGIT SEVEN COMMA
getNumericValue(): 7
getType(): 11
is DECIMAL_DIGIT_NUMBER: false
isDigit(): false
Some interesting notes on the output:
Table of Contents
ASCII Character Set and Encoding
GB2312 Character Set and Encoding
GB18030 Character Set and Encoding
JIS X0208 Character Set and Encodings
UTF-8 (Unicode Transformation Format - 8-Bit)
UTF-16, UTF-16BE and UTF-16LE Encodings
UTF-32, UTF-32BE and UTF-32LE Encodings
Python Language and Unicode Characters
►Java Language and Unicode Characters
Unicode Versions Supported in Java History
'int' and 'String' - Basic Data Types for Unicode
"Character" Class with Unicode Utility Methods
Character.toChars() - "char" Sequence of Code Point
►Character.getNumericValue() - Numeric Value of Code Point
"String" Class with Unicode Utility Methods
String.length() Is Not Number of Characters
String.toCharArray() Returns the UTF-16BE Sequence
String Literals and Source Code Encoding
Encoding Conversion Programs for Encoded Text Files
Using Notepad as a Unicode Text Editor
Using Microsoft Word as a Unicode Text Editor