KEMBAR78
Simple Java | PDF | Teaching Methods & Materials | Computers
0% found this document useful (0 votes)
136 views219 pages

Simple Java

The Java "HelloWorld" program demonstrates several fundamental concepts in Java: 1. All Java programs are built from classes, with methods and fields contained within classes. This reflects Java's object-oriented design. 2. The "main" method serves as the program entry point and is static, so it does not require object instantiation to be called. 3. Java code is compiled to bytecode, which is executed by the JVM. Bytecode can be examined using tools like "javap" to see low-level instructions and constant pool references.

Uploaded by

ednilson_campos
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
136 views219 pages

Simple Java

The Java "HelloWorld" program demonstrates several fundamental concepts in Java: 1. All Java programs are built from classes, with methods and fields contained within classes. This reflects Java's object-oriented design. 2. The "main" method serves as the program entry point and is static, so it does not require object instantiation to be called. 3. Java code is compiled to bytecode, which is executed by the JVM. Bytecode can be examined using tools like "javap" to see low-level instructions and constant pool references.

Uploaded by

ednilson_campos
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 219

1

Simple Java
X Wang

Version 1.0 Published In The Wild

CONTENTS

i ii 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27

freface

java questions 5 what can we learn from java helloworld ? 6 how to build your own java library ? 13 when and how a java class is loaded and initialized ? 16 how static type checking works in java ? 20 java double example 23 diagram to show java string s immutability 24 the substring () method in jdk 6 and jdk 7 27 why string is immutable in java ? 31 string is passed by reference in java 34 start from length & length () in java 38 what exactly is null in java ? 41 comparable vs comparator in java 43 java equals () and hashcode () contract 48 overriding and overloading in java with examples 52 what is instance initializer in java ? 55 why field can t be overridden ? 58 4 types of java inner classes 60 what is inner interface in java ? 63 constructors of sub and super classes in java ? 67 java access level for members : public , protected , private 71 when to use private constructors in java ? 72 2 examples to show how java exception handling works 73 diagram of exception hierarchy 75 java read a file line by line - how many ways ? 78 java write to a file - code example 81 fileoutputstream vs . filewriter 84 should . close () be put in finally block or not ? 86

Contents

28 how to use java properties file ? 88 29 monitors - the basic idea of java synchronization 90 30 the interface and class hierarchy diagram of java collec tions 93 31 a simple treeset example 97 32 deep understanding of arrays . sort () 100 33 arraylist vs . linkedlist vs . vector 106 34 hashset vs . treeset vs . linkedhashset 112 35 hashmap vs . treemap vs . hashtable vs . linkedhashmap 118 36 efficient counter in java 126 37 frequently used methods of java hashmap 133 38 java type erasure mechanism 136 39 why do we need generic types in java ? 139 40 set vs . set <?> 143 41 how to convert array to arraylist in java ? 146 42 yet another java passes by reference or by value ? 149 43 java reflection tutorial 152 44 how to design a java framework ? - a simple example 160 45 why do we need java web frameworks like struts 2 ? 163 46 jvm run - time data areas 166 47 how does java handle aliasing ? 169 48 what does a java array look like in memory ? 172 49 the introduction of memory leaks 175 50 what is servlet container ? 178 51 what is aspect - oriented programming ? 182 52 library vs . framework ? 185 53 java and computer science courses 187 54 how java compiler generate code for overloaded and over ridden methods ? 189 55 top 10 methods for java arrays 191 56 top 10 questions of java strings 194 57 top 10 questions for java regular expression 197 58 top 10 questions about java exceptions 203 59 top 10 questions about java collections 207 60 top 9 questions about java maps 213

Part I F R E FA C E

Contents

The creation of Program Creek was inspired by the belief that every developer should have a blog. The word creek" was picked because of the beautiful scenes of Arkansas, a central state of America where I studied and worked for three years. The blog has been used as my notes to track what I have done and my learning experience of programming. Unexpectedly, millions of people have visited Program Creek since I wrote the rst post ve years ago. The large amount of trafc indicates a more important fact than that my writing skills are good(which is not the case): Developers like to read simple learning materials and quick solutions. By analyzing the trafc data of blog posts, I learned which ways of explaining things developers prefer. Many people believe in that diagrams are easier to understand things. While visualization is a good way to understand and remember things, there are other ways to enhance a learning experience. One is by comparing different but related concepts. For example, by comparing ArrayList with LinkedList, one can better understand them and use them properly. Another way is to look at the frequently asked questions. For example, by reading Top 10 methods for Java arrays, one can quickly remember some useful methods and use the methods used by the majority. There are numerous blogs, books and tutorials available to learn Java. A lot of them receive large trafc by developers with a large variety of different interests. Program Creek is just one of them. This collection might be useful for two kinds of people: rst, the regular visitors of Program Creek will nd a convenient collection of most popular posts; second, developers who want to read something that is more than words. Repetition is the key of learning any programming language. Hopefully, this contributes another non-boring repetition for you. Since this collection is 100% from the blog, there is no good reason to keep two versions of it. The PDF book was converted automatically from the original blog posts. Every title in the book is linked back to the original blog. When the title is clicked, it opens the original post in your browser. If you nd any problems, please go to the post and leave your comment there. As it is an automatic conversion, there may be some formatting problems. Please leave a comment if you nd one. You can also contact me by email: contact@programcreek.com. Thank you for downloading this PDF! Chrismas Day 2013

Part II J AVA Q U E S T I O N S

1
W H AT C A N W E L E A R N F R O M J AVA H E L L O W O R L D ? This is the program every Java programmer knows. It is simple, but a simple start can lead to deep understanding of more complex stuff. In this post I will explore what can be learned from this simple program. Please leave your comments if hello world means more to you. HelloWorld.java
public c l a s s HelloWorld { / @param a r g s / public s t a t i c void main ( S t r i n g [ ] a r g s ) { / / TODO Auto g e n e r a t e d method s t u b System . out . p r i n t l n ( " Hello World " ) ; } }

1.1

why everything starts with a class ?

Java programs are built from classes, every method and eld has to be in a class. This is due to its object-oriented feature: everything is an object which is an instance of a class. Object-oriented programming languages have a lot of advantages over functional programming languages such as better modularity, extensibility, etc.

1.2. WHY THERE IS ALWAYS A MAIN METHOD?

1.2

why there is always a main method ?

The main method is the program entrance and it is static. static means that the method is part of its class, not part of objects. Why is that? Why dont we put a non-static method as program entrance? If a method is not static, then an object needs to be created rst to use the method. Because the method has to be invoked on an object. For an entrance, this is not realistic. Therefore, program entrance method is static. The parameter String[] args indicates that an array of strings can be sent to the program to help with program initialization.

1.3

bytecode of helloworld

To execute the program, Java le is rst compiled to java byte code stored in the .class le. What does the byte code look like? The byte code itself is not readable. If we use a hex editor, it looks like the following:

1.3. BYTECODE OF HELLOWORLD

We can see a lot of opcode(e.g. CA, 4C, etc) in the bytecode above, each of them has a corresponding mnemonic code (e.g., aload_0 in the example below). The opcode is not readable, but we can use javap to see the mnemonic form of a .class le. javap -c prints out disassembled code for each method in the class. Disassembled code means the instructions that comprise the Java bytecodes.
j a v a p c l a s s p a t h .

c HelloWorld

Compiled from " HelloWorld . j a v a " public c l a s s HelloWorld extends j a v a . lang . O b j e c t { public HelloWorld ( ) ; Code : 0: aload_ 0 1: invokespecial # 1 ; / / Method j a v a / l a n g / O b j e c t ." < i n i t > " : ( ) V 4: return public s t a t i c void main ( j a v a . lang . S t r i n g [ ] ) ; Code : 0: getstatic # 2 ; / / F i e l d j a v a / l a n g / System . o u t : L j a v a / i o / PrintStream ; 3: ldc # 3 ; / / S t r i n g H e l l o World

1.3. BYTECODE OF HELLOWORLD

10

5: 8: }

invokevirtual # 4 ; / / Method j a v a / i o / P r i n t S t r e a m . p r i n t l n : ( Ljava / lang / String ; )V return

The code above contains two methods: one is the default constructor, which is inferred by compiler; the other is main method. Below each method, there are a sequence of instructions, such as aload_0, invokespecial #1, etc. What each instruction does can be looked up in Java bytecode instruction listings. For instance, aload_0 loads a reference onto the stack from local variable 0, getstatic fetches a static eld value of a class. Notice the #2 after getstatic instruction points to the run-time constant pool. Constant pool is one of the JVM run-time data areas. This leads us to take a look at the constant pool, which can be done by using javap -verbose command. In addition, each instruction starts with a number, such as 0, 1, 4, etc. In the .class le, each method has a corresponding bytecode array. These numbers correspond to the index of the array where each opcode and its parameters are stored. Each opcode is 1 byte long and instructions can have 0 or multiple parameters. Thats why these numbers are not consecutive. Now we can use javap -verbose to take a further look of the class.
j a v a p c l a s s p a t h . verbose HelloWorld Compiled from " HelloWorld . j a v a " public c l a s s HelloWorld extends j a v a . lang . O b j e c t S o u r c e F i l e : " HelloWorld . j a v a " minor v e r s i o n : 0 major v e r s i o n : 50 Constant pool : c o n s t # 1 = Method #6.#15; / / j a v a / l a n g / O b j e c t ." < i n i t > " : ( ) V c o n s t #2 = F i e l d #16.#17; // j a v a / l a n g / System . o u t : Ljava / io / PrintStream ; c o n s t #3 = S t r i n g #18; // H e l l o World c o n s t # 4 = Method #19.#20; // java / io / PrintStream . p r i n t l n : ( Ljava / lang / String ; )V c o n s t #5 = c l a s s #21; // HelloWorld c o n s t #6 = c l a s s #22; // java / lang / Object c o n s t # 7 = Asciz < i n i t >; c o n s t # 8 = Asciz ( )V; c o n s t # 9 = Asciz Code ;

1.3. BYTECODE OF HELLOWORLD

11

const const const const const const const const const const const const const const const const const const const

# 10 # 11 # 12 # 13 # 14 # 15 # 16 # 17 # 18 # 19 # 20 # 21 # 22 # 23 # 24 # 25 # 26 # 27 # 28

= = = = = = = = = = = = = = = = = = =

Asciz Asciz Asciz Asciz Asciz NameAndType class NameAndType Asciz class NameAndType Asciz Asciz Asciz Asciz Asciz Asciz Asciz Asciz

LineNumberTable ; main ; ( [ Ljava/lang/ S t r i n g ; ) V ; SourceFile ; HelloWorld . j a v a ; # 7 : # 8 ; / / "< i n i t > " : ( ) V #23; // j a v a / l a n g / System #24:#25; / / out : Ljava / i o / PrintStream ; Hello World ; #26; // java / io / PrintStream #27:#28; / / p r i n t l n : ( Ljava / lang / String ; )V HelloWorld ; j a v a /lang/ O b j e c t ; j a v a /lang/System ; out ; Ljava/ i o / P r i n t S t r e a m ; ; java/io/PrintStream ; println ; ( Ljava/lang/ S t r i n g ; ) V ;

{ public HelloWorld ( ) ; Code : S t a c k = 1 , L o c a l s = 1 , A r g s _ s i z e =1 0: aload_ 0 1: invokespecial # 1 ; / / Method j a v a / l a n g / O b j e c t ." < i n i t > " : ( ) V 4: return LineNumberTable : line 2: 0

public s t a t i c void main ( j a v a . lang . S t r i n g [ ] ) ; Code : S t a c k = 2 , L o c a l s = 1 , A r g s _ s i z e =1 0: getstatic # 2 ; / / F i e l d j a v a / l a n g / System . o u t : L j a v a / i o / PrintStream ; 3: ldc # 3 ; / / S t r i n g H e l l o World 5: invokevirtual # 4 ; / / Method j a v a / i o / P r i n t S t r e a m . p r i n t l n : ( Ljava / lang / String ; )V 8: return LineNumberTable : line 9: 0 line 10: 8

1.4. HOW IS IT EXECUTED IN JVM?

12

From JVM specication: The run-time constant pool serves a function similar to that of a symbol table for a conventional programming language, although it contains a wider range of data than a typical symbol table. The #1 in the invokespecial #1 instruction points to #1 constant in the constant pool. The constant is Method #6.#15;. From the number, we can get the nal constant recursively. LineNumberTable provides information to a debugger to indicate which line of Java source code corresponds to which byte code instruction. For example, line 9 in the Java source code corresponds to byte code 0 in the main method and line 10 corresponds to byte code 8. If you want to know more about bytecode, you can create and compile a more complicated class to take a look. HelloWorld is really a start point of doing this.

1.4

how is it executed in jvm ?

Now the question is how JVM loads the class and invoke the main method? Before the main method is executed, JVM needs to 1) load, 2) link, and 3) initialize the class. 1) Loading brings binary form for a class/interface into JVM. 2) Linking incorporates the binary type data into the run-time state of JVM. Linking consists of 3 steps: verication, preparation, and optional resolution. Verication ensures the class/interface is structurally correct; preparation involves allocating memory needed by the class/interface; resolution resolves symbolic references. And nally 3) initialization assigns the class variables with proper initial values.

1.4. HOW IS IT EXECUTED IN JVM?

13

This loading job is done by Java Classloaders. When the JVM is started, three class loaders are used: Bootstrap class loader: loads the core Java libraries located in the /jre/lib directory. It is a part of core JVM, and is written in native code. Extensions class loader: loads the code in the extension directories(e.g., /jar/lib/ext). System class loader: loads code found on CLASSPATH. So HelloWorld class is loaded by system class loader. When the main method is executed, it will trigger loading, linking, and initialization of other dependent classes if they exist. Finally, the main() frame is pushed into the JVM stack, and program counter(PC) is set accordingly. PC then indicates to push println() frame to the JVM stack. When the main() method completes, it will popped up from the stack and execution is done.

2
H O W T O B U I L D Y O U R O W N J AVA L I B R A R Y ? Code reuse is one of the most important factors in software development. It is a VERY good idea to put frequently-used functions together and build a library for yourself. Whenever some method is used, just simply make a method invocation. For Java, its straightforward to manage such a library. Here a simple example in Eclipse. The library will contain only one add method for demo purpose. Step 1: Create a Java Project named as MyMath, and a simple add method under Simple class. Package structure is as follows:

Simple.java
public c l a s s Simple { public s t a t i c i n t add ( i n t a , i n t b ) { r e t u r n a+b ; } }

Step 2: Export as a .jar le. Right Click the project and select export, a window is show as follows.

14

15

Following the wizard, to get the .jar le. Step 3: Use the jar le. Right click the target project, and select Build Path->Add External Archives>following wizard to add the jar le. Now you can make a simple method call.
public c l a s s Main { public s t a t i c void main ( S t r i n g [ ] a r g s ) { System . out . p r i n t l n ( Simple . add ( 1 , 2 ) ) ; } }

Last, but not the least, the library should be constantly updated and optimized. Documentation is important. If the library is not documented well, you may totally forget a function you programmed a year ago. Proper package names should be used to indicate the function of classes and methods. For example, you can name rst layer of the packages by following the package names of standard Java library: programcreek.util, programcreek.io, programcreek.math, programcreek.text, etc. You domain specic knowledge then can be used in the next level. In addition, always do enough research rst and make sure there is no imple-

16

ments of what you want to do before you start program anything. Libraries from industry utilizes the power of thousands of smart developers.

3
W H E N A N D H O W A J AVA C L A S S I S L O A D E D A N D INITIALIZED? In Java, you rst write a .java le which is then compiled to .class le during compile time. Java is capable of loading classes at run time. The confusion is what is the difference between load and initialize. When and how is a Java class loaded and initialized? It can be clearly illustrated by using a simple example below.

3.1

what does it mean by saying load a class ?

C/C++ is compiled to native machine code rst and then it requires a linking step after compilation. What the linking does is combining source les from different places and form an executable program. Java does not do that. The linking-like step for Java is done when they are loaded into JVM. Different JVMs load classes in different ways, but the basic rule is only loading classes when they are needed. If there are some other classes that are required by the loaded class, they will also be loaded. The loading process is recursive.

3.2

when and how is a java class loaded ?

In Java, loading policies is handled by a ClassLoader. The following example shows how and when a class is loaded for a simple program. TestLoader.java
package compiler ;

17

3.2. WHEN AND HOW IS A JAVA CLASS LOADED?

18

public c l a s s TestLoader { public s t a t i c void main ( S t r i n g [ ] a r g s ) { System . out . p r i n t l n ( " t e s t " ) ; } }

A.java
package compiler ; public c l a s s A { public void method ( ) { System . out . p r i n t l n ( " i n s i d e o f A" ) ; } }

Here is the directory hierarchy in eclipse:

By running the following command, we can get information about each class loaded. The -verbose:class option displays information about each class loaded.
j a v a verbose : c l a s s c l a s s p a t h /home/ron/workspace/ U l t i m a t e T e s t / bin/ compiler . TestLoader

Part of output:
[ Loaded sun . misc . J a v a S e c u r i t y P r o t e c t i o n D o m a i n A c c e s s from /usr/ l o c a l / j a v a /j d k 1 . 6 . 0 _ 34 / j r e / l i b / r t . j a r ] [ Loaded j a v a . s e c u r i t y . ProtectionDomain$ 2 from /usr/ l o c a l / j a v a /j d k 1 . 6 . 0 _ 34 / j r e / l i b / r t . j a r ] [ Loaded j a v a . s e c u r i t y . ProtectionDomain$Key from /usr/ l o c a l / j a v a / j d k 1 . 6 . 0 _ 34 / j r e / l i b / r t . j a r ] [ Loaded j a v a . s e c u r i t y . P r i n c i p a l from /usr/ l o c a l / j a v a /j dk 1 . 6 . 0 _ 34 / jre/lib/rt . jar ] [ Loaded compiler . TestLoader from f i l e : /home/xiwang/workspace/ U l t i m a t e T e s t /bin /] test [ Loaded j a v a . lang . Shutdown from /usr/ l o c a l / j a v a /jd k 1 . 6 . 0 _ 34 / j r e / lib/rt . jar ]

3.2. WHEN AND HOW IS A JAVA CLASS LOADED?

19

[ Loaded j a v a . lang . Shutdown$Lock from /usr/ l o c a l / j a v a /j dk 1 . 6 . 0 _ 34 / jre/lib/rt . jar ]

Now If we change TestLoader.java to:


package compiler ; public c l a s s TestLoader { public s t a t i c void main ( S t r i n g [ ] a r g s ) { System . out . p r i n t l n ( " t e s t " ) ; A a = new A( ) ; a . method ( ) ; } }

And run the same command again, the output would be:
[ Loaded sun . misc . J a v a S e c u r i t y P r o t e c t i o n D o m a i n A c c e s s from /usr/ l o c a l / j a v a /j d k 1 . 6 . 0 _ 34 / j r e / l i b / r t . j a r ] [ Loaded j a v a . s e c u r i t y . ProtectionDomain$ 2 from /usr/ l o c a l / j a v a /j d k 1 . 6 . 0 _ 34 / j r e / l i b / r t . j a r ] [ Loaded j a v a . s e c u r i t y . ProtectionDomain$Key from /usr/ l o c a l / j a v a / j d k 1 . 6 . 0 _ 34 / j r e / l i b / r t . j a r ] [ Loaded j a v a . s e c u r i t y . P r i n c i p a l from /usr/ l o c a l / j a v a /j dk 1 . 6 . 0 _ 34 / jre/lib/rt . jar ] [ Loaded compiler . TestLoader from f i l e : /home/xiwang/workspace/ U l t i m a t e T e s t /bin /] test [ Loaded compiler .A from f i l e : /home/xiwang/workspace/ U l t i m a t e T e s t / bin /] inside of A [ Loaded j a v a . lang . Shutdown from /usr/ l o c a l / j a v a /jd k 1 . 6 . 0 _ 34 / j r e / lib/rt . jar ] [ Loaded j a v a . lang . Shutdown$Lock from /usr/ l o c a l / j a v a /j dk 1 . 6 . 0 _ 34 / jre/lib/rt . jar ]

We can see the difference highlighted in red. A.class is loaded only when it is used. In summary, a class is loaded: when the new bytecode is executed. For example, SomeClass f = new SomeClass(); when the bytecodes make a static reference to a class. For example, System.out.

3.3. WHEN AND HOW IS A JAVA CLASS INITIALIZED?

20

3.3

when and how is a java class initialized ?

A class is initialized when a symbol in the class is rst used. When a class is loaded it is not initialized. JVM will initialize superclass and elds in textual order, initialize static, nal elds rst, and give every eld a default value before initialization. Java Class Instance Initialization is an example that shows the order of execution for eld, static eld and constructor.

4
H O W S TAT I C T Y P E C H E C K I N G W O R K S I N J AVA ? From Wiki: Static type-checking is the process of verifying the type safety of a program based on analysis of a programs source code. Dynamic type-checking is the process of verifying the type safety of a program at runtime Java uses static type checking to analyze the program during compile-time to prove the absence of type errors. The basic idea is never let bad things happen at runtime. By understanding the following example, you should have a good understanding of how static type checking works in Java.

4.1

code example

Suppose we have the following classes, A and B. B extends A.


class A { A me ( ) { return this ; } public void doA ( ) { System . out . p r i n t l n ( "Do A" ) ; } } c l a s s B extends A { public void doB ( ) {

21

4.2. HOW STATIC TYPE CHECKING WORKS?

22

System . out . p r i n t l n ( "Do B " ) ; } }

First of all, what does new B().me() return? An A object or a B object? The me() method is declared to return an A, so during compile time, compiler only sees it return an A object. However, it actually returns a B object during run-time, since B inherits As methods and return this(itself).

4.2

how static type checking works ?

The following line will be illegal, even though the object is being invoked on is a B object. The problem is that its reference type is A. Compiler doesnt know its real type during compile-time, so it sees the object as type A.
// illegal new B ( ) . me ( ) . doB ( ) ;

So only the following method can be invoked.


// legal new B ( ) . me ( ) . doA ( ) ;

However, we can cast the object to type B, like the following:


// legal ( ( B ) new B ( ) . me ( ) ) . doB ( ) ;

If the following C class is added,


c l a s s C extends A{ public void doBad ( ) { System . out . p r i n t l n ( "Do C" ) ; } }

then the following statement is legal and can pass static type checking:
// legal ( ( C) new B ( ) . me ( ) ) . beBad ( ) ;

4.2. HOW STATIC TYPE CHECKING WORKS?

23

Compiler does not know its real time, but runtime will throw a cast exception since B can not be casted to C:
j a v a . lang . C l a s s C a s t E x c e p t i o n : B cannot be c a s t t o C

5
J AVA D O U B L E E X A M P L E Have you ever met the situation that you get an integer but you really want a double. For the following method, devide(2,3) will return 0.0.
public s t a t i c double devide ( i n t x , i n t y ) { r e t u r n x/y ; }

The problem is that x/y does int division. If you want it to do double division, you can cast one of the operand. Both (double)x/y and x/(double)y will work.
public s t a t i c double devide ( i n t x , i n t y ) { r e t u r n ( double ) x/y ; }

Very often you want to round a double. There are multiple ways to do it, the following is a commonly used and simple method. If you want to round the result to 2 digits, you can use the following code:
public s t a t i c double devide ( i n t x , i n t y ) { double z= ( double ) x/y ; double pro = Math . round ( z 1 0 0 ) ; r e t u r n pro / 1 0 0 ; }

24

6
D I A G R A M T O S H O W J AVA S T R I N G S I M M U TA B I L I T Y Here are a set of diagrams to explain Java Strings immutability. 6.1 declare a string
S t r i n g s = " abcd " ;

s stores the reference of the string object. The arrow below should be interpreted as store reference of.

6.2

assign a string variable to another string variable

25

6.3. CONCAT STRING

26

S t r i n g s2 = s ;

s2 stores the same reference value, since it is the same string object.

6.3

concat string

s = s . concat ( " ef " ) ;

s now stores the reference of newly created string object.

6.4. SUMMARY

27

6.4

summary

Once a string is created in memory(heap), it can not be changed. We should note that all methods of String do not change the string itself, but rather return a new String. If we need a string that can be modied, we will need StringBuffer or StringBuilder. Otherwise, there would be a lot of time wasted for Garbage Collection, since each time a new String is created. Here is an example of StringBuilder usage.

7
THE SUBSTRING() METHOD IN JDK 6 AND JDK 7 The substring(int beginIndex, int endIndex) method in JDK 6 and JDK 7 are different. Knowing the difference can help you better use them. For simplicity reasons, in the following substring() represent the substring(int beginIndex, int endIndex) method.

7.1

what substring () does ?

The substring(int beginIndex, int endIndex) method returns a string that starts with beginIndex and ends with endIndex-1.
S t r i n g x = " abcdef " ; x = x . substring (1 ,3) ; System . out . p r i n t l n ( x ) ;

Output:
bc

7.2

what happens when substring () is called ?

You may know that because x is immutable, when x is assigned with the result of x.substring(1,3), it points to a totally new string like the following:

28

7.3. SUBSTRING() IN JDK 6

29

However, this diagram is not exactly right or it represents what really happens in the heap. What really happens when substring() is called is different between JDK 6 and JDK 7.

7.3

substring () in jdk 6

String is supported by a char array. In JDK 6, the String class contains 3 elds: char value[], int offset, int count. They are used to store real character array, the rst index of the array, the number of characters in the String. When the substring() method is called, it creates a new string, but the strings value still points to the same array in the heap. The difference between the two Strings is their count and offset values.

7.4. A PROBLEM CAUSED BY SUBSTRING() IN JDK 6

30

The following code is simplied and only contains the key point for explain this problem.
/ / JDK 6 S t r i n g ( i n t o f f s e t , i n t count , char value [ ] ) { t h i s . value = value ; this . offset = offset ; t h i s . count = count ; } public S t r i n g s u b s t r i n g ( i n t beginIndex , i n t endIndex ) { / / c h e c k boundary r e t u r n new S t r i n g ( o f f s e t + beginIndex , endIndex beginIndex , value ) ; }

7.4

a problem caused by substring () in jdk 6

If you have a VERY long string, but you only need a small part each time by using substring(). This will cause a performance problem, since you need only a small part, you keep the whole thing. For JDK 6, the solution is using the following, which will make it point to a real sub string:

7.5. SUBSTRING() IN JDK 7

31

x = x . substring (x , y) + " "

7.5

substring () in jdk 7

This is improved in JDK 7. In JDK 7, the substring() method actually create a new array in the heap.

/ / JDK 7 public S t r i n g ( char value [ ] , i n t o f f s e t , i n t count ) { / / c h e c k boundary t h i s . value = Arrays . copyOfRange ( value , o f f s e t , o f f s e t + count ) ; } public S t r i n g s u b s t r i n g ( i n t beginIndex , i n t endIndex ) { / / c h e c k boundary i n t subLen = endIndex beginIndex ; r e t u r n new S t r i n g ( value , beginIndex , subLen ) ; }

Top 10 questions about Java String.

8
W H Y S T R I N G I S I M M U TA B L E I N J AVA ? This is an old yet still popular question. There are multiple reasons that String is designed to be immutable in Java. A good answer depends on good understanding of memory, synchronization, data structures, etc. In the following, I will summarize some answers. 8.1 requirement of string pool

String pool (String intern pool) is a special storage area in Method Area. When a string is created and if the string already exists in the pool, the reference of the existing string will be returned, instead of creating a new object and returning its reference. The following code will create only one string object in the heap.
S t r i n g s t r i n g 1 = " abcd " ; S t r i n g s t r i n g 2 = " abcd " ;

32

8.2. ALLOW STRING TO CACHE ITS HASHCODE

33

If string is not immutable, changing the string with one reference will lead to the wrong value for the other references.

8.2

allow string to cache its hashcode

The hashcode of string is frequently used in Java. For example, in a HashMap. Being immutable guarantees that hashcode will always the same, so that it can be cashed without worrying the changes.That means, there is no need to calculate hashcode every time it is used. This is more efcient. In String class, it has the following code:
p r i v a t e i n t hash ; / / t h i s i s u s e d t o c a c h e h a s h c o d e .

8.3

security

String is widely used as parameter for many java classes, e.g. network connection, opening les, etc. Were String not immutable, a connection or le would be changed and lead to serious security threat. The method thought it was connecting to one machine, but was not. Mutable strings could cause security problem in Reection too, as the parameters are strings. Here is a code example:
boolean c on n ec t ( s t r i n g s ) { if ( ! isSecure ( s ) ) {

8.3. SECURITY

34

throw new S e c u r i t y E x c e p t i o n ( ) ; } / / h e r e w i l l c a u s e p r o b l e m , i f s i s c h a n g e d b e f o r e t h i s by using o t h e r r e f e r e n c e s . causeProblem ( s ) ; }

In summary, the reasons include design, efciency, and security. Actually, this is also true for many other why questions in a Java interview.

9
S T R I N G I S PA S S E D B Y R E F E R E N C E I N J AVA This is a classic question of Java. Many similar questions have been asked on stackoverow, and there are a lot of incorrect/incomplete answers. The question is simple if you dont think too much. But it could be very confusing, if you give more thought to it.

9.1

a code fragment that is interesting & confusing

public s t a t i c void main ( S t r i n g [ ] a r g s ) { S t r i n g x = new S t r i n g ( " ab " ) ; change ( x ) ; System . out . p r i n t l n ( x ) ; } public s t a t i c void change ( S t r i n g x ) { x = " cd " ; }

It prints ab. In C++, the code is as follows:


void change ( s t r i n g &x ) { x = " cd " ; } i n t main ( ) { s t r i n g x = " ab " ; change ( x ) ;

35

9.2. COMMON CONFUSING QUESTIONS

36

cout << x << endl ; }

it prints cd. 9.2 common confusing questions

x stores the reference which points to the ab string in the heap. So when x is passed as a parameter to the change() method, it still points to the ab in the heap like the following:

Because java is pass-by-value, the value of x is the reference to ab. When the method change() gets invoked, it creates a new cd object, and x now is pointing to cd like the following:

9.3. WHAT THE CODE REALLY DOES?

37

It seems to be a pretty reasonable explanation. They are clear that Java is always pass-by-value. But what is wrong here?

9.3

what the code really does ?

The explanation above has several mistakes. To understand this easily, it is a good idea to briey walk though the whole process. When the string ab is created, Java allocates the amount of memory required to store the string object. Then, the object is assigned to variable x, the variable is actually assigned a reference to the object. This reference is the address of the memory location where the object is stored. The variable x contains a reference to the string object. x is not a reference itself! It is a variable that stores a reference(memory address). Java is pass-by-value ONLY. When x is passed to the change() method, a copy of value of x (a reference) is passed. The method change() creates another object cd and it has a different reference. It is the variable x that changes its reference(to cd), not the reference itself.

9.4

the wrong explanation

The problem raised from the rst code fragment is nothing related with string immutability. Even if String is replaced with StringBuilder, the result is still the

9.5. SOLUTION TO THIS PROBLEM

38

same. The key point is that variable stores the reference, but is not the reference itself!

9.5

solution to this problem

If we really need to change the value of the object. First of all, the object should be changeable, e.g., StringBuilder. Secondly, we need to make sure that there is no new object created and assigned to the parameter variable, because Java is passing-by-value only.
public s t a t i c void main ( S t r i n g [ ] a r g s ) { S t r i n g B u i l d e r x = new S t r i n g B u i l d e r ( " ab " ) ; change ( x ) ; System . out . p r i n t l n ( x ) ; } public s t a t i c void change ( S t r i n g B u i l d e r x ) { x . d e l e t e ( 0 , 2 ) . append ( " cd " ) ; }

10
S TA R T F R O M L E N G T H & L E N G T H ( ) I N J AVA First of all, can you quickly answer the following question? Without code autocompletion of any IDE, how to get the length of an array? And how to get the length of a String? I asked this question to developers of different levels: entry and intermediate. They can not answer the question correctly or condently. While IDE provides convenient code autocompletion, it also brings the problem of surface understanding. In this post, I will explain some key concepts about Java arrays. The answer:
i n t [ ] a r r = new i n t [ 3 ] ; System . out . p r i n t l n ( a r r . l e n g t h ) ; / / l e n g t h f o r a r r a y S t r i n g s t r = " abc " ; System . out . p r i n t l n ( s t r . l e n g t h ( ) ) ; / / l e n g t h ( ) f o r s t r i n g

The question is why array has the length eld but string does not? Or why string has the length() method while array does not?

10.1

q1 . why arrays have length property ?

First of all, an array is a container object that holds a xed number of values of a single type. After an array is created, its length never changes[1]. The arrays length is available as a nal instance variable length. Therefore, length can be considered as a dening attribute of an array.

39

10.2. Q2. WHY THERE IS NOT A CLASS ARRAY DEFINED SIMILARLY LIKE STRING?

40

An array can be created by two methods: 1) an array creation expression and 2) an array initializer. When it is created, the size is specied. An array creation expression is used in the example above. It species the element type, the number of levels of nested arrays, and the length of the array for at least one of the levels of nesting. This declaration is also legal, since it species one of the levels of nesting.
i n t [ ] [ ] a r r = new i n t [ 3 ] [ ] ;

An array initializer creates an array and provides initial values for all its components. It is written as a comma-separated list of expressions, enclosed by braces and . For example,
int [ ] arr = { 1 , 2 , 3 } ;

10.2

q2 . why there is not a class array defined similarly like string ?

Since an array is an object, the following code is legal.


O b j e c t o b j = new i n t [ 1 0 ] ;

An array contains all the members inherited from class Object(except clone). Why there is not a class denition of an array? We can not nd an Array.java le. A rough explanation is that theyre hidden from us. You can think about the question - if there IS a class Array, what would it look like? It would still need an array to hold the array data, right? Therefore, it is not a good idea to dene such a class. Actually we can get the class of an array by using the following code:
i n t [ ] a r r = new i n t [ 3 ] ; System . out . p r i n t l n ( a r r . g e t C l a s s ( ) ) ;

Output:
class [ I

"class [I" stands for the run-time type signature for the class object "array with component type int".

10.3. Q3. WHY STRING HAS LENGTH() METHOD?

41

10.3

q3 . why string has length () method ?

The backup data structure of a String is a char array. There is no need to dene a eld that is not necessary for every application. Unlike C, an Array of characters is not a String in Java.

11
W H AT E X A C T LY I S N U L L I N J AVA ? Lets start from the following statement:
Stri ng x = null ;

11.1

what exactly does this statement do ?

Recall what is a variable and what is a value. A common metaphor is that a variable is similar to a box. Just as you can use a box to store something, you can use a variable to store a value. When declaring a variable, we need to set its type. There are two major categories of types in Java: primitive and reference. Variables declared of a primitive type store values; variables declared of a reference type store references. In this case, the initialization statement declares a variables x. x stores String reference. It is null here. The following visualization gives a better sense about this concept.

42

11.2. WHAT EXACTLY IS NULL IN MEMORY?

43

11.2

what exactly is null in memory ?

What exactly is null in memory? Or What is the null value in Java? First of all, null is not a valid object instance, so there is no memory allocated for it. It is simply a value that indicates that the object reference is not currently referring to an object. From JVM Specications: The Java Virtual Machine specication does not mandate a concrete value encoding null. I would assume it is all zeros of something similar like itis on other C like languages.

11.3

what exactly is x in memory ?

Now we know what null is. And we know a variable is a storage location and an associated symbolic name (an identier) which contains some value. Where exactly x is in memory? From the diagram of JVM run-time data areas, we know that since each method has a private stack frame within the threads steak, the local variable are located on that frame.

12
C O M PA R A B L E V S C O M PA R AT O R I N J AVA Comparable and Comparator are two interfaces provided by Java Core API. From their names, you can tell that they may be used for comparing stuff in some way. But what exactly are they and what is the difference between them? The following are two examples for answering this question. The simple examples compare two HDTVs size. How to use Comparable vs. Comparator is obvious after reading the code.

12.1

comparable

Comparable is implemented by a class in order to be able to comparing object of itself with some other objects. The class itself must implement the interface in order to be able to compare its instance(s). The method required for implementation is compareTo(). Here is an example to show the usage:
c l a s s HDTV implements Comparable<HDTV> { private int size ; p r i v a t e S t r i n g brand ; public HDTV( i n t s i z e , S t r i n g brand ) { this . size = size ; t h i s . brand = brand ; } public i n t g e t S i z e ( ) { return siz e ; } public void s e t S i z e ( i n t s i z e ) {

44

12.1. COMPARABLE

45

this . size = size ; } public S t r i n g getBrand ( ) { r e t u r n brand ; } public void setBrand ( S t r i n g brand ) { t h i s . brand = brand ; } @Override public i n t compareTo (HDTV t v ) { i f ( t h i s . getSize ( ) > tv . getSize ( ) ) return 1 ; else i f ( t h i s . getSize ( ) < tv . getSize ( ) ) r e t u r n 1; else return 0 ; } } public c l a s s Main { public s t a t i c void main ( S t r i n g [ ] a r g s ) { HDTV t v 1 = new HDTV( 5 5 , " Samsung " ) ; HDTV t v 2 = new HDTV( 6 0 , " Sony " ) ; i f ( t v 1 . compareTo ( t v 2 ) > 0 ) { System . out . p r i n t l n ( t v 1 . getBrand ( ) + " i s better . " ) ; } else { System . out . p r i n t l n ( t v 2 . getBrand ( ) + " i s better . " ) ; } } } Sony i s b e t t e r .

12.2. COMPARATOR

46

12.2

comparator

Comparator is capable of comparing two DIFFERENT types of objects. The method required for implementation is compare(). Now lets use another way to compare those TVs size. The common use of Comparator is sorting. Both Collections and Arrays classes provide a sort method which use a Comparator.
import j a v a . u t i l . A r r a y L i s t ; import j a v a . u t i l . C o l l e c t i o n s ; import j a v a . u t i l . Comparator ; c l a s s HDTV { private int size ; p r i v a t e S t r i n g brand ; public HDTV( i n t s i z e , S t r i n g brand ) { this . size = size ; t h i s . brand = brand ; } public i n t g e t S i z e ( ) { return siz e ; } public void s e t S i z e ( i n t s i z e ) { this . size = size ; } public S t r i n g getBrand ( ) { r e t u r n brand ; } public void setBrand ( S t r i n g brand ) { t h i s . brand = brand ; } } c l a s s SizeComparator implements Comparator<HDTV> { @Override public i n t compare (HDTV tv 1 , HDTV t v 2 ) { int tv1Size = tv1 . getSize ( ) ; int tv2Size = tv2 . getSize ( ) ;

12.2. COMPARATOR

47

if ( tv1Size > tv2Size ) { return 1 ; } else if ( tv1Size < tv2Size ) { r e t u r n 1; } else { return 0 ; } } } public c l a s s Main { public s t a t i c void HDTV t v 1 = HDTV t v 2 = HDTV t v 3 =

main ( S t r i n g [ ] a r g s ) { new HDTV( 5 5 , " Samsung " ) ; new HDTV( 6 0 , " Sony " ) ; new HDTV( 4 2 , " Panasonic " ) ;

A r r a y L i s t <HDTV> a l = new A r r a y L i s t <HDTV> ( ) ; a l . add ( t v 1 ) ; a l . add ( t v 2 ) ; a l . add ( t v 3 ) ; C o l l e c t i o n s . s o r t ( a l , new SizeComparator ( ) ) ; f o r (HDTV a : a l ) { System . out . p r i n t l n ( a . getBrand ( ) ) ; } } }

Output:
Panasonic Samsung Sony

Often we may use Collections.reverseOrder() method to get a descending order Comparator. Like the following:
A r r a y L i s t < I n t e g e r > a l = new A r r a y L i s t < I n t e g e r > ( ) ; a l . add ( 3 ) ; a l . add ( 1 ) ; a l . add ( 2 ) ; System . out . p r i n t l n ( a l ) ; Collections . sort ( al ) ;

12.2. COMPARATOR

48

System . out . p r i n t l n ( a l ) ; Comparator< I n t e g e r > comparator = C o l l e c t i o n s . r e v e r s e O r d e r ( ) ; C o l l e c t i o n s . s o r t ( a l , comparator ) ; System . out . p r i n t l n ( a l ) ;

Output:
[3 ,1 ,2] [1 ,2 ,3] [3 ,2 ,1]

13
J AVA E Q U A L S ( ) A N D H A S H C O D E ( ) C O N T R A C T

The Java super class java.lang.Object has two very important methods dened:
public boolean e q u a l s ( O b j e c t o b j ) public i n t hashCode ( )

They have been proved to be extremely important to understand, especially when user-dened objects are added to Maps. However, even advanced-level developers sometimes cant gure out how they should be used properly. In this post, I will rst show an example of a common mistake, and then explain how equals() and hashCode contract works.

49

13.1. A COMMON MISTAKE

50

13.1

a common mistake

Common mistake is shown in the example below.


import j a v a . u t i l . HashMap ; public c l a s s Apple { private String color ; public Apple ( S t r i n g c o l o r ) { this . color = color ; } public boolean e q u a l s ( O b j e c t o b j ) { i f ( ! ( o b j i n s t a n c e o f Apple ) ) return false ; i f ( o b j == t h i s ) return true ; r e t u r n t h i s . c o l o r == ( ( Apple ) o b j ) . c o l o r ; } public s t a t i c void main ( S t r i n g [ ] a r g s ) { Apple a 1 = new Apple ( " green " ) ; Apple a 2 = new Apple ( " red " ) ; / / hashMap s t o r e s a p p l e t y p e and i t s q u a n t i t y HashMap<Apple , I n t e g e r > m = new HashMap<Apple , Integer >() ; m. put ( a 1 , 1 0 ) ; m. put ( a 2 , 2 0 ) ; System . out . p r i n t l n (m. g e t ( new Apple ( " green " ) ) ) ; } }

In this example, a green apple object is stored successfully in a hashMap, but when the map is asked to retrieve this object, the apple object is not found. The program above prints null. However, we can be sure that the object is stored in the hashMap by inspecting in the debugger:

13.2. PROBLEM CAUSED BY HASHCODE()

51

13.2

problem caused by hashcode ()

The problem is caused by the un-overridden method hashCode(). The contract between equals() and hasCode() is that: 1. If two objects are equal, then they must have the same hash code. 2. If two objects have the same hashcode, they may or may not be equal. The idea behind a Map is to be able to nd an object faster than a linear search. Using hashed keys to locate objects is a two-step process. Internally the Map stores objects as an array of arrays. The index for the rst array is the hashcode() value of the key. This locates the second array which is searched linearly by using equals() to determine if the object is found. The default implementation of hashCode() in Object class returns distinct integers for different objects. Therefore, in the example above, different objects(even with same type) have different hashCode. Hash Code is like a sequence of garages for storage, different stuff can be stored in different garages. It is more efcient if you organize stuff to different place instead of the same garage. So its a good practice to equally distribute the hashCode value. (Not the main point here though) The solution is to add hashCode method to the class. Here I just use the color strings length for demonstration.
public i n t hashCode ( ) { return this . color . length ( ) ;

13.2. PROBLEM CAUSED BY HASHCODE()

52

14
O V E R R I D I N G A N D O V E R L O A D I N G I N J AVA W I T H EXAMPLES 14.1 overriding vs . overloading Here are some important facts about Overriding and Overloading: 1. Real object type, not the reference variables type, determines which overridden method is used at runtime. 2. Reference type determines which overloaded method will be used at compile time. 3. Polymorphism applies to overriding, not to overloading.

14.2

example of overriding

Here is an example of overriding. After reading the code, guess the output. Easy!
c l a s s Dog { public void bark ( ) { System . out . p r i n t l n ( " woof " ) ; } } c l a s s Hound extends Dog { public void s n i f f ( ) { System . out . p r i n t l n ( " s n i f f " ) ; } public void bark ( ) { System . out . p r i n t l n ( " bowl " ) ;

53

14.2. EXAMPLE OF OVERRIDING

54

} } public c l a s s Main { public s t a t i c void main ( S t r i n g [ ] a r g s ) { new Main ( ) . go ( ) ; } void go ( ) { new Hound ( ) . bark ( ) ; ( ( Dog ) new Hound ( ) ) . bark ( ) ; / / ( ( Dog ) new Hound ( ) ) . s n i f f ( ) ; } }

Output? Yes, here you go.


bowl bowl

A better example:
c l a s s Animal { void s t i n k y ( ) { System . out . p r i n t l n ( " s t i n k y animal ! " ) ; } } c l a s s Dog extends Animal { public void s t i n k y ( ) { System . out . p r i n t l n ( " s t i n k y dog ! " ) ; } public void bark ( ) { System . out . p r i n t l n ( "wow wow" ) ; } } c l a s s Cow extends Animal { public void s t i n k y ( ) { System . out . p r i n t l n ( " s t i n k y cow ! " ) ; } } public c l a s s T e s t O v e r r i d i n g {

14.2. EXAMPLE OF OVERRIDING

55

public s t a t i c void main ( S t r i n g [ ] a r g s ) { Animal o b j = new Dog ( ) ; obj . stinky ( ) ; } }

When you create object like the above and call a method:
Animal o b j = new Dog ( ) ; obj . stinky ( ) ;

What compiler does is that it checks the class type of object which is Animal here. After that it checks whether the stinky() exist in Animal or not. Always remember that objects are created at run-time. So compiler has no way to know that the Dog class stinky() method is to be called. So at compile time class type of reference variable is checked to check such a method exist or not. Now at run-time, the JVM knows that though the class type of obj is Animal, at run time it is referring to the object of Dog. So it calls the stinky() of Dog class. This is called Dynamic Polymorphism.

15
W H AT I S I N S TA N C E I N I T I A L I Z E R I N J AVA ? In this post, an example is rst given to illustrate what are instance variable initializer, instance initializer and static initializer. Then how instance initializer works is explained. 15.1 execution order

Look at the following class, do you know which one gets executed rst?
public c l a s s Foo { // instance variable S t r i n g s = " abc " ; initializer

// constructor public Foo ( ) { System . out . p r i n t l n ( " c o n s t r u c t o r c a l l e d " ) ; } // static initializer static { System . out . p r i n t l n ( " s t a t i c i n i t i a l i z e r c a l l e d " ) ; } // instance i n i t i a l i z e r { System . out . p r i n t l n ( " i n s t a n c e i n i t i a l i z e r c a l l e d " ) ; }

56

15.2. HOW DOES JAVA INSTANCE INITIALIZER WORK?

57

public s t a t i c void main ( S t r i n g [ ] a r g s ) { new Foo ( ) ; new Foo ( ) ; } }

Output:
static i n i t i a l i z e r called instance i n i t i a l i z e r called constructor called instance i n i t i a l i z e r called constructor called

15.2

how does java instance initializer work ?

The instance initializer above contains a print statement. To understand how it works, we can think of it as a variable assignment statement(e.g. b = 0), then this would not difcult to understand. Instead of
int b = 0

, you could write


int b ; b = 0;

Therefore, instance initializer and instance variable initializer are pretty much the same.

15.3

when are instance initializers useful ?

The use of instance initializers are rare, but still it can be a useful alternative to instance variable initializers if: (1) initializer code must handle exceptions (2) perform calculations that cant be expressed with an instance variable initializer.

15.3. WHEN ARE INSTANCE INITIALIZERS USEFUL?

58

Of course, such code could be written in constructors. But if a class had multiple constructors, you would have to repeat the code in each constructor. With an instance initializer, you can just write the code once, and it will be executed no matter what constructor is used to create the object. (I guess this is just a concept, and it is not used often.) Another case in which instance initializers are useful is anonymous inner classes, which cant declare any constructors at all. (Will this be a good place to place a logging function?)

16
WHY FIELD CANT BE OVERRIDDEN? This article shows the basic object oriented concept in Java - Field Hiding. 16.1 can field be overridden in java ? Lets rst take a look at the following example which creates two Sub objects. One is assigned to a Sub reference, the other is assigned to a Super reference.
package oo ; c l a s s Super { S t r i n g s = " Super " ; } c l a s s Sub extends Super { S t r i n g s = " Sub " ; } public c l a s s F i e l d O v e r r i d i n g { public s t a t i c void main ( S t r i n g [ ] a r g s ) { Sub c 1 = new Sub ( ) ; System . out . p r i n t l n ( c 1 . s ) ; Super c 2 = new Sub ( ) ; System . out . p r i n t l n ( c 2 . s ) ; } }

What is the output?

59

16.2. HIDING FIELDS INSTEAD OF OVERRIDING THEM

60

Sub Super

We did create two Sub objects, but why the second one prints out Super?

16.2

hiding fields instead of overriding them

In [1], there is a clear denition of Hiding Fields: Within a class, a eld that has the same name as a eld in the superclass hides the superclasss eld, even if their types are different. Within the subclass, the eld in the superclass cannot be referenced by its simple name. Instead, the eld must be accessed through super. Generally speaking, we dont recommend hiding elds as it makes code difcult to read. From this denition, member variables/class elds cannot be overridden like methods. When subclass denes a eld with same name, it just declares a new eld. Therefore, they can not be accessed polymorphically. They can not be overridden, which also means they are hidden and can be access though some ways.

16.3

ways to access hidden fields

1). By using parenting reference type, the hidden parent elds can be access, like the example above. 2). By casting you can access the hidden member in the superclass.
System . out . p r i n t l n ( ( ( Super ) c 1 ) . s ) ;

17
4 T Y P E S O F J AVA I N N E R C L A S S E S There are 4 different types of inner classes you can use in Java. The following gives their name and examples. 17.1 static nested classes

c l a s s Outer { s t a t i c c l a s s Inner { void go ( ) { System . out . p r i n t l n ( " I n n e r c l a s s r e f e r e n c e is : " + this ) ; } } } public c l a s s T e s t { public s t a t i c void main ( S t r i n g [ ] a r g s ) { Outer . I n n e r n = new Outer . I n n e r ( ) ; n . go ( ) ; } } I n n e r c l a s s r e f e r e n c e i s : Outer$Inner@ 19 e 7 ce 87

17.2

member inner class

Member class is instance-specic. It has access to all methods, elds, and the Outers this reference.

61

17.3. METHOD-LOCAL INNER CLASSES

62

public c l a s s Outer { private int x = 100; public void makeInner ( ) { I n n e r i n = new I n n e r ( ) ; i n . seeOuter ( ) ; } c l a s s Inner { public void seeOuter ( ) { System . out . p r i n t l n ( " Outer x i s " + x ) ; System . out . p r i n t l n ( " I n n e r c l a s s r e f e r e n c e i s " + t h i s ) ; System . out . p r i n t l n ( " Outer c l a s s r e f e r e n c e i s " + Outer . this ) ; } } public s t a t i c void main ( S t r i n g [ ] a r g s ) { Outer o = new Outer ( ) ; I n n e r i = o . new I n n e r ( ) ; i . seeOuter ( ) ; } } Outer x i s 100 I n n e r c l a s s r e f e r e n c e i s Outer$Inner@ 4 dfd 9726 Outer c l a s s r e f e r e n c e i s Outer@ 43 ce 67 ca

17.3

method - local inner classes

public c l a s s Outer { private String x = " outer " ; public void d o S t u f f ( ) { c l a s s MyInner { public void seeOuter ( ) { System . out . p r i n t l n ( " x i s " + x ) ; } }

17.4. ANONYMOUS INNER CLASSES

63

MyInner i = new MyInner ( ) ; i . seeOuter ( ) ; } public s t a t i c void main ( S t r i n g [ ] a r g s ) { Outer o = new Outer ( ) ; o . doStuff ( ) ; } } x i s outer public c l a s s Outer { private s t a t i c String x = " s t a t i c outer " ; public s t a t i c void d o S t u f f ( ) { c l a s s MyInner { public void seeOuter ( ) { System . out . p r i n t l n ( " x i s " + x ) ; } } MyInner i = new MyInner ( ) ; i . seeOuter ( ) ; } public s t a t i c void main ( S t r i n g [ ] a r g s ) { Outer . d o S t u f f ( ) ; } } x i s s t a t i c outer

17.4

anonymous inner classes

This is frequently used when you add an action listener to a widget in a GUI application.
button . a d d A c t i o n L i s t e n e r ( new A c t i o n L i s t e n e r ( ) { public void acti onPerfor med ( ActionEvent e ) { comp . s e t T e x t ( " Button has been c l i c k e d " ) ; } }) ;

18
W H AT I S I N N E R I N T E R FA C E I N J AVA ? 18.1 what is inner interface in java ? Inner interface is also called nested interface, which means declare an interface inside of another interface. For example, the Entry interface is declared in the Map interface.
public i n t e r f a c e Map { i n t e r f a c e Entry { i n t getKey ( ) ; } void c l e a r ( ) ; }

18.2

why use inner interface ?

There are several compelling reasons for using inner interface: It is a way of logically grouping interfaces that are only used in one place. It increases encapsulation. Nested interfaces can lead to more readable and maintainable code. One example of inner interface used in java standard library is java.util.Map and Java.util.Map.Entry. Here java.util.Map is used also as a namespace. Entry does not belong to the global scope, which means there are many other entities that are

64

18.3. HOW INNER INTERFACE WORKS?

65

Entries and are not necessary Maps entries. This indicates that Entry represents entries related to the Map.

18.3

how inner interface works ?

To gure out how inner interface works, we can compare it with nested classes. Nested classes can be considered as a regular method declared in outer class. Since a method can be declared as static or non-static, similarly nested classes can be static and non-static. Static class is like a static method, it can only access outer class members through objects. Non-static class can access any member of the outer class.

18.4. A SIMPLE EXAMPLE OF INNER INTERFACE?

66

Because an interface can not be instantiated, the inner interface only makes sense if it is static. Therefore, by default inter interface is static, no matter you manually add static or not.

18.4

a simple example of inner interface ?

Map.java
public i n t e r f a c e Map {

18.4. A SIMPLE EXAMPLE OF INNER INTERFACE?

67

i n t e r f a c e Entry { i n t getKey ( ) ; } void c l e a r ( ) ; }

MapImpl.java
public c l a s s MapImpl implements Map {

c l a s s ImplEntry implements Map . Entry { public i n t getKey ( ) { return 0 ; } } @Override public void c l e a r ( ) { // clear } }

19
CONSTRUCTORS OF SUB AND SUPER CLASSES IN J AVA ? This post summarizes a commonly asked question about Java constructors. 19.1 why creating an object of the sub class invokes also the con structor of the super class ?

c l a s s Super { String s ; public Super ( ) { System . out . p r i n t l n ( " Super " ) ; } } public c l a s s Sub extends Super { public Sub ( ) { System . out . p r i n t l n ( " Sub " ) ; } public s t a t i c void main ( S t r i n g [ ] a r g s ) { Sub s = new Sub ( ) ; } }

It prints:
Super

68

19.2. A COMMON ERROR MESSAGE: IMPLICIT SUPER CONSTRUCTOR IS UNDEFINED FOR DEFA

Sub

When inheriting from another class, super() has to be called rst in the constructor. If not, the compiler will insert that call. This is why super constructor is also invoked when a Sub object is created. This doesnt create two objects, only one Sub object. The reason to have super constructor called is that if super class could have private elds which need to be initialized by its constructor. After compiler inserts the super constructor, the sub class constructor looks like the following:
public Sub ( ) { super ( ) ; System . out . p r i n t l n ( " Sub " ) ; }

19.2

a common error message : implicit super constructor is un defined for default constructor

This is a compilation error message seen by a lot of Java developers. Implicit super constructor is undened for default constructor. Must dene an explicit constructor

19.3. EXPLICITLY CALL SUPER CONSTRUCTOR IN SUB CONSTRUCTOR

70

This compilation error occurs because the default super constructor is undened. In Java, if a class does not dene a constructor, compiler will insert a default one for the class, which is argument-less. If a constructor is dened, e.g. Super(String s), compiler will not insert the default argument-less one. This is the situation for the Super class above. Since compiler tries to insert super() to the 2 constructors in the Sub class, but the Supers default constructor is not dened, compiler reports the error message. To x this problem, simply add the following Super() constructor to the Super class, OR remove the self-dened Super constructor.
public Super ( ) { System . out . p r i n t l n ( " Super " ) ; }

19.3

explicitly call super constructor in sub constructor

The following code is OK:

19.4. THE RULE

71

The Sub constructor explicitly call the super constructor with parameter. The super constructor is dened, and good to invoke.

19.4

the rule

In brief, the rules is: sub class constructor has to invoke super class instructor, either explicitly by programmer or implicitly by compiler. For either way, the invoked super constructor has to be dened.

19.5

the interesting question

Why Java doesnt provide default constructor, if class has a constructor with parameter(s)? Some answers: http://stackoverow.com/q/16046200/127859

20
J AVA A C C E S S L E V E L F O R M E M B E R S : P U B L I C , P R O T E C T E D , P R I VAT E Java access level contains two parts: class level and member level. For class level, it can be public or no explicit modier(package-private). For member access level, it can be public, private, protected, or package-private (no explicit modier). This table summarizes the access level of different modiers for members. Access level determines the accessibility of eld and method. It has 4 levels: public, private, protected, or package-private (no explicit modier).

72

21
W H E N T O U S E P R I VAT E C O N S T R U C T O R S I N J AVA ? If a method is private, it means that it can not be accessed from any class other than itself. This is the access control mechanism provided by Java. When it is used appropriately, it can produce security and functionality. Constructors, like regular methods, can also be declared as private. You may wonder why we need a private constructor since it is only accessible from its own class. When a class needs to prevent the caller from creating objects. Private constructors are suitable. Objects can be constructed only internally. One application is in the singleton design pattern. The policy is that only one object of that class is supposed to exist. So no other class than itself can access the constructor. This ensures the single instance existence of the class. Private constructors have been widely used in JDK, the following code is part of Runtime class.
public c l a s s Runtime { p r i v a t e s t a t i c Runtime currentRuntime = new Runtime ( ) ; public s t a t i c Runtime getRuntime ( ) { r e t u r n currentRuntime ; } / / Don t l e t a n y o n e e l s e i n s t a n t i a t e t h i s c l a s s p r i v a t e Runtime ( ) { } }

73

22
2 E X A M P L E S T O S H O W H O W J AVA E X C E P T I O N H A N D L I N G WORKS There are 2 examples below. One shows all caller methods also need to handle exceptions thrown by the callee method. The other one shows the super class can be used to catch or handle subclass exceptions. 22.1 caller method must handle exceptions thrown by the callee method

Here is a program which handles exceptions. Just test that, if an exception is thrown in one method, not only that method, but also all methods which call that method have to declare or throw that exception.
public c l a s s e x c e p t i o n T e s t { private s t a t i c Exception exception ; public s t a t i c void main ( S t r i n g [ ] a r g s ) throws E x c e p t i o n { callDoOne ( ) ; } public s t a t i c void doOne ( ) throws E x c e p t i o n { throw e x c e p t i o n ; } public s t a t i c void callDoOne ( ) throws E x c e p t i o n { doOne ( ) ; } }

74

22.2. THE SUPER CLASS CAN BE USED TO CATCH OR HANDLE SUBCLASS EXCEPTIONS

75

22.2

the super class can be used to catch or handle subclass ex ceptions

The following is also OK, because the super class can be used to catch or handle subclass exceptions:
c l a s s myException extends E x c e p ti o n { } public c l a s s e x c e p t i o n T e s t { private s t a t i c Exception exception ; p r i v a t e s t a t i c myException myexception ; public s t a t i c void main ( S t r i n g [ ] a r g s ) throws E x c e p t i o n { callDoOne ( ) ; } public s t a t i c void doOne ( ) throws myException { throw myexception ; } public s t a t i c void callDoOne ( ) throws E x c e p t i o n { doOne ( ) ; throw e x c e p t i o n ; } }

This is the reason that only one parent class in the catch clause is syntactically safe.

23
DIAGRAM OF EXCEPTION HIERARCHY In Java, exception can be checked or unchecked. They both t into a class hierarchy. The following diagram shows Java Exception classes hierarchy. Red colored are checked exceptions. Any checked exceptions that may be thrown in a method must either be caught or declared in the methods throws clause. Checked exceptions must be caught at compile time. Checked exceptions are so called because both the Java compiler and the Java virtual machine check to make sure this rule is obeyed. Green colored are uncheck exceptions. They are exceptions that are not expected to be recovered, such as null pointer, divide by 0, etc.

76

77

78

Check out top 10 questions about Java exceptions.

24
J AVA R E A D A F I L E L I N E B Y L I N E - H O W M A N Y WAY S ? The number of total classes of Java I/O is large, and it is easy to get confused when to use which. The following are two methods for reading a le line by line. Method 1:
p r i v a t e s t a t i c void r e a d F i l e 1 ( F i l e f i n ) throws IOException { F i l e I n p u t S t r e a m f i s = new F i l e I n p u t S t r e a m ( f i n ) ; / / C on st r uc t B u f f e r e d R e a d e r from InputStreamReader BufferedReader br = new BufferedReader ( new InputStreamReader ( f i s ) ) ; Stri ng l i n e = null ; while ( ( l i n e = br . readLine ( ) ) ! = n u l l ) { System . out . p r i n t l n ( l i n e ) ; } br . c l o s e ( ) ; }

Method 2:
p r i v a t e s t a t i c void r e a d F i l e 2 ( F i l e f i n ) throws IOException { / / C on st r uc t B u f f e r e d R e a d e r from F i l e R e a d e r BufferedReader br = new BufferedReader ( new F i l e R e a d e r ( f i n ) ); Stri ng l i n e = null ; while ( ( l i n e = br . readLine ( ) ) ! = n u l l ) { System . out . p r i n t l n ( l i n e ) ; }

79

80

br . c l o s e ( ) ; }

Use the following code:


/ / use . to get current d i r e c t o r y F i l e d i r = new F i l e ( " . " ) ; F i l e f i n = new F i l e ( d i r . g e t C a n o n i c a l P a t h ( ) + F i l e . s e p a r a t o r + " i n . txt " ) ; readFile1 ( fin ) ; readFile2 ( fin ) ;

Both works for reading a text le line by line. The difference between the two methods is what to use to construct a BufferedReader. Method 1 uses InputStreamReader and Method 2 uses FileReader. Whats the difference between the two classes? From Java Doc, An InputStreamReader is a bridge from byte streams to character streams: It reads bytes and decodes them into characters using a specied charset. InputStreamReader can handle other input streams than les, such as network connections, classpath resources, ZIP les, etc. FileReader is Convenience class for reading character les. The constructors of this class assume that the default character encoding and the default byte-buffer size are appropriate. FileReader does not allow you to specify an encoding other than the platform default encoding. Therefore, it is not a good idea to use it if the program will run on systems with different platform encoding. In summary, InputStreamReader is always a safer choice than FileReader. It is worth to mention here that instead of using a concrete / or for a path, you should always use File.separator which can ensure that the separator is always correct for different operating systems. Also the path used should be relative, and that ensures the path is always correct. Update: You can also use the following method which is available since Java 1.7. Essentially, it is the same with Method 1.
Charset c h a r s e t = Charset . forName ( "USASCII " ) ; t r y ( BufferedReader r e a d e r = F i l e s . newBufferedReader ( f i l e , c h a r s e t )) {

81

Stri ng l i n e = null ; while ( ( l i n e = r e a d e r . readLine ( ) ) ! = n u l l ) { System . out . p r i n t l n ( l i n e ) ; } } c a t c h ( IOException x ) { System . e r r . format ( " IOException : %s%n " , x ) ; }

The newBufferedReader method does the following:


public s t a t i c BufferedReader newBufferedReader ( Path path , Charset cs ) { CharsetDecoder decoder = c s . newDecoder ( ) ; Reader r e a d e r = new InputStreamReader ( newInputStream ( path ) , decoder ) ; r e t u r n new BufferedReader ( r e a d e r ) ; }

Reading the class hierarchy diagram is also very helpful for understanding those inputstream and reader related concept: http://www.programcreek.com/2012/05/javaio-class-hierarchy-diagram/.

25
J AVA W R I T E T O A F I L E - C O D E E X A M P L E This is Java code for writing something to a le. Every time after it runs, a new le is created, and the previous one is gone. This is different from appending content to a le.
public s t a t i c void w r i t e F i l e 1 ( ) throws IOException { F i l e f o u t = new F i l e ( " out . t x t " ) ; FileOutputStream f o s = new FileOutputStream ( f o u t ) ; B u f f e r e d W r i t e r bw = new B u f f e r e d W r i t e r ( new OutputStreamWriter ( f o s ) ) ; f o r ( i n t i = 0 ; i < 1 0 ; i ++) { bw . w r i t e ( " something " ) ; bw . newLine ( ) ; } bw . c l o s e ( ) ; }

This example use FileOutputStream, instead you can use FileWriter or PrintWriter which is normally good enough for a text le operations. Use FileWriter:
public s t a t i c void w r i t e F i l e 2 ( ) throws IOException { F i l e W r i t e r fw = new F i l e W r i t e r ( " out . t x t " ) ; f o r ( i n t i = 0 ; i < 1 0 ; i ++) { fw . w r i t e ( " something " ) ; }

82

83

fw . c l o s e ( ) ; }

Use PrintWriter:
public s t a t i c void w r i t e F i l e 3 ( ) throws IOException { P r i n t W r i t e r pw = new P r i n t W r i t e r ( new F i l e W r i t e r ( " out . t x t " ) ); f o r ( i n t i = 0 ; i < 1 0 ; i ++) { pw. w r i t e ( " something " ) ; } pw. c l o s e ( ) ; }

Use OutputStreamWriter:
public s t a t i c void w r i t e F i l e 4 ( ) throws IOException { F i l e f o u t = new F i l e ( " out . t x t " ) ; FileOutputStream f o s = new FileOutputStream ( f o u t ) ; OutputStreamWriter osw = new OutputStreamWriter ( f o s ) ; f o r ( i n t i = 0 ; i < 1 0 ; i ++) { osw . w r i t e ( " something " ) ; } osw . c l o s e ( ) ; }

From Java Doc: FileWriter is a convenience class for writing character les. The constructors of this class assume that the default character encoding and the default byte-buffer size are acceptable. To specify these values yourself, construct an OutputStreamWriter on a FileOutputStream. PrintWriter prints formatted representations of objects to a text-output stream. This class implements all of the print methods found in PrintStream. It does not contain methods for writing raw bytes, for which a program should use unencoded byte streams.

84

The main difference is that PrintWriter offers some additional methods for formatting such as println and printf. In addition, FileWriter throws IOException in case of any I/O failure. PrintWriter methods do not throws IOException, instead they set a boolean ag which can be obtained using checkError(). PrintWriter automatically invokes ush after every byte of data is written. In case of FileWriter, caller has to take care of invoking ush.

26
FILEOUTPUTSTREAM VS. FILEWRITER When we use Java to write something to a le, we can do it in the following two ways. One uses FileOutputStream, the other uses FileWriter. Using FileOutputStream:
F i l e f o u t = new F i l e ( f i l e _ l o c a t i o n _ s t r i n g ) ; FileOutputStream f o s = new FileOutputStream ( f o u t ) ; B u f f e r e d W r i t e r out = new B u f f e r e d W r i t e r ( new OutputStreamWriter ( f o s )); out . w r i t e ( " something " ) ;

Using FileWriter:
F i l e W r i t e r f s t r e a m = new F i l e W r i t e r ( f i l e _ l o c a t i o n _ s t r i n g ) ; B u f f e r e d W r i t e r out = new B u f f e r e d W r i t e r ( f s t r e a m ) ; out . w r i t e ( " something " ) ;

Both will work, but what is the difference between FileOutputStream and FileWriter? There are a lot of discussion on each of those classes, they both are good implements of le i/o concept that can be found in a general operating systems. However, we dont care how it is designed, but only how to pick one of them and why pick it that way. From Java API Specication: FileOutputStream is meant for writing streams of raw bytes such as image data. For writing streams of characters, consider using FileWriter.

85

86

If you are familiar with design patterns, FileWriter is a typical usage of Decorator pattern actually. I have use a simple tutorial to demonstrate the Decorator pattern, since it is very important and very useful for many designs. One application of FileOutputStream is converting a le to a byte array.

27
S H O U L D . C L O S E ( ) B E P U T I N F I N A L LY B L O C K O R N O T ? The following are 3 different ways to close a output writer. The rst one puts close() method in try clause, the second one puts close in nally clause, and the third one uses a try-with-resources statement. Which one is the right or the best?
/ / c l o s e ( ) i s in try c l a u s e try { P r i n t W r i t e r out = new P r i n t W r i t e r ( new B u f f e r e d W r i t e r ( new F i l e W r i t e r ( " out . t x t " , t r u e ) ) ) ; out . p r i n t l n ( " t h e t e x t " ) ; out . c l o s e ( ) ; } c a t c h ( IOException e ) { e . printStackTrace ( ) ; } / / c l o s e ( ) i s in f i n a l l y c l a u s e P r i n t W r i t e r out = n u l l ; try { out = new P r i n t W r i t e r ( new B u f f e r e d W r i t e r ( new F i l e W r i t e r ( " out . t x t " , t r u e ) ) ) ; out . p r i n t l n ( " t h e t e x t " ) ; } c a t c h ( IOException e ) { e . printStackTrace ( ) ; } finally { i f ( out ! = n u l l ) { out . c l o s e ( ) ; } }

87

27.1. ANSWER / / t r y with r e s o u r c e s t a t e m e n t t r y ( P r i n t W r i t e r out 2 = new P r i n t W r i t e r ( new B u f f e r e d W r i t e r ( new F i l e W r i t e r ( " out . t x t " , t r u e ) ) ) ) { out 2 . p r i n t l n ( " t h e t e x t " ) ; } c a t c h ( IOException e ) { e . printStackTrace ( ) ; }

88

27.1

answer

Because the Writer should be closed in either case (exception or no exception), close() should be put in nally clause. From Java 7, we can use try-with-resources statement.

28
H O W T O U S E J AVA P R O P E R T I E S F I L E ? For conguration purposes, using properties le is a good way of reusing. In this way, when the code is packaged to a jar le, other users can just put the different congurations in the cong.properties le. The following is a simple example of using properties le. 1. create the le hierarchy like the following. Mainly remember to put the cong.properties le under src package. Other testing code and database class are put in different package under src.

2. The following is the code.


package T e s t ; import j a v a . i o . IOException ; import j a v a . u t i l . P r o p e r t i e s ; public c l a s s T e s t { public s t a t i c void main ( S t r i n g [ ] a r g s ) { P r o p e r t i e s c o n f i g F i l e = new P r o p e r t i e s ( ) ; try {

89

90

c o n f i g F i l e . load ( T e s t . c l a s s . g e t C l a s s L o a d e r ( ) . getResourceAsStream ( " c o n f i g . properties " ) ) ; S t r i n g name = c o n f i g F i l e . g e t P r o p e r t y ( " name "); System . out . p r i n t l n ( name ) ; } c a t c h ( IOException e ) { e . printStackTrace ( ) ; } } }

3. The content in the conguration le following the format of key=value.

29
M O N I T O R S - T H E B A S I C I D E A O F J AVA S Y N C H R O N I Z AT I O N If you took operating system course in college, you might remember that monitor is an important concept of synchronization in operating systems. It is also used in Java synchronization. This post uses an analogy to explain the basic idea of monitor.

29.1

what is a monitor ?

A monitor can be considered as a building which contains a special room. The special room can be occupied by only one customer(thread) at a time. The room usually contains some data and code.

If a customer wants to occupy the special room, he has to enter the Hallway(Entry Set) to wait rst. Scheduler will pick one based on some criteria(e.g. FIFO). If he is suspended for some reason, he will be sent to the wait room, and be scheduled

91

29.2. HOW IS IT IMPLEMENTED IN JAVA?

92

to reenter the special room later. As it is shown in the diagram above, there are 3 rooms in this building.

In brief, a monitor is a facility which monitors the threads access to the special room. It ensures that only one thread can access the protected data or code.

29.2

how is it implemented in java ?

In the Java virtual machine, every object and class is logically associated with a monitor. To implement the mutual exclusion capability of monitors, a lock (sometimes called a mutex) is associated with each object and class. This is called a semaphore in operating systems books, mutex is a binary semaphore. If one thread owns a lock on some data, then no others can obtain that lock until the thread that owns the lock releases it. It would be not convenient if we need to write a semaphore all the time when we do multi-threading programming. Luckily, we dont need to since JVM does that for us automatically. To claim a monitor region which means data not accessible by more than one thread, Java provide synchronized statements and synchronized methods. Once the code is embedded with synchronized keyword, it is a monitor region. The locks are implemented in the background automatically by JVM.

29.3. IN JAVA SYNCHRONIZATION CODE, WHICH PART IS MONITOR?

93

29.3

in java synchronization code , which part is monitor ?

We know that each object/class is associated with a Monitor. I think it is good to say that each object has a monitor, since each object could have its own critical section, and capable of monitoring the thread sequence. To enable collaboration of different threads, Java provide wait() and notify() to suspend a thread and to wake up another thread that are waiting on the object respectively. In addition, there are 3 other versions:
wait ( long timeout , i n t nanos ) wait ( long timeout ) n o t i f i e d by o t h e r t h r e a d s or n o t i f i e d by timeout . notify ( al l )

Those methods can only be invoked within a synchronized statement or synchronized method. The reason is that if a method does not require mutual exclusion, there is no need to monitor or collaborate between threads, every thread can access that method freely. Here are some synchronization code examples.

30
T H E I N T E R FA C E A N D C L A S S H I E R A R C H Y D I A G R A M O F J AVA C O L L E C T I O N S 30.1 collection vs collections First of all, Collection and Collections are two different concepts. As you will see from the hierarchy diagram below, Collection is a root interface in the Collection hierarchy but Collections is a class which provide static methods to manipulate on some Collection types.

94

30.2. CLASS HIERARCHY OF COLLECTION

95

30.2

class hierarchy of collection

The following diagram demonstrates class hierarchy of Collection.

30.3

class hierarchy of map

Here is class hierarchy of Map.

30.4. SUMMARY OF CLASSES

96

30.4

summary of classes

30.5

code example

The following is a simple example to illustrate some collection types:


L i s t < S t r i n g > a 1 = new A r r a y L i s t < S t r i n g > ( ) ; a 1 . add ( " Program " ) ; a 1 . add ( " Creek " ) ; a 1 . add ( " J a v a " ) ; a 1 . add ( " J a v a " ) ; System . out . p r i n t l n ( " A r r a y L i s t Elements " ) ; System . out . p r i n t ( " \ t " + a 1 + " \n " ) ; L i s t < S t r i n g > l 1 = new L i n k e d L i s t < S t r i n g > ( ) ;

30.5. CODE EXAMPLE

97

l 1 . add ( " Program " ) ; l 1 . add ( " Creek " ) ; l 1 . add ( " J a v a " ) ; l 1 . add ( " J a v a " ) ; System . out . p r i n t l n ( " L i n k e d L i s t Elements " ) ; System . out . p r i n t ( " \ t " + l 1 + " \n " ) ; Set < S t r i n g > s 1 = new HashSet < S t r i n g > ( ) ; / / o r new T r e e S e t ( ) w i l l order the elements ; s 1 . add ( " Program " ) ; s 1 . add ( " Creek " ) ; s 1 . add ( " J a v a " ) ; s 1 . add ( " J a v a " ) ; s 1 . add ( " t u t o r i a l " ) ; System . out . p r i n t l n ( " S e t Elements " ) ; System . out . p r i n t ( " \ t " + s 1 + " \n " ) ; Map< S t r i n g , S t r i n g > m1 = new HashMap< S t r i n g , S t r i n g > ( ) ; / / o r new TreeMap ( ) w i l l o r d e r b a s e d on k e y s m1 . put ( " Windows " , " 2000 " ) ; m1 . put ( " Windows " , "XP" ) ; m1 . put ( " Language " , " J a v a " ) ; m1 . put ( " Website " , " programcreek . com" ) ; System . out . p r i n t l n ( "Map Elements " ) ; System . out . p r i n t ( " \ t " + m1 ) ;

Output:
A r r a y L i s t Elements [ Program , Creek , Java , J a v a ] L i n k e d L i s t Elements [ Program , Creek , Java , J a v a ] S e t Elements [ t u t o r i a l , Creek , Program , J a v a ] Map Elements { Windows=XP , Website=programcreek . com , Language=J a v a }

31
A SIMPLE TREESET EXAMPLE The following is a very simple TreeSet example. From this simple example, you will see: TreeSet is sorted How to iterate a TreeSet How to check empty How to retrieve rst/last element How to remove an element If you want to know more about Java Collection, check out the Java Collection hierarchy diagram.
import j a v a . u t i l . I t e r a t o r ; import j a v a . u t i l . T r e e S e t ; public c l a s s TreeSetExample { public s t a t i c void main ( S t r i n g [ ] a r g s ) { System . out . p r i n t l n ( " Tree S e t Example ! \ n " ) ; T r e e S e t < I n t e g e r > t r e e = new T r e e S e t < I n t e g e r > ( ) ; t r e e . add ( 1 2 ) ; t r e e . add ( 6 3 ) ; t r e e . add ( 3 4 ) ; t r e e . add ( 4 5 ) ; / / h e r e i t t e s t i t s s o r t e d , 63 i s t h e l a s t e l e m e n t . s e e output below I t e r a t o r <Integer > i t e r a t o r = tree . i t e r a t o r ( ) ;

98

99

System . out . p r i n t ( " Tree s e t data : " ) ; / / Displaying the Tree s e t data while ( i t e r a t o r . hasNext ( ) ) { System . out . p r i n t ( i t e r a t o r . n ex t ( ) + " " ) ; } System . out . p r i n t l n ( ) ; / / Check empty o r n o t i f ( t r e e . isEmpty ( ) ) { System . out . p r i n t ( " Tree S e t i s empty . " ) ; } else { System . out . p r i n t l n ( " Tree S e t s i z e : " + t r e e . s i z e ( ) ); } / / R e t r i e v e f i r s t d a t a from t r e e s e t System . out . p r i n t l n ( " F i r s t data : " + t r e e . f i r s t ( ) ) ; / / R e t r i e v e l a s t d a t a from t r e e s e t System . out . p r i n t l n ( " L a s t data : " + t r e e . l a s t ( ) ) ; i f ( t r e e . remove ( 4 5 ) ) { / / r e m o v e e l e m e n t by v a l u e System . out . p r i n t l n ( " Data i s removed from t r e e s e t " ); } else { System . out . p r i n t l n ( " Data doesn t e x i s t ! " ) ; } System . out . p r i n t ( "Now t h e t r e e s e t c o n t a i n : " ) ; iterator = tree . iterator () ; / / Displaying the Tree s e t data while ( i t e r a t o r . hasNext ( ) ) { System . out . p r i n t ( i t e r a t o r . n ex t ( ) + " " ) ; } System . out . p r i n t l n ( ) ; System . out . p r i n t l n ( "Now t h e s i z e o f t r e e s e t : " + t r e e . size () ) ; / / Remove a l l tree . clear () ; i f ( t r e e . isEmpty ( ) ) { System . out . p r i n t ( " Tree S e t i s empty . " ) ;

100

} else { System . out . p r i n t l n ( " Tree S e t s i z e : " + t r e e . s i z e ( ) ); } } }

Output:
Tree S e t Example ! Tree s e t data : 12 34 45 63 Tree S e t s i z e : 4 F i r s t data : 12 L a s t data : 63 Data i s removed from t r e e s e t Now t h e t r e e s e t c o n t a i n : 12 34 63 Now t h e s i z e o f t r e e s e t : 3 Tree S e t i s empty .

32
D E E P U N D E R S TA N D I N G O F A R R AY S . S O R T ( ) Arrays.sort(T[], Comparator <? super T >c) is a method for sorting user-dened object array. The ofcial Java Doc briey describe what it does, but not much for deep understanding. In this post, I will walk though the key information for deeper understanding of this method.

32.1

a simple example showing how to use arrays . sort ()

By reading the following example, you can quickly get an idea of how to use this method correctly. A Comparator is dened for comparing Dogs by size and then the Comparator is used as a parameter for the sort method.
import j a v a . u t i l . Arrays ; import j a v a . u t i l . Comparator ; c l a s s Dog { int size ; public Dog ( i n t s ) { size = s ; } } c l a s s DogSizeComparator implements Comparator<Dog>{ @Override public i n t compare ( Dog o 1 , Dog o 2 ) { r e t u r n o1 . s i z e o2 . s i z e ; } }

101

32.2. STRATEGY PATTERN USED

102

public c l a s s ArraySort { public s t a t i c void main ( S t r i n g [ ] a r g s ) { Dog d1 = new Dog ( 2 ) ; Dog d2 = new Dog ( 1 ) ; Dog d3 = new Dog ( 3 ) ; Dog [ ] dogArray = { d1 , d2 , d3 } ; printDogs ( dogArray ) ; Arrays . s o r t ( dogArray , new DogSizeComparator ( ) ) ; printDogs ( dogArray ) ; } public s t a t i c void printDogs ( Dog [ ] dogs ) { f o r ( Dog d : dogs ) System . out . p r i n t ( d . s i z e + " " ) ; System . out . p r i n t l n ( ) ; } }

Output:
2 1 3 1 2 3

32.2

strategy pattern used

As this is a perfect example of Strategy pattern, it is worth to mention here why strategy pattern is good for this situation. In brief, Strategy pattern enables different algorithms get selected at run-time. In this case, by passing different Comparator, different algorithms can get selected. Based on the example above and now assuming you have another Comparator which compares Dogs by weight instead of by size, you can simply create a new Comparator like the following.
c l a s s Dog { int size ; i n t weight ;

32.2. STRATEGY PATTERN USED

103

public Dog ( i n t s , i n t w) { size = s ; weight = w; } } c l a s s DogSizeComparator implements Comparator<Dog>{ @Override public i n t compare ( Dog o 1 , Dog o 2 ) { r e t u r n o1 . s i z e o2 . s i z e ; } } c l a s s DogWeightComparator implements Comparator<Dog>{ @Override public i n t compare ( Dog o 1 , Dog o 2 ) { r e t u r n o 1 . weight o 2 . weight ; } } public c l a s s ArraySort { public s t a t i c void main ( S t r i n g [ ] a r g s ) { Dog d1 = new Dog ( 2 , 5 0 ) ; Dog d2 = new Dog ( 1 , 3 0 ) ; Dog d3 = new Dog ( 3 , 4 0 ) ; Dog [ ] dogArray = { d1 , d2 , d3 } ; printDogs ( dogArray ) ; Arrays . s o r t ( dogArray , new DogSizeComparator ( ) ) ; printDogs ( dogArray ) ; Arrays . s o r t ( dogArray , new DogWeightComparator ( ) ) ; printDogs ( dogArray ) ; } public s t a t i c void printDogs ( Dog [ ] dogs ) { f o r ( Dog d : dogs ) System . out . p r i n t ( " s i z e = " +d . s i z e + " weight = " + d . weight + " " ) ;

32.3. WHY USE SUPER?

104

System . out . p r i n t l n ( ) ; } } s i z e = 2 weight = 50 s i z e = 1 weight = 30 s i z e = 3 weight = 40 s i z e = 1 weight = 30 s i z e = 2 weight = 50 s i z e = 3 weight = 40 s i z e = 1 weight = 30 s i z e = 3 weight = 40 s i z e = 2 weight = 50

Comparator is just an interface. Any Comparator that implements this interface can be used during run-time. This is the key idea of Strategy design pattern.

32.3

why use super ?

It is straightforward if Comparator <T >c is the parameter, but the second parameter is Comparator<? super T >c. <? super T >means the type can be T or its super types. Why it allows super types? The answer is: This approach allows using same comparator for all sub classes. This is almost obvious in the following example.
import j a v a . u t i l . Arrays ; import j a v a . u t i l . Comparator ; c l a s s Animal { int size ; } c l a s s Dog extends Animal { public Dog ( i n t s ) { size = s ; } } c l a s s Cat extends Animal { public Cat ( i n t s ) { size = s ; } } c l a s s AnimalSizeComparator implements Comparator<Animal >{

32.3. WHY USE SUPER?

105

@Override public i n t compare ( Animal o 1 , Animal o 2 ) { r e t u r n o1 . s i z e o2 . s i z e ; } / / i n t h i s way , a l l sub c l a s s e s o f Animal can u s e t h i s comparator . } public c l a s s ArraySort { public s t a t i c void main ( S t r i n g [ ] a r g s ) { Dog d1 = new Dog ( 2 ) ; Dog d2 = new Dog ( 1 ) ; Dog d3 = new Dog ( 3 ) ; Dog [ ] dogArray = { d1 , d2 , d3 } ; printDogs ( dogArray ) ; Arrays . s o r t ( dogArray , new AnimalSizeComparator ( ) ) ; printDogs ( dogArray ) ; System . out . p r i n t l n ( ) ; / / when can Cat c 1 Cat c 2 Cat c 3 you h a v e an a r r a y o f Cat , same C o m p a r a t o r be used . = new Cat ( 2 ) ; = new Cat ( 1 ) ; = new Cat ( 3 ) ;

Cat [ ] c a t A r r a y = { c 1 , c 2 , c 3 } ; printDogs ( c a t A r r a y ) ; Arrays . s o r t ( catArray , new AnimalSizeComparator ( ) ) ; printDogs ( c a t A r r a y ) ; } public s t a t i c void printDogs ( Animal [ ] animals ) { f o r ( Animal a : animals ) System . out . p r i n t ( " s i z e = " +a . s i z e + " " ) ; System . out . p r i n t l n ( ) ; } }

32.4. SUMMARY

106

s i z e =2 s i z e =1 s i z e =3 s i z e =1 s i z e =2 s i z e =3 s i z e =2 s i z e =1 s i z e =3 s i z e =1 s i z e =2 s i z e =3

32.4

summary

To summarize, the takeaway messages from Arrays.sort(): generic - super strategy pattern merge sort - nlog(n) time complexity Java.util.Collections#sort(List <T >list, Comparator <? super T >c) has similar idea with Arrays.sort.

33
A R R AY L I S T V S . L I N K E D L I S T V S . V E C T O R 33.1 list overview List, as its name indicates, is an ordered sequence of elements. When we talk about List, it is a good idea to compare it with Set. A Set is a set of unique and unordered elements. The following is the class hierarchy diagram of Collection. From that you have a general idea of what Im talking about.

107

33.2. ARRAYLIST VS. LINKEDLIST VS. VECTOR

108

33.2

arraylist vs . linkedlist vs . vector

From the hierarchy diagram, they all implement List interface. They are very similar to use. Their main difference is their implementation which causes different performance for different operations. ArrayList is implemented as a resizable array. As more elements are added to ArrayList, its size is increased dynamically. Its elements can be accessed directly by using the get and set methods, since ArrayList is essentially an array. LinkedList is implemented as a double linked list. Its performance on add and remove is better than Arraylist, but worse on get and set methods. Vector is similar with ArrayList, but it is synchronized. ArrayList is a better choice if your program is thread-safe. Vector and ArrayList require space as more elements are added. Vector each time doubles its array size, while ArrayList grow 50 LinkedList, however, also implements Queue interface which adds more methods than ArrayList and Vector, such as offer(), peek(), poll(), etc. Note: The default initial capacity of an ArrayList is pretty small. It is a good habit to construct the ArrayList with a higher initial capacity. This can avoid the resizing cost.

33.3

arraylist example

A r r a y L i s t < I n t e g e r > a l = new A r r a y L i s t < I n t e g e r > ( ) ; a l . add ( 3 ) ; a l . add ( 2 ) ; a l . add ( 1 ) ; a l . add ( 4 ) ; a l . add ( 5 ) ; a l . add ( 6 ) ; a l . add ( 6 ) ; I t e r a t o r <Integer > i t e r 1 = al . i t e r a t o r ( ) ; while ( i t e r 1 . hasNext ( ) ) { System . out . p r i n t l n ( i t e r 1 . n ex t ( ) ) ; }

33.4. LINKEDLIST EXAMPLE

109

33.4

linkedlist example

L i n k e d L i s t < I n t e g e r > l l = new L i n k e d L i s t < I n t e g e r > ( ) ; l l . add ( 3 ) ; l l . add ( 2 ) ; l l . add ( 1 ) ; l l . add ( 4 ) ; l l . add ( 5 ) ; l l . add ( 6 ) ; l l . add ( 6 ) ; I t e r a t o r <Integer > i t e r 2 = l l . i t e r a t o r ( ) ; while ( i t e r 2 . hasNext ( ) ) { System . out . p r i n t l n ( i t e r 2 . n ex t ( ) ) ; }

As shown in the examples above, they are similar to use. The real difference is their underlying implementation and their operation complexity. 33.5 vector

Vector is almost identical to ArrayList, and the difference is that Vector is synchronized. Because of this, it has an overhead than ArrayList. Normally, most Java programmers use ArrayList instead of Vector because they can synchronize explicitly by themselves.

33.6

performance of arraylist vs . linkedlist

* add() in the table refers to add(E e), and remove() refers to remove(int index)

33.6. PERFORMANCE OF ARRAYLIST VS. LINKEDLIST

110

ArrayList has O(n) time complexity for arbitrary indices of add/remove, but O(1) for the operation at the end of the list. LinkedList has O(n) time complexity for arbitrary indices of add/remove, but O(1) for operations at end/beginning of the List. I use the following code to test their performance:
A r r a y L i s t < I n t e g e r > a r r a y L i s t = new A r r a y L i s t < I n t e g e r > ( ) ; L i n k e d L i s t < I n t e g e r > l i n k e d L i s t = new L i n k e d L i s t < I n t e g e r > ( ) ; / / A r r a y L i s t add long s t a r t T i m e = System . nanoTime ( ) ; f o r ( i n t i = 0 ; i < 1 0 0 0 0 0 ; i ++) { a r r a y L i s t . add ( i ) ; } long endTime = System . nanoTime ( ) ; long d u r a t i o n = endTime s t a r t T i m e ; System . out . p r i n t l n ( " A r r a y L i s t add : " + d u r a t i o n ) ; / / L i n k e d L i s t add s t a r t T i m e = System . nanoTime ( ) ; f o r ( i n t i = 0 ; i < 1 0 0 0 0 0 ; i ++) { l i n k e d L i s t . add ( i ) ; } endTime = System . nanoTime ( ) ; d u r a t i o n = endTime s t a r t T i m e ; System . out . p r i n t l n ( " L i n k e d L i s t add : " + d u r a t i o n ) ; / / ArrayList get s t a r t T i m e = System . nanoTime ( ) ; f o r ( i n t i = 0 ; i < 1 0 0 0 0 ; i ++) { arrayList . get ( i ) ; } endTime = System . nanoTime ( ) ; d u r a t i o n = endTime s t a r t T i m e ; System . out . p r i n t l n ( " A r r a y L i s t g e t : / / LinkedList get s t a r t T i m e = System . nanoTime ( ) ;

" + duration ) ;

33.6. PERFORMANCE OF ARRAYLIST VS. LINKEDLIST

111

f o r ( i n t i = 0 ; i < 1 0 0 0 0 ; i ++) { linkedList . get ( i ) ; } endTime = System . nanoTime ( ) ; d u r a t i o n = endTime s t a r t T i m e ; System . out . p r i n t l n ( " L i n k e d L i s t g e t : " + d u r a t i o n ) ;

/ / A r r a y L i s t remove s t a r t T i m e = System . nanoTime ( ) ; f o r ( i n t i = 9 9 9 9 ; i >= 0 ; i ) { a r r a y L i s t . remove ( i ) ; } endTime = System . nanoTime ( ) ; d u r a t i o n = endTime s t a r t T i m e ; System . out . p r i n t l n ( " A r r a y L i s t remove :

" + duration ) ;

/ / L i n k e d L i s t remove s t a r t T i m e = System . nanoTime ( ) ; f o r ( i n t i = 9 9 9 9 ; i >= 0 ; i ) { l i n k e d L i s t . remove ( i ) ; } endTime = System . nanoTime ( ) ; d u r a t i o n = endTime s t a r t T i m e ; System . out . p r i n t l n ( " L i n k e d L i s t remove : " + d u r a t i o n ) ;

And the output is:


A r r a y L i s t add : 13265642 L i n k e d L i s t add : 9550057 A r r a y L i s t g e t : 1543352 L i n k e d L i s t g e t : 85085551 A r r a y L i s t remove : 199961301 L i n k e d L i s t remove : 85768810

The difference of their performance is obvious. LinkedList is faster in add and remove, but slower in get. Based on the complexity table and testing results, we

33.6. PERFORMANCE OF ARRAYLIST VS. LINKEDLIST

112

can gure out when to use ArrayList or LinkedList. In brief, LinkedList should be preferred if: there are no large number of random access of element there are a large number of add/remove operations

34
HASHSET VS. TREESET VS. LINKEDHASHSET A Set contains no duplicate elements. That is one of the major reasons to use a set. There are 3 commonly used implementations of Set: HashSet, TreeSet and LinkedHashSet. When and which to use is an important question. In brief, if you need a fast set, you should use HashSet; if you need a sorted set, then TreeSet should be used; if you need a set that can be store the insertion order, LinkedHashSet should be used.

34.1

set interface

Set interface extends Collection interface. In a set, no duplicates are allowed. Every element in a set must be unique. You can simply add elements to a set, and duplicates will be removed automatically.

113

34.2. HASHSET VS. TREESET VS. LINKEDHASHSET

114

34.2

hashset vs . treeset vs . linkedhashset

HashSet is Implemented using a hash table. Elements are not ordered. The add, remove, and contains methods has constant time complexity O(1). TreeSet is implemented using a tree structure(red-black tree in algorithm book). The elements in a set are sorted, but the add, remove, and contains methods has time complexity of O(log (n)). It offers several methods to deal with the ordered set like rst(), last(), headSet(), tailSet(), etc. LinkedHashSet is between HashSet and TreeSet. It is implemented as a hash table with a linked list running through it, so it provides the order of insertion. The time complexity of basic methods is O(1).

34.3

treeset example

T r e e S e t < I n t e g e r > t r e e = new T r e e S e t < I n t e g e r > ( ) ; t r e e . add ( 1 2 ) ; t r e e . add ( 6 3 ) ; t r e e . add ( 3 4 ) ;

34.3. TREESET EXAMPLE

115

t r e e . add ( 4 5 ) ; I t e r a t o r <Integer > i t e r a t o r = tree . i t e r a t o r ( ) ; System . out . p r i n t ( " Tree s e t data : " ) ; while ( i t e r a t o r . hasNext ( ) ) { System . out . p r i n t ( i t e r a t o r . nex t ( ) + " " ) ; }

Output is sorted as follows:


Tree s e t data : 12 34 45 63

Now lets dene a Dog class as follows:


c l a s s Dog { int size ; public Dog ( i n t s ) { size = s ; } public S t r i n g t o S t r i n g ( ) { return siz e + " " ; } }

Lets add some dogs to TreeSet like the following:


import j a v a . u t i l . I t e r a t o r ; import j a v a . u t i l . T r e e S e t ; public c l a s s T e s t T r e e S e t { public s t a t i c void main ( S t r i n g [ ] a r g s ) { T r e e S e t <Dog> d s e t = new T r e e S e t <Dog > ( ) ; d s e t . add ( new Dog ( 2 ) ) ; d s e t . add ( new Dog ( 1 ) ) ; d s e t . add ( new Dog ( 3 ) ) ; I t e r a t o r <Dog> i t e r a t o r = d s e t . i t e r a t o r ( ) ; while ( i t e r a t o r . hasNext ( ) ) { System . out . p r i n t ( i t e r a t o r . nex t ( ) + " " ) ; } } }

34.4. HASHSET EXAMPLE

116

Compile ok, but run-time error occurs:


E x c e p t i o n i n t h r e a d " main " j a v a . lang . C l a s s C a s t E x c e p t i o n : c o l l e c t i o n . Dog cannot be c a s t t o j a v a . lang . Comparable a t j a v a . u t i l . TreeMap . put ( Unknown Source ) a t j a v a . u t i l . T r e e S e t . add ( Unknown Source ) a t c o l l e c t i o n . T e s t T r e e S e t . main ( T e s t T r e e S e t . j a v a : 2 2 )

Because TreeSet is sorted, the Dog object need to implement java.lang.Comparables compareTo() method like the following:
c l a s s Dog implements Comparable<Dog>{ int size ; public Dog ( i n t s ) { size = s ; } public S t r i n g t o S t r i n g ( ) { return siz e + " " ; } @Override public i n t compareTo ( Dog o ) { return siz e o . si ze ; } }

The output is:


1 2 3

34.4

hashset example

HashSet <Dog> d s e t = new HashSet <Dog > ( ) ; d s e t . add ( new Dog ( 2 ) ) ; d s e t . add ( new Dog ( 1 ) ) ; d s e t . add ( new Dog ( 3 ) ) ; d s e t . add ( new Dog ( 5 ) ) ; d s e t . add ( new Dog ( 4 ) ) ; I t e r a t o r <Dog> i t e r a t o r = d s e t . i t e r a t o r ( ) ; while ( i t e r a t o r . hasNext ( ) ) { System . out . p r i n t ( i t e r a t o r . n ex t ( ) + " " ) ; }

34.5. LINKEDHASHSET EXAMPLE

117

Output:
5 3 2 1 4

Note the order is not certain. 34.5 linkedhashset example

LinkedHashSet <Dog> d s e t = new LinkedHashSet <Dog > ( ) ; d s e t . add ( new Dog ( 2 ) ) ; d s e t . add ( new Dog ( 1 ) ) ; d s e t . add ( new Dog ( 3 ) ) ; d s e t . add ( new Dog ( 5 ) ) ; d s e t . add ( new Dog ( 4 ) ) ; I t e r a t o r <Dog> i t e r a t o r = d s e t . i t e r a t o r ( ) ; while ( i t e r a t o r . hasNext ( ) ) { System . out . p r i n t ( i t e r a t o r . n ex t ( ) + " " ) ; }

The order of the output is certain and it is the insertion order:


2 1 3 5 4

34.6

performance testing

The following method tests the performance of the three class on add() method.
public s t a t i c void main ( S t r i n g [ ] a r g s ) { Random r = new Random ( ) ; HashSet <Dog> hashSet = new HashSet <Dog > ( ) ; T r e e S e t <Dog> t r e e S e t = new T r e e S e t <Dog > ( ) ; LinkedHashSet <Dog> l i n k e d S e t = new LinkedHashSet <Dog > ( ) ; / / s t a r t time long s t a r t T i m e = System . nanoTime ( ) ; f o r ( i n t i = 0 ; i < 1 0 0 0 ; i ++) { int x = r . nextInt (1000 10) + 10; hashSet . add ( new Dog ( x ) ) ; } / / end t i m e

34.6. PERFORMANCE TESTING

118

long endTime = System . nanoTime ( ) ; long d u r a t i o n = endTime s t a r t T i m e ; System . out . p r i n t l n ( " HashSet : " + d u r a t i o n ) ; / / s t a r t time s t a r t T i m e = System . nanoTime ( ) ; f o r ( i n t i = 0 ; i < 1 0 0 0 ; i ++) { int x = r . nextInt (1000 10) + 10; t r e e S e t . add ( new Dog ( x ) ) ; } / / end t i m e endTime = System . nanoTime ( ) ; d u r a t i o n = endTime s t a r t T i m e ; System . out . p r i n t l n ( " T r e e S e t : " + d u r a t i o n ) ; / / s t a r t time s t a r t T i m e = System . nanoTime ( ) ; f o r ( i n t i = 0 ; i < 1 0 0 0 ; i ++) { int x = r . nextInt (1000 10) + 10; l i n k e d S e t . add ( new Dog ( x ) ) ; } / / end t i m e endTime = System . nanoTime ( ) ; d u r a t i o n = endTime s t a r t T i m e ; System . out . p r i n t l n ( " LinkedHashSet : " + d u r a t i o n ) ; }

From the output below, we can clearly wee that HashSet is the fastest one.
HashSet : 2244768 T r e e S e t : 3549314 LinkedHashSet : 2263320

* The test is not precise, but can reect the basic idea.

35
H A S H M A P V S . T R E E M A P V S . H A S H TA B L E V S . LINKEDHASHMAP Map is one of the most important data structures. In this tutorial, I will show you how to use different maps such as HashMap, TreeMap, HashTable and LinkedHashMap.

35.1

map overview

There are 4 commonly used implementations of Map in Java SE - HashMap, TreeMap, Hashtable and LinkedHashMap. If we use one sentence to describe each implementation, it would be the following:

119

35.2. HASHMAP

120

HashMap is implemented as a hash table, and there is no ordering on keys or values. TreeMap is implemented based on red-black tree structure, and it is ordered by the key. LinkedHashMap preserves the insertion order Hashtable is synchronized, in contrast to HashMap.

35.2

hashmap

If key of the HashMap is self-dened objects, then equals() and hashCode() contract need to be followed.
c l a s s Dog { String color ; Dog ( S t r i n g c ) { color = c ; } public S t r i n g t o S t r i n g ( ) { r e t u r n c o l o r + " dog " ; } } public c l a s s TestHashMap { public s t a t i c void main ( S t r i n g [ ] a r g s ) { HashMap<Dog , I n t e g e r > hashMap = new HashMap<Dog , Integer >() ; Dog d1 = new Dog ( " red " ) ; Dog d2 = new Dog ( " b l a c k " ) ; Dog d3 = new Dog ( " white " ) ; Dog d4 = new Dog ( " white " ) ; hashMap . put ( d1 , hashMap . put ( d2 , hashMap . put ( d3 , hashMap . put ( d4 , 10) ; 15) ; 5) ; 20) ;

// print size System . out . p r i n t l n ( hashMap . s i z e ( ) ) ;

35.2. HASHMAP

121

/ / l o o p HashMap f o r ( Entry <Dog , I n t e g e r > e n t r y : hashMap . e n t r y S e t () ) { System . out . p r i n t l n ( e n t r y . getKey ( ) . t o S t r i n g ( ) + " " + e n t r y . getValue ( ) ) ; } } }

Output:
4 white dog b l a c k dog red dog white dog

5 15 10 20

Note here, we add white dogs twice by mistake, but the HashMap takes it. This does not make sense, because now we are confused how many white dogs are really there. The Dog class should be dened as follows:
c l a s s Dog { String color ; Dog ( S t r i n g c ) { color = c ; } public boolean e q u a l s ( O b j e c t o ) { r e t u r n ( ( Dog ) o ) . c o l o r == t h i s . c o l o r ; } public i n t hashCode ( ) { return color . length ( ) ; } public S t r i n g t o S t r i n g ( ) { r e t u r n c o l o r + " dog " ; } }

35.3. TREEMAP

122

Now the output is:


3 red dog 10 white dog 20 b l a c k dog 15

The reason is that HashMap doesnt allow two identical elements. By default, the hashCode() and equals() methods implemented in Object class are used. The default hashCode() method gives distinct integers for distinct objects, and the equals() method only returns true when two references refer to the same object. Check out the hashCode() and equals() contract if this is not obvious to you. Check out the most frequently used methods for HashMap, such as iteration, print, etc.

35.3

treemap

A TreeMap is sorted by keys. Lets rst take a look at the following example to understand the sorted by keys idea.
c l a s s Dog { String color ; Dog ( S t r i n g c ) { color = c ; } public boolean e q u a l s ( O b j e c t o ) { r e t u r n ( ( Dog ) o ) . c o l o r == t h i s . c o l o r ; } public i n t hashCode ( ) { return color . length ( ) ; } public S t r i n g t o S t r i n g ( ) { r e t u r n c o l o r + " dog " ; } } public c l a s s TestTreeMap { public s t a t i c void main ( S t r i n g [ ] a r g s ) {

35.3. TREEMAP

123

Dog Dog Dog Dog

d1 d2 d3 d4

= = = =

new new new new

Dog ( " red " ) ; Dog ( " b l a c k " ) ; Dog ( " white " ) ; Dog ( " white " ) ;

TreeMap<Dog , I n t e g e r > treeMap = new TreeMap<Dog , Integer >() ; treeMap . put ( d1 , 1 0 ) ; treeMap . put ( d2 , 1 5 ) ; treeMap . put ( d3 , 5 ) ; treeMap . put ( d4 , 2 0 ) ; f o r ( Entry <Dog , I n t e g e r > e n t r y : treeMap . e n t r y S e t () ) { System . out . p r i n t l n ( e n t r y . getKey ( ) + " " + e n t r y . getValue ( ) ) ; } } }

Output:
E x c e p t i o n i n t h r e a d " main " j a v a . lang . C l a s s C a s t E x c e p t i o n : c o l l e c t i o n . Dog cannot be c a s t t o j a v a . lang . Comparable a t j a v a . u t i l . TreeMap . put ( Unknown Source ) a t c o l l e c t i o n . TestHashMap . main ( TestHashMap . j a v a : 3 5 )

Since TreeMaps are sorted by keys, the object for key has to be able to compare with each other, thats why it has to implement Comparable interface. For example, you use String as key, because String implements Comparable interface. Lets change the Dog, and make it comparable.
c l a s s Dog implements Comparable<Dog>{ String color ; int size ; Dog ( S t r i n g c , i n t s ) { color = c ; size = s ; } public S t r i n g t o S t r i n g ( ) { r e t u r n c o l o r + " dog " ;

35.3. TREEMAP

124

} @Override public i n t compareTo ( Dog o ) { return o . si ze this . si ze ; } } public c l a s s TestTreeMap { public s t a t i c void main ( S t r i n g [ ] a r g s ) { Dog d1 = new Dog ( " red " , 3 0 ) ; Dog d2 = new Dog ( " b l a c k " , 2 0 ) ; Dog d3 = new Dog ( " white " , 1 0 ) ; Dog d4 = new Dog ( " white " , 1 0 ) ; TreeMap<Dog , I n t e g e r > treeMap = new TreeMap<Dog , Integer >() ; treeMap . put ( d1 , 1 0 ) ; treeMap . put ( d2 , 1 5 ) ; treeMap . put ( d3 , 5 ) ; treeMap . put ( d4 , 2 0 ) ; f o r ( Entry <Dog , I n t e g e r > e n t r y : treeMap . e n t r y S e t () ) { System . out . p r i n t l n ( e n t r y . getKey ( ) + " " + e n t r y . getValue ( ) ) ; } } }

Output:
red dog 10 b l a c k dog 15 white dog 20

It is sorted by key, i.e., dog size in this case. If Dog d4 = new Dog(white, 10); is replaced with Dog d4 = new Dog(white, 40);, the output would be:
white dog 20 red dog 10 b l a c k dog 15

35.4. HASHTABLE white dog 5

125

The reason is that TreeMap now uses compareTo() method to compare keys. Different sizes make different dogs!

35.4

hashtable

From Java Doc: The HashMap class is roughly equivalent to Hashtable, except that it is unsynchronized and permits nulls.

35.5

linkedhashmap

LinkedHashMap is a subclass of HashMap. That means it inherits the features of HashMap. In addition, the linked list preserves the insertion-order. Lets replace the HashMap with LinkedHashMap using the same code used for HashMap.
c l a s s Dog { String color ; Dog ( S t r i n g c ) { color = c ; } public boolean e q u a l s ( O b j e c t o ) { r e t u r n ( ( Dog ) o ) . c o l o r == t h i s . c o l o r ; } public i n t hashCode ( ) { return color . length ( ) ; } public S t r i n g t o S t r i n g ( ) { r e t u r n c o l o r + " dog " ; } } public c l a s s TestHashMap {

35.5. LINKEDHASHMAP

126

public s t a t i c void main ( S t r i n g [ ] a r g s ) { Dog Dog Dog Dog d1 d2 d3 d4 = = = = new new new new Dog ( " red " ) ; Dog ( " b l a c k " ) ; Dog ( " white " ) ; Dog ( " white " ) ;

LinkedHashMap<Dog , I n t e g e r > linkedHashMap = new LinkedHashMap<Dog , I n t e g e r > ( ) ; linkedHashMap . put ( d1 , 1 0 ) ; linkedHashMap . put ( d2 , 1 5 ) ; linkedHashMap . put ( d3 , 5 ) ; linkedHashMap . put ( d4 , 2 0 ) ; f o r ( Entry <Dog , I n t e g e r > e n t r y : linkedHashMap . entrySet ( ) ) { System . out . p r i n t l n ( e n t r y . getKey ( ) + " " + e n t r y . getValue ( ) ) ; } } }

Output is:
red dog 10 b l a c k dog 15 white dog 20

The difference is that if we use HashMap the output could be the following - the insertion order is not preserved.
red dog 10 white dog 20 b l a c k dog 15

36
E F F I C I E N T C O U N T E R I N J AVA You may often use HashMap as a counter to understand the frequency of something from database or text. This articles compares the 3 different approaches to implement a counter by using HashMap. 36.1 the naive counter

If you use such a counter, your code may look like the following:
S t r i n g s = " one two t h r e e two t h r e e t h r e e " ; S t r i n g [ ] sArr = s . s p l i t ( " " ) ; / / naive approach HashMap< S t r i n g , I n t e g e r > c o u n t e r = new HashMap< S t r i n g , I n t e g e r > ( ) ; f o r ( S t r i n g a : sArr ) { i f ( c o u n t e r . containsKey ( a ) ) { i n t oldValue = c o u n t e r . g e t ( a ) ; c o u n t e r . put ( a , oldValue + 1 ) ; } else { c o u n t e r . put ( a , 1 ) ; } }

In each loop, you check if the key exists or not. If it does, increment the old value by 1, if not, set it to 1. This approach is simple and straightforward, but it is not the most efcient approach. This method is considered less efcient for the following reasons:

127

36.2. THE BETTER COUNTER

128

containsKey(), get() are called twice when a key already exists. That means searching the map twice. Since Integer is immutable, each loop will create a new one for increment the old value

36.2

the better counter

Naturally we want a mutable integer to avoid creating many Integer objects. A mutable integer class is dened as follows:
c l a s s MutableInteger { private int val ; public M u t a b l e I n t e g e r ( i n t v a l ) { this . val = val ; } public i n t g e t ( ) { return val ; } public void s e t ( i n t v a l ) { this . val = val ; } / / used to p r i n t value c o n v i n e n t l y public S t r i n g t o S t r i n g ( ) { return Integer . toString ( val ) ; } }

And the counter is improved and changed to the following:


HashMap< S t r i n g , MutableInteger > newCounter = new HashMap< S t r i n g , MutableInteger > ( ) ; f o r ( S t r i n g a : sArr ) { i f ( newCounter . containsKey ( a ) ) { M u t a b l e I n t e g e r oldValue = newCounter . g e t ( a ) ; oldValue . s e t ( oldValue . g e t ( ) + 1 ) ;

36.3. THE EFFICIENT COUNTER

129

} else { newCounter . put ( a , new M u t a b l e I n t e g e r ( 1 ) ) ; } }

This seems better because it does not require creating many Integer objects any longer. However, the search is still twice in each loop if a key exists.

36.3

the efficient counter

The HashMap.put(key, value) method returns the keys current value. This is useful, because we can use the reference of the old value to update the value without searching one more time!
HashMap< S t r i n g , MutableInteger > e f f i c i e n t C o u n t e r = new HashMap< S t r i n g , MutableInteger > ( ) ; f o r ( S t r i n g a : sArr ) { M u t a b l e I n t e g e r i n i t V a l u e = new M u t a b l e I n t e g e r ( 1 ) ; M u t a b l e I n t e g e r oldValue = e f f i c i e n t C o u n t e r . put ( a , initValue ) ; i f ( oldValue ! = n u l l ) { i n i t V a l u e . s e t ( oldValue . g e t ( ) + 1 ) ; } }

36.4

performance difference

To test the performance of the three different approaches, the following code is used. The performance test is on 1 million times. The raw results are as follows: Naive Approach : 222796000 Better Approach: 117283000 Efcient Approach: 96374000 The difference is signicant - 223 vs. 117 vs. 96. There is very much difference between Naive and Better, which indicates that creating objects are expensive!

36.4. PERFORMANCE DIFFERENCE

130

S t r i n g s = " one two t h r e e two t h r e e t h r e e " ; S t r i n g [ ] sArr = s . s p l i t ( " " ) ; long s t a r t T i m e = 0 ; long endTime = 0 ; long d u r a t i o n = 0 ; / / naive approach s t a r t T i m e = System . nanoTime ( ) ; HashMap< S t r i n g , I n t e g e r > c o u n t e r = new HashMap< S t r i n g , I n t e g e r > ( ) ; f o r ( i n t i = 0 ; i < 1 0 0 0 0 0 0 ; i ++) f o r ( S t r i n g a : sArr ) { i f ( c o u n t e r . containsKey ( a ) ) { i n t oldValue = c o u n t e r . g e t ( a ) ; c o u n t e r . put ( a , oldValue + 1 ) ; } else { c o u n t e r . put ( a , 1 ) ; } } endTime = System . nanoTime ( ) ; d u r a t i o n = endTime s t a r t T i m e ; System . out . p r i n t l n ( " Naive Approach :

" + duration ) ;

/ / b e t t e r approach s t a r t T i m e = System . nanoTime ( ) ; HashMap< S t r i n g , MutableInteger > newCounter = new HashMap< S t r i n g , MutableInteger > ( ) ; f o r ( i n t i = 0 ; i < 1 0 0 0 0 0 0 ; i ++) f o r ( S t r i n g a : sArr ) { i f ( newCounter . containsKey ( a ) ) { M u t a b l e I n t e g e r oldValue = newCounter . g e t ( a ); oldValue . s e t ( oldValue . g e t ( ) + 1 ) ; } else { newCounter . put ( a , new M u t a b l e I n t e g e r ( 1 ) ) ; } } endTime = System . nanoTime ( ) ; d u r a t i o n = endTime s t a r t T i m e ;

36.5. COMMENT FROM KEITH(FROM COMMENT LIST BELOW)

131

System . out . p r i n t l n ( " B e t t e r Approach : / / e f f i c i e n t approach s t a r t T i m e = System . nanoTime ( ) ;

" + duration ) ;

HashMap< S t r i n g , MutableInteger > e f f i c i e n t C o u n t e r = new HashMap< S t r i n g , MutableInteger > ( ) ; f o r ( i n t i = 0 ; i < 1 0 0 0 0 0 0 ; i ++) f o r ( S t r i n g a : sArr ) { M u t a b l e I n t e g e r i n i t V a l u e = new M u t a b l e I n t e g e r ( 1 ) ; M u t a b l e I n t e g e r oldValue = e f f i c i e n t C o u n t e r . put ( a , initValue ) ; i f ( oldValue ! = n u l l ) { i n i t V a l u e . s e t ( oldValue . g e t ( ) + 1 ) ; } } endTime = System . nanoTime ( ) ; d u r a t i o n = endTime s t a r t T i m e ; System . out . p r i n t l n ( " E f f i c i e n t Approach :

" + duration ) ;

When you use a counter, you probably also need a function to sort the map by value. You can check out the frequently used method of HashMap.

36.5

comment from keith ( from comment list below )

One of the best comments Ive ever received! Added a couple tests: 1) Refactored better approach to just call get instead of containsKey. Usually, the elements you want are in the HashMap so that reduces from two searches to one. 2) Added a test with AtomicInteger, which michal mentioned. 3) Compared to singleton int array, which uses less memory according to http://amzn.com/0748614079 I ran the test program 3x and took the min to remove variance from other programs. Note that you cant do this within the program or the results are affected too much, probably due to gc.

36.5. COMMENT FROM KEITH(FROM COMMENT LIST BELOW)

132

Naive: 201716122 Better Approach: 112259166 Efcient Approach: 93066471 Better Approach (without containsKey): 69578496 Better Approach (without containsKey, with AtomicInteger): 94313287 Better Approach (without containsKey, with int[]): 65877234 Better Approach (without containsKey):
HashMap< s t r i n g , m u t a b l e i n t e g e r = " " > e f f i c i e n t C o u n t e r 2 = new HashMap <string , mutableinteger=" " >() ; f o r ( i n t i = 0 ; i < NUM_ITERATIONS ; i ++) f o r ( S t r i n g a : sArr ) { M u t a b l e I n t e g e r value = e f f i c i e n t C o u n t e r 2 . g e t ( a ) ; i f ( value ! = n u l l ) { value . s e t ( value . g e t ( ) + 1 ) ; } else { e f f i c i e n t C o u n t e r 2 . put ( a , new M u t a b l e I n t e g e r ( 1 ) ) ; } }

Better Approach (without containsKey, with AtomicInteger):


HashMap< s t r i n g , a t o m i c i n t e g e r = " " > atomicCounter = new HashMap< string , atomicinteger=" " >() ; f o r ( i n t i = 0 ; i < NUM_ITERATIONS ; i ++) f o r ( S t r i n g a : sArr ) { AtomicInteger value = atomicCounter . g e t ( a ) ; i f ( value ! = n u l l ) { value . incrementAndGet ( ) ; } else { atomicCounter . put ( a , new AtomicInteger ( 1 ) ) ; } }

Better Approach (without containsKey, with int[]):


HashMap< s t r i n g , i n t [ ] = " " > i n t C o u n t e r = new HashMap< s t r i n g , i n t [ ] = " " >() ; f o r ( i n t i = 0 ; i < NUM_ITERATIONS ; i ++) f o r ( S t r i n g a : sArr ) { i n t [ ] valueWrapper = i n t C o u n t e r . g e t ( a ) ;

36.6. CONCLUSION SO FAR

133

i f ( valueWrapper == n u l l ) { i n t C o u n t e r . put ( a , new i n t [ ] { 1 } ) ; } else { valueWrapper [ 0 ] + + ; } }

Guavas MultiSet is probably faster still.

36.6

conclusion so far

The winner is the last one which uses int arrays.

37
F R E Q U E N T LY U S E D M E T H O D S O F J AVA H A S H M A P HashMap is very useful when a counter is required.
HashMap< S t r i n g , I n t e g e r > countMap = new HashMap< S t r i n g , I n t e g e r > ( ) ; / / . . . . a lot of a s l i k e the following i f ( countMap . keySet ( ) . c o n t a i n s ( a ) ) { countMap . put ( a , countMap . g e t ( a ) + 1 ) ; } else { countMap . put ( a , 1 ) ; }

37.1

loop through hashmap

I t e r a t o r i t = mp. e n t r y S e t ( ) . i t e r a t o r ( ) ; while ( i t . hasNext ( ) ) { Map . Entry p a i r s = (Map . Entry ) i t . n ex t ( ) ; System . out . p r i n t l n ( p a i r s . getKey ( ) + " = " + p a i r s . getValue ( ) ) ; } Map< I n t e g e r , I n t e g e r > map = new HashMap< I n t e g e r , I n t e g e r > ( ) ; f o r (Map . Entry < I n t e g e r , I n t e g e r > e n t r y : map . e n t r y S e t ( ) ) { System . out . p r i n t l n ( " Key = " + e n t r y . getKey ( ) + " , Value = " + e n t r y . getValue ( ) ) ; }

37.2

print hashmap

134

37.3. SORT HASHMAP BY VALUE

135

public s t a t i c void printMap (Map mp) { I t e r a t o r i t = mp. e n t r y S e t ( ) . i t e r a t o r ( ) ; while ( i t . hasNext ( ) ) { Map . Entry p a i r s = (Map . Entry ) i t . n ex t ( ) ; System . out . p r i n t l n ( p a i r s . getKey ( ) + " = " + p a i r s . getValue () ) ; i t . remove ( ) ; / / a v o i d s a C o n c u r r e n t M o d i f i c a t i o n E x c e p t i o n } }

37.3

sort hashmap by value

The following code example take advantage of a constructor of TreeMap here.


c l a s s ValueComparator implements Comparator< S t r i n g > { Map< S t r i n g , I n t e g e r > base ; public ValueComparator (Map< S t r i n g , I n t e g e r > base ) { t h i s . base = base ; } public i n t compare ( S t r i n g a , S t r i n g b ) { i f ( base . g e t ( a ) >= base . g e t ( b ) ) { r e t u r n 1; } else { return 1 ; } / / r e t u r n i n g 0 would merge k e y s } } HashMap< S t r i n g , I n t e g e r > countMap = new HashMap< S t r i n g , I n t e g e r > ( ) ; / / add a l o t o f e n t r i e s countMap . put ( " a " , 1 0 ) ; countMap . put ( " b " , 2 0 ) ; ValueComparator vc = new ValueComparator ( countMap ) ; TreeMap< S t r i n g , I n t e g e r > sortedMap = new TreeMap< S t r i n g , I n t e g e r >( vc ); sortedMap . p u t A l l ( countMap ) ;

37.3. SORT HASHMAP BY VALUE

136

printMap ( sortedMap ) ;

There are different ways of sorting HashMap, this way has been voted the most in stackoverow.

38
J AVA T Y P E E R A S U R E M E C H A N I S M Java Generics is a feature introduced from JDK 5. It allows us to use type parameter when dening class and interface. It is extensively used in Java Collection framework. The type erasure concept is one of the most confusing part about Generics. This article illustrates what it is and how to use it.

38.1

a common mistake

In the following example, the method accept accepts a list of Object as its parameter. In the main method, it is called by passing a list of String. Does this work?
public c l a s s Main { public s t a t i c void main ( S t r i n g [ ] a r g s ) throws IOException { A r r a y L i s t < S t r i n g > a l = new A r r a y L i s t < S t r i n g > ( ) ; a l . add ( " a " ) ; a l . add ( " b " ) ; accept ( al ) ; } public s t a t i c void a c c e p t ( A r r a y L i s t <Object > a l ) { for ( Object o : al ) System . out . p r i n t l n ( o ) ; } }

137

38.2. LIST<OBJECT >VS. LIST<STRING >

138

It seems ne since Object is a super type of String obviously. However, that will not work. Compilation will not pass, and give you an error at the line of accept(al);:
The method a c c e p t ( A r r a y L i s t < O b j e c t > ) i n t h e type Main i s not a p p l i c a b l e f o r t h e arguments ( A r r a y L i s t )

38.2

list < object > vs . list < string >

The reason is type erasure. REMEMBER: Java generics is implemented on the compilation level. The byte code generated from compiler does not contain type information of generic type for the run-time execution. After compilation, both List of Object and List of String become List, and the Object/String type is not visible for JVM. During compilation stage, compiler nds out that they are not the same, then gives a compilation error.

38.3

wildcards and bounded wildcards

List<? >- List can contain any type


public s t a t i c void main ( S t r i n g a r g s [ ] ) { A r r a y L i s t <Object > a l = new A r r a y L i s t <Object > ( ) ; a l . add ( " abc " ) ; test ( al ) ; } public s t a t i c void t e s t ( A r r a y L i s t <?> a l ) { f o r ( O b j e c t e : a l ) { / / no m a t t e r what t y p e , i t w i l l b e O b j e c t System . out . p r i n t l n ( e ) ; / / i n t h i s method , b e c a u s e we don t know what t y p e ? i s , we can n o t add a n y t h i n g t o a l . } }

Always remember that Generic is a concept of compile-time. In the example above, since we dont know ?, we can not add anything to al. To make it work, you can use wildcards.

38.4. COMPARISONS L i s t < O b j e c t > L i s t can c o n t a i n O b j e c t or i t s subtype L i s t < ? extends Number > L i s t can c o n t a i n Number or i t s subtypes . L i s t < ? super Number > L i s t can c o n t a i n Number or i t s s u p e r t y p e s .

139

38.4

comparisons

Now we know that ArrayList <String >is NOT a subtype of ArrayList <Object >. As a comparison, you should know that if two generic types have the same parameter, their inheritance relation is true for the types. For example, ArrayList <String >is subtype of Collection<String>. Arrays are different. They know and enforce their element types at runtime. This is called reication. For example, Object[] objArray is a super type of String[] strArr. If you try to store a String into an array of integer, youll get an ArrayStoreException during run-time.

39
W H Y D O W E N E E D G E N E R I C T Y P E S I N J AVA ? Generic types are extensively used in Java collections. The fundamental question is why we need Generic Types in Java? Understanding this question will help you better understand related concepts. I will use a short and simple example to show why Generic is introduced to Java.

39.1

overview of generics

The goal of implementing Generics is nding bugs in compilation stage, other than waiting for run-time. This can reduce a lot of time for debugging java program, because compile-time bugs are much easier to nd and x. From the beginning, we need to keep in mind that generic types only exist in compile-time. A lot of misunderstanding and confusion come from this.

39.2

what if there is no generics ?

In the following program, the Room class denes a member object. We can pass any object to it, such as String, Integer, etc.
c l a s s Room { private Object o b j e c t ; public void add ( O b j e c t o b j e c t ) { this . object = object ; }

140

39.3. WHEN GENERICS IS USED

141

public O b j e c t g e t ( ) { return o b j e c t ; } } public c l a s s Main { public s t a t i c void main ( S t r i n g [ ] a r g s ) { Room room = new Room ( ) ; room . add ( 6 0 ) ; / / room . add ( " 6 0 " ) ; / / t h i s w i l l c a u s e a runt i m e error I n t e g e r i = ( I n t e g e r ) room . g e t ( ) ; System . out . p r i n t l n ( i ) ; } }

The program runs totally ne when we add an integer and cast it. But if a user accidentally add a string 60 to it, compiler does not know it is a problem. When the program is run, it will get a ClassCastException.
E x c e p t i o n i n t h r e a d " main " j a v a . lang . C l a s s C a s t E x c e p t i o n : j a v a . lang . S t r i n g cannot be c a s t t o j a v a . lang . I n t e g e r a t c o l l e c t i o n . Main . main ( Main . j a v a : 2 1 )

You may wonder why not just declare the eld type to be Integer instead of Object. If so, then the room is not so much useful because it can only store one type of thing.

39.3

when generics is used

If generic type is used, the program becomes the following.


c l a s s Room<T> { private T t ; public void add ( T t ) { this . t = t ; }

39.4. SUMMARY

142

public T g e t ( ) { return t ; } } public c l a s s Main { public s t a t i c void main ( S t r i n g [ ] a r g s ) { Room< I n t e g e r > room = new Room< I n t e g e r > ( ) ; room . add ( 6 0 ) ; I n t e g e r i = room . g e t ( ) ; System . out . p r i n t l n ( i ) ; } }

Now if someone adds room.add(60), a compile-time error will be shown like the following:

We can easily see how this works. In addition, there is no need to cast the result any more from room.get() since compile knows get() will return an Integer.

39.4

summary

To sum up, the reasons to use Generics are as follows: Stronger type checking at compile time. Elimination of explicit cast. Enabling better code reusability such as implementation of generic algorithms

39.4. SUMMARY

143

Java Generic type is only a compile-time concept. During run-time, all types information is erased, and this is call Type Erasure. Here is an interesting example to show how to avoid the common mistakes of Type Erasure.

40
SET VS. SET<?> You may know that an unbounded wildcard Set<?>can hold elements of any type, and a raw type Set can also hold elements of any type. Then what is the difference between them? 40.1 two facts about set <?>

Item 1: Since the question mark ? stands for any type. Set<?>is capable of holding any type of elements. Item 2: Because we dont know the type of ?, we cant put any element into Set<?> So a Set<?>can hold any type of element(Item 1), but we cant put any element into it(Item 2). Do the two statements conict to each other? Of course they are not. This can be clearly illustrated by the following two examples: Item 1 means the following situation:
/ / L e g a l Code public s t a t i c void main ( S t r i n g [ ] a r g s ) { HashSet < I n t e g e r > s 1 = new HashSet < I n t e g e r >( Arrays . a s L i s t (1 , 2 , 3) ) ; p r i n t S e t ( s1 ) ; } public s t a t i c void p r i n t S e t ( Set <?> s ) { for ( Object o : s ) { System . out . p r i n t l n ( o ) ; } }

144

40.2. SET VS. SET<?>

145

Since Set<?>can hold any type of elements, we simply use Object in the loop. Item 2 means the following situation which is illegal:
/ / I l l e g a l Code public s t a t i c void p r i n t S e t ( Set <?> s ) { s . add ( 1 0 ) ; / / t h i s l i n e i s i l l e g a l for ( Object o : s ) { System . out . p r i n t l n ( o ) ; } }

Because we dont know the type of <?>exactly, we can not add any thing to it other than null. For the same reason, we can not initialize a set with Set<?>. The following is illegal:
/ / I l l e g a l Code Set <?> s e t = new HashSet <? >() ;

40.2

set vs . set <?>

Whats the difference between raw type Set and unbounded wildcard Set<?>? This method declaration is ne:
public s t a t i c void p r i n t S e t ( S e t s ) { s . add ( " 2 " ) ; for ( Object o : s ) { System . out . p r i n t l n ( o ) ; } }

because raw type has no restrictions. However, this will easily corrupt the invariant of collection. In brief, wildcard type is safe and the raw type is not. We can not put any element into a Set<?>.

40.3. WHEN SET<?>IS USEFUL?

146

40.3

when set <?> is useful ?

When you want to use a generic type, but you dont know or care what the actual type the parameter is, you can use <?>[1]. It can only be used as parameters. For example:
public s t a t i c void main ( S t r i n g [ ] a r g s ) { HashSet < I n t e g e r > s 1 = new HashSet < I n t e g e r >( Arrays . a s L i s t (1 ,2 ,3) ) ; HashSet < I n t e g e r > s 2 = new HashSet < I n t e g e r >( Arrays . a s L i s t (4 ,2 ,3) ) ; System . out . p r i n t l n ( getUnion ( s 1 , s 2 ) ) ; } public s t a t i c i n t getUnion ( Set <?> s 1 , Set <?> s 2 ) { i n t count = s 1 . s i z e ( ) ; for ( Object o : s2 ) { i f ( ! s1 . c o n t a i n s ( o ) ) { count ++; } } r e t u r n count ; }

41
H O W T O C O N V E R T A R R AY T O A R R AY L I S T I N J AVA ? This is a question that is worth to take a look for myself, because it is one of the top viewed and voted questions in stackoverow. The one who accidentally asks such a question could gain a lot of reputation which would enable him to do a lot of stuff on stackoverow. This does not make sense so much for me, but lets take a look at the question rst. The question asks how to convert the following array to an ArrayList.
Element [ ] a r r a y = { new Element ( 1 ) , new Element ( 2 ) , new Element ( 3 ) } ;

41.1

most popular and accepted answer

The most popular and the accepted answer is the following:


A r r a y L i s t <Element > a r r a y L i s t = new A r r a y L i s t <Element >( Arrays . a s L i s t ( array ) ) ;

First, lets take a look at the Java Doc for the constructor method of ArrayList. ArrayList - ArrayList(Collection c) Constructs a list containing the elements of the specied collection, in the order they are returned by the collections iterator. So what the constructor does is the following: 1. Convert the collection c to an array 2. Copy the array to ArrayLists own back array called elementData If the add() method is invoked NOW, the size of the elementData array is not large enough to home one more element. So it will be copied to a new larger array. As the code below indicates, the size grows 1.5 times of old array.

147

41.2. NEXT POPULAR ANSWER

148

public void e n s u r e C a p a c i t y ( i n t minCapacity ) { modCount++; i n t o l d C a p a c i t y = elementData . l e n g t h ; i f ( minCapacity > o l d C a p a c i t y ) { O b j e c t oldData [ ] = elementData ; i n t newCapacity = ( o l d C a p a c i t y 3 ) / 2 + 1 ; i f ( newCapacity < minCapacity ) newCapacity = minCapacity ; / / minCapacity i s usually c l o s e to size , so t h i s i s a win : elementData = Arrays . copyOf ( elementData , newCapacity ) ; } }

41.2

next popular answer

The next popular answer is:


L i s t <Element > l i s t = Arrays . a s L i s t ( a r r a y ) ;

It is not the best, because the size of the list returned from asList() is xed. We know ArrayList is essentially implemented as an array, and the list returned from asList() is a xed-size list backed by the original array. In this way, if add or remove elements from the returned list, an UnsupportedOperationException will be thrown.
l i s t . add ( new Element ( 4 ) ) ; E x c e p t i o n i n t h r e a d " main " j a v a . lang . C l a s s C a s t E x c e p t i o n : j a v a . u t i l . A r r a y s $ A r r a y L i s t cannot be c a s t t o j a v a . u t i l . A r r a y L i s t a t c o l l e c t i o n . ConvertArray . main ( ConvertArray . j a v a : 2 2 )

41.3

indications of the question

The problem is not hard, and kind of interesting. Every Java programmer knows ArrayList, it is simple but easy to make such a mistake. I guess that is why this question is so popular. If a similar question asked about a Java library in a specic domain, it would be less likely to become so popular.

41.3. INDICATIONS OF THE QUESTION

149

There are several answers that basically indicate the same solution. This is true for a lot of questions, I guess people just dont care, they like answering!

42
Y E T A N O T H E R J AVA PA S S E S B Y R E F E R E N C E O R B Y VA L U E ? This is a classic interview question which confuses novice Java developers. In this post I will use an example and some diagram to demonstrate that: Java is pass-by-value.

42.1

some definitions

Pass by value: make a copy in memory of the actual parameters value that is passed in. Pass by reference: pass a copy of the address of the actual parameter. Java is always pass-by-value. Primitive data types and object reference are just values.

42.2

passing primitive type variable

Since Java is pass-by-value, its not hard to understand the following code will not swap anything.
swap ( Type arg 1 , Type arg 2 ) { Type temp = arg 1 ; arg 1 = arg 2 ; arg 2 = temp ; }

150

42.3. PASSING OBJECT VARIABLE

151

42.3

passing object variable

Java manipulates objects by reference, and all object variables are references. However, Java doesnt pass method arguments by reference, but by value. Question is: why the member value of the object can get changed?

Code:
c l a s s Apple { public S t r i n g c o l o r = " red " ; } public c l a s s Main { public s t a t i c void main ( S t r i n g [ ] a r g s ) { Apple apple = new Apple ( ) ; System . out . p r i n t l n ( apple . c o l o r ) ; changeApple ( apple ) ; System . out . p r i n t l n ( apple . c o l o r ) ; } public s t a t i c void changeApple ( Apple apple ) { apple . c o l o r = " green " ; } }

42.3. PASSING OBJECT VARIABLE

152

Since the orignal and copied reference refer the same object, the member value gets changed.

Output:
red green

43
J AVA R E F L E C T I O N T U T O R I A L What is reection, why is it useful, and how to use it? 43.1 what is reflection ?

Reection is commonly used by programs which require the ability to examine or modify the runtime behavior of applications running in the Java virtual machine. This concept is often mixed with introspection. The following are their denitions from Wiki: Introspection is the ability of a program to examine the type or properties of an object at runtime. Reection is the ability of a program to examine and modify the structure and behavior of an object at runtime. From their denitions, introspection is a subset of reection. Some languages support introspection, but do not support reection, e.g., C++.

153

43.2. WHY DO WE NEED REFLECTION?

154

Introspection Example: The instanceof operator determines whether an object belongs to a particular class.
i f ( o b j i n s t a n c e o f Dog ) { Dog d = ( Dog ) o b j ; d . bark ( ) ; }

Reection Example: The Class.forName() method returns the Class object associated with the class/interface with the given name(a string and full qualied name). The forName method causes the class with the name to be initialized.
/ / with r e f l e c t i o n Class <?> c = C l a s s . forName ( " c l a s s p a t h . and . classname " ) ; O b j e c t dog = c . newInstance ( ) ; Method m = c . getDeclaredMethod ( " bark " , new Class < ? > [ 0 ] ) ; m. invoke ( dog ) ;

In Java, reection is more about introspection, because you can not change structure of an object. There are some APIs to change accessibilities of methods and elds, but not structures.

43.2

why do we need reflection ?

Reection enables us to: Examine an objects class at runtime Construct an object for a class at runtime

43.3. HOW TO USE REFLECTION?

155

Examine a classs eld and method at runtime Invoke any method of an object at runtime Change accessibility ag of Constructor, Method and Field etc. Reection is the common approach of famework. For example, JUnit use reection to look through methods tagged with the @Test annotation, and then call those methods when running the unit test. (Here is a set of examples of how to use JUnit.) For web frameworks, product developers dene their own implementation of interfaces and classes and put is in the conguration les. Using reection, it can quickly dynamically initialize the classes required. For example, Spring uses bean conguration such as:
<bean i d= " someID " c l a s s = "com . programcreek . Foo " > <p r o p e r t y name= " someField " value= " someValue " /> </bean>

When the Spring context processes this <bean >element, it will use Class.forName(String) with the argument com.programcreek.Foo to instantiate that Class. It will then again use reection to get the appropriate setter for the <property >element and set its value to the specied value. The same mechanism is also used for Servlet web applications:
<servlet > < s e r v l e t name>someServlet </ s e r v l e t name> < s e r v l e t c l a s s >com . programcreek . W h y R e f l e c t i o n S e r v l e t </ s e r v l e t class > <servlet >

43.3

how to use reflection ?

How to use reection API can be shown by using a small set of typical code examples. Example 1: Get class name from object

43.3. HOW TO USE REFLECTION?

156

package m y r e f l e c t i o n ; import j a v a . lang . r e f l e c t . Method ; public c l a s s R e f l e c t i o n H e l l o W o r l d { public s t a t i c void main ( S t r i n g [ ] a r g s ) { Foo f = new Foo ( ) ; System . out . p r i n t l n ( f . g e t C l a s s ( ) . getName ( ) ) ; } } c l a s s Foo { public void p r i n t ( ) { System . out . p r i n t l n ( " abc " ) ; } }

Output:
m y r e f l e c t i o n . Foo

Example 2: Invoke method on unknown object For the code example below, image the types of an object is unknown. By using reection, the code can use the object and nd out if the object has a method called print and then call it.
package m y r e f l e c t i o n ; import j a v a . lang . r e f l e c t . Method ; public c l a s s R e f l e c t i o n H e l l o W o r l d { public s t a t i c void main ( S t r i n g [ ] a r g s ) { Foo f = new Foo ( ) ; Method method ; try { method = f . g e t C l a s s ( ) . getMethod ( " p r i n t " , new Class < ? > [ 0 ] ) ; method . invoke ( f ) ; } catch ( Exception e ) { e . printStackTrace ( ) ; } } }

43.3. HOW TO USE REFLECTION?

157

c l a s s Foo { public void p r i n t ( ) { System . out . p r i n t l n ( " abc " ) ; } } abc

Example 3: Create object from Class instance


package m y r e f l e c t i o n ; public c l a s s R e f l e c t i o n H e l l o W o r l d { public s t a t i c void main ( S t r i n g [ ] a r g s ) { / / create instance of " Class " Class <?> c = n u l l ; try { c= C l a s s . forName ( " m y r e f l e c t i o n . Foo " ) ; } catch ( Exception e ) { e . printStackTrace ( ) ; } / / c r e a t e i n s t a n c e o f " Foo " Foo f = n u l l ; try { f = ( Foo ) c . newInstance ( ) ; } catch ( Exception e ) { e . printStackTrace ( ) ; } f . print ( ) ; } } c l a s s Foo { public void p r i n t ( ) { System . out . p r i n t l n ( " abc " ) ; } }

Example 4: Get constructor and create instance


package m y r e f l e c t i o n ;

43.3. HOW TO USE REFLECTION?

158

import j a v a . lang . r e f l e c t . C o n s t r u c t o r ; public c l a s s R e f l e c t i o n H e l l o W o r l d { public s t a t i c void main ( S t r i n g [ ] a r g s ) { / / create instance of " Class " Class <?> c = n u l l ; try { c= C l a s s . forName ( " m y r e f l e c t i o n . Foo " ) ; } catch ( Exception e ) { e . printStackTrace ( ) ; } / / c r e a t e i n s t a n c e o f " Foo " Foo f 1 = n u l l ; Foo f 2 = n u l l ; / / get a l l constructors C o n s t r u c t o r <?> cons [ ] = c . g e t C o n s t r u c t o r s ( ) ; try { f 1 = ( Foo ) cons [ 0 ] . newInstance ( ) ; f 2 = ( Foo ) cons [ 1 ] . newInstance ( " abc " ) ; } catch ( Exception e ) { e . printStackTrace ( ) ; } f1 . print ( ) ; f2 . print ( ) ; } } c l a s s Foo { String s ; public Foo ( ) { } public Foo ( S t r i n g s ) { t h i s . s=s ; } public void p r i n t ( ) { System . out . p r i n t l n ( s ) ;

43.3. HOW TO USE REFLECTION?

159

} }

Output:
null abc

In addition, you can use Class instance to get implemented interfaces, super class, declared eld, etc. Example 5: Change array size though reection
package m y r e f l e c t i o n ; import j a v a . lang . r e f l e c t . Array ; public c l a s s R e f l e c t i o n H e l l o W o r l d { public s t a t i c void main ( S t r i n g [ ] a r g s ) { int [ ] intArray = { 1 , 2 , 3 , 4 , 5 } ; i n t [ ] newIntArray = ( i n t [ ] ) changeArraySize ( intArray , 1 0 ) ; p r i n t ( newIntArray ) ; String [ ] a t r = { " a " , "b" , " c " , "d" , " e " } ; S t r i n g [ ] s t r 1 = ( S t r i n g [ ] ) changeArraySize ( a t r , 10) ; print ( str1 ) ; } / / change array s i z e public s t a t i c O b j e c t changeArraySize ( O b j e c t obj , i n t l e n ) { Class <?> a r r = o b j . g e t C l a s s ( ) . getComponentType ( ) ; O b j e c t newArray = Array . newInstance ( a r r , l e n ) ; / / do a r r a y c o p y i n t co = Array . getLength ( o b j ) ; System . arraycopy ( obj , 0 , newArray , 0 , co ) ; r e t u r n newArray ; } // print public s t a t i c void p r i n t ( O b j e c t o b j ) {

43.4. SUMMARY

160

Class <?> c = o b j . g e t C l a s s ( ) ; i f ( ! c . isArray ( ) ) { return ; } System . out . p r i n t l n ( " \nArray l e n g t h : " + Array . getLength ( o b j ) ) ; f o r ( i n t i = 0 ; i < Array . getLength ( o b j ) ; i ++) { System . out . p r i n t ( Array . g e t ( obj , i ) + " " ) ; } } }

Output:
Array 1 2 3 Array a b c l e n g t h : 10 4 5 0 0 0 0 0 l e n g t h : 10 d e null null null null null

43.4

summary

The above code examples shows a very small set of functions provided by Java reection. Reading those examples may only give you a taste of Java reection, you may want to Read more information on Oracle website.

44
H O W T O D E S I G N A J AVA F R A M E W O R K ? - A S I M P L E EXAMPLE You may be curious about how framework works? A simple framework example will be made here to demonstrate the idea of frameworks. 44.1 goal of a framework

First of all, why do we need a framework other than just a normal library? The goal of framework is dening a process which let developers implement certain functions based on individual requirements. In other words, framework denes the skeleton and developers ll in the ash when using it.

44.2

the simplest framework

In the following example, the rst 3 classes are dened as a part of framework and the 4th class is the client code of the framework. Main.java is the entry point of the framework. This can not be changed.
/ / i m a g i n e t h i s i s t h e e n t r y p o i n t f o r a f r a m e w o r k , i t can n o t b e changed public c l a s s Main { public s t a t i c void main ( S t r i n g [ ] a r g s ) { Human h = new Human( new Walk ( ) ) ; h . doMove ( ) ; } }

161

44.2. THE SIMPLEST FRAMEWORK

162

Move.java is the Hook. A hook is where developers can dene / extend functions based on their own requirements.
public a b s t r a c t c l a s s Move { public a b s t r a c t void a c t i o n ( ) ; }

Human.java is the Template, which reects the idea of how the framework works.
public c l a s s Human { p r i v a t e Move move ; public Human( Move m) { t h i s . move = m; } public void doMove ( ) { t h i s . move . a c t i o n ( ) ; } }

This simple framework allows and requires developers to extend Move class. Actually, in this simple framework, action() method is the only thing developers are able to change. Inside of the implementation, different action can be programmed to different purpose. E.g. the example below print 5 miles per hour, of course, you can redene it as 50 miles per hour.
public c l a s s Walk extends Move { @Override public void a c t i o n ( ) { / / TODO Auto g e n e r a t e d method s t u b System . out . p r i n t l n ( " 5 m i l e s per hour i t i s slow ! "); } }

44.3. CONCLUSION

163

44.3

conclusion

The example here just shows how a simple Template and Hook works. A real framework is more complicated than this. Not only does it contain other relations like template-temple relation, but also very complex process about how to efciently improve performance and programming usability.

45
W H Y D O W E N E E D J AVA W E B F R A M E W O R K S L I K E S T R U T S 2? There are various kinds of Java web frameworks, such as Spring MVC, JavaServer Faces, Struts 2, etc. For a newbie programmer, there is an exponential learning curve. Why do I need Java web frameworks like Struts 2? This question can be answered by starting from answering how the Servlet API works. Here is a post which contains code about how to simply program with Servlet API. You would never use this to really program a large project, but its good to take a look how it looks like originally. Here is a simple Servlet which process request from client and generate response html.
import import import import import import import j a v a . i o . IOException ; java . io . PrintWriter ; javax . s e r v l e t . ServletConfig ; javax . s e r v l e t . ServletException ; javax . s e r v l e t . http . HttpServlet ; javax . s e r v l e t . http . HttpServletRequest ; javax . s e r v l e t . http . HttpServletResponse ;

public c l a s s WelcomeServlet extends H t t p S e r v l e t { @Override public void i n i t ( S e r v l e t C o n f i g c o n f i g ) throws ServletException { super . i n i t ( c o n f i g ) ; }

164

165

p r o t e c t e d void doPost ( H t t p S e r v l e t R e q u e s t r e q u e s t , H t t p S e r v l e t R e s p o n s e response ) throws S e r v l e t E x c e p t i o n , IOException { / / Get t h e v a l u e o f f o r m p a r a m e t e r S t r i n g name = r e q u e s t . getParameter ( " name " ) ; S t r i n g welcomeMessage = " Welcome " +name ; / / S e t t h e c o n t e n t t y p e (MIME Type ) o f t h e r e s p o n s e . response . setContentType ( " t e x t /html " ) ; P r i n t W r i t e r out = response . g e t W r i t e r ( ) ; / / W r i t e t h e HTML t o t h e r e s p o n s e out . p r i n t l n ( " <html > " ) ; out . p r i n t l n ( " <head> " ) ; out . p r i n t l n ( " < t i t l e > A very simple s e r v l e t example </ t i t l e > " ) ; out . p r i n t l n ( " </head> " ) ; out . p r i n t l n ( " <body> " ) ; out . p r i n t l n ( " <h1 > " +welcomeMessage+ " </h1 > " ) ; out . p r i n t l n ( " <a h r e f = " / s e r v l e t e x a m p l e /pages/form . html " > " + " C l i c k here t o go back t o i np ut page " + " </a> " ) ; out . p r i n t l n ( " </body> " ) ; out . p r i n t l n ( " </html > " ) ; out . c l o s e ( ) ; }

public void d e s t r o y ( ) { } }

This is very simple, real usage wont be easy like this. A real servlet has more work to do as summarized below: Binding request parameters to Java types. String name = request.getParameter("name"); Validating data. E.g. There should not be numbers in peoples name.

166

Making calls to business logic. E.g. Process the name for some purposes. Communicate with the data layer. E.g. Store user data. Rendering presentation layer (HTML, and so on). E.g. Return results for client browser. Of course, we can do all of those by ourselves, which is totally possible. However, that would take a lot of time. And very often, those functions are common features which can be implemented in some certain approach. Struts 2 is such an approach. It provides a standard way to implement those common functions following MVC design patterns. Here is my previous post about a simple Struts2 application.

46
J V M R U N - T I M E D ATA A R E A S This is my note of reading JVM specication. I draw a diagram which helps me understand.

46.1

data areas for each individual thread ( not shared )

Data Areas for each individual thread include program counter register, JVM Stack, and Native Method Stack. They are all created when a new thread is created.

167

46.2. DATA AREAS SHARED BY ALL THREADS

168

A Program Counter Register: it is used to control each execution of each thread. A JVM Stack: It contains frames which is demonstrated in the diagram below. Native Method Stack: it is used to support native methods, i.e., non-Java language A methods. A

46.2

data areas shared by all threads

All threads share Heap and Method Area. Heap: it is the area that we most frequently deal with. It stores arrays and objects, created when JVM starts up. Garbage Collection works in this area. Method Area: it stores run-time constant pool, eld and method data, and meth A C A ods and constructors codeA Runtime Constant Pool: It is a per-class or per-interface run-time representation of the constant_pool table in a class le. It contains several kinds of constants, ranging from numeric literals known at compile-time to method and eld references that must be resolved at run-time.

46.2. DATA AREAS SHARED BY ALL THREADS

169

Stack contains Frames, and a frame is pushed to the stack when a method is invoked. A frame contains local variable array, Operand Stack, Reference to Constant Pool. For more information, please go to the ofcal JVM specication site.

47
H O W D O E S J AVA H A N D L E A L I A S I N G ? 47.1 what is java aliasing ? Aliasing means there are multiple aliases to a location that can be updated, and these aliases have different types. In the following example, a and b are two variable names that have two different types A and B. B extends A.
B [ ] b = new B [ 1 0 ] ; A[ ] a = b ; a [ 0 ] = new A( ) ; b [ 0 ] . methodParent ( ) ;

In memory, they both refer to the same location.

The pointed memory location are pointed by both a and b. During run-time, the actual object stored determines which method to call.

170

47.2. HOW DOES JAVA HANDLE ALIASING PROBLEM?

171

47.2

how does java handle aliasing problem ?

If you copy this code to your eclipse, there will be no compilation errors.
class A { public void methodParent ( ) { System . out . p r i n t l n ( " method i n P a r e n t " ) ; } } c l a s s B extends A { public void methodParent ( ) { System . out . p r i n t l n ( " o v e r r i d e method i n Child " ) ; } public void methodChild ( ) { System . out . p r i n t l n ( " method i n Child " ) ; } } public c l a s s Main { public s t a t i c void main ( S t r i n g [ ] a r g s ) { B [ ] b = new B [ 1 0 ] ; A[ ] a = b ; a [ 0 ] = new A( ) ; b [ 0 ] . methodParent ( ) ; } }

But if you run the code, the output would be:


E x c e p t i o n i n t h r e a d " main " j a v a . lang . A r r a y S t o r e E x c e p t i o n : a l i a s i n g t e s t .A a t a l i a s i n g t e s t . Main . main ( Main . j a v a : 2 6 )

The reason is that Java handles aliasing during run-time. During run-time, it knows that the rst element should be a B object, instead of A. Therefore, it only runs correctly if it is changed to:
B [ ] b = new B [ 1 0 ] ;

47.2. HOW DOES JAVA HANDLE ALIASING PROBLEM?

172

A[ ] a = b ; a [ 0 ] = new B ( ) ; b [ 0 ] . methodParent ( ) ;

and the output is:


o v e r r i d e method i n Child

48
W H AT D O E S A J AVA A R R AY L O O K L I K E I N M E M O R Y ? e) Arrays in Java store one of two things: either primitive values (int, char, A or references (a.k.a pointers). When an object is creating by using new, memory is allocated on the heap and a reference is returned. This is also true for arrays, since arrays are objects.

48.1

single - dimension array

i n t a r r [ ] = new i n t [ 3 ] ;

The int[] arr is just the reference to the array of 3 integer. If you create an array with 10 integer, it is the same - an array is allocated and a reference is returned.

48.2

two - dimensional array

How about 2-dimensional array? Actually, we can only have one dimensional arrays in Java. 2D arrays are basically just one dimensional arrays of one dimensional arrays.
i n t [ ] [ ] a r r = new i n t [ 3 ] [ ] ;

173

48.3. WHERE ARE THEY LOCATED IN MEMORY?

174

a r r [ 0 ] = new i n t [ 3 ] ; a r r [ 1 ] = new i n t [ 5 ] ; a r r [ 2 ] = new i n t [ 4 ] ;

Multi-dimensional arrays use the name rules.

48.3

where are they located in memory ?

Arrays are also objects in Java, so how an object looks like in memory applies to an array. As we know that JVM runtime data areas include heap, JVM stack, and others. For a simple example as follows, lets see where the array and its reference are stored.
class A { int x ; int y ; } ... public void m1 ( ) { int i = 0;

48.3. WHERE ARE THEY LOCATED IN MEMORY?

175

m2 ( ) ; } public void m2 ( ) { A a = new A( ) ; } ...

With the above declaration, lets invoke m1() and see what happens: When m1 is invoked, a new frame (Frame-1) is pushed into the stack, and local variable i is also created in Frame-1. Then m2 is invoked inside of m1, another new frame (Frame-2) is pushed into the stack. In m2, an object of class A is created in the heap and reference variable is put in Frame-2. Now, at this point, the stack and heap looks like the following:

Arrays are treated the same way like objects, so how array locates in memory is straight-forward.

49
T H E I N T R O D U C T I O N O F M E M O RY L E A K S One of the most signicant advantages of Java is its memory management. You simply create objects and Java Garbage Collector takes care of allocating and freeing memory. However, the situation is not as simple as that, because memory leaks frequently occur in Java applications. This tutorial illustrates what is memory leak, why it happens, and how to prevent them.

49.1

what are memory leaks ?

Denition of Memory Leak: objects are no longer being used by the application, but Garbage Collector can not remove them because they are being referenced. To understand this denition, we need to understand objects status in memory. The following diagram illustrates what is unused and what is unreferenced.

176

49.2. WHY MEMORY LEAKS HAPPEN?

177

From the diagram, there are referenced objects and unreferenced objects. Unreferenced objects will be garbage collected, while referenced objects will not be garbage collected. Unreferenced objects are surely unused, because no other objects refer to it. However, unused objects are not all unreferenced. Some of them are being referenced! Thats where the memory leaks come from.

49.2

why memory leaks happen ?

Lets take a look at the following example and see why memory leaks happen. In the example below, object A refers to object B. As lifetime (t1 - t4) is much longer than Bs (t2 - t3). When B is no longer being used in the application, A still holds a reference to it. In this way, Garbage Collector can not remove B from memory. This would possibly cause out of memory problem, because if A does the same thing for more objects, then there would be a lot of objects that are uncollected and consume memory space. It is also possible that B hold a bunch of references of other objects. Those objects referenced by B will not get collected either. All those unused objects will consume precious memory space.

49.3. HOW TO PREVENT MEMORY LEAKS?

178

49.3

how to prevent memory leaks ?

The following are some quick hands-on tips for preventing memory leaks. Pay attention to Collection classes, such as HashMap, ArrayList, etc., as they are common places to nd memory leaks. When they are declared static, their life time is the same as the life time of the application. Pay attention to event listeners and callbacks. A memory leak may occur if a listener is registered but not unregistered when the class is not being used any longer. If a class manages its own memory, the programer should be alert for memory leaks.[1] Often times member variables of an object that point to other objects need to be null out.

49.4

a little quiz : why substring () method in jdk 6 can cause mem ory leaks ?

To answer this question, you may want to read Substring() in JDK 6 and 7.

50
W H AT I S S E R V L E T C O N TA I N E R ? In this post, I write a little bit about the basic ideas of web server, Servlet container and its relation with JVM. I want to show that Servlet container is nothing more than a Java program. 50.1 what is a web server ?

To know what is a Servlet container, we need to know what is a Web Server rst.

A web server uses HTTP protocol to transfer data. In a simple situation, a user type in a URL (e.g. www.programcreek.com/static.html) in browser (a client), and get a web page to read. So what the server does is sending a web page to the client. The transformation is in HTTP protocol which species the format of request and response message.

179

50.2. WHAT IS A SERVLET CONTAINER?

180

50.2

what is a servlet container ?

As we see here, the user/client can only request static webpage from the server. This is not good enough, if the user wants to read the web page based on his input. The basic idea of Servlet container is using Java to dynamically generate the web page on the server side. So servlet container is essentially a part of a web server that interacts with the servlets.

Servlet container is the container for Servlets.

50.3

what is a servlet ?

Servlet is an interface dened in javax.servlet package. It declares three essential methods for the life cycle of a servlet - init(), service(), and destroy(). They are implemented by every servlet(dened in SDK or self-dened) and are invoked at specic times by the server. The init() method is invoked during initialization stage of the servlet life cycle. It is passed an object implementing the javax.servlet.ServletCong interface, which allows the servlet to access initialization parameters from the web application. The service() method is invoked upon each request after its initialization. Each request is serviced in its own separate thread. The web container calls the service() method of the servlet for every request. The service() method determines the kind of request being made and dispatches it to an appropriate method to handle the request.

50.4. HOW SERVLET CONTAINER AND WEB SERVER PROCESS A REQUEST?

181

The destroy() method is invoked when the servlet object should be destroyed. It releases the resources being held. From the life cycle of a servlet object, we can see that servlet classes are loaded to container by class loader dynamically. Each request is in its own thread, and a servlet object can serve multiple threads at the same time(thread not safe). When it is no longer being used, it should be garbage collected by JVM. Like any Java program, the servlet runs within a JVM. To handle the complexity of HTTP requests, the servlet container comes in. The servlet container is responsible for servlets creation, execution and destruction.

50.4

how servlet container and web server process a request ?

Web server receives HTTP request Web server forwards the request to servlet container The servlet is dynamically retrieved and loaded into the address space of the container, if it is not in the container. The container invokes the init() method of the servlet for initialization(invoked once when the servlet is loaded rst time) The container invokes the service() method of the servlet to process the HTTP request, i.e., read data in the request and formulate a response. The servlet remains in the containers address space and can process other HTTP requests. Web server return the dynamically generated results to the correct location The six steps are marked on the following diagram:

50.5. THE ROLE OF JVM

182

50.5

the role of jvm

Using servlets allows the JVM to handle each request within a separate Java thread, and this is one of the key advantage of Servlet container. Each servlet is a Java class with special elements responding to HTTP requests. The main function of Servlet contain is to forward requests to correct servlet for processing, and return the dynamically generated results to the correct location after the JVM has processed them. In most cases servlet container runs in a single JVM, but there are solutions when container need multiple JVMs.

51
W H AT I S A S P E C T - O R I E N T E D P R O G R A M M I N G ? What is Aspect-Oriented Programming(AOP)? By using the diagram below, the concept can be understood in a few seconds. 51.1 the cross - cutting concerns problem

First take a took at the diagram below, and think about what could be the problem.

183

51.1. THE CROSS-CUTTING CONCERNS PROBLEM

184

1. Code tangling: the logging code is mixed with business logic. 2. Code scattering: caused by identical code put in every module. The logging function is a called cross-cutting concern. That is, a function that is used in many other modules, such as authentication, logging, performance, error checking, data persistence, storage management, to name just a few. By using Object-Oriented Programming (OOP), we can dene low coupling and high cohesion system. However, when it comes to cross-cutting concerns, it does not handle it well for the reason that it does not relation between handle core concerns and cross-cutting concerns.

51.2. SOLUTION FROM AOP

185

51.2

solution from aop

52
L I B R A RY V S . F R A M E W O R K ? What is the difference between a Java Library and a framework? The two concepts are important but sometimes confusing for Java developers. 52.1 key difference and definition of library and framework

The key difference between a library and a framework is Inversion of Control. When you call a method from a library, you are in control. But with a framework, the control is inverted: the framework calls you.

A library is just a collection of class denitions. The reason behind is simply code reuse, i.e. get the code that has already been written by other developers. The classes and methods normally dene specic operations in a domain specic area. For example, there are some libraries of mathematics which can let developer just call the function without redo the implementation of how an algorithm works.

186

52.2. THEIR RELATION

187

In framework, all the control ow is already there, and theres a bunch of predened white spots that you should ll out with your code. A framework is normally more complex. It denes a skeleton where the application denes its own features to ll out the skeleton. In this way, your code will be called by the framework when appropriately. The benet is that developers do not need to worry about if a design is good or not, but just about implementing domain specic functions.

52.2

their relation

Both of them dened API, which is used for programmers to use. To put those together, we can think of a library as a certain function of an application, a framework as the skeleton of the application, and an API is connector to put those together. A typical development process normally starts with a framework, and ll out functions dened in libraries through API.

52.3

examples

1. How to make a Java library?? 2. How to design a framework??

53
J AVA A N D C O M P U T E R S C I E N C E C O U R S E S A good programmer does not only know how to program a task, but also knows why it is done that way and how to do it efciently. Indeed, we can nd almost any code by using Google, knowing why it is done that way is much more difcult than knowing how to do it, especially when something goes wrong. To understand Java design principles behind, Computer Science(CS) courses are helpful. Here is the diagram showing the relation between Java and Operating System, Networks, Articial Intelligence, Compiler, Algorithm, and Logic.

188

189

54
H O W J AVA C O M P I L E R G E N E R AT E C O D E F O R O V E R L O A D E D AND OVERRIDDEN METHODS? Here is a simple Java example showing Polymorphism: overloading and overriding. Polymorphism means that functions assume different forms at different times. In case of compile time it is called function overloading. Overloading allows related methods to be accessed by use of a common name. It is sometimes called ad hoc polymorphism, as opposed to the parametric polymorphism.
class A { public void M( i n t i ) { System . out . p r i n t l n ( " i n t " ) ; } public void M( S t r i n g s ) { / / t h i s i s an o v e r l o a d i n g method System . out . p r i n t l n ( " s t r i n g " ) ; } } c l a s s B extends A{ public void M( i n t i ) { / / t h i s i s o v e r r i d i n g method System . out . p r i n t l n ( " o v e r r i d e n i n t " ) ; } } public s t a t i c void main ( S t r i n g [ ] a r g s ) { A a = new A( ) ; a .M( 1 ) ;

190

191

a .M( " abc " ) ; A b = new B ( ) ; b .M( 1 2 3 4 ) ; }

From the compiler perspective, how is code generated for the correct function calls? Static overloading is not hard to implement. When processing the declaration of an overloaded function, a new binding maps it to a different implementation. During the type checking process, compiler analyzes the parameters real type to determine which function to use. Dynamic overloading allows different implementations of a function to be chosen on the run-time type of an actual parameter. It is a form of dynamic dispatch. Dynamic dispatch is also used to implement method overriding. The overridden method are determined by real object type during run-time. To understand dynamic dispatch, there is a post about object layout in memory.

55
T O P 1 0 M E T H O D S F O R J AVA A R R AY S The following are top 10 methods for Java Array. They are the most voted questions from stackoverow. 55.1 declare an array

S t r i n g [ ] aArray = new S t r i n g [ 5 ] ; S t r i n g [ ] bArray = { " a " , " b " , " c " , " d " , " e " } ; S t r i n g [ ] cArray = new S t r i n g [ ] { " a " , " b " , " c " , " d " , " e " } ;

55.2

print an array in java

int [ ] intArray = { 1 , 2 , 3 , 4 , 5 } ; S t r i n g i n t A r r a y S t r i n g = Arrays . t o S t r i n g ( i n t A r r a y ) ; / / print d i r e c t l y will print r e f e r e n c e value System . out . p r i n t l n ( i n t A r r a y ) ; / / [ I@ 7150 bd 4 d System . out . p r i n t l n ( i n t A r r a y S t r i n g ) ; / / [1 , 2 , 3 , 4 , 5]

55.3

create an arraylist from an array

String [ ] stringArray = { " a " , "b" , " c " , "d" , " e " } ; A r r a y L i s t < S t r i n g > a r r a y L i s t = new A r r a y L i s t < S t r i n g >( Arrays . a s L i s t ( stringArray ) ) ;

192

55.4. CHECK IF AN ARRAY CONTAINS A CERTAIN VALUE

193

System . out . p r i n t l n ( a r r a y L i s t ) ; // [a , b , c , d , e]

55.4

check if an array contains a certain value

String [ ] stringArray = { " a " , "b" , " c " , "d" , " e " } ; boolean b = Arrays . a s L i s t ( s t r i n g A r r a y ) . c o n t a i n s ( " a " ) ; System . out . p r i n t l n ( b ) ; / / true

55.5

concatenate two arrays

int [ ] intArray = { 1 , 2 , 3 , 4 , 5 } ; i n t [ ] i n t A r r a y 2 = { 6 , 7 , 8 , 9 , 10 } ; / / Apache Commons Lang l i b r a r y i n t [ ] combinedIntArray = A r r a y U t i l s . addAll ( intArray , i n t A r r a y 2 ) ;

55.6

declare an array inline

method ( new S t r i n g [ ] { " a " , " b " , " c " , " d " , " e " } ) ;

55.7

joins the elements of the provided array into a single string

/ / containing the provided l i s t of elements / / Apache common l a n g S t r i n g j = S t r i n g U t i l s . j o i n ( new S t r i n g [ ] { " a " , " b " , " c " } , " , " ) ; System . out . p r i n t l n ( j ) ; // a , b , c

55.8

covnert an arraylist to an array

String [ ] stringArray = { " a " , "b" , " c " , "d" , " e " } ; A r r a y L i s t < S t r i n g > a r r a y L i s t = new A r r a y L i s t < S t r i n g >( Arrays . a s L i s t ( stringArray ) ) ; S t r i n g [ ] s t r i n g A r r = new S t r i n g [ a r r a y L i s t . s i z e ( ) ] ; a r r a y L i s t . toArray ( s t r i n g A r r ) ; for ( String s : stringArr ) System . out . p r i n t l n ( s ) ;

55.9. CONVERT AN ARRAY TO A SET

194

55.9

convert an array to a set

Set < S t r i n g > s e t = new HashSet < S t r i n g >( Arrays . a s L i s t ( s t r i n g A r r a y ) ) ; System . out . p r i n t l n ( s e t ) ; // [d , e , b , c , a]

55.10

reverse an array

int [ ] intArray = { 1 , 2 , 3 , 4 , 5 } ; ArrayUtils . reverse ( intArray ) ; System . out . p r i n t l n ( Arrays . t o S t r i n g ( i n t A r r a y ) ) ; / / [5 , 4 , 3 , 2 , 1]

55.11

. remove element of an array

int [ ] intArray = { 1 , 2 , 3 , 4 , 5 } ; i n t [ ] removed = A r r a y U t i l s . removeElement ( intArray , 3 ) ; / / c r e a t e a new a r r a y System . out . p r i n t l n ( Arrays . t o S t r i n g ( removed ) ) ;

55.12

one more - convert int to byte array

byte [ ] b y t e s = B y t e B u f f e r . a l l o c a t e ( 4 ) . p u t I n t ( 8 ) . a r r a y ( ) ; f o r ( byte t : b y t e s ) { System . out . format ( " 0 x%x " , t ) ; }

56
T O P 1 0 Q U E S T I O N S O F J AVA S T R I N G S The following are top 10 frequently asked questions about Java Strings. 56.1 how to compare strings ? use == or use equals ()?

In brief, == tests if references are equal and equals() tests if values are equal. Unless you want to check if two strings are the same object, you should always use equals(). It would be better if you know the concept of string interning.

56.2

why is char [] preferred over string for security sensitive in formation ?

Strings are immutable, which means once they are created, they will stay unchanged until Garbage Collector kicks in. With an array, you can explicitly change its elements. In this way, security sensitive information(e.g. password) will not be present anywhere in the system.

56.3

can we use string for switch statement ?

Yes to version 7. From JDK 7, we can use string as switch condition. Before version 6, we can not use string as switch condition.
/ / java 7 only !

195

56.4. HOW TO CONVERT STRING TO INT?

196

switch ( s t r . toLowerCase ( ) ) { case " a " : value = 1 ; break ; case " b " : value = 2 ; break ; }

56.4

how to convert string to int ?

i n t n = I n t e g e r . p a r s e I n t ( " 10 " ) ;

Simple, but so frequently used and sometimes ignored. 56.5 how to split a string with white space characters ?

We can simple do split using regular expression. stands for white space characters such as , , , .
S t r i n g [ ] s t r A r r a y = a S t r i n g . s p l i t ( " \\s+ " ) ;

56.6

what substring () method really does ?

In JDK 6, the substring() method gives a window to an array of chars which represents the existing String, but do not create a new one. To create a new string represented by a new char array, you can do add an empty string like the following:
s t r . s u b s t r i n g (m, n ) + " "

This will create a new char array that represents the new string. The above approach sometimes can make your code faster, because Garbage Collector can collect the unused large string and keep only the sub string. In Oracle JDK 7, substring() creates a new char array, not uses the existing one. Check out the diagram for showing substring() difference between JDK 6 and JDK 7.

56.7. STRING VS STRINGBUILDER VS STRINGBUFFER

197

56.7

string vs stringbuilder vs stringbuffer

String vs StringBuilder: StringBuilder is mutable, which means you can modify it after its creation. StringBuilder vs StringBuffer: StringBuffer is synchronized, which means it is thread-safe but slower than StringBuilder.

56.8

how to repeat a string ?

In Python, we can just multiply a number to repeat a string. In Java, we can use the repeat() method of StringUtils from Apache Commons Lang package.
S t r i n g s t r = " abcd " ; String repeated = S t r i n g U t i l s . repeat ( str , 3 ) ; / / abcdabcdabcd

56.9

how to convert string to date ?

S t r i n g s t r = " Sep 1 7 , 2013 " ; Date date = new SimpleDateFormat ( "M M M M d , yy " , L o c a l e . ENGLISH) . parse ( s t r ) ; System . out . p r i n t l n ( date ) ; / / Tue Sep 17 0 0 : 0 0 : 0 0 EDT 2013

56.10

. how to count # of occurrences of a character in a string ?

Use StringUtils from apache commons lang.


i n t n = S t r i n g U t i l s . countMatches ( " 11112222 " , " 1 " ) ; System . out . p r i n t l n ( n ) ;

56.11

one more do you know how to detect if a string contains only uppercase letter ?

57
T O P 1 0 Q U E S T I O N S F O R J AVA R E G U L A R E X P R E S S I O N This post summarizes the top questions asked about Java regular expressions. As they are most frequently asked, you may nd that they are also very useful. 1. How to extract numbers from a string? One common question of using regular expression is to extract all the numbers into an array of integers. In Java, m . eans a range of digits (0-9). Using the predened classes whenever possible will make your code easier to read and eliminate errors introduced by malformed character classes. Please refer to Predened character classes for more details. Please note the rst backslash in . . If you are using an escaped construct within a string literal, you must precede the backslash with another backslash for the string to compile. Thats why we need to use d.
L i s t < I n t e g e r > numbers = new L i n k e d L i s t < I n t e g e r > ( ) ; P a t t e r n p = P a t t e r n . compile ( " \\d+ " ) ; Matcher m = p . matcher ( s t r ) ; while (m. f i n d ( ) ) { numbers . add ( I n t e g e r . p a r s e I n t (m. group ( ) ) ) ; }

2. How to split Java String by newlines? There are at least three different ways to enter a new line character, dependent on the operating system you are working on. \r represents CR (Carriage Return), which is used in Unix \n means LF (Line Feed), used in Mac OS

198

199

\r\n means CR + LF, used in Windows Therefore the most straightforward way to split string by new lines is
S t r i n g l i n e s [ ] = S t r i n g . s p l i t ( " \\r ?\\n " ) ;

But if you dont want empty lines, you can use, which is also my favourite way:
S t r i n g . s p l i t ( " [\\ r\\n]+ " )

A more robust way, which is really system independent, is as follows. But remember, you will still get empty lines if two newline characters are placed side by side.
S t r i n g . s p l i t ( System . g e t P r o p e r t y ( " l i n e . s e p a r a t o r " ) ) ;

3. Importance of Pattern.compile() A regular expression, specied as a string, must rst be compiled into an instance of Pattern class. Pattern.compile() method is the only way to create a instance of object. A typical invocation sequence is thus
P a t t e r n p = P a t t e r n . compile ( " a b " ) ; Matcher matcher = p . matcher ( " aaaaab " ) ; a s s e r t matcher . matches ( ) == t r u e ;

Essentially, Pattern.compile() is used to transform a regular expression into an Finite state machine (see Compilers: Principles, Techniques, and Tools (2nd Edition)). But all of the states involved in performing a match resides in the matcher. By this way, the Pattern p can be reused. And many matchers can share the same pattern.
Matcher anotherMatcher = p . matcher ( " aab " ) ; a s s e r t anotherMatcher . matches ( ) == t r u e ;

Pattern.matches() method is dened as a convenience for when a regular expression is used just once. This method still uses compile() to get the instance of a Pattern implicitly, and matches a string. Therefore,
boolean b = P a t t e r n . matches ( " a b " , " aaaaab " ) ;

is equivalent to the rst code above, though for repeated matches it is less efcient since it does not allow the compiled pattern to be reused.

200

4. How to escape text for regular expression? In general, regular expression uses to escape constructs, but it is painful to precede the backslash with another backslash for the Java string to compile. There is another way for users to pass string Literals to the Pattern, like $5. Instead of writing $5 or [$]5, we can type
P a t t e r n . quote ( " $ 5 " ) ;

5. Why does String.split() need pipe delimiter to be escaped? String.split() splits a string around matches of the given regular expression. Java expression supports special characters that affect the way a pattern is matched, which is called metacharacter. | is one metacharacter which is used to match a single regular expression out of several possible regular expressions. For example, A|B means either A or B. Please refer to Alternation with The Vertical Bar or Pipe Symbol for more details. Therefore, to use | as a literature, you need to escape it by adding in front of it, like |. 6. How can we match anbn with Java regex? This is the language of all non-empty strings consisting of some number of as followed by an equal number of bs, like ab, aabb, and aaabbb. This language can be show to be context-free grammar S aSb | ab, and therefore a non-regular language. However, Java regex implementations can recognize more than just regular languages. That is, they are not regular by formal language theory denition. Using lookahead and self-reference matching will achieve it. Here I will give the nal regular expression rst, then explain it a little bit. For a comprehensive expla bn with Java regex. nation, I would refer you to read How can we match an
P a t t e r n p = P a t t e r n . compile ( " ( ? x ) ( ? : a ( ? = a (\\ 1 ?+b ) ) ) +\\1 " ) ; / / true System . out . p r i n t l n ( p . matcher ( " aaabbb " ) . matches ( ) ) ; // false System . out . p r i n t l n ( p . matcher ( " aaaabbb " ) . matches ( ) ) ; // false System . out . p r i n t l n ( p . matcher ( " aaabbbb " ) . matches ( ) ) ; // false System . out . p r i n t l n ( p . matcher ( " caaabbb " ) . matches ( ) ) ;

201

Instead of explaining the syntax of this complex regular expression, I would rather say a little bit how it works. In the rst iteration, it stops at the rst a then looks ahead (after skipping some as by using a*) whether there is a b. This was achieved by using (?:a(?= a*(\\1?+b))). If it matches, \1, the self-reference matching, will matches the very inner parenthesed elements, which is one single b in the rst iteration. In the second iteration, the expression will stop at the second a, then it looks ahead (again skipping as) to see if there will be b. But this time, \\1+b is actually equivalent to bb, therefore two bs have to be matched. If so, \1 will be changed to bb after the second iteration. In the nth iteration, the expression stops at the nth a and see if there are n bs ahead. By this way, the expression can count the number of as and match if the number of bs followed by a is same. 7. How to replace 2 or more spaces with single space in string and delete leading spaces only? String.replaceAll() replaces each substring that matches the given regular expression with the given replacement. 2 or more spaces can be expressed by regular expression [ ]+. Therefore, the following code will work. Note that, the solution wont ultimately remove all leading and trailing whitespaces. If you would like to have them deleted, you can use String.trim() in the pipeline.
S t r i n g l i n e = " aa bbbbb ccc d "; / / " aa bbbbb c c c d " System . out . p r i n t l n ( l i n e . r e p l a c e A l l ( " [\\ s ]+ " , " " ) ) ;

8. How to determine if a number is a prime with regex?


public s t a t i c // false System . out . / / true System . out . / / true System . out . / / true System . out . // false void main ( S t r i n g [ ] a r g s ) { p r i n t l n ( prime ( 1 ) ) ; p r i n t l n ( prime ( 2 ) ) ; p r i n t l n ( prime ( 3 ) ) ; p r i n t l n ( prime ( 5 ) ) ;

202

System . out . / / true System . out . // false System . out . // false System . out . }

p r i n t l n ( prime ( 8 ) ) ; p r i n t l n ( prime ( 1 3 ) ) ; p r i n t l n ( prime ( 1 4 ) ) ; p r i n t l n ( prime ( 1 5 ) ) ;

public s t a t i c boolean prime ( i n t n ) { r e t u r n ! new S t r i n g ( new char [ n ] ) . matches ( " . ? | ( . . + ? ) \\1+ " ) ; }

The function rst generates n number of characters and tries to see if that string matches .?|(..+?) 1+. If it is prime, the expression will return false and the ! will reverse the result. The rst part .? just tries to make sure 1 is not primer. The magic part is the second part where backreference is used. (..+?) 1+ rst try to matches n length of characters, then repeat it several times by 1+. By denition, a prime number is a natural number greater than 1 that has no positive divisors other than 1 and itself. That means if a=n*m then a is not a prime. n*m can be further explained repeat n m times, and that is exactly what the regular expression does: matches n length of characters by using (..+?), then repeat it m times by using 1+. Therefore, if the pattern matches, the number is not prime, otherwise it is. Remind that ! will reverse the result. 9. How to split a comma-separated string but ignoring commas in quotes? You have reached the point where regular expressions break down. It is better and more neat to write a simple splitter, and handles special cases as you wish. Alternative, you can mimic the operation of nite state machine, by using a switch statement or if-else. Attached is a snippet of code.
public s t a t i c void main ( S t r i n g [ ] a r g s ) { S t r i n g l i n e = " aaa , bbb , \ " c , c \ " , dd ; dd , \ " e , e " ; L i s t < S t r i n g > t o k s = splitComma ( l i n e ) ; for ( String t : toks ) {

57.1. . HOW TO USE BACKREFERENCES IN JAVA REGULAR EXPRESSIONS

203

System . out . p r i n t l n ( " > " + t ) ; } } p r i v a t e s t a t i c L i s t < S t r i n g > splitComma ( S t r i n g s t r ) { int s t a r t = 0; L i s t < S t r i n g > t o k s = new A r r a y L i s t < S t r i n g > ( ) ; boolean withinQuote = f a l s e ; f o r ( i n t end = 0 ; end < s t r . l e n g t h ( ) ; end++) { char c = s t r . charAt ( end ) ; switch ( c ) { case , : i f ( ! withinQuote ) { t o k s . add ( s t r . s u b s t r i n g ( s t a r t , end ) ) ; s t a r t = end + 1 ; } break ; c a s e \" : withinQuote = ! withinQuote ; break ; } } i f ( s t a r t < s t r . length ( ) ) { t o k s . add ( s t r . s u b s t r i n g ( s t a r t ) ) ; } return toks ; }

57.1

. how to use backreferences in java regular expressions

Backreferences is another useful feature in Java regular expression.

58
T O P 1 0 Q U E S T I O N S A B O U T J AVA E X C E P T I O N S This article summarizes the top 10 frequently asked questions and answers about Java exceptions. For example, whats the best practice for exception management? 58.1 checked vs . unchecked

In brief, checked exceptions must be explicitly caught in a method or declared in the methods throws clause. Unchecked exceptions are caused by problems that can not be solved, such as dividing by zero, null pointer, etc. Checked exceptions are especially important because you expect other developers who use your API to know how to handle the exceptions. For example, IOException is a commonly used checked exception and RuntimeException is an unchecked exception. You can check out the exception hierarchy diagram before reading the rest.

58.2

best practice for exception management

If an exception can be properly handled then it should be caught, otherwise, it should be thrown.

204

58.3. WHY VARIABLES DEFINED IN TRY CAN NOT BE USED IN CATCH OR FINALLY?

205

58.3

why variables defined in try can not be used in catch or fi nally ?

In the following code, the string s declared in try block can not be used in catch clause. The code does not pass compilation.
try { F i l e f i l e = new F i l e ( " path " ) ; F i l e I n p u t S t r e a m f i s = new F i l e I n p u t S t r e a m ( f i l e ) ; String s = " inside " ; } c a t c h ( FileNotFoundException e ) { e . printStackTrace ( ) ; System . out . p r i n t l n ( s ) ; }

The reason is that you dont know where in the try block the exception would be thrown. It is quite possible that the exception is thrown before the object is declared. This is true for this particular example.

58.4

why do double . parsedouble ( null ) and integer . parseint ( null ) throw different exceptions ?

They actually throw different exceptions. This is a problem of JDK, so it does not worth too much thinking.
Integer . parseInt ( null ) ; / / throws j a v a . lang . NumberFormatException : n u l l Double . parseDouble ( n u l l ) ; / / throws java . lang . NullPointerException

58.5

commonly used runtime exceptions in java

Here are just some of them. IllegalArgumentException ArrayIndexOutOfBoundsException They can be used in if statement when the condition is not satised as follows:

58.6. CAN WE CATCH MULTIPLE EXCEPTIONS IN THE SAME CATCH CLAUSE?

206

i f ( o b j == n u l l ) { throw new I l l e g a l A r g u m e n t E x c e p t i o n ( " o b j can not be n u l l " ) ;

58.6

can we catch multiple exceptions in the same catch clause ?

The answer is YES. As long as those exceptions can trace back to the same node in the hierarchy, you can use that one only.

58.7

can constructor throw exceptions in java ?

The answer is YES. Constructor is a special kind of method. Here is a code example.

58.8

throw exception in final clause

It is legal to do the following:


public s t a t i c void main ( S t r i n g [ ] a r g s ) { F i l e f i l e 1 = new F i l e ( " path 1 " ) ; F i l e f i l e 2 = new F i l e ( " path 2 " ) ; try { F i l e I n p u t S t r e a m f i s = new F i l e I n p u t S t r e a m ( f i l e 1 ) ; } c a t c h ( FileNotFoundException e ) { e . printStackTrace ( ) ; } finally { try { F i l e I n p u t S t r e a m f i s = new F i l e I n p u t S t r e a m ( file2 ) ; } c a t c h ( FileNotFoundException e ) { e . printStackTrace ( ) ; } } }

But to have better code readability, you should wrap the embedded try-catch block as a new method, and then put the method invocation in the nally clause.

58.9. CAN RETURN BE USED IN FINALLY BLOCK

207

public s t a t i c void main ( S t r i n g [ ] a r g s ) { F i l e f i l e 1 = new F i l e ( " path 1 " ) ; F i l e f i l e 2 = new F i l e ( " path 2 " ) ; try { F i l e I n p u t S t r e a m f i s = new F i l e I n p u t S t r e a m ( f i l e 1 ) ; } c a t c h ( FileNotFoundException e ) { e . printStackTrace ( ) ; } finally { methodThrowException ( ) ; } }

58.9

can return be used in finally block

Yes, it can.

58.10

. why developers consume exception silently ?

There are so many time code segments like the following occur. If properly handling exceptions are so important, why developers keep doing that?
try { ... } catch ( Exception e ) { e . printStackTrace ( ) ; }

Ignoring is just easy. Frequent occurrence does not mean correctness.

59
T O P 1 0 Q U E S T I O N S A B O U T J AVA C O L L E C T I O N S The following are the most popular questions of Java collections asked and discussed on Stackoverow. Before you look at those questions, its a good idea to see the class hierarchy diagram. 59.1 when to use linkedlist over arraylist ?

ArrayList is essentially an array. Its elements can be accessed directly by index. But if the array is full, a new larger array is needed to allocate and moving all elements to the new array will take O(n) time. Also adding or removing an element needs to move existing elements in an array. This might be the most disadvantage to use ArrayList. LinkedList is a double linked list. Therefore, to access an element in the middle, it has to search from the beginning of the list. On the other hand, adding and removing an element in LinkedList is quicklier, because it only changes the list locally. In summary, the worst case of time complexity comparison is as follows:
| A r r a y l i s t | LinkedList g e t ( index ) | O( 1 ) | O( n ) add ( E ) | O( n ) | O( 1 ) add ( E , index ) | O( n ) | O( n ) remove ( index ) | O( n ) | O( n ) I t e r a t o r . remove ( ) | O( n ) | O( 1 ) I t e r a t o r . add ( E ) | O( n ) | O( 1 )

208

59.2. EFFICIENT EQUIVALENT FOR REMOVING ELEMENTS WHILE ITERATING THE COLLECTIO

Despite the running time, memory usage should be considered too especially for large lists. In LinkedList, every node needs at least two extra pointers to link the previous and next nodes; while in ArrayList, only an array of elements is needed. More comparisons between list.

59.2

efficient equivalent for removing elements while iterating the collection

The only correct way to modify a collection while iterating is using Iterator.remove()(). For example,
I t e r a t o r <Integer > i t r = l i s t . i t e r a t o r ( ) ; while ( i t r . hasNext ( ) ) { / / do s o m e t h i n g i t r . remove ( ) ; }

One most frequent incorrect code is


for ( Integer i : l i s t ) { l i s t . remove ( i ) ; }

You will get a ConcurrentModicationException by running the above code. This is because an iterator has been generated (in for statement) to traverse over the list, but at the same time the list is changed by Iterator.remove(). In Java, it is not generally permissible for one thread to modify a collection while another thread is iterating over it.

59.3

how to convert list to int []?

The easiest way might be using ArrayUtils in Apache Commons Lang library.
i n t [ ] a r r a y = A r r a y U t i l s . t o P r i m i t i v e ( l i s t . toArray ( new I n t e g e r [ 0 ] ) ) ;

In JDK, there is no short-cut. Note that you can not use List.toArray(), because that will convert List to Integer[]. The correct way is following,

59.4. HOW TO CONVERT INT[] INTO LIST?

210

i n t [ ] a r r a y = new i n t [ l i s t . s i z e ( ) ] ; f o r ( i n t i = 0 ; i < l i s t . s i z e ( ) ; i ++) { array [ i ] = l i s t . get ( i ) ; }

59.4

how to convert int [] into list ?

The easiest way might still be using ArrayUtils in Apache Commons Lang library, like below.
L i s t l i s t = Arrays . a s L i s t ( A r r a y U t i l s . t o O b j e c t ( a r r a y ) ) ;

In JDK, there is no short-cut either.


int [ ] array = { 1 , 2 , 3 , 4 , 5 } ; L i s t < I n t e g e r > l i s t = new A r r a y L i s t < I n t e g e r > ( ) ; for ( int i : array ) { l i s t . add ( i ) ; }

59.5

what is the best way to filter a collection ?

Again, you can use third-party package, like Guava or Apache Commons Lang to fulll this function. Both provide a lter() method (in Collections2 of Guava and in CollectionUtils of Apache). The lter() method will return elements that match a given Predicate. In JDK, things become harder. A good news is that in Java 8, Predicate will be added. But for now you have to use Iterator to traverse the whole collection.
I t e r a t o r <Integer > i t r = l i s t . i t e r a t o r ( ) ; while ( i t r . hasNext ( ) ) { i n t i = i t r . ne xt ( ) ; i f ( i > 5) { / / f i l t e r a l l i n t s b i g g e r than 5 i t r . remove ( ) ; } }

59.6. EASIEST WAY TO CONVERT A LIST TO A SET?

211

Of course you can mimic the way of what Guava and Apache did, by introducing a new interface Predicate. That might also be what most advanced developers will do.
public i n t e r f a c e P r e d i c a t e <T> { boolean t e s t ( T o ) ; } public s t a t i c <T> void f i l t e r ( C o l l e c t i o n <T> c o l l e c t i o n , P r e d i c a t e < T> p r e d i c a t e ) { i f ( ( c o l l e c t i o n ! = n u l l ) && ( p r e d i c a t e ! = n u l l ) ) { I t e r a t o r <T> i t r = c o l l e c t i o n . i t e r a t o r ( ) ; while ( i t r . hasNext ( ) ) { T o b j = i t r . ne xt ( ) ; i f ( ! predicate . t e s t ( obj ) ) { i t r . remove ( ) ; } } } }

Then we can use the following code to lter a collection:


f i l t e r ( l i s t , new P r e d i c a t e < I n t e g e r > ( ) { public boolean t e s t ( I n t e g e r i ) { r e t u r n i <= 5 ; } }) ;

59.6

easiest way to convert a list to a set ?

There are two ways to do so, depending on how you want equal dened. The rst piece of code puts a list into a HashSet. Duplication is then identied mostly by hashCode(). In most cases, it will work. But if you need to specify the way of comparison, it is better to use the second piece of code where you can dene your own comparator.
Set < I n t e g e r > s e t = new HashSet < I n t e g e r >( l i s t ) ; Set < I n t e g e r > s e t = new T r e e S e t < I n t e g e r >( aComparator ) ; s e t . addAll ( l i s t ) ;

59.7. HOW DO I REMOVE REPEATED ELEMENTS FROM ARRAYLIST?

212

59.7

how do i remove repeated elements from arraylist ?

This question is quite related to the above one. If you dont care the ordering of the elements in the ArrayList, a clever way is to put the list into a set to remove duplication, and then to move it back to the list. Here is the code
A r r a y L i s t l i s t = . . . / / i n i t i a l a l i s t w i t h d u p l i c a t e e l e m e n t s Set < I n t e g e r > s e t = new HashSet < I n t e g e r >( l i s t ) ; l i s t . clear () ; l i s t . addAll ( s e t ) ;

If you DO care about the ordering, there is no short-cut way. Two loops are needed at least.

59.8

sorted collection

There are a couple of ways to maintain a sorted collection in Java. All of them provide a collection in natural ordering or by the specied comparator. By natural ordering, you also need to implement the Comparable interface in the elements. Collections.sort() can sort a List. As specied in the javadoc, this sort is stable, and guarantee n log(n) performance. PriorityQueue provides an ordered queue. The difference between PriorityQueue and Collections.sort() is that, PriorityQueue maintain an order queue at all time, but you can only get the head element from the queue. You can not randomly access its element like PriorityQueue.get(4). If there is no duplication in the collection, TreeSet is another choice. Same as PriorityQueue, it maintains the ordered set at all time. You can get the lowest and highest elements from the TreeSet. But you still cannot randomly access its element. In a short, Collections.sort() provides a one-time ordered list. PriorityQueue and TreeSet maintain ordered collections at all time, in the cost of no indexed access of elements.

59.9. COLLECTIONS.EMPTYLIST() VS NEW INSTANCE

213

59.9

collections . emptylist () vs new instance

The same question applies to emptyMap() and emptySet(). Both methods return an empty list, but Collections.emptyList() returns a list that is immutable. This mean you cannot add new elements to the empty list. At the background, each call of Collections.emptyList() actually wont create a new instance of an empty list. Instead, it will reuse the existing empty instance. If you are familiar Singleton in the design pattern, you should know what I mean. So this will give you better performance if called frequently.

59.10

collections . copy

There are two ways to copy a source list to a destination list. One way is to use ArrayList constructor
A r r a y L i s t < I n t e g e r > d s t L i s t = new A r r a y L i s t < I n t e g e r >( s r c L i s t ) ;

The other is to use Collections.copy() (as below). Note the rst line, we allocate a list at least as long as the source list, because in the javadoc of Collections, it says The destination list must be at least as long as the source list.
A r r a y L i s t < I n t e g e r > d s t L i s t = new A r r a y L i s t < I n t e g e r >( s r c L i s t . s i z e ( ) ); C o l l e c t i o n s . copy ( d s t L i s t , s r c L i s t ) ;

Both methods are shallow copy. So what is the difference between these two methods? First, Collections.copy() wont reallocate the capacity of dstList even if dstList does not have enough space to contain all srcList elements. Instead, it will throw an IndexOutOfBoundsException. One may question if there is any benet of it. One reason is that it guarantees the method runs in linear time. Also it makes suitable when you would like to reuse arrays rather than allocate new memory in the constructor of ArrayList. Collections.copy() can only accept List as both source and destination, while ArrayList accepts Collection as the parameter, therefore more general.

60
T O P 9 Q U E S T I O N S A B O U T J AVA M A P S In general, Map is a data structure consisting of a set of key-value pairs, and each key can only appears once in the map. This post summarizes Top 9 FAQ of how to use Java Map and its implemented classes. For sake of simplicity, I will use generics in examples. Therefore, I will just write Map instead of specic Map. But you can always assume that both the K and V are comparable, which means K extends Comparable and V extends Comparable.

60.1

convert a map to list

In Java, Map interface provides three collection views: key set, value set, and keyvalue set. All of them can be converted to List by using a constructor or addAll() method. The following snippet of code shows how to construct an ArrayList from a map.
/ / key l i s t L i s t k e y L i s t = new A r r a y L i s t (map . keySet ( ) ) ; / / value l i s t L i s t v a l u e L i s t = new A r r a y L i s t (map . v a l u e S e t ( ) ) ; / / key v a l u e l i s t L i s t e n t r y L i s t = new A r r a y L i s t (map . e n t r y S e t ( ) ) ;

60.2

iterate over each entry in a map

Iterating over every pair of key-value is the most basic operation to traverse a map. In Java, such pair is stored in the map entry called Map.Entry. Map.entrySet()

214

60.3. SORT A MAP ON THE KEYS

215

returns a key-value set, therefore the most efcient way of going through every entry of a map is
f o r ( Entry e n t r y : map . e n t r y S e t ( ) ) { / / get key K key = e n t r y . getKey ( ) ; / / get value V value = e n t r y . getValue ( ) ; }

Iterator can also be used, especially before JDK 1.5


I t e r a t o r i t r = map . e n t r y S e t ( ) . i t e r a t o r ( ) ; while ( i t r . hasNext ( ) ) { Entry e n t r y = i t r . n ext ( ) ; / / get key K key = e n t r y . getKey ( ) ; / / get value V value = e n t r y . getValue ( ) ; }

60.3

sort a map on the keys

Sorting a map on the keys is another frequent operation. One way is to put Map.Entry into a list, and sort it using a comparator that sorts the value.
L i s t l i s t = new A r r a y L i s t (map . e n t r y S e t ( ) ) ; C o l l e c t i o n s . s o r t ( l i s t , new Comparator ( ) { @Override public i n t compare ( Entry e 1 , Entry e 2 ) { r e t u r n e 1 . getKey ( ) . compareTo ( e 2 . getKey ( ) ) ; } }) ;

The other way is to use SortedMap, which further provides a total ordering on its keys. Therefore all keys must either implement Comparable or be accepted by the comparator.

60.4. SORT A MAP ON THE VALUES

216

One implementing class of SortedMap is TreeMap. Its constructor can accept a comparator. The following code shows how to transform a general map to a sorted map.
SortedMap sortedMap = new TreeMap ( new Comparator ( ) { @Override public i n t compare (K k 1 , K k 2 ) { r e t u r n k 1 . compareTo ( k 2 ) ; } }) ; sortedMap . p u t A l l (map) ;

60.4

sort a map on the values

Putting the map into a list and sorting it works on this case too, but we need to compare Entry.getValue() this time. The code below is almost same as before.
L i s t l i s t = new A r r a y L i s t (map . e n t r y S e t ( ) ) ; C o l l e c t i o n s . s o r t ( l i s t , new Comparator ( ) { @Override public i n t compare ( Entry e 1 , Entry e 2 ) { r e t u r n e 1 . getValue ( ) . compareTo ( e 2 . getValue ( ) ) ; } }) ;

We can still use a sorted map for this question, but only if the values are unique too. Under such condition, you can reverse the key=value pair to value=key. This solution has very strong limitation therefore is not really recommended by me.

60.5

initialize a static / immutable map

When you expect a map to remain constant, its a good practice to copy it into an immutable map. Such defensive programming techniques will help you create not only safe for use but also safe for thread maps.

60.6. DIFFERENCE BETWEEN HASHMAP, TREEMAP, AND HASHTABLE

217

To initialize a static/immutable map, we can use a static initializer (like below). The problem of this code is that, although map is declared as static nal, we can still operate it after initialization, like Test.map.put(3,"three");. Therefore it is not really immutable. To create an immutable map using a static initializer, we need an extra anonymous class and copy it into a unmodiable map at the last step of initialization. Please see the second piece of code. Then, an UnsupportedOperationException will be thrown if you run Test.map.put(3,"three");.
public c l a s s T e s t { p r i v a t e s t a t i c f i n a l Map map ; static { map = new HashMap ( ) ; map . put ( 1 , " one " ) ; map . put ( 2 , " two " ) ; } } public c l a s s T e s t { p r i v a t e s t a t i c f i n a l Map map ; static { Map aMap = new HashMap ( ) ; aMap . put ( 1 , " one " ) ; aMap . put ( 2 , " two " ) ; map = C o l l e c t i o n s . unmodifiableMap ( aMap ) ; } }

Guava libraries also support different ways of intilizaing a static and immutable collection. To learn more about the benets of Guavas immutable collection utilities, see Immutable Collections Explained in Guava User Guide.

60.6

difference between hashmap, treemap, and hashtable

There are three main implementations of Map interface in Java: HashMap, TreeMap, and Hashtable. The most important differences include: The order of iteration. HashMap and HashTable make no guarantees as to the order of the map; in particular, they do not guarantee that the order

60.7. A MAP WITH REVERSE VIEW/LOOKUP

218

will remain constant over time. But TreeMap will iterate the whole entries according the natural ordering of the keys or by a comparator. key-value permission. HashMap allows null key and null values. HashTable does not allow null key or null values. If TreeMap uses natural ordering or its comparator does not allow null keys, an exception will be thrown. Synchronized. Only HashTable is synchronized, others are not. Therefore, if a thread-safe implementation is not needed, it is recommended to use HashMap in place of HashTable. A more complete comparison is
| HashMap | HashTable | TreeMap i t e r a t i o n order | no | no | yes n u l l keyvalue | yes yes | yes yes | noyes synchronized | no | yes | no time performance | O( 1 ) | O( 1 ) | O( l o g n ) implementation | buckets | buckets | redb l a c k t r e e

Read more about HashMap vs. TreeMap vs. Hashtable vs. LinkedHashMap.

60.7

a map with reverse view / lookup

Sometimes, we need a set of key-key pairs, which means the maps values are unique as well as keys (one-to-one map). This constraint enables to create an inverse lookup/view of a map. So we can lookup a key by its value. Such data structure is called bidirectional map, which unfortunetely is not supported by JDK.

60.8

both apache common collections and guava provide imple mentation of bidirectional map, called bidimap and bimap, respectively. both enforce the restriction that there is a 1 : 1 relation between keys and values . 7 . shallow copy of a map

Most implementation of a map in java, if not all, provides a constructor of copy of another map. But the copy procedure is not synchronized. That means when

60.9. FOR THIS REASON, I WILL NOT EVEN TELL YOU HOW TO USE CLONE() METHOD TO COP

one thread copies a map, another one may modify it structurally. To [prevent accidental unsynchronized copy, one should use Collections.synchronizedMap() in advance.
Map copiedMap = C o l l e c t i o n s . synchronizedMap (map) ;

Another interesting way of shallow copy is by using clone() method. However it is NOT even recommended by the designer of Java collection framework, Josh Bloch. In a conversation about Copy constructor versus cloning, he said I often provide a public clone method on concrete classes because people expect it. e Its a shame that Cloneable is broken, but it happens. A e Cloneable is a weak A spot, and I think people should be aware of its limitations.

60.9

for this reason , i will not even tell you how to use clone () method to copy a map. 8 . create an empty map

If the map is immutable, use


map = C o l l e c t i o n s . emptyMap ( ) ;

Otherwise, use whichever implementation. For example


map = new HashMap ( ) ;

THE END

You might also like