Java String Handling Guide
Java String Handling Guide
A string is a sequence of character and Java implements strings as objects of type String.
Implementing strings as built-in objects allows Java to provide a full complement of features
that make string handling convenient. For example, Java has methods to compare two strings,
search for a substring, concatenate two strings, and change the case of letters within a string.
Also, String objects can be constructed a number of ways, making it easy to obtain a string
when needed. Once a String object has been created, you cannot change the characters of that
string. Whenever we need any modifications, a new string object containing modifications
has to be created. However, a variable declared as String reference can point to some other
String object, and hence can be changed
3. To create a string object that contains same characters as another string object:
String(String strObj);
For example,
String(char chars[])
For example,
• Even though Java’s char type uses 16 bits to represent the basic Unicode character set, the
typical format for strings on the Internet uses arrays of 8-bit bytes constructed from the
ASCII character set. Because 8-bit ASCII strings are common, the String class provides
constructors that initialize a string when given a byte array.
String(byte asciiChars[ ])
For example,
• JDK 5 and higher versions have two more constructors. The first one supports the extended
Unicode character set.
String(StringBuilder strBuildObj)
STRING LENGTH
The length of a string is the number of characters that it contains. To obtain this value,
call the length() method. For example,
System.out.println(s.length()); //prints 5
SPECIAL STRING OPERATIONS
Java supports many string operations. Though there are several string handling methods are
available, for the use of programmer, Java does many operations automatically without
requiring a call for separate method. This adds clarity to the program. We will now see few of
such operations.
• String Literals: Instead of using character arrays and new operator for creating string
instance,
or
A String object will be created for every string literal and hence, we can even use,
System.out.println(“Hello”.length()); //prints 5
• String Concatenation: Java does not allow any other operator than + on strings.
Concatenation of two or more String objects can be achieved using + operator. For example,
One practical use of string concatenation is found when you are creating very long strings.
Instead of letting long strings wrap around within your source code, you can break them into
String longStr = "This could have been " +"a very long line that would have " +"wrapped
around. But string concatenation " +"prevents this.";
System.out.println(longStr);
String Concatenation with Other Data Types: We can concatenate String with other data
types.
For example,
int age = 9;
Here, the int value in age is automatically converted into its string representation within a
String object. The compiler will convert an operand to its string equivalent whenever the
other operand of the + is an instance of String. But, we should be careful while mixing data
types:
String s= “Four : ” + 2 + 2;
This is because, “Four :” is concatenated with 2 first, then the resulting string is again
• String Conversion and toString(): Java uses valueOf( ) method for converting data into its
String toString( )
To implement toString( ), simply return a String object that contains the human-readable
string that appropriately describes an object of our class.
class Box
{ width = w;
height = h;
depth = d;
}
class StringDemo
System.out.println(b);
Output:
Note: Observe that, Box’s toString( ) method is automatically invoked when a Box object is
The String class provides different ways for extracting characters from a string object.
Though a String object is not a character array, many of the String methods use an index into
a string object for their operation.
• charAt() : This method is used to extract a single character from a String. It has this
generalform:
char ch;
• getChars() : If you need to extract more than one character at a time, you can use this
method. It has the following general form:
sourceStart specifies the index of the beginning of the substring sourceEnd specifies an index
that is one past the end of the desired substring. (i.e. the substring contains the characters
from sourceStart through sourceEnd–1) target specifies the array which receives the substring
targetStart is the index within target at which the substring will be copied Care must be taken
to assure that the target array is large enough to hold the number of characters in the specified
substring.
class StringDemo1
System.out.println(buf);
Output:
demo
uses the default character-to-byte conversions provided by the platform. Here is its simplest
form:
byte[ ] getBytes( )
Other forms of getBytes( ) are also available. getBytes( ) is most useful when you are
exporting a String value into an environment that does not support 16-bit Unicode characters.
For example, most Internet protocols and text file formats use 8-bit ASCII for all text
interchange.
• toCharArray() : If you want to convert all the characters in a String object into a character
array, the easiest way is to call toCharArray( ). It returns an array of characters for the entire
string. It has this general form:
char[ ] toCharArray( )
String s1="hello";
char[] ch=s1.toCharArray();
for(int i=0;i<ch.length;i++)
System.out.print(ch[i]);
• equals() and equalsIgnoreCase(): To compare two strings for equality, we have two
methods:
Here, str is the String object being compared with the invoking String object. The first
method is case sensitive and returns true, if two strings are equal. The second method returns
true if two strings are same, whatever may be their case.
String s1 = "Hello";
String s2 = "Hello";
String s3 = "Good-bye";
String s4 = "HELLO";
System.out.println(s1.equals(s2)); //true
System.out.println(s1.equals(s3)); //false
System.out.println(s1.equals(s4)); //false
System.out.println(s1.equalsIgnoreCase(s4)); //true
str2StartIndex The index at which the comparison will start within str2.
ignoreCase used in second version. If it is true, the case of the characters is ignored.
System.out.println(s1.regionMatches(6,s2,0,3)); //false
System.out.println(s1.regionMatches(true,6,s2,0,3)); //true
• startsWith( ) and endsWith(): These are the specialized versions of the regionMatches()
method. The startsWith() method determines whether a given String begins with a specified
string. The endsWith() method determines whether the String in question ends with a
specified string. They have the following general forms:
Ex:
"Foobar".endsWith("bar") //true
"Foobar".startsWith("Foo") //true
Here, startIndex specifies the index into the invoking string at which point the search will
begin.
• equals( ) v/s == : The equals( ) method compares the characters inside a String object. The
== operator compares two object references to see whether they refer to the same instance.
String s1 = "Hello";
System.out.println(s1.equals(s2)); //true
• compareTo(): This method is used to check whether a string is less than, greater than or
equal to the other string. The meaning of less than, greater than refers to the dictionary order
(based on Unicode). It has this general form:
This method will return 0, if both the strings are same. Otherwise, it will return the difference
between the ASCII values of first non-matching character. If you want to ignore case
differences when comparing two strings, use compareToIgnoreCase(), as shown here:
Ex:
Method Purpose
int indexOf(String str) To search for the first or last occurrence of a substring
int indexOf(int ch, int startIndex) Used to specify a starting point for the search. Here,
startIndex specifies the index at which point the search begins. For indexOf() method, the
search runs from startIndex to the end of the string. For lastIndexOf( ) method, the search
runs from startIndex to zero.
class Demo
String s = "Now is the time for all good men to come to the aid of their
country.";
System.out.println(s.indexOf('t')); //7
System.out.println(s.lastIndexOf('t')); //65
System.out.println(s.indexOf("the")); //7
System.out.println(s.lastIndexOf("the")); //55
MODIFYING A STRING
Since String objects cannot be changed, whenever we want to modify a String, we must
either copy it into a StringBuffer or StringBuilder, or use one of the following String
methods, which will construct a new copy of the string with our modifications complete.
substring(): Used to extract a substring from a given string. It has two formats:
o String substring(int startIndex): Here, startIndex specifies the index at which the
substring will begin. This form returns a copy of the substring that begins at startIndex and
runs to the end of the invoking string.
o String substring(int startIndex, int endIndex): Here, startIndex specifies the beginning
index, and endIndex specifies the stopping point. The string returned contains all the
characters from the beginning index, up to, but not including, the ending index.
Ex:
String result ;
result=org.substring(5);
result=org.substring(5, 7);
System.out.println(result); //is
This method creates a new object that contains the invoking string with the contents of str
String s1 = "one";
String s2 = s1.concat("two");
is same as
String s1 = "one";
String s2 = s1 + "two";
replace():The first form of this method replaces all occurrences of one character in the
invoking
For example,
The second form of replace( ) replaces one character sequence with another.
trim():The trim( ) method returns a copy of the invoking string from which any leading
and trailing white-space has been removed. It has this general form:
String trim( )
Here is an example:
This puts the string “Hello World” into s by eliminating white-spaces at the beginning and at
the end.
int value=30;
String s1=String.valueOf(value);
String toLowerCase()
String toUpperCase()
For example,
String s1 = str.toUpperCase();
Method Description
int codePointAt(int i) Returns the Unicode code point at the location specified by i.
int codePointBefore(int i) Returns the Unicode code point at the location that precedes
that specified by i.
int codePointCount(int start, int end) Returns the number of code points in the portion of
the invoking String that is between start and end–1.boolean contains(CharSequence str)
Returns true if the invoking object contains the string specifiedby str. Returns false,
otherwise. boolean contentEquals(CharSequence str) Returns true if the invoking string
contains the same string asstr. Otherwise, returns false.
boolean contentEquals(StringBuffer str) Returns true if the invoking string contains the
same string as str. Otherwise, returns false.
static String format(String fmtstr,Object ... args) Returns a string formatted as specified
by fmtstr.
static String format(Locale loc,String fmtstr, Object ... args) Returns a string formatted as
specified by fmtstr. Formatting is governed by the locale specified by loc.
boolean matches(string regExp) Returns true if the invoking string matches the regular
expression passed in regExp. Otherwise, returns false.
int offsetByCodePoints(int start, int num) Returns the index with the invoking string that is
num code pointsbeyond the starting index specified by start.
String replaceFirst(String regExp, String newStr) Returns a string in which the first
substring that matches the regular expression specified by regExp is replaced by newStr.
String replaceAll(String regExp, String newStr) Returns a string in which all substrings
that match the regular expression specified by regExp are replaced by newStr.
String[ ] split(String regExp) Decomposes the invoking string into parts and returns an
array that contains the result. Each part is delimited by the regular expression passed in
regExp.
String[ ] split(String regExp,int max) Decomposes the invoking string into parts and
returns an array that contains the result. Each part is delimited by the regular expression
passed in regExp. The number of pieces is specified by max. If max is negative, then the
invoking string is fully decomposed. Otherwise, if max contains a nonzero value, the last
entry in the returned array contains the remainder of the invoking string. If max is zero, the
invoking string is fully decomposed.
STRINGBUFFER CLASS
We know that, String represents fixed-length, immutable character sequences. In contrast,
StringBuffer represents growable and writeable character sequences. We can insert characters
in the middle or append at the end using this class. StringBuffer will automatically grow to
make room for such additions and often has more characters pre-allocated than are actually
needed, to allow room for growth.Constructors:
StringBuffer(int size) : accepts an integer argument that explicitly sets the size of the
buffer
StringBuffer(String str) : accepts a String argument that sets the initial contents of the
StringBuffer object and reserves room for 16 more characters without reallocation.
StringBuffer class provides various methods to perform certain tasks, which are mainly
focused on changing the content of the string (Remember, String class is immutable – means
content of the String class objects cannot be modified). Some of them are discussed
hereunder:
length() and capacity(): These two methods can be used to find the length and total
allocatedcapacity of StringBuffer object. As an empty object of StringBuffer class gets 16
character space, the capacity of the object will be sum of 16 and the length of string value
allocated to that object.
Example:
ensureCapacity(): If you want to preallocate room for a certain number of characters after
a StringBuffer has been constructed, you can use this method to set the size of the buffer.
This is useful if you know in advance that you will be appending a large number of small
strings to a StringBuffer. The method ensureCapacity() has this general form:
• charAt() and setCharAt(): The value of a single character can be obtained from a
StringBuffer via the charAt() method. You can set the value of a character within a
StringBuffer using setCharAt(). Their general forms are shown here:
For charAt(), where specifies the index of the character being obtained. For setCharAt(),
where specifies the index of the character being set, and ch specifies the new value of that
character.
Example:
StringBuffer sb = new StringBuffer("Hello");
sb.setCharAt(1, 'i');
sb.setLength(2);
Output would be –
charAt(1) before = e
buffer after = Hi
charAt(1) after = i
Here, sourceStart specifies the index of the beginning of the substring, and sourceEnd
specifies an index that is one past the end of the desired substring. This means that the
substring contains the characters from sourceStart through sourceEnd–1. The array that will
receive the characters is specified by target. The index within target at which the substring
will be copied is passed in targetStart. Care must be taken to assure that the target array is
large enough to hold the number of characters in the specified substring.
• append(): The append() method concatenates the string representation of any other type of
data to the end of the invoking StringBuffer object. It has several overloaded versions. Here
are a few of its forms:
String.valueOf() is called for each parameter to obtain its string representation. The result is
appended to the current StringBuffer object. The buffer itself is returned by each version of
int a = 42;
s = sb.append("a=").append(a).append("!").toString();
• insert(): The insert() method inserts one string into another. It is overloaded to accept
values of all the simple types, plus Strings, Objects, and CharSequences. Like append(), it
calls
String.valueOf( ) to obtain the string representation of the value it is called with. This string
is then inserted into the invoking StringBuffer object. Few forms are:
Here, index specifies the index at which point the string will be inserted into the invoking
System.out.println(s); //abcdef
s.reverse();
System.out.println(s); //fedcba
• delete() and deleteCharAt(): You can delete characters within a StringBuffer by using the
It deletes a sequence of characters from the invoking object. Here, startIndex specifies the
index of the first character to remove, and endIndex specifies an index one past the last
character to remove. Thus, the substring deleted runs from startIndex to endIndex–1.The
resulting
It deletes the character at the index specified by loc. It returns the resulting StringBuffer
object.
Example:
sb.delete(4, 7);
sb.deleteCharAt(0);
• replace(): You can replace one set of characters with another set inside a StringBuffer
object by calling replace( ). Its signature is shown here:
The substring being replaced is specified by the indexes startIndex and endIndex. Thus, the
sb.replace(5, 7, "was");
• substring() : You can obtain a portion of a StringBuffer by calling substring(). It has the
The first form returns the substring that starts at startIndex and runs to the end of the invoking
StringBuffer object. The second form returns the substring that starts at startIndex and runs
through endIndex–1. These methods work just like those defined for String that were
described
earlier.
Method Description
Appends a Unicode code point to the end of the invoking object. A reference to the object is
returned.
int codePointAt(int i)
int codePointBefore(int i)
Returns the Unicode code point at the location that precedes that specified by i.
Returns the number of code points in the portion of the invoking String that is between start
and end–1.
Searches the invoking StringBuffer for the first occurrence of str. Returns the index of the
match, or –1 if no match is found.
Searches the invoking StringBuffer for the first occurrence of str, beginning at startIndex.
Returns the index of the match, or –1 if no match is found.
Searches the invoking StringBuffer for the last occurrence of str. Returns the index of the
match, or –1 if no match is found.
Searches the invoking StringBuffer for the last occurrence of str, beginning at startIndex.
Returns the index of the match, or –1 if no match is found.
Returns a substring of the invoking string, beginning at startIndex and stopping at stopIndex.
This method is required by the CharSequence interface, which is now implemented by
Reduces the size of the character buffer for the invoking object to exactly fit the current
contents.
STRINGBUILDER CLASS
J2SE 5 adds a new string class to Java’s already powerful string handling capabilities. This
new class is called StringBuilder. It is identical to StringBuffer except for one important
difference: it is not synchronized, which means that it is not thread-safe. The advantage of
StringBuilder is faster performance. However, in cases in which you are using
multithreading, you must use StringBuffer rather than StringBuilder.
Other Topics
Using instanceof
Sometimes, we need to check type of the object during runtime of the program. We may
create multiple
classes and objects to these classes in a program. In such situations, the instanceof operator is
useful
class A
int i, j;
class B
int i, j;
class C extends A
{
int k;
class InstanceOfEx
A a = new A();
B b = new B();
C c = new C();
A ob;
ob = c;
Static Import
The statement static import expands the capabilities of the import keyword. By following
import with the keyword static, an import statement can be used to import the static members
of a class or interface.
When using static import, it is possible to refer to static members directly by their names,
without having to qualify them with the name of their class. This simplifies and shortens the
syntax required to use a static member.
We have observed earlier that when we need to use some Math functions, we need to use
Math.sqrt(), Math.pow() etc. Using static import feature, we can just use sqrt(), pow() etc. as
shown below –
class Hypot
double hypot;
side1 = 3.0;
side2 = 4.0;
When working with overloaded constructors, it is sometimes useful for one constructor to
invoke another.
In Java, this is accomplished by using another form of the this keyword. The general form is
shown here:
this(arg-list)
When this() is executed, the overloaded constructor that matches the parameter list specified
by arg-list is executed first. Then, if there are any statements inside the original constructor,
they are executed. The call to this() must be the first statement within the constructor.
class MyClass
int a, b;
MyClass(int i, int j)
a = i;
b = j;
MyClass(int i)
MyClass( )
void disp()
System.out.println(“a=”+a + “ b=”+b);
class thisDemo
{
MyClass m1 = new MyClass();
m1.disp();
m2.disp();
m3.disp();
Output:
a= 0 b=0
a= 8 b=8
a= 2 b=3
IMPORTANT QUESTIONS:
1. List and explain any four constructors of String class with suitable examples.
(i) charAt()
(ii) toCharArray()
(iii) regionMatches()
(v) replace()
(vi) trim()
(vii) substring()
3. Explain various forms of indexOf() and lastIndexOf() methods with a code snippet.
(i) setCharAt()
(ii) append()
(iii) insert()
(iv) reverse()
(v) delete()
(vi) deleteCharAt()
6. Write a note on