Unit 29: Nested Class
Learning Objectives
Students should
- understand the need for nested class.
- understand the behavior of the different kinds of nested class.
- be able to write nested classes.
Matryoshka Doll
So far, we have defined a class only at the "top-level" of our program. Java allows us to define a class within another class, or within a method.
A nested class is a class defined within another containing class. For example, the following declaration declares a private nested class named B
within the class A
.
1 2 3 4 5 |
|
Nested classes are used to group logically relevant classes together. Typically, a nested class is tightly coupled with the container class and would have no use outside of the container class. Nested classes can be used to encapsulate information within a container class, for instance, when the implementation of the container class becomes too complex. As such, it is useful for "helper" classes that serve specific purposes.
A nested class is a field of the containing class and can access fields and methods of the container class, including those declared as private
. We can keep the nested class within the abstraction barrier by declaring the nested class as private
if there is no need for it to be exposed to the client outside the barrier.
Since the nested class can access the private fields of the container class, we should introduce a nested class only if the nested class belongs to the same encapsulation as the container class. Otherwise, the container class would leak its implementation details to the nested class.
Take the HashMap<K,V>
class for instance. The implementation of HashMap<K,V>
contains one top-level class HashMap<K,V>
(at Line 124) and several nested classes, including the HashIterator<E>
abstract class (at Line 178), which implement an Iterator<E>
interface for iterating through the key and value pairs in the map, and a static Entry<K,V>
class (at Line 687), which encapsulates a key-value pair in the map. Some of these classes are declared private
if they are only used within the HashMap<K,V>
class.
Example from CS2030S This Semester
We can take another example from your labs on coffee shop simulation. In one of many possible designs, the subclasses of Event
: ArrivalEvent
, DepartureEvent
, etc. are not used anywhere outside of CoffeeSimulation
. They can be safely encapsulated within CoffeeSimulation
as inner classes, so that these classes can access the fields within the CoffeeSimulation
class, simplifying their implementation.
A nested class can be either static or non-static. Just like static fields and static methods, a static nested class is associated with the containing class, NOT an instance. So, it can only access static fields and static methods of the containing class. A non-static nested class, on the other hand, can access all fields and methods of the containing class. A non-static nested class is also known as an inner class.
Static vs Non-Static
Recap the following access behavior:
From | access Static | access Non-Static |
---|---|---|
Static | ||
Non-Static |
The example below shows a container class A
with two nested classes, a non-static inner class B
, and a static nested class C
. B
can access instance fields, instance methods, class fields, and class methods in A
. C
can only access the class fields and class methods in A
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 |
|
Recall that we recommend that all access to instance fields be done through the this
reference. In the example above, however, we can't access this.x
from within B
.
1 2 3 4 5 6 7 8 9 |
|
Since this.x
is called within a method of B
, this
would refer to the instance of B
, rather than the instance of A
. Java has a piece of syntax called qualified this
to resolve this. A qualified this
reference is prefixed with the enclosing class name, to differentiate between the this
of the inner class and the this
of the enclosing class. In the example above, we can access x
from A
through the A.this
reference.
1 2 3 4 5 6 7 8 9 |
|
Hiding Nested Classes
When we use nested classes for encapsulating the implementation details of the enclosing class, it is better to declare them as private
nested classes, so that they are not exposed across the abstract barrier, adhering to the information hiding principle. For example, consider the following
1 2 3 4 5 6 7 8 |
|
We cannot access A.B
or A.C
.
1 2 |
|
Note that it is still possible to expose instances of private nested classes outside the enclosing class. For example, let's say we have:
1 2 3 4 5 6 7 8 9 |
|
We are allowed to call A::foo
and obtain a reference to an instance of B
, as long as the type B
(or rather, A.B
) is not used directly.
1 2 3 |
|
Since the type A.B
is private to within A
, we cannot call methods of B
outside of A
as well.
1 2 |
|
We can still expose the method in the private
nested class if the class is a subtype of a non-private
type. Let us illustrate this with an interface. Instead of returning B
, our method foo
now returns I
.
1 2 3 4 5 6 7 8 9 10 11 12 |
|
More on Static Nested Class
Recap that from static context we cannot access non-static elements. As the instance fields of the outer class is a non-static element, we cannot access them. But also recap that to access the instance field x
of the outer class where there is a conflicting name x
with a field of the inner class, we require the use of qualified name A.this.x
.
The two implies that A.this
is not going to be used at all by the static nested class. As such, we should omit them from the stack and heap diagram as well. To illustrate this difference, consider the following class reproduced from above.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 |
|
Now consider the following code snippet.
1 2 |
|
The stack and heap diagram at the line marked Line A is shown below. The diagram below also illustrates the use of meta space to store the class field A.y
.
Also, A.this
is always captured, so A.this
is added to the bottom of the dashed line.
Local Class
We can also declare a class within a function, just like a local variable.
To motivate this, let's consider how one would use the java.util.Comparator
interface.
The Comparator
interface allows us to specify how to compare two elements, by implementing this interface with a customized compare()
method. compare(o1,o2)
should return
Return Value | Meaning |
---|---|
A negative integer | o1 is "less than" o2 |
The integer 0 |
o1 is "equal" o2 |
A positive integer | o1 is "greater than" o2 |
Suppose we have a list of strings, and we want to sort them in the order of their length, we can write the following method:
1 2 3 4 5 6 7 8 9 10 |
|
This makes the code easier to read since we keep the definition of the class and its usage closer together.
Classes like NameComparator
that are declared inside a method (or to be more precise, inside a block of code between {
and }
, but not directly inside a class) is called a local class. Just like a local variable, a local class is scoped within the method. Like a nested class, a local class has access to the variables of the enclosing class through the qualified this
reference. Further, it can access the local variables of the enclosing method.
For example,
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 |
|
Here, B
is a local class defined in method f()
. It has access to all the local variables accessible from within f
, as well as the fields of its enclosing class.
Variable Capture
Recall that when a method returns, all local variables of the methods are removed from the stack. But, an instance of that local class might still exist. Consider the following example:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 |
|
Calling
1 2 3 |
|
will give us a reference to an object of type B
now. But, if we call b.g()
, what is the value of y
? Without variable capture, the stack and heap diagram is something like the following. Notice how we have no access to y
anymore!
For this reason, even though a local class can access the local variables in the enclosing method, the local class makes a copy of local variables inside itself. We say that a local class captures the local variables. Note that local variables are variables declared within a method. These variables are local to the method. Fields can always be accessed and need not be captured through this means.
Visually, we will include the captured variables after the fields in the stack and heap diagram. We will use a dashed line to separate the fields and captured variables. Note that captured variables are NOT part of the fields, so it cannot be accessed with the dot operator (e.g., this.y
).
Variables are only captured when they are (i) local to the method and (ii) used in local class. Consider the following modification to the previous example:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 |
|
We have added another local variable z
. However, this variable is not used within the local class B
. As such, the instance of B
does not capture the variable z
and only captures the variable y
. This capture can only be shown after we have discussed the next subsection on effectively final
.
Effectively final
Variable captures can be confusing. Consider the following code:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 |
|
Will sort
sorts in ascending order (i.e., use the value of ascendingOrder
when the class is declared) or descending order (i.e., use the value of ascendingOrder
when the class is instantiated)?
To complicate matter further, in the example above, we are only creating a single instance of NameComparator
. What if we have a thousand of those? Should the statement ascendingOrder = false
modify all thousand variables that are captured? What if there are no instance of NameComparator
?
To avoid confusing code like this, Java only allows a local class to access variables that are explicitly declared final
or implicitly final (a.k.a. effectively final). An implicitly final variable cannot be re-assigned after initialization. Therefore, Java saves us from such a hair-pulling situation above and disallows such code -- ascendingOrder
is effectively final so the assignment ascendingOrder = false
will cause compilation error.
Breaking the Limitation of Effectively final
. The limitation of effectively final only happen because the value is of a primitive type. So, if we captures the value and forbids re-assigning the value, there is nothing we can do to change primitive value.
On the other hand, reference type can be mutated without assignment statement! So if we use our own implementation of Bool
class below instead of boolean
primitive type, we can modify the code above to allow the "value" in variable ascendingOrder
to be changed. However, this change is via mutation and not re-assignment to the variable.
This also saves us the problem of figuring out what will happen if we have a thousand instances of NameComparator
. In the previous case, because the variables are captured, we need to dynamically add a thousand different assignment statements to assign the new value to all thousand different instances.
On the other hand, using reference type Bool
, the value of the captured variable is a reference to a memory location in the heap. Therefore, all thousand instances of NameComparator
captures this reference. This is where aliasing helps us. The thousand instances has the same alias to this memory location. So, changes in this memory location can have effect on the instance.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 |
|
The code above does compile but now we are no longer save from such a hair-pulling situation. So please exercise this with extreme caution.
Variable Capture in Javascript
Those of you who did CS1101S or otherwise familiar with Javascript might want to note that this is different from Javascript, which does not enforce the final/effectively final restriction in variable captures. This is because there is no concept of primitive value in Javascript.
Every single primitive type is automatically boxed in Javascript. The unboxed variant is not available to the programmer directly. So, if we write x = 1
in Javascript, the value 1
is boxed and put into the heap. Then, the variable x
in the stack points to this box in the heap unlike Java primitive type.
Anonymous Class
An anonymous class is one where you declare a local class and instantiate it in a single statement. This is a syntactic sugar to allow programmers to declare quickly a "single-use" class -- a class that we use only once and never need again. We don't even need to give the class a name (hence, anonymous).
1 2 3 4 5 |
|
The example above removes the need to declare a class just to compare two strings.
An anonymous class has the following format: new X (arguments) { body }
, where:
- X is a class that the anonymous class extends or an interface that the anonymous class implements. X cannot be empty. This syntax also implies an anonymous class cannot extend another class and implement an interface at the same time. Furthermore, an anonymous class cannot implement more than one interface. Put it simply, you cannot have
extends
andimplements
keyword in betweenX
and(arguments)
. - arguments are the arguments that you want to pass into the constructor of the anonymous class. If the anonymous class is extending an interface, then there is no constructor, but we still need
()
. - body is the body of the class as per normal, except that we cannot have a constructor for an anonymous class.
The syntax might look overwhelming at the beginning, but we can also write it as:
1 2 3 4 5 6 |
|
Line 1 above looks just like what we do when we instantiate a class, except that we are instantiating an interface with a { .. }
body.
An anonymous class is just like a local class, it captures the variables of the enclosing scope as well -- the same rules to variable access as local classes applies.