Introduction
We as software programmers knows what is object oriented programming. But sometimes we need to spend more time to decide what are the properties needed for a particular class. Otherwise we had to suffer in at a later point, if the class holds wrong state attributes. Here we are going to discuss about what type of class should be stateful and what stateless.
What is mean by state of an object?
Before we discuss about Stateless or Stateful classes we should have better understanding about what is mean by the state of an object. Its same as the English meaning "the particular condition that someone or something is in at a specific time" of state.
When we come to programming and think about the condition of object at a specific time, its nothing but the value of it's properties or member variables at a given point of time. Who decides what are the properties of objects. Its the class. Who decides what are the properties and members inside a class?Its programmer who coded that class. Who is programmer? Everybody who reads this article including me who is writing this. Are we all experts in taking decision on what are the properties needed for each class?
I don't think so. At least its true in case of programmers in India who come into the software industry by only looking at the salary and programming as a daily job. First of all its not something can be taught in colleges like how other engineering disciplines works. It needs to come via experience because programming is in its early stages compared to other engineering and its more like art than engineering. Engineering can sometimes have hard rules but art cannot. Even after being in the programming for around 15 years (sorry I count my college days as well in programming experience) I still take considerable amount of time to decide what are the properties needed for a class and the name of the class itself.
Can we bring some rules to what are the properties needed? In other words what properties, should the state of an object include? Or should the objects be stateless always. Below are some thoughts on this area.
Entity classes / Business Objects
There are multiple names such as entity classes , business objects etc...given to classes which are representing a clear state of something. If we take example of Employee class, it's sole purpose is to hold the state of an employee. What that state probably can contain? EmpId, Company, Designation, JoinedDate etc...I hope there would be no confusions till this point. Everybody agrees that this type of classes should be stateful without much arguments, because this is taught in college.
But how we should do salary calculation?
-
Should the CalculateSalary()
needs to be a method inside the Employee class?
-
Should there be a SalaryCalculator
class and that class should contain the Calculate() method
-
In case there is SalaryCalculator
class
-
Whether it should have properties such as BasicPay,DA HRA etc?
-
Or the Employee
object needs to be a private member variable in that SalaryCalculator which is injected via constructor?
-
Or SalaryCalculator
should expose Employee public property (Get&SetEmployee
methods in Java)
Helper / Operation / Manipulator classes
This is the type of classes which do a task. SalaryCalculator falls into this type. There are many names to this type where classes do actions and can be found in programs with many prefix and suffixes such as
-
class SomethingCalculator
eg:SalaryCalculator
-
class SomethingHelper
eg: DBHelper
-
class SomethingController
eg: DBController
-
class SomethingManager
-
class SomethingExecutor
-
class SomethingProvider
-
class SomethingWorker
-
class SomethingBuilder
-
class SomethingAdapter
-
class SomethingGenerator
A long list can be found here. People have different opinion in using which suffix for what situation. But our interest is something else.
Whether can we add state to this type of classes? I would suggest stateless. Lets examine why I am saying 'no', in rest of this article.
Hybrid classes
According to wikipedia encapsulation in object oriented programming is "...the packing of data and functions into a single component". Does this mean all the methods which manipulate that object should be there in the entity class? I don't think so. The entity class can have state accessor methods such as GetName()
, SetName()
, GetJoiningDate
, GetSalary()
etc...
But CalculateSalary()
should be outside. Why its so?
According to the SOLID - Single Responsibility Principle "A class should change only for one reason". If we keep CalculateSalary()
method inside the Employee class that class will change for any of the below 2 reasons which is a violation.
I hope its clear. Now we have 2 classes in this context. Employee class and SalaryCalculator class. How do they connect each other. There are multiple ways. One is to create object of SalaryCalculator class inside the GetSalary method and call the Calculate() to set the salary variable of Employee class. If we do so it became hybrid because it is acting like entity class and it initiate operation like helper class. I really don't encourage this type of hybrid classes. But in situations such as Save entity method, this is kind of OK with some sort of delegation of operation.
"Whenever you feel that your class is falling in this hybrid category, think about re-factoring. if you feel that your classes are not falling in any of the above categories stop coding."
State in Helper / Manipulator class
What is the problem if our helper classes keep state? Before that lets look at what are the different combination of state values a SalaryCalculator
class can take? Below are some examples
Scenario 1 - Primitive values
class SalaryCalculator
{
public double Basic { get; set; }
public double DA { get; set; }
public string Designation { get; set; }
public double Calculate()
{
}
}
Cons
There are chances that the Basic salary can be of a "Accountant" and the Designation can be "Director" which is not at all matching.There is no enforced way to make sure that the SalaryCalculator can work independently.
Similarly if this executes in threaded environment, it will fail.
Scenario 2 - Object as state
class SalaryCalculator
{
public Employee Employee { get; set; }
public double Calculate()
{
}
}
Cons
If one SalaryCalculator object is shared by 2 threads and each thread is for different employee, the sequence of execution might be as follows which cause logical errors.
-
Thread 1 sets employee1
object
-
Thread 2 sets employee2
object
-
Thread 1 calls Calculate
method and gets Salary
for employee2
We can argue that the Employee
dependency can be injected via constrictor and make the property read only. Then we need to create SalaryCalculator
objects for each and every employee object. So better do not design your helper classes in this way.
Scenario 3 - No state
class SalaryCalculator
{
public double Calculate(Employee input)
{
}
}
This is near perfect situation. But here we can argue that, if all the methods are not using any member variable what is the use of keeping it as non static class.
The second principle in SOLID principles says "Open for extension and closed for modification". What does it mean? When we write a class, it should be complete. There should be no reason to modify it. But should be extensible via sub classing and overriding. So how would our final one looks like?
interface ISalaryCalculator
{
double Calculate(Employee input);
}
class SimpleSalaryCalculator:ISalaryCalculator
{
public virtual double Calculate(Employee input)
{
return input.Basic + input.HRA;
}
}
class TaxAwareSalaryCalculator : SimpleSalaryCalculator
{
public override double Calculate(Employee input)
{
return base.Calculate(input)-GetTax(input);
}
private double GetTax(Employee input)
{
throw new NotImplementedException();
}
}
As I mentioned many times in my blog, always program to interface. In the above code snippet, I implemented interface methods implicitly. That is to reduce the space here. Always implement explicitly. The Logic of calculation should be kept in a protected function so that the inherited classes can call that function in case required.
Below is the way how the Calculator
class(es) should be consumed
class SalaryCalculatorFactory
{
internal static ISalaryCalculator GetCalculator()
{
return new SimpleSalaryCalculator();
}
}
class PaySlipGenerator
{
void Generate()
{
Employee emp = new Employee() { };
double salary =SalaryCalculatorFactory.GetCalculator().Calculate(emp);
}
}
The Factory class encapsulate the logic of deciding which child class to be used. It can be static as above or dynamic using reflection. As far as the reason for change in this class is object creation, we are not violating the "Single Responsibility Principle"
In case you are going for Hybrid class, you may invoke calculation from the Employee.Salary
property or Employee.GetSalary()
as below.
class Employee
{
public string Name { get; set; }
public int EmpId { get; set; }
public double Basic { get; set; }
public double HRA { get; set; }
public double Salary
{
get{return SalaryCalculatorFactory.GetCalculator().Calculate(this);}
}
}
Conclusion
"Don't code when we are thinking. Don't think when we are coding". This principle will give us enough freedom to think whether the class should be stateless or stateful. If stateful, what the state of an object should expose.
- Make entity classed stateful.
- Helper / Operation classes should be stateless.
- Make sure the Helper classes are not static.
- Even if there is hybrid class, make sure it's not violating the SRP.
- Spent some time on class design before coding. Show the class diagram to 2-3 fellow programmers and get their opinions.
- Name the class wisely. The names will help us to decide state. There is no hard rule for naming. Below are some I am following.
- Entity classes should be named with nouns which represents a type of object - eg:
Employee
- Helper / Worker class names should be reflecting that its a worker. eg:
SalaryCalculator
, PaySlipGenerator
etc... - Verb should never be used as class name - eg:class
CalculateSalary{}
Points of Interest
- Most of the classes we are writing falls into the hybrid category which violates SRP. Please comment if there are any scenario which cannot be coded without hybrid classes
History
- Inital version - 30Oct2014