Session Initiation Protocol (SIP) is considered as a signaling protocol for
IP multimedia subsystem (IMS). IMS is introduced by 3rd generation partnership
project (3GPP) as signaling foundation in next generation networks (NGN). Despite
having the features such as: text based, IP based, independent of the data transmission,
support for mobility and end-to-end, the SIP protocol has not suitable
mechanism to deal with overload. Therefore, many mechanisms are proposed to
control overload in SIP networks. One of their most famous is occupancy CPU
(OCC) that is used in many researches. In traditional OCC, the value of parameters
is indicated and they are used in subsequent documents. In this paper, optimal
parameters value is obtained by Q-learning algorithm. Because modeling a large
SIP network is impossible by mathematical relationships and it is a heuristic
problem, Q-learning is the best method to compute the parameters. The simulation
results demonstrate that the Q-learning output is comparable with traditional OCC.