This article emphasises the benefits of using mutually dynamic pricing, as opposed individual pricing of product or services. By using mutually beneficial rating, the algorithm is able to use the information of various product or services to enhance the profit received from rating all the items in a consistent manner. This enables for quicker learning once the demand for the various product or services is powerfully connected. However, the range of mutually beneficial product will increase the speed of convergence decreases exponentially. Because the range of mutually beneficial product becomes large, the decision maker could take into account grouping product if they follow an equal demand pattern, or put together rating extremely related mutually beneficial product. Moreover, we analyse to behave the Q-learning with eligibility trace algorithm under different conditions without any explicit knowledge of client buying behaviour.