The exponential growth of the Internet of Things has led to the explosive growth of smart devices connected to the internet, generating vast amounts of data. However, this growth has brought forth a set of challenges, including limited bandwidth, slow response times, and elevated security concerns, particularly within conventional cloud computing models. To tackle the demand for faster and more efficient data processing, a concept known as edge clouds has emerged. In contrast to traditional cloud networks, edge networks are physically closer to end-users. This geographic proximity empowers them to handle certain user processing needs without the necessity of transferring data to a central cloud layer. This approach not only alleviates the load on the central cloud but also provides users connected to the edge with faster and higher-quality services. Nevertheless, edge networks surrounded by inherent limitations, including constraints on processing resources, storage capacity, and communication bandwidth. User-required services are organized into chains of functions, forming specific sequences that serve as requests within the edge network. Given these constraints, it becomes vital to establish these service chains with minimal delay while efficiently utilizing the available edge resources. This entails scheduling virtual functions to run on edge resources in a manner that expedites resource release and reduces the overall response time for incoming requests. This research presents an innovative approach grounded in Deep Q-learning for Service Function Chain management. It selects the most suitable nodes on which the Virtual Network Functions have already been deployed and are ready for chaining into service function chains (SFCs). Then to achieve an optimum Quality of Experience an scheduling mechanism will be used to reorder the functions within the chains to achieve the shortest total completion time of chains. Going beyond traditional SFC chaining, our study explores simultaneous scheduling for multiple SFCs, setting it apart from prior efforts. Significantly, this research pioneers the use of Deep Q-Learning for joint resolution of SFC chaining and scheduling of multiple SFCs, offering a comprehensive analysis in comparison with two distinct scenarios; focusing solely on chaining or concentrating exclusively on scheduling. Employing a wide set of synthetic SFC requests, evaluation results shows an average of 58% reduction in total completion time of SFCs, 43% reduction in request rejection ratio, and almost 23% better resource utilization which is crucial in edge computing. [ABSTRACT FROM AUTHOR]