Efficient Serving of Large Language Models with OpenVINO — INKHUB