{"id":16961,"date":"2026-04-14T14:28:41","date_gmt":"2026-04-14T14:28:41","guid":{"rendered":"https:\/\/dmsretail.com\/RetailNews\/how-cisco-cx-scaled-ai-without-sacrificing-security-or-cost\/"},"modified":"2026-04-14T14:28:41","modified_gmt":"2026-04-14T14:28:41","slug":"how-cisco-cx-scaled-ai-without-sacrificing-security-or-cost","status":"publish","type":"post","link":"https:\/\/dmsretail.com\/RetailNews\/how-cisco-cx-scaled-ai-without-sacrificing-security-or-cost\/","title":{"rendered":"How Cisco CX scaled AI without sacrificing security or cost"},"content":{"rendered":"<p> <p><a href=\"https:\/\/dmsretail.com\/online-workshops-list\/\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-496\" src=\"https:\/\/dmsretail.com\/RetailNews\/wp-content\/uploads\/2022\/05\/RETAIL-ONLINE-TRAINING-728-X-90.png\" alt=\"Retail Online Training\" width=\"729\" height=\"91\" srcset=\"https:\/\/dmsretail.com\/RetailNews\/wp-content\/uploads\/2022\/05\/RETAIL-ONLINE-TRAINING-728-X-90.png 729w, https:\/\/dmsretail.com\/RetailNews\/wp-content\/uploads\/2022\/05\/RETAIL-ONLINE-TRAINING-728-X-90-300x37.png 300w\" sizes=\"auto, (max-width: 729px) 100vw, 729px\" \/><\/a><\/p><br \/>\n<\/p>\n<div>\n<p><em>Discover how Cisco Customer Experience (CX) leveraged AI-ready infrastructure\u2014including networking, compute, and observability\u2014to secure sensitive data, control costs, and maximize ROI on agentic AI workloads.\u00a0<\/em><\/p>\n<p>AI isn\u2019t just moving; it\u2019s sprinting.<\/p>\n<p><span data-contrast=\"none\">By strategically leveraging AI, we create proactive, personalized, and predictive customer experiences that enhance satisfaction and loyalty \u2014 without sacrificing security or budget. As capabilities rapidly evolve, Cisco CX is transforming a critical part of its backend infrastructure to support advanced AI workloads. <\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/p>\n<h2><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<strong>Tackling security and cost challenges<\/strong><\/span><\/h2>\n<p><span data-contrast=\"none\">Although cloud platforms excel at flexibility and speed, they introduce some data sovereignty challenges and consumption-based cost volatility. For AI workloads processing sensitive customer information, we decided that trade-off was untenable.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/p>\n<p><b><span data-contrast=\"none\">Security: <\/span><\/b><span data-contrast=\"none\">Cloud environments may distribute data across multiple networks and even geographically disperse locations, with access controls through distinct third parties \u2014 expanding the attack surface in an intensifying threat landscape. On-premises deployment eliminates these intermediaries. Cisco infrastructure met our data sovereignty needs, fuses security across every layer of the stack, and reduces exposure to distributed threats.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/p>\n<p><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><b><span data-contrast=\"none\">Cost: <\/span><\/b><span data-contrast=\"none\">On-premises deployment also grants greater control over operational expenses. In the cloud, AI use cases with unpredictable inferencing demands cause token costs to skyrocket. For example, a chatbot handling customer inquiries may process 10 million tokens during peak season but only 2 million during off-peak periods \u2014 resulting in 5x cost variance month-to-month. This unpredictability makes budgeting difficult and compounds as AI workloads scale. When we process tokens locally, we replace that ingress and egress volatility with cost certainty, converting variable expenses into predictable capital investments.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/p>\n<p><span class=\"TextRun SCXW131784918 BCX4\" lang=\"EN-US\" xml:lang=\"EN-US\" data-contrast=\"none\"><span class=\"NormalTextRun SCXW131784918 BCX4\">On-premises infrastructure mitigates two critical business risks: data sovereignty loss and budget unpredictability. By <\/span><span class=\"NormalTextRun SCXW131784918 BCX4\">eliminating<\/span><span class=\"NormalTextRun SCXW131784918 BCX4\"> third-party intermediaries and variable token costs, <\/span><span class=\"NormalTextRun SCXW131784918 BCX4\">we<\/span><span class=\"NormalTextRun SCXW131784918 BCX4\"> gain<\/span><span class=\"NormalTextRun SCXW131784918 BCX4\">ed<\/span><span class=\"NormalTextRun SCXW131784918 BCX4\"> both <\/span><span class=\"NormalTextRun SCXW131784918 BCX4\">digital<\/span><span class=\"NormalTextRun SCXW131784918 BCX4\"> resilience and financial predictability to responsibly scale AI.<\/span><\/span><span class=\"EOP SCXW131784918 BCX4\" data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/p>\n<h2><strong>Deploying AI workloads flexibly<\/strong><\/h2>\n<p><span data-contrast=\"none\">Since each AI use case has distinct requirements for data security and computational scale, a rigid \u201cone-size-fits-all\u201d deployment strategy undermined our security and cost objectives. Instead, we guided our decisions using pre-defined criteria for AI use case prioritization, investment, and deployment. The three key objectives were:<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/p>\n<ul>\n<li><b><span data-contrast=\"none\">Smart and scalable:<\/span><\/b><span data-contrast=\"none\"> Maximize the value of customer investments in Cisco technologies and solutions.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/li>\n<li><b><span data-contrast=\"none\">Customizable experience: <\/span><\/b><span data-contrast=\"none\">Tailor a proactive, predictive, and personalized journey.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/li>\n<li><b><span data-contrast=\"none\">Digital resilience and security: <\/span><\/b><span data-contrast=\"none\">Create a resilient, reliable, and secure customer environment.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/li>\n<\/ul>\n<p><span data-contrast=\"none\">These objectives shaped where we deployed each workload, either on-premises, in the cloud, or hybrid. Take our Customer Sentiment Analysis Agent as an example. It analyzes signals to drive customer renewals by processing a wide scope of sensitive data from customer adoption journeys, support interactions, and the Cisco install base. Because of its data sensitivity and scale requirements, on-premises deployment was both the secure and cost-effective choice \u2014 allowing us to maintain full control over customer renewal data while avoiding unpredictable token costs during peak analysis periods.\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"auto\">With the support from this and other agents, Cisco CX had 30% better accessibility to adoption metrics versus<\/span><span data-contrast=\"none\"> manual assessments and <\/span><span data-contrast=\"auto\">eliminated daily administrative friction up to 40%.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/p>\n<h2><strong>Harnessing Cisco compute and networking<\/strong><\/h2>\n<p>To scale AI workloads while maintaining data sovereignty and operational cost predictability, we leveraged the following Cisco components:<\/p>\n<ul>\n<li><span data-contrast=\"none\">Cisco Unified Computing System (UCS) Servers<\/span> <span data-contrast=\"none\">handle the compute demands of AI workloads, such as model tuning, application inferencing, and task automation. The unified architecture simplifies scaling and management, enabling our team to grow AI capabilities without the budgeting uncertainty that accompanies cloud-based inferencing.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/li>\n<li><span data-contrast=\"none\">Cisco Nexus 9000 Series Switches<\/span><span data-contrast=\"auto\"> with <\/span><span data-contrast=\"none\">Silicon One<\/span><span data-contrast=\"auto\"> ASICs (Application-Specific Integrated Circuits) <\/span><span data-contrast=\"none\">provide the low-latency, high-throughput networking required for intensive AI operations. Their programmable design reduces operational overhead during scaling events, ensuring our infrastructure can adapt to workload demands without introducing new security vectors or complexity.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/li>\n<li><span data-contrast=\"none\">Splunk Cloud Platform<\/span> <span data-contrast=\"none\">delivers real-time visibility and infrastructure health across the entire stack. This visibility is essential for maintaining security posture and operational efficiency \u2014 so we can effectively detect anomalies, optimize resource utilization, and ensure predictable performance as workloads scale.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/li>\n<\/ul>\n<h2><strong>Best practices and learnings<\/strong><\/h2>\n<p><span data-contrast=\"none\">By deploying Cisco compute, networking, and observability solutions in tandem with CX agentic capabilities, Cisco CX ensures the end-to-end customer lifecycle remains secure, seamless, and cost-effective. As we continue to scale AI workloads, here\u2019s what we\u2019ve learned:<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/p>\n<ul>\n<li><b><span data-contrast=\"auto\">Align on highest value use cases: <\/span><\/b><span data-contrast=\"auto\">Prioritization isn\u2019t subjective.<\/span> <span data-contrast=\"auto\">Establish clear criteria to evaluate the value of use cases and deploy accordingly, so you don\u2019t have to compromise security or cost.\u00a0<\/span><span data-ccp-props=\"{}\">\u00a0<\/span><\/li>\n<li><b><span data-contrast=\"auto\">Prioritize reusability of AI infrastructure:<\/span><\/b><span data-contrast=\"auto\"> Design your AI infrastructure as a shared platform, not siloed resources. The on-premises cluster that powers our Renewals Agents also supports CiscoIQ, eliminating redundant hardware investments and accelerating time-to-value for new agentic workflows. This \u201cbuild once, deploy many\u201d approach maximizes ROI and enables rapid scaling without proportional infrastructure costs.<\/span><span data-ccp-props=\"{}\">\u00a0<\/span><\/li>\n<li><b><span data-contrast=\"none\">Embrace continuous evaluation of your deployment model: <\/span><\/b><span data-contrast=\"none\">Just as your infrastructure needs flexibility, your teams must regularly assess and adapt processes to optimize performance and cost. Recognize that high-value use cases evolve with market conditions and customer needs \u2014 your infrastructure strategy should too.<\/span><span data-ccp-props=\"{}\">\u00a0<\/span><\/li>\n<li><b><span data-contrast=\"auto\">Accelerate time-to-market:<\/span><\/b><span data-contrast=\"auto\"> Design your infrastructure for reusability and flexibility to reduce deployment cycles for new AI workflows. Instead of building custom infrastructure for each use case, teams can quickly provision new workloads, creating more time for experimentation.\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/li>\n<\/ul>\n<p><span data-contrast=\"none\">By investing in the right infrastructure and mindset, Cisco CX created space for both our team and customers to innovate and thrive.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/p>\n<p><strong>Related Links:<br \/><\/strong><\/p>\n<p><span data-contrast=\"none\">Cisco Support Services<\/span><span data-contrast=\"none\">\u00a0<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559739&quot;:0,&quot;335559740&quot;:276}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">More Cisco CX blogs<\/span><\/p>\n<\/p><\/div>\n<p><p><a href=\"https:\/\/dmsretail.com\/online-workshops-list\/\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-496\" src=\"https:\/\/dmsretail.com\/RetailNews\/wp-content\/uploads\/2022\/05\/RETAIL-ONLINE-TRAINING-728-X-90.png\" alt=\"Retail Online Training\" width=\"729\" height=\"91\" srcset=\"https:\/\/dmsretail.com\/RetailNews\/wp-content\/uploads\/2022\/05\/RETAIL-ONLINE-TRAINING-728-X-90.png 729w, https:\/\/dmsretail.com\/RetailNews\/wp-content\/uploads\/2022\/05\/RETAIL-ONLINE-TRAINING-728-X-90-300x37.png 300w\" sizes=\"auto, (max-width: 729px) 100vw, 729px\" \/><\/a><\/p><br \/><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Discover how Cisco Customer Experience (CX) leveraged AI-ready infrastructure\u2014including networking, compute, and observability\u2014to secure sensitive data, control costs, and maximize ROI on agentic AI workloads.\u00a0 [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":16962,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[5],"tags":[],"class_list":["post-16961","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology"],"_links":{"self":[{"href":"https:\/\/dmsretail.com\/RetailNews\/wp-json\/wp\/v2\/posts\/16961","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dmsretail.com\/RetailNews\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dmsretail.com\/RetailNews\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dmsretail.com\/RetailNews\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/dmsretail.com\/RetailNews\/wp-json\/wp\/v2\/comments?post=16961"}],"version-history":[{"count":0,"href":"https:\/\/dmsretail.com\/RetailNews\/wp-json\/wp\/v2\/posts\/16961\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dmsretail.com\/RetailNews\/wp-json\/wp\/v2\/media\/16962"}],"wp:attachment":[{"href":"https:\/\/dmsretail.com\/RetailNews\/wp-json\/wp\/v2\/media?parent=16961"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dmsretail.com\/RetailNews\/wp-json\/wp\/v2\/categories?post=16961"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dmsretail.com\/RetailNews\/wp-json\/wp\/v2\/tags?post=16961"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}