aster.cloud aster.cloud
  • /
  • Platforms
    • Public Cloud
    • On-Premise
    • Hybrid Cloud
    • Data
  • Architecture
    • Design
    • Solutions
    • Enterprise
  • Engineering
    • Automation
    • Software Engineering
    • Project Management
    • DevOps
  • Programming
  • Tools
  • About
aster.cloud aster.cloud
  • /
  • Platforms
    • Public Cloud
    • On-Premise
    • Hybrid Cloud
    • Data
  • Architecture
    • Design
    • Solutions
    • Enterprise
  • Engineering
    • Automation
    • Software Engineering
    • Project Management
    • DevOps
  • Programming
  • Tools
  • About
  • Computing
  • Design

The Cost And Sustainability Of Generative AI

  • relay
  • February 23, 2023
  • 4 minute read

All the people using DALL-E to create images or letting ChatGPT write their term papers are eating up a lot of cloud resources. Who’s going to pay for all this?

AI is resource intensive for any platform, including public clouds. Most AI technology requires numerous inference calculations that add up to higher processor, network, and storage requirements—and higher power bills, infrastructure costs, and carbon footprints.

The rise of generative AI systems, such as ChatGPT, has brought this issue to the forefront again. Given the popularity of this technology and the likely massive expansion of its use by companies, governments, and the public, we could see the power consumption growth curve take on a concerning arc.

AI has been viable since the 1970s but did not have much business impact initially, given the number of resources needed for a full-blown AI system to work. I remember designing AI-enabled systems in my 20s that would have required more than $40 million in hardware, software, and data center space to get it running. Spoiler alert: That project and many other AI projects never saw a release date. The business cases just did not work.

Cloud changed all of that. What once was unapproachable is now cost-efficient enough to be possible with public clouds. In fact, the rise of cloud, as you may have guessed, was roughly aligned with the rise of AI in the past 10 to 15 years. I would say that now they are tightly coupled.

Cloud resource sustainability and cost

You really don’t need to do much research to predict what’s going to happen here. Demand will skyrocket for AI services, such as the generative AI systems that are driving interest now as well as other AI and machine learning systems. This surge will be led by businesses that are looking for an innovative advantage, such as intelligent supply chains, or even thousands of college students wanting a generative AI system to write their term papers.

Read More  How To Migrate A Group Of Individual Instances To A Stateful MIG Using Python Script

More demand for AI means more demand for the resources these AI systems use, such as public clouds and the services they provide. This demand will most likely be met with more data centers housing power-hungry servers and networking equipment.

Public cloud providers are like any other utility resource provider and will increase prices as demand rises, much like we see household power bills go up seasonally (also based on demand). As a result, we normally curtail usage, running the air conditioning at 74 degrees rather than 68 in the summer.

However, higher cloud computing costs may not have the same effect on enterprises. Businesses may find that these AI systems are not optional and are needed to drive certain critical business processes. In many cases, they may try to save money within the business, perhaps by reducing the number of employees in order to offset the cost of AI systems. It’s no secret that generative AI systems will displace many information workers soon.

What can be done?

If the demand for resources to run AI systems will lead to higher computing costs and carbon output, what can we do? The answer is perhaps in finding more efficient ways for AI to utilize resources, such as processing, networking, and storage.

Sampling a pipelining, for instance, can speed up deep learning by reducing the amount of data processed. Research done at MIT and IBM shows that you can reduce the resources needed for running a neural network on large data sets with this approach. However, it also limits accuracy, which could be acceptable for some business use cases but not all.

Read More  Discovering 10 Cybersecurity Practices for Enhanced IT Infrastructure Protection

Another approach that is already in use in other technology spaces is in-memory computing. This architecture can speed up AI processing by not moving data in and out of memory. Instead, AI calculations run directly within the memory module, which speeds things up significantly.

Other approaches are being developed, such as changes to physical processors—using coprocessors for AI calculations to make things speedier—or next-generation computing models, such as quantum. You can expect plenty of announcements from the larger public cloud providers about technology that will be able to solve many of these problems.

What should you do?

The message here is not to avoid AI to get a lower cloud computing bill or to save the planet. AI is a fundamental approach to computing that most businesses can leverage for a great deal of value.

I’m advising you to go into an AI-enablement or net-new AI system development project with a clear understanding of the costs and the impact on sustainability, which are directly linked. You’ll have to make a cost/benefit choice, and this really goes back to what value you can bring back to the business for the cost and risk required. Nothing new here.

I do believe that much of this issue will be solved with innovation, whether it’s in-memory or quantum computing or something we’ve yet to see. Both the AI technology providers and the cloud computing providers are keen to make AI more cost-efficient and green. That’s the good news.

Source: Cyberpogo

relay

Related Topics
  • Artificial Intelligence
  • ChatGPT
  • DALL-E
  • InfoWorld
You May Also Like
View Post
  • Computing
  • Data

Sovereign Clouds Are Becoming A Big Deal Again

  • March 23, 2023
View Post
  • Computing
  • Design
  • Multi-Cloud

Why Is Your Multicloud So Slow?

  • March 17, 2023
View Post
  • Computing
  • Technology

Cloudflare Democratizes Post-Quantum Cryptography By Delivering It For Free, By Default

  • March 16, 2023
View Post
  • Architecture
  • Computing
  • Design
  • Engineering
  • Multi-Cloud

3 Ways To Screw Up A Multicloud Deployment

  • March 14, 2023
View Post
  • Cloud-Native
  • Design
  • Engineering
  • Technology

5 GKE Features To Help You Optimize Your Clusters

  • March 13, 2023
View Post
  • Design
  • Engineering

The ABCs Of Building Reliable, Scalable, And Maintainable Web Applications – Reliability

  • March 13, 2023
View Post
  • Computing

Kubernetes-Native Database: TiDB Vs. DataStax Astra DB

  • March 8, 2023
View Post
  • Design
  • Engineering
  • Solutions
  • Technology

Network Detection And Response: The Future Of Cybersecurity

  • March 8, 2023

Stay Connected!
LATEST
  • 1
    My First Pull Request At Age 14
    • March 24, 2023
  • 2
    AWS Chatbot Now Integrated Into Microsoft Teams
    • March 24, 2023
  • 3
    Verify POST Endpoint Availability With Uptime Checks
    • March 24, 2023
  • 4
    Sovereign Clouds Are Becoming A Big Deal Again
    • March 23, 2023
  • 5
    Ditching Google: The 3 Search Engines That Use AI To Give Results That Are Meaningful
    • March 23, 2023
  • 6
    Pythonic Techniques For Handling Sequences
    • March 21, 2023
  • 7
    Oracle Cloud Infrastructure to Increase the Reliability, Efficiency, and Simplicity of Large-Scale Kubernetes Environments at Reduced Costs
    • March 20, 2023
  • 8
    Monitor Kubernetes Cloud Costs With Open Source Tools
    • March 20, 2023
  • 9
    What Is An Edge-Native Application?
    • March 20, 2023
  • 10
    Eclipse Java Downloads Skyrocket
    • March 19, 2023
about
Hello World!

We are aster.cloud. We’re created by programmers for programmers.

Our site aims to provide guides, programming tips, reviews, and interesting materials for tech people and those who want to learn in general.

We would like to hear from you.

If you have any feedback, enquiries, or sponsorship request, kindly reach out to us at:

[email protected]
Most Popular
  • 1
    Cloudflare Takes On Online Fraud Detection Market
    • March 15, 2023
  • 2
    Linux Foundation Training & Certification & Cloud Native Computing Foundation Partner With Corise To Prepare 50,000 Professionals For The Certified Kubernetes Administrator Exam
    • March 16, 2023
  • 3
    Cloudflare Democratizes Post-Quantum Cryptography By Delivering It For Free, By Default
    • March 16, 2023
  • 4
    Daily QR “Scan Scams” Phishing Users On Their Mobile Devices
    • March 16, 2023
  • 5
    Lockheed Martin Launches Commercial Ground Control Software For Satellite Constellations
    • March 14, 2023
  • /
  • Platforms
  • Architecture
  • Engineering
  • Programming
  • Tools
  • About

Input your search keywords and press Enter.