Alibaba Cloud Bailian announces price reduction for some model context caching

26 Aug 2025 18:25
On August 26th, Alibaba Cloud's large model service platform, Bailian, released a notice of price reduction for some model context caches. After this price adjustment, when some models are requested to hit the cache, the hit input token will be charged according to cached_token, and the unit price will be adjusted from 40% of the input_token unit price before the price adjustment to 20% of the input_token unit price; Input tokens that are not hit will be charged according to the standard input_token.

Most Popular Latest News