What happened?
In #28770 we have enabled caching for state api, which improves performance of iterable side inputs. Unfortunately, this change can result in a performance regression for certain pipelines that use fully materialized side inputs, such as side inputs that use AsDict
, AsList
views.
As a precaution, we recommend users of 2.52.0, 2.53.0 and 2.54.0 Beam SDK to supply an additional pipeline option --max_cache_memory_usage_mb=0
to restore the prior behavior, particularly if the pipelines use large materialized side inputs.
Issue Priority
Priority: 2 (default / most bugs should be filed as P2)
Issue Components