Current version of FastNetMon uses single CPU core to calculate traffic speed. Even on fastest CPUs we can handle around ~1m+ active hosts with traffic. We need to be able to process even more active hosts by scaling speed processing by multiple CPU cores. Our current plan to shard all speed counters into multiple segments and process them independently.