Andi has been doing some awesome performance testing of Openfire and finds that there is a major regression of scalability from Openfire 3.9.3 to 3.10.1. Here are some Tsung reports
Andi then did a test with simply replacing MINA in 3.10.1 from version 2.0.9 to 2.0.7
Here's the test result for 3.10.1 with mina 2.0.7. Very similar to the 3.9.3 performance. Also got to around 15k users with memory being stable around 300-400MB. https://www.dropbox.com/s/23ucnvhtvqze6m4/15ksimul3.10.1-mina2.0.7-20150617-1252.zip?dl=0
This MINA Jira issue may be related
It may be time to consider downgrading Openfire to MINA 2.0.7
I'll grab this one. Looks like we need to (at least) revert this commit:
and maybe several others related to NIOConnection.
PR submitted against master (3.11.0), but can also be merged into 3.10 branch as needed to support a 3.10.2 patch release.
(06/18/2015 10:37:06 PM) Andi: Alright so the performance tests from today are looking pretty good. 3.10.1 with Tom's changes to revert mina back to 2.0.7 are on par with the performance of 3.9.3.
(06/18/2015 10:38:06 PM) Andi: The test we ran today was to get about 19k users logged in, have them IM with each other as well as create and join some of about 250 rooms.
(06/18/2015 10:38:38 PM) Andi: Some of these rooms ended up with 200 to 350 users, all of which are posting messages. Not at crazy rates but somewhat realistic scenarios
(06/18/2015 10:39:00 PM) Andi: So overall a pretty decent load of connected users, IM messages as well as MUC activity going on.
(06/18/2015 10:41:09 PM) Andi: Also the earlier thing I mentioned about higher mem usage of 3.10.1 is a non issue. while at the end after the test, 3.10.1 ended up with more 'used' memory, most of that was unreachable objects and just didn't get garbage collected yet. Here's the heapdump MAT analysis of that: https://www.dropbox.com/s/mgy6yd23rn9fo6z/3.10.1_heapdump_20150618_high_mem_usage.zip?dl=0
(06/18/2015 10:41:58 PM) Andi: and here are the 4 Tsung reports for the 2 runs of the test against 3.9.3 and 2 runs of the test against 3.10.1 with Tom's changes. Also Openfire was not restarted between the 2 3.10.1 runs.
(06/18/2015 10:42:16 PM) Andi: https://www.dropbox.com/s/uj01y91ulfssoqg/muc20k-3.9.3-20150618-1005.zip?dl=0
(06/18/2015 10:42:29 PM) Andi: https://www.dropbox.com/s/gffzu5ak9tlsgop/muc20k-3.9.3-20150618-1126.zip?dl=0
(06/18/2015 10:42:42 PM) Andi: https://www.dropbox.com/s/7wku413ddq09fnh/muc20k-3.10.2-20150618-1748.zip?dl=0
(06/18/2015 10:42:55 PM) Andi: https://www.dropbox.com/s/vikfr7lvbztqvu4/muc20k-3.10.2-20150618-1936.zip?dl=0