[METRON-996] ASA Parser Performance improvement - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Done
Priority: Major
Resolution: Done
Affects Version/s: 0.4.0
Fix Version/s: 0.4.0
Labels:
None

Description

I got mildly interested in parser performance as a result of some recent work on tuning, and did some very quick benchmarking with Predfix on the ASA parser (which I hadn’t really cared about enough due to relatively low volume previously).

That said, it’s not exactly perf optimised. 3 runs of 1000 iterations on my laptop as a micro-benchmark in Predfix (I know, scientific, right), with some changes (basically pushing all the grok statements up to pre-compile in init (the parser currently uses one grok to do the syslog bit and figure out which grok it needs for the second half, so this makes for a large number of Grok objects upfront, which I think we can live with.

Do you think we should do this benchmarking properly, and extend? Anyone have thoughts about how to build parser benchmarks in to our test suite properly?

Also, since these are showing approx 20 times improvement on the P95 interval, do we think it’s worth the memory (not measured, but 39 Grok objects hanging around? If so I’ll get it JIRAed up and push my new version.

Run results:-

Base line (current master as is)

= Benchmark ==================================================================================
	unit	sum	min	max	avg	stddev	conf95	runs
========================================= TimeMeter ==========================================
. AsaBenchmark ...............................................................................
parserBenchmark	ms	5597.98	04.90	159.02	05.60	04.89	[05.01-06.20]	1000.00
parserBenchmark	ms	5503.91	04.82	149.60	05.50	04.59	[05.00-05.90]	1000.00
parserBenchmark	ms	5620.90	04.80	152.83	05.62	04.71	[04.98-06.73]	1000.00
==============================================================================================

Syslog element of Grok pulled out and pre-compiled

= Benchmark ==================================================================================
	unit	sum	min	max	avg	stddev	conf95	runs
========================================= TimeMeter ==========================================
. AsaBenchmark ...............................................................................
parserBenchmark	ms	4299.91	03.29	120.06	04.30	03.89	[03.36-07.10]	1000.00
parserBenchmark	ms	4206.98	03.31	129.41	04.21	04.07	[03.46-05.44]	1000.00
parserBenchmark	ms	3843.05	03.28	119.39	03.84	03.79	[03.33-04.55]	1000.00
==============================================================================================

With all precompiled in a hash map (more memory use, but not by a lot)

= Benchmark =================================================================================
	unit	sum	min	max	avg	stddev	conf95	runs
========================================= TimeMeter =========================================
. AsaBenchmark ..............................................................................
parserBenchmark	ms	514.68	00.22	112.35	00.51	03.55	[00.24-00.79]	1000.00
parserBenchmark	ms	472.42	00.22	105.19	00.47	03.32	[00.23-00.70]	1000.00
parserBenchmark	ms	484.40	00.21	103.71	00.48	03.27	[00.24-00.76]	1000.00
==============================================================================================

Attachments

Issue Links

links to

GitHub Pull Request #617

Activity

People

Assignee:: Unassigned

Reporter:: Simon Elliston Ball

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 11/Jun/17 12:42

Updated:: 07/Nov/18 03:26

Resolved:: 07/Nov/18 03:26