Python Lambda logging duplication workaround

sventech · March 29, 2017, 11:14pm

We’ve been building some complex Python code on the Serverless framework w/AWS Lambda. Recently we spent a month resolving a Lambda malfunction with Amazon support. There is a very unexpected behavior. Because the “logger” object is global to the interpreter in Python, log handlers can hang around and multiply for multiple executions of a Lambda because Amazon reuses the same environment and leaves all global variables in place – even if you try to NULL out the object you’ll find that it stays. We stored some log metadata in our handler and found that it persisted from one execution to the next, so that we had data from triggers (message UUIDs) hanging around, and also a single log message would show up multiple times.

Here is what solved it:

import logging
from logstash_formatter import LogstashFormatterV1
logger = logging.getLogger(function_name)
logger.handlers = [] # <== SOLUTION HERE
handler = logging.StreamHandler(sys.stdout)
log_fmt = "" # omitted for simplicity
formatter = LogstashFormatterV1(log_fmt)
handler.setFormatter(formatter)
logger.addHandler(handler)

This is going to be common for anyone doing log shipping to ElasticSearch with CloudWatch Logs, etc.
I’d like to see this make it into documentation so others don’t suffer as we did. I’m not sure where to direct a pull request or what would be appropriate.

kitos9112 · February 4, 2018, 11:46am

This AWS Lambda function scheme was driving me crazy… many thanks for those tips!

I did not know what to do to overcome this misfunctioning!

bill · February 5, 2018, 1:56am

Thanks for sharing, @sventech

Is it same to the problem I discovered recently?

2 weeks ago I enabled Custom Access Logging in api gateway’s stage (for example, dev), I found the logs are collected across in a lot of log steams (UUID) in same log group. They are not always the latest logs saved in latest log stream, in each log stream, the logs can be 2 weeks old to present.

boarik · February 28, 2018, 9:40am

@sventech the solution you suggested doesn’t work for us. Can you confirm that the solving line
logger.handlers = [] # <== SOLUTION HERE

Should actually be:
logging.getLogger().handlers = []

Which effectively resets the handlers of the root logger.

Thx

sventech · February 28, 2018, 10:43am

Yes, I was doing the standard format

logger = logging.getLogger()

logger.handlers = []

boarik · February 28, 2018, 1:16pm

Excellent.

Just for future knowledge or anyone else coming across this issue, here is the cause:

Lambda sets up its own form of root logger with a handler attached.
Python’s logging module creates loggers in an hierarchical manner such that each logger is a descendant of the root logger.
With this given, every logger created in the Lambda code will be a descendant of that pre-set root logger.
Python loggers also behave such that a logger would first handle the log message with its attached log handlers, and then propagate the message up the logging hierarchy to its parent. In turn each ancestor up the chain would also issue the message to its handlers and then pass the call up to its parent and so on.

The solution for this can be either deleting the root logger’s handlers altogether
OR
setting the logger’s propagate attribute to False, such that it won’t propagate the log message to its parent and prevent the ancestor’s handlers from being called.

That is:

import logging
import sys

logger = logging.getLogger("my_module")
handler = logging.StreamHandler(sys.stdout)
formatter = logging.Formatter("(%(name)s) %(message)s")
handler.setFormatter(formatter)
logger.addHandler(handler)
logger.propagate = False

def lambda_handler(event, context):
    logger.warn("Hello from Lambda")
    
    return "Hello from Lambda"

sventech · February 28, 2018, 5:59pm

Hi Bill, that is interesting but I think unrelated – would be more about Amazon infrastructure log handling vs. Python interpreter in Lambda.

oshriPP · November 10, 2019, 9:31am

Hi all,

I also cross this issue, and I wonder if there is an allegiant way to remove only the AWS handler from the handlers list?
I tried to run on the handlers-list and find the AWS handler and remove it, but the get_name function in the handler class return None.

Thanks…

Topic		Replies	Views
Lambda, APIG and Cloudwatch Serverless Framework	0	424	June 8, 2019
Log4j Logger problem Serverless Framework aws	2	1568	September 13, 2018
Seperate logs stream for single lambda function Serverless Framework aws , lambda , cloudformation , api-gateway	0	385	December 31, 2020
Unable to update log format or level in lambdas defined with serverless framework Serverless Framework aws , lambda , api-gateway	1	161	April 17, 2025
Seperate logs stream based on the requests/functions Serverless Framework aws , lambda	0	291	July 30, 2020

Python Lambda logging duplication workaround

Related topics