Hi, I want to merge multiple small files into multi GB file using lambda, I understand lambda has certain limits on memory sizes and run time. Can this be done with Lambda? or what is the Best way to do this?
Thanks in advance!
This domain-specific bioinformatics paper might give you an idea on how to proceed:
Just look at the (simple) method:
- Pull from S3 to EC2.
- Split files.
- Upload splitted files to S3.
- Run lambdas on the splitted files.
- Pull results from the lambdas to EC2 for merging.
Hope that helps!