ashish adhav: May 2014

LOG ROTATION

Many of you might have faced this issue.Lets say we have a script try.sh and we run it as follows

sh try.sh > output

or consider we have a web server which sends its logs to some log file .

Now everything is fine until the size of the log File is small.

But once it rises you may face disk full issues.

I faced this issue with my webserver.

Now if you try to manually truncate the file it will not work.

Since it has been opened for writing and is currently being written.

Deleting it and creating new file of the same name will not work since the node is attached to the script .

Then how can you rotate the log file and compress it.

Answer is use the linux utility logrotate.

For more details do

man logrotate

It needs a config file to get the log file to be rotated.

add a cron job as follows.

/usr/sbin/logrotate /etc/logrotate.conf

the block u must append in /etc/logrotate.conf

/home/script/output {

daily

rotate 12

size 1G

compress

missingok

notifempty

copytruncate

}

here /home/script/output is the path of the output file to be rotated.

other fields mean the following

run it daily

keep log file of up to 12 days

rotate if size more than 1 GB

compress the log file after rotation.

do not worry if file is missing

notify if the file is empty

create a copy of it , and truncate the original file.

with this the original issue has been solved.

You can now save the disk space and rotate logs.

TCMALLOC

Installation steps:

packages needed

1] Libunwind

$ get latest tar from http://download.savannah.gnu.org/releases/libunwind/

$ tar -zxvf libunwind-1.1.tar.gz

$ cd libunwind-1.1

$ ./configure

$ make

$ make install

2 ] gperftools

$ get latest tar from https://code.google.com/p/gperftools/downloads/list

$ tar -zxvf gperftools-2.1.tar.gz

$ cd gperftools-2.1

$ ./configure

$ make

$ make install

WHY IS IT MORE EFFICIENT ??

A good memory allocator needs to balance a number of goals. Two of these most prominent goals are minimizing time and minimizing space usage. Speed is important for any malloc implementation because if malloc can get faster, thousands of existing applications having bottlenecks on dynamic memory allocation will get significant performance boost without the need to change any code.

Disadvantages of Traditional malloc and an improvement:

Lots of wasted space especially for small allocation objects: each Header / Footer occupies 4 bytes (in a 32 bit machine), if coalescing are adopted (without coalescing, only Header is required), every object must be surrounded by Header and Footer, so N- 8 byte objects will account for 16*N bytes.

If the size of a free object is bigger than required, but not big enough to carve into smaller objects, then there will be an internal fragmentation.

No mechanism to separate small allocations with large (eg. requesting more than 200 MB memory) ones. Cannot adjust the bin size to speed up all kinds of allocations.
In a multithreaded application, these data structures need to be protected with locks. As memory is being allocated concurrently in multiple threads, all the threads must wait in line while requests are handled once at a time. Therefore, all threads are competing for access to the same heap causing a problem known as heap contention. Adding a Second Layer (as a thread-cache) on top of the “Base Allocator” can greatly increase the scalability of the allocator .

TCMalloc’s design to counter glibc malloc’s shortages:

Memories are allocated by a run of pages instead of arbitrary sizes, thus greatly reduced sbrk or mmap system call overhead.

Instead of using Header / Footer, a global page map was used to map between a given page to the location containing info about this page. For 64-bit architecture with 4K pages, 2^52 items have to be mapped. Therefore, a three-level radix tree was used to minimize memory cost, at the beginning, about 4.5 MB are used for the mapping.

Separating the small allocation (<= 32KB) with big ones. Also, each thread gets a private thread cache, lock free multi-threaded allocation can be achieved if there is enough space in thread cache.

Large object allocations are satisfied by the central page heap, the central heap is NOT thread-safe, so a spinlock has to be taken when allocating from central page heap.

ashish adhav

Wednesday, May 28, 2014

Problem of the dangling output log file

TCMALLOC and glibc malloc