[prev in list] [next in list] [prev in thread] [next in thread]
List: illumos-discuss
Subject: Re: [discuss] netstat -a hanging on domain sockets
From: "Jim Klimov" <jimklimov () cos ! ru>
Date: 2016-04-22 13:09:58
Message-ID: 70246F2F-F852-4356-BAF3-452A6F9BCBBC () cos ! ru
[Download RAW message or body]
21 апреля 2016 г. 20:24:37 CEST, Gabriele Bulfon <gbulfon@sonicle.com> \
пишет:
> I see...but I feel cpu was normal, memory was ok, disk space too...what
> may be affecting unix domain sockets and hangs?
> BTW, this is a sample output from the same zone, now that is responding
> fine.
> Would you please give me some insights to what are the interesting
> numbers in these cases?
> sonicle@cloudserver:~$ prstat -mL PID USERNAME USR SYS TRP TFL DFL
> LCK SLP LAT VCX ICX SCL SIG PROCESS/LWPID 26840 sonicle 5.3 0.0 0.0
> 0.0 0.0 0.0 95 0.0 6 3 111 0 imapd/1 23455 sonicle 3.1 0.2 0.0
> 0.0 0.0 0.0 97 0.0 82 3 843 0 imapd/1 5952 sonicle 0.2 0.5 0.0
> 0.0 0.0 99 0.1 0.0 1K 1 13K 0 slapd/7sonicle@cloudserver:~$
> vmstat kthr memory page disk faults
> cpu r b w swap free re mf pi po fr de sr s0 s1 s2 s3 in sy
> cs us sy id 0 0 0 20345404 1559380 416 1750 0 0 0 0 0 -0 18 4 99 5798
> 49801 17818 3 2 95sonicle@cloudserver:~$ mpstatCPU minf mjf xcal intr
> ithr csw icsw migr smtx srw syscl usr sys wt idl 0 190 0 32
> 592 124 1826 11 101 81 0 2519 2 3 0 95 1 113 0
> 48 318 32 1411 3 69 71 0 4228 4 2 0 94 2 96
> 0 44 537 34 1157 3 61 67 0 4213 4 3 0 93 3
> 89 0 39 338 78 1220 3 68 75 0 3165 3 2 0 95
> 4 272 0 17 269 31 1174 5 48 64 0 2047 1 2 0
> 97 5 115 0 43 327 46 1271 3 57 69 0 3488 4 2
> 0 94 6 79 0 30 260 33 881 2 46 58 0 2622 3
> 1 0 96 7 91 0 49 336 51 1267 3 56 68 0 2930
> 4 2 0 95 8 77 0 25 257 39 719 3 42 48 0
> 2280 2 1 0 96 9 87 0 40 271 37 878 2 50 56
> 0 3109 4 1 0 95 10 95 0 48 284 45 950 2 52
> 58 0 3392 4 2 0 95 11 81 0 27 233 33 696 2
> 39 47 0 2836 3 1 0 96 12 83 0 47 327 46 1237
> 3 56 67 0 3383 4 2 0 95 13 88 0 31 346 134
> 899 2 46 57 0 3022 4 1 0 95 14 117 0 47 410
> 152 1264 3 57 68 0 4247 5 2 0 93 15 76 0 27
> 692 508 968 3 47 75 0 2319 2 2 0 96
> ----------------------------------------------------------------------------------------
> Sonicle S.r.l.
> >
> http://www.sonicle.com
> Music:
> http://www.gabrielebulfon.com
> Quantum Mechanics :
> http://www.cdbaby.com/cd/gabrielebulfon
> ----------------------------------------------------------------------------------
> Da: Robert Mustacchi
> discuss@lists.illumos.org
> Data: 21 aprile 2016 20.09.28 CEST
> Oggetto: Re: [discuss] netstat -a hanging on domain sockets
> On 4/21/16 11:07 , Gabriele Bulfon wrote:
> Hi,
> I had a strange situation on a server starting to respond very slow on
> almost any deamon (tomcat/apache/ssh/cyrus).
> While prstat/top did not show any high cpu usage (under 5%) and prstat
> -Z didn't show any very high memory usage, and
> swap look there unused...I tried running netstat -a to see how many
> sockets where there serving.
> The result was a very fast output of all the normal sockets, and then
> hanging for 10-30 seconds before the output of active UNIX domain
> sockets.
> I repeated the command some times, but didn't think about saving the
> output somewhere, so I don't know what the domain sockets state was
> (which files, where).
> I can say the machine runs amavis / spamd / lmtpd /saslauthd / cyrus /
> postfix / apache / tomcat / postgres for more than 600 users.
> Is there anything I should look deeper about the domain socket usage?
> Is it possible that they were so much busy slowing down
> every socket (even ssh took 1 to 2 minutes to take me to the bash).
> In general, you should use the -n option to netstat whenever you're
> debugging so you're not generating a lot of dns resolutions. But when
> you're looking at that kind of thing, you should start with your CPU
> microstates for processes (usually the -mL options to prstat) and tools
> like mpstat and vmstat.
> Robert
>
In vmstat looking for lag reasons, for me often there is correlation with pi/po \
(swapping page-in/out activity), sr (scanrate - how often kernel looks for pages to \
evict) and cs (context switching - good/bad values depend on amount and performance \
of cores, but bad impact reflects that the system is overloaded with active \
processes).
Small free RAM (under 128-64-32mb depending on don't know what) is also a problem - \
processes can fail to fork, etc.
But I more often deal with small old servers, where 2-4gb ram is norm, 16gb is \
supersized ;)
Jim
--
Typos courtesy of K-9 Mail on my Samsung Android
-------------------------------------------
illumos-discuss
Archives: https://www.listbox.com/member/archive/182180/=now
RSS Feed: https://www.listbox.com/member/archive/rss/182180/25758063-6f7f4185
Modify Your Subscription: \
https://www.listbox.com/member/?member_id=25758063&id_secret=25758063-83fb4fd4 \
Powered by Listbox: http://www.listbox.com
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic