Unsolved
This post is more than 5 years old
1 Rookie
•
32 Posts
0
5394
July 10th, 2019 06:00
Isilon Platform API service issue
We have an issue on our cluster (Currently running OneFS 7.2.0.1) where the Platform API service (papi) has started to do some wierd things. I started to receive errors when commands were run on the command line such as "incomplete response from server". This happened on 8 of the 13 nodes in the cluster wihch was a bit odd. The only difference between the 8 i got the error on and the 5 that were fine is these 5 are currently suspended in the IP Pools. At the same time we see this we see issues with getting into the WebUI and also InsightIQ connection to the cluster.
I've read the article about isi_papi_d being in a bad state and if i restart the service it works for about 30-40 minutes then starts to fail again.
I noticed that when it started to fail the process was always at the top of the list if i ran a top command and usually chewing up quite a bit of CPU% and either in a ucond stater or kqread state. And from what i can find neither of those is good as it seems to indicate that the process is waiting on something, somewhere.
I've tried the usual Windows fix or rebooting all the nodes since some had been up for well over 200 days and thats not worked.
So i thought i would ask the community, see if anyone has any thoughts on what could be causing this and how to get round it? All thoughts gratefully received.
0 events found


Phil.Lam
3 Apprentice
•
637 Posts
0
July 19th, 2019 17:00
I think you answered your problem:
The only difference between the 8 i got the error on and the 5 that were fine is these 5 are currently suspended in the IP Pools. At the same time we see this we see issues with getting into the WebUI and also InsightIQ connection to the cluster.
Those nodes IP won't respond if in a static pool. Use an IP from a dynamic nfs pool? Or just use an IP that you know is reachable.
XXeeRRFF
1 Rookie
•
1 Message
0
October 7th, 2024 13:22
I am using an IP from a Dynamic Pool for all requests. The failure of connecting to the PAPI service just started to occur.
Is there any other way to quiet this report? Or any advice to troubleshoot not seeing any bad drives all IPs are reachable.
(edited)
DELL-Sam L
Moderator
•
7.9K Posts
0
October 7th, 2024 16:13
Hello XXeeRRFF,
What is your current onefs version that you are running on your isilon system?