Quantcast
Channel: THWACK: Unanswered Discussions - All Communities
Viewing all 19321 articles
Browse latest View live

Not able to add new server in DPA

$
0
0

Error detail:-

An unknown error has occurred. The provided message is "Could not get JDBC Connection; nested exception is java.sql.SQLNonTransientConnectionException: Could not connect to server:1433: Connection reset".


After upgrading Kiwi to 9.6.5 service crashes

$
0
0

After we upgraded our Kiwi Syslog to 9.6.5 we have had an ongoing issue where any rule or config change crashes the syslog service. This happens on both of our Kiwi servers.

 

We ruled out McAfee AntiVirus and HBSS (uninstalled)

 

The error I see in the Application events shows:

 

Framework Version: v4.0.30319

Description: The process was terminated due to an unhandled exception.

Exception Info: exception code c0000096, exception address 0686733C

 

I have re-opened our case with support but I wanted to reach out to the community to see if anyone else is seeing behavior. We might have to roll back to an earlier version if we don't get something resolved soon.

 

Thanks

APE Down

$
0
0

Hello

 

Could somebody help me in the below:

 

We are having one APE and another Main Server. Some nodes are being assigned to APE. In case the APE goes down do the Main Poller will poll automatically. If no, can we set an alert with action to change the APE.

WPM Crashes When Using Run-As

$
0
0

Hello,


Has anyone else observed the issue where running the WPM Recorder using the Ran As option crashes?

We've had a handful of people looking at this one from our end and even had a support engineer from Solarwinds on.
I am hoping someone has found the solution to this problem.

 

Thank you for any information.

Hi volume of audit success events by the SAM account - why?

$
0
0

Some of my monitored servers are posting a high volume of Success Audit logon/Logoff events in the Windows event log.   Is this normal?   Why would these appear close to 100 times per minute?   Node is set to be polled every 120sec.

 

Some of my servers are working pretty hard and I'm looking on whether this is an extra stress on them.

 

Thoughts?

 

Sky2830

Connection Issues with SWQL Studio

$
0
0

I have found a couple articles on this, but not fix. I am getting an Unable to connect to information service, an error occurred when verifying security for the message. Anyone have a fix for this.

 

I have tried:

1. local and domain account.

2. time is correct between servers.

3. imported SSL Cert

4. Run as admin or not admin

 

Any help would be appreciated.

 

Thanks,

 

Cody Bulwin

change time zone download error

$
0
0

Hi twack community

I need help please

 

Last saturday there was a change of time in my country, which makes the patching to all servers with the new time, but when I make a download of the settings of some device, it is downloaded with the old time, how can I solve it?

Thanks for the help.

 

Current Time

Download with another time

DPA bug finder reward?

$
0
0

Found a pretty significant bug with DPA-  sent it in to my account rep.   Is there an official bug finder reward program at solarwinds?    Its not an application to technical functionality bug- but more of revenue impacting bug.... those are the worst kinds.


Reboot monitor for Linux systems

$
0
0

Hello,

We are using this monitor to receive emails on reboot. The windows machines are working OK but every Linux server is sending lots of false positives. I can re-create this issue by re-starting the SNMP service for example. This also occurs on Linux servers with the SolarWinds agent used.

 

We are using the following if helpful:

Orion Platform 2018.2 HF6, IPAM 4.7.0, NPM 12.3, DPAIM 11.1.0, NTA 4.2.3, SAM 6.7.0, NetPath 1.1.3

How do you implement HA?

$
0
0

Hi,

I'm currently working on HA server implementation on two sites, Location_A and Location_B  which are on a different location and subnets

 

The current status of implementation.

1. Primary server, APE and SQL located in Location_A

2. Secondary server or the HA is in Location_B, which is on a different subnets

3. There are no HA on APE and SQL server which are both in Location_A

 

Question.

1.Is this enough to only add the secondary server in location location_B? I wonder if there's a circuit outage and the primary servers only in location_A

2. Should I also considering to add HA for APE and SQL which is currently in Location_A?

3. Or with all budgetary consideration on the two items mention above, is it advisable to install a separate instance in Location_ B.

 

 

Thanks much,

Node alert using HW component details

$
0
0

Hi All,

 

I've got a Node alert working for node up/down events and i'd like to extend it to show hardware component details such as power supply status.

 

I've created a node event that I know triggers:

 

 

The trigger action is to send an email with the message body:

 

An issue has been detected at ${N=Alerting;M=AlertTriggerTime;F=DateTime} on ${N=SwisEntity;M=MachineType} device named ${N=SwisEntity;M=Caption} (IP: ${N=SwisEntity;M=IP_Address}, DNS: ${N=SwisEntity;M=DNS})

 

 

View full device details here: ${N=SwisEntity;M=DetailsUrl}.

View full alert details here: ${N=Alerting;M=AlertDetailsUrl}

Click here to acknowledge the alert: ${N=Alerting;M=AcknowledgeUrl}

 

 

HW details:

Alert ${N=Alerting;M=AlertName} at ${N=Alerting;M=AlertTriggerTime;F=DateTime}

${N=SwisEntity;M=FullyQualifiedName} is ${N=SwisEntity;M=Status;F=Status}

 

 

This message was brought to you by the alert named: ${N=Alerting;M=AlertName}

The node is monitored by the polling engine ${N=SwisEntity;M=Engine.ServerName}

 

 

However, the actual result simulation (on a node with a failed PSU) just shows:

 

An issue has been detected at Thursday, 11 April 2019 10:15 on HP 3800 Switch Stack device named SW-203-03-013-01-01 (IP: 10.152.0.21, DNS: )

 

View full device details here: https://SWIN.FQDN:443/Orion/NetPerfMon/NodeDetails.aspx?NetObject=N:52.

View full alert details here:

Click here to acknowledge the alert:

 

HW details:

Alert TEST of ## Node is DOWN/UP - Switches SMS Day Hrs at Thursday, 11 April 2019 10:15

${N=SwisEntity;M=FullyQualifiedName} is Up

 

This message was brought to you by the alert named: TEST of ## Node is DOWN/UP - Switches SMS Day Hrs

The node is monitored by the polling engine SWIN

 

Can I only solve this with SWQL?  If so, does anyone know what I would look for?

 

I can workaround this with a specific hardware alert, but that will result in another duplicate set of actions to manage,  using the action manager to enable/disable alerts to the individual engineers.

 

Any ideas will help!

 

Thanks

 

Glen.

Full Health Report/Audit

$
0
0

Is there a way or has anyone been able to use the database from Orion to create a live health report?

 

IE:

 

Site1:

  • Network
    • 8 switches
    • 5 VG
  • IDF
    • UPS
    • Camera

 

Which ever OID Orion pulls a certain percentage has to be met. 0-65% red, 66-90% yellow, 91-100% green

A webpage then has the site name and devices. Below each device the color of the device percentage online is displayed. You can click on the color marker and it tells you what is good and bad by host and ip?

Management Addressing of a WAN and LANs?

$
0
0

Hi,

 

I've got three management addresses on each L3 device on my network. 

How could I simplify this?

What's the most elegant way of setting up management addressing?

This is what I'm doing now:

 

It is good practice to put your L3 management addresses on loopback interfaces - which are always up as long as that item of equipment is functioning.

 

To monitor our WAN, of about 400 sites, we have one ‘management address per site' which is separate from the edge-customer traffic, which can be polled by monitoring tools, which could easily represent the SLA compliance or otherwise of our WAN suppliers.

That consumes 2 x ‘C’ class address ranges and is good for 2 x 254 = 508 sites

Those addresses should appear on the Lo0  interfaces of the routers or L3-switches which serve as L3-gateways across our network.

Those addresses are e.g 10.253.0.xxx/24 and 10.254.0.xxx/24, so one particular site in the south of our WAN would have this address:

10.253.0.88  ANYTOWN-RTR1-Lo0  # Lo0  WAN management address on Cisco WS-3650-24PS

 

And those addresses aren't summarized in the routing table.

 

But we also need to monitor and manage all the network devices within each site.  Now, since the router or L3 switch is always on the LAN, we should be able to deprecate the addressing for ‘one management address per site’.  But we can’t. Because, if you give the first address in the subnet to your router but put it on a loopback interface, then it can’t act as the gateway to the rest of that subnet.  So you won’t be able to monitor and manage your physical L2 switches or the L2 wireless access points which are the rest of that subnet. So the L3 device has to have TWO management addresses.

e.g.

10.251.xxx.0/24 for LAN management in the south, 10.252.xxx.0/24 for LAN management in the north

 

Since our WAN provider are offering SLAs on those WAN links for us, they need a management address as well, fenced off by different security ACLs.  So that’s 3 @ IP addresses per router just for management.  Which is naturally confusing for every new hire we get.

e.g.

10.250.0.xxx/24 for exterior monitoring on Lo1 in the south, 10.251.0.xxx/24 for exterior provider monitoring on Lo1 in the North.

 

So ANYTOWN-RTR1 has

10.250.0.88  ANYTOWN-RTR1-Lo1  # Lo1 External Contractor management address on Cisco WS-3650-24PS

10.251.88.1  ANYTOWN-RTR1-v201  # Vlan201 LAN Management address on  Cisco WS-3650-24PS

10.253.0.88  ANYTOWN-RTR1-Lo0  # Lo0  WAN management address on Cisco WS-3650-24PS

 

And that's a lot of config and router table entries just for management.

 

Does anyone have a more elegant, economic way of doing management addressing?

Help for Backup for fortigate 100d in Kiwi Cattools

$
0
0

I would like a help to be able to configure a backup of our firewalls, the great majority are fortigate 100d, i have kiwi catools

 

Tks

Pulling in specific charts from WPM transactions

$
0
0

Hello,

 

What is the best way about pulling in SolarWinds WPM charts from 'Transaction Details - Summary'?  The way I am doing it now is through Custom HTML and using iframes, I feel like there is probably an easier way to do this.  This is the chart I want to pull into a WPM dashboard:

 

 

Here is my dashboard with the Custom HTML widgets:

 

Now this is all fine and dandy, but there are some pain points, I had to configure each chart with the 'Custom HTML', when trying to use these tabs with the 'NOC' view I get blank white screens instead:

 

Is there a better method of going about this?

 

Thank you!


How to report on a UnDP (universal device poller)

$
0
0

     I am trying to create a report that shows information from a UnDP. I am unable to find the field in report writer. Does anyone happen to have a walkthrough?

Compare two netflow sources

$
0
0

Hi Everyone,

 

Is it possible to create a view that will show two netflow sources "top 5 conversations" side by side? Ive been playing with the netflow navigator and am not finding it. Using NTA 4.2.3

Disk vs Volume (SAM or NPM)

$
0
0

Hi Everyone,

 

I have been tasked to setup IOP monitoring for our database servers running in SUSE. I have installed the agent as the functionality is built in. I am now able to see IOPs for the volumes. However, I would like to get IOPs data on the disk drives themselves instead of the volumes (or in addition). Is this possible? For example sda and sdb. That way, we would know if a particular drive is getting an abnormal amount of activity. I appreciate the help.

 

Thanks,

Mike

Top CPUs by Percent Load granularity

$
0
0

I'm trying to have a 1 min granularity in the Top CPUs by Percent Load graph under any node detail. I changed the Node Status Polling to 60 secs and the Collect Statistics Every 1 minute and then clicked the Top CPUs by Percent Load graph, clicked Edit but the minimum Sample Interval showing there is 5 mins. Is there any way this can be lowered to 1 min? I see the 1 min Sample Interval is available for the Min/Max/Average bits per second graph.

Windows Powershell Script produces Output Results : Not Defined when run from SAM

$
0
0

Hello,

 

I am attempting to run a Windows Powershell Component Monitor in order to monitor the DFS backlog on one of our file servers.  I am running into an issue with getting this created in SAM 6.7.  When I set the test node and run it against the target server I am seeing a "Down" result.  When I go to the script Body within the monitor and test the script itself for Output, I get a Output Results : Not Defined message.

 

When I run this same script from the Solarwinds server via Powershell it completes without error.

 

Attached is the powershell script I am trying to get working.  I have also imported a couple "DFS Backlog" templates from THWACK and am seeing the same output.

Viewing all 19321 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>