View Issue Details

IDProjectCategoryView StatusLast Update
0002177GNUnetstatistics servicepublic2012-06-02 19:15
ReporterChristian Grothoff Assigned ToBart Polot  
PrioritynormalSeverityminorReproducibilitysometimes
Status closedResolutionfixed 
Product Version0.9.2 
Target Version0.9.3Fixed in Version0.9.3 
Summary0002177: Stopping gnunet-arm fails sometimes
DescriptionAlso with an odd error message that should be improved to better state what the error was:

Hostkeys created Progress: [0%...25%...50%...75%...100%]
Peers started Progress: [0%...25%...50%...75%...100%]
Peer connection Progress: [0%.......25%.......50%.......75%.......100%]
Timeout communicating with ARM service.
Feb 24 13:47:04-748739 test_dht_topo-7614 WARNING test: FAILED!
FAIL: test_dht_2dtorus
TagsNo tags attached.

Relationships

has duplicate 0002174 closedBart Polot test_mesh_2dtorus fails with a strange error message 

Activities

Christian Grothoff

2012-02-25 02:24

manager   ~0005508

http://gnunet.org:8010/builders/fedora15-x86-grothoff/builds/1831/steps/tests%20dht/logs/stdio

Christian Grothoff

2012-02-26 21:44

manager   ~0005521

grothoff@spec:~/svn/gnunet/src/dht$ while true; do ./test_dht_2dtorus && echo ok || break; done
Hostkeys created Progress: [0%...25%...50%...75%...100%]
Peers started Progress: [0%...25%...50%...75%...100%]
Peer connection Progress: [0%.......25%.......50%.......75%.......100%]
ok
Hostkeys created Progress: [0%...25%...50%...75%...100%]
Peers started Progress: [0%...25%...50%...75%...100%]
Peer connection Progress: [0%.......25%.......50%.......75%.......100%]
ok
Hostkeys created Progress: [0%...25%...50%...75%...100%]
Peers started Progress: [0%...25%...50%...75%...100%]
Peer connection Progress: [0%.......25%.......50%.......75%.......100%]
ok
Hostkeys created Progress: [0%...25%...50%...75%...100%]
Peers started Progress: [0%...25%...50%...75%...100%]
Peer connection Progress: [0%.......25%.......50%.......75%.......100%]
ok
Hostkeys created Progress: [0%...25%...50%...75%...100%]
Peers started Progress: [0%...25%...50%...75%...100%]
Peer connection Progress: [0%.......25%.......50%.......75%.......100%]
ok
Hostkeys created Progress: [0%...25%...50%...75%...100%]
Peers started Progress: [0%...25%...50%...75%...100%]
Peer connection Progress: [0%.......25%.......50%.......75%.......100%]
ok
Hostkeys created Progress: [0%...25%...50%...75%...100%]
Peers started Progress: [0%...25%...50%...75%...100%]
Peer connection Progress: [0%.......25%.......50%.......75%.......100%]
ok
Hostkeys created Progress: [0%...25%...50%...75%...100%]
Peers started Progress: [0%...25%...50%...75%...100%]
Peer connection Progress: [0%.......25%.......50%.......75%.......100%]
ok
Hostkeys created Progress: [0%...25%...50%...75%...100%]
Peers started Progress: [0%...25%...50%...75%...100%]
Peer connection Progress: [0%.......25%.......50%.......75%.......100%]
ok
Hostkeys created Progress: [0%...25%...50%...75%...100%]
Peers started Progress: [0%...25%...50%...75%...100%]
Peer connection Progress: [0%.......25%.......50%.......75%.......100%]
ok
Hostkeys created Progress: [0%...25%...50%...75%...100%]
Peers started Progress: [0%...25%...50%...75%...100%]
Peer connection Progress: [0%.......25%.......50%.......75%.......100%]
Timeout communicating with ARM service.
Timeout communicating with ARM service.
Feb 26 20:37:07-165484 test_dht_topo-10432 WARNING test: FAILED!

Bart Polot

2012-02-28 17:41

manager   ~0005532

Added extra logging to print cause of error.

Bart Polot

2012-02-28 23:06

manager   ~0005538

Problems seems to be in the testing library: the call to gnunet-arm times out and the testcase finished without killing some of the peers.

Bart Polot

2012-02-29 15:34

manager   ~0005543

[bart@saturn ~/svn/gnunet/src/dht]$ ./test_dht_2dtorus
Hostkeys created Progress: [0%...25%...50%...75%...100%]
Peers started Progress: [0%...25%...50%...75%...100%]
Peer connection Progress: [0%.......25%.......50%.......75%.......100%]
Feb 29 15:18:02-405554 test_dht_topo-5113 INFO FOUND 1!
Feb 29 15:18:02-405653 test_dht_topo-5113 INFO FOUND 2!
Feb 29 15:18:02-406074 test_dht_topo-5113 INFO FOUND FAR!
Feb 29 15:18:02-406174 test_dht_topo-5113 INFO disconnecting peers
Feb 29 15:18:02-406227 test_dht_topo-5113 INFO Ending test.
Timeout communicating with ARM service.
Timeout communicating with ARM service.
Feb 29 15:20:02-600987 test_dht_topo-5113 WARNING Shutdown of peers failed!
[bart@saturn ~/svn/gnunet/src/dht]$ ps aux | grep gnunet
bart 5143 0.0 0.0 20444 1396 pts/10 SN 15:17 0:00 gnunet-service-arm -c /tmp/test_dht_topo//3//gnunet-testing-configFtVWOy -L DEBUG
bart 5146 0.0 0.0 20444 1376 pts/10 SN 15:17 0:00 gnunet-service-arm -c /tmp/test_dht_topo//11//gnunet-testing-configcs8WTv -L DEBUG
bart 5158 0.0 0.0 41724 2276 pts/10 SN 15:17 0:00 gnunet-service-dht -c /tmp/test_dht_topo//11//gnunet-testing-configcs8WTv
bart 5160 0.0 0.0 41724 2292 pts/10 SN 15:17 0:00 gnunet-service-dht -c /tmp/test_dht_topo//3//gnunet-testing-configFtVWOy
bart 5296 0.0 0.0 10588 1092 pts/10 S+ 15:20 0:00 grep --color=auto gnunet
[bart@saturn ~/svn/gnunet/src/dht]$ ls /tmp/
akonadi-bart.BTmlh0 gnunet-pipe-4Fc1qZ gnunet-pipe-87nPpE gnunet-pipe-CsaEze gnunet-pipe-F9NQc0 gnunet-pipe-j0nVoe gnunet-pipe-Ly48i5 gnunet-pipe-oWNyr5 gnunet-pipe-RcE1X4 gnunet-pipe-tZzLZb gnunet-pipe-xG7TRT pulse-n7CcctT9RwKS
bartgnunet gnunet-pipe-4fgBQd gnunet-pipe-8rIpmg gnunet-pipe-d9my6p gnunet-pipe-FernHU gnunet-pipe-JreSoC gnunet-pipe-M1RRWn gnunet-pipe-PDmhBa gnunet-pipe-rHzMA5 gnunet-pipe-UaczPE gnunet-pipe-xPHMNv screens
cron.gsOzPP gnunet-pipe-4Xrrit gnunet-pipe-9tiUP4 gnunet-pipe-DHlPKe gnunet-pipe-fxvQVs gnunet-pipe-JtDCaJ gnunet-pipe-MIwQDN gnunet-pipe-pE66VC gnunet-pipe-RiYY5f gnunet-pipe-uJfUDa gnunet-pipe-y3uelO ssh-pGRQfiJw4333
gnunet-pipe-027jo0 gnunet-pipe-5r0JED gnunet-pipe-afuAxS gnunet-pipe-Dng9PE gnunet-pipe-fz6KLm gnunet-pipe-k1r08m gnunet-pipe-mMcUzT gnunet-pipe-PfzbJd gnunet-pipe-rjbAyQ gnunet-pipe-UR5dON gnunet-pipe-ySBTi1
gnunet-pipe-0sqbwv gnunet-pipe-6Wlg4l gnunet-pipe-aVciLh gnunet-pipe-DniKWw gnunet-pipe-GiRYVO gnunet-pipe-KA4YlO gnunet-pipe-n0yIyn gnunet-pipe-pG2hxl gnunet-pipe-Rjr0ch gnunet-pipe-vNJJmT gnunet-pipe-ySWjov
gnunet-pipe-2esTdZ gnunet-pipe-6x8AZu gnunet-pipe-bfDL11 gnunet-pipe-dq2IvL gnunet-pipe-GnGi5f gnunet-pipe-kBuZXQ gnunet-pipe-N1cA7l gnunet-pipe-q4DYW5 gnunet-pipe-rL0dQC gnunet-pipe-VNpQGK gpg-nTveNi
gnunet-pipe-2x7SGK gnunet-pipe-70kGMW gnunet-pipe-BFHVr7 gnunet-pipe-Dqja9W gnunet-pipe-He4TpE gnunet-pipe-kLRvdQ gnunet-pipe-NBRORd gnunet-pipe-qhF48F gnunet-pipe-rpvn4V gnunet-pipe-vqOxL2 kde-bart
gnunet-pipe-32GbyW gnunet-pipe-73Zrif gnunet-pipe-bqC1Dx gnunet-pipe-dwsQWv gnunet-pipe-Hkvkt2 gnunet-pipe-Ks0y7q gnunet-pipe-O6VBCm gnunet-pipe-qJhVtP gnunet-pipe-s9Q6c5 gnunet-pipe-wc5koq kde-kdm
gnunet-pipe-3OdEmw gnunet-pipe-7QCaOB gnunet-pipe-BuCSaN gnunet-pipe-dXu6cn gnunet-pipe-hoPPmP gnunet-pipe-L6RwhE gnunet-pipe-oOTQwn gnunet-pipe-R0xwmI gnunet-pipe-SqOMSr gnunet-pipe-WtlolO ksocket-bart
gnunet-pipe-40Tge6 gnunet-pipe-7ubPj8 gnunet-pipe-c5aOs0 gnunet-pipe-EFKj1v gnunet-pipe-i7VCYN gnunet-pipe-L7iU3m gnunet-pipe-ouzHRr gnunet-pipe-R2rPFv gnunet-pipe-sTjCaF gnunet-pipe-wzPLjN ksocket-kdm
gnunet-pipe-4A6bvr gnunet-pipe-84WipD gnunet-pipe-cj3EMT gnunet-pipe-EXmMdW gnunet-pipe-IVLrVC gnunet-pipe-LQewFD gnunet-pipe-OW0JF4 gnunet-pipe-r8i8e8 gnunet-pipe-tddmbe gnunet-pipe-xdFBjf plugtmp
[bart@saturn ~/svn/gnunet/src/dht]$

Two 'Timeout communicating with ARM service.' messages, two peers left running. Since the /tmp/test_dht_topo directory is deleted, cannot try to gnunet-arm -e them again.

Bart Polot

2012-02-29 15:40

manager   ~0005544

[bart@saturn ~]$ gnunet-arm -e -c /tmp/test_dht_topo/6/gnunet-testing-config3raBO6
^CFeb 29 15:39:10-571080 arm-api-6490 WARNING Failed to transmit shutdown request to client.
Error communicating with ARM service.
[bart@saturn ~]$

Bart Polot

2012-03-05 17:41

manager   ~0005565

New suspect: statistics-api. If sync_first = yes does not cancel scheduled reconnect task with big timeout, making dht to hang there, making arm hang there, making testing fail.

Bart Polot

2012-03-05 17:56

manager   ~0005566

Changing to GNUNET_STATISTICS_destroy (GDS_stats, GNUNET_NO) in dht service seems to make the bug go away, I'll try to fix the statistics api.

Bart Polot

2012-03-05 18:40

manager   ~0005568

Fixed in r20285.

Issue History

Date Modified Username Field Change
2012-02-24 22:50 Christian Grothoff New Issue
2012-02-24 22:50 Christian Grothoff Assigned To => Bart Polot
2012-02-24 22:50 Christian Grothoff Status new => assigned
2012-02-25 02:24 Christian Grothoff Note Added: 0005508
2012-02-26 21:44 Christian Grothoff Note Added: 0005521
2012-02-26 21:45 Christian Grothoff Relationship added related to 0002174
2012-02-28 17:41 Bart Polot Note Added: 0005532
2012-02-28 17:41 Bart Polot Status assigned => acknowledged
2012-02-28 23:06 Bart Polot Note Added: 0005538
2012-02-28 23:06 Bart Polot Category DHT service => testing library
2012-02-29 15:34 Bart Polot Note Added: 0005543
2012-02-29 15:40 Bart Polot Note Added: 0005544
2012-02-29 15:41 Bart Polot Category testing library => ARM service
2012-02-29 15:41 Bart Polot Product Version Git master => 0.9.2
2012-02-29 15:41 Bart Polot Target Version => 0.9.3
2012-02-29 15:41 Bart Polot Summary test_dht_2dtorus fails sometimes => Stopping gnunet-arm fails sometimes
2012-02-29 15:42 Bart Polot Relationship replaced has duplicate 0002174
2012-03-05 17:41 Bart Polot Note Added: 0005565
2012-03-05 17:41 Bart Polot Category ARM service => statistics service
2012-03-05 17:56 Bart Polot Note Added: 0005566
2012-03-05 18:40 Bart Polot Note Added: 0005568
2012-03-05 18:40 Bart Polot Status acknowledged => resolved
2012-03-05 18:40 Bart Polot Fixed in Version => 0.9.3
2012-03-05 18:40 Bart Polot Resolution open => fixed
2012-06-02 19:15 Christian Grothoff Status resolved => closed