-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathrunc2.log
More file actions
5895 lines (5895 loc) · 295 KB
/
runc2.log
File metadata and controls
5895 lines (5895 loc) · 295 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
logs_data/30hcd2/cgrew_pr/ablo/run_2024-08-10_20-39-21/run_2691b_00001_1_clip_discriminator=10,dr_cc=0.8000,seed=1_2024-08-10_20-39-21/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s1/08.10_20.40.13/models/ckpt_policy_T15000000.pt
0_hccustom_v0_type2-infogsdr_ppo-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s1
v_data: 0
Inferred from ckptpath name:
il_method: infogsdr
rl_method: ppo
activation: relu
hidden_size: [100, 100]
norm_obs: 0
info_loss_type: None
encode_sampling: normal
normalize_code: 0
tl_emb: 0
TRAINED C_DATA: 2
ARG C_DATA: 2
Using (100, 100) relu networks.
State dim: 17, action dim: 6, action bound 1
args.encode_dim: 2
TRAJ is loaded from /home/vsreeramdass3/code/vild_code1/imitation_data/STRAT_h5/HCCustom_v0.h5 with traj_num 25.0, data_size 25000 steps, and average return 996.96
No of trajs: Train 10 Val 4 Test 11
Total data pairs: 9990, K 10, state dim 17, action dim 6, a min -0.7615941, a_max 0.7615942
Policy model is loaded from logs_data/30hcd2/cgrew_pr/ablo/run_2024-08-10_20-39-21/run_2691b_00001_1_clip_discriminator=10,dr_cc=0.8000,seed=1_2024-08-10_20-39-21/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s1/08.10_20.40.13/models/ckpt_policy_T15000000.pt
Discr model is loaded from logs_data/30hcd2/cgrew_pr/ablo/run_2024-08-10_20-39-21/run_2691b_00001_1_clip_discriminator=10,dr_cc=0.8000,seed=1_2024-08-10_20-39-21/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s1/08.10_20.40.13/models/ckpt_discr_T15000000.pt
***** Max Ep Steps: 1000 args.seed 1 test_seed 1 *****
*** budgets = [10, 20, 30, 40, 50] NPARALLEL = 50 n_test_episodes = 5 ***
***** num zs = 1500 *****
***** args.encode_sampling = normal *****
current_vel_mean [0.97, 1.78, 2.88, 3.65, 4.87]
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
zvs shape 1500
RT 980.7661333333334 63.380688749667456 1000.0 -39.2
LR 0.9823999999999999 0.08949994413406076 1.0 -0.2
VL-std-all 0.24629206468980377 0.0
*** budget = 10
VL10-1 0.8811929289514988 0.3469696893178786
RT-VL10-1 928.3866666666667 152.33209627506463 999.6 -39.2
VL10-2 0.19344818373625156 0.23200044732728056
RT-VL10-2 981.7466666666667 78.87170229063288 999.6 276.4
VL10-3 0.3149717544832664 0.21861619491143122
RT-VL10-3 958.6506666666666 73.59508022204263 1000.0 659.2
VL10-4 0.11451995448298002 0.11637223719726775
RT-VL10-4 988.7626666666669 25.568453340439316 999.2 842.4
VL10-5 0.5632536545068427 0.15507635050486565
RT-VL10-5 995.344 1.4095143371625105 998.8 988.8
VL10-all 0.4134772952321679 0.27881266645358627
RT-VL10-all 970.5781333333332 24.458723849329925 995.344 928.3866666666667
*** budget = 20
VL20-1 0.7292141045916122 0.2835625496111074
RT-VL20-1 873.6373333333333 194.89382735108765 999.6 -39.2
VL20-2 0.11637362469235231 0.04948518462954205
RT-VL20-2 979.344 97.26128416452939 999.6 276.4
VL20-3 0.18513454853757505 0.16287672268294057
RT-VL20-3 952.3040000000001 78.72744788615128 998.8 659.2
VL20-4 0.05487331544478271 0.06766963121630232
RT-VL20-4 984.7893333333335 33.26454598450963 999.2 842.4
VL20-5 0.48923232388938354 0.0870628415025192
RT-VL20-5 995.4933333333333 1.2587118635953038 998.8 992.8
VL20-all 0.3149655834311411 0.25530906199236586
RT-VL20-all 957.1136 44.10115685376068 995.4933333333333 873.6373333333333
*** budget = 30
VL30-1 0.6472089189138814 0.29961862196611805
RT-VL30-1 847.32 175.0273350079924 999.6 276.4
VL30-2 0.1031499077309361 0.04137710463010622
RT-VL30-2 980.7919999999999 100.67523993514989 999.6 276.4
VL30-3 0.1272511648398945 0.1290545344042711
RT-VL30-3 950.4239999999998 81.29769138173604 998.8 659.2
VL30-4 0.02938459799545712 0.035922111563027485
RT-VL30-4 988.24 24.798580604542675 999.2 870.4
VL30-5 0.4587436431879081 0.05613873604464932
RT-VL30-5 995.648 1.2487177423261058 998.8 993.2
VL30-all 0.27314764653361545 0.23831623858650108
RT-VL30-all 952.4848 54.784442661763 995.648 847.32
*** budget = 40
VL40-1 0.5533992271742876 0.2924138073745708
RT-VL40-1 801.081081081081 179.23833723276502 998.8 276.4
VL40-2 0.08932664999016506 0.028973630136598906
RT-VL40-2 975.3513513513514 116.53098792519738 999.6 276.4
VL40-3 0.10208817409012315 0.09744826179265968
RT-VL40-3 931.2540540540539 95.16193480019201 998.8 659.2
VL40-4 0.024275277703882717 0.028102601621917456
RT-VL40-4 980.7567567567568 38.865398635585805 999.2 845.6
VL40-5 0.4456243530650585 0.05312842972328913
RT-VL40-5 995.6432432432431 1.2504141023353859 998.8 993.2
VL40-all 0.24294273640470337 0.21387680454777983
RT-VL40-all 936.8172972972973 71.17768105796154 995.6432432432431 801.081081081081
*** budget = 50
VL50-1 0.5275151393284777 0.3071402935974114
RT-VL50-1 805.0933333333334 166.67752284643007 999.6 410.0
VL50-2 0.0857926794519933 0.030779036090712766
RT-VL50-2 970.88 128.99816639523732 999.6 276.4
VL50-3 0.07822511040468019 0.05954640016257102
RT-VL50-3 929.1466666666668 96.81392370016941 998.8 659.2
VL50-4 0.01534891745469095 0.012894031977274463
RT-VL50-4 986.3066666666668 29.33388181305404 998.0 870.4
VL50-5 0.4418987401918711 0.046948896827505096
RT-VL50-5 995.52 1.29573659874734 998.8 993.2
VL50-all 0.22975611736634263 0.2113397702658119
RT-VL50-all 937.3893333333333 69.94837786698291 995.52 805.0933333333334
logs_data/30hcd2/cgrew_pr/ablo/run_2024-08-10_20-39-21/run_2691b_00003_3_clip_discriminator=10,dr_cc=0.8000,seed=2_2024-08-10_20-39-21/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s2/08.10_20.40.13/models/ckpt_policy_T15000000.pt
0_hccustom_v0_type2-infogsdr_ppo-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s2
v_data: 0
Inferred from ckptpath name:
il_method: infogsdr
rl_method: ppo
activation: relu
hidden_size: [100, 100]
norm_obs: 0
info_loss_type: None
encode_sampling: normal
normalize_code: 0
tl_emb: 0
TRAINED C_DATA: 2
ARG C_DATA: 2
Using (100, 100) relu networks.
State dim: 17, action dim: 6, action bound 1
args.encode_dim: 2
TRAJ is loaded from /home/vsreeramdass3/code/vild_code1/imitation_data/STRAT_h5/HCCustom_v0.h5 with traj_num 25.0, data_size 25000 steps, and average return 996.96
No of trajs: Train 10 Val 4 Test 11
Total data pairs: 9990, K 10, state dim 17, action dim 6, a min -0.7615941, a_max 0.7615942
Policy model is loaded from logs_data/30hcd2/cgrew_pr/ablo/run_2024-08-10_20-39-21/run_2691b_00003_3_clip_discriminator=10,dr_cc=0.8000,seed=2_2024-08-10_20-39-21/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s2/08.10_20.40.13/models/ckpt_policy_T15000000.pt
Discr model is loaded from logs_data/30hcd2/cgrew_pr/ablo/run_2024-08-10_20-39-21/run_2691b_00003_3_clip_discriminator=10,dr_cc=0.8000,seed=2_2024-08-10_20-39-21/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s2/08.10_20.40.13/models/ckpt_discr_T15000000.pt
***** Max Ep Steps: 1000 args.seed 1 test_seed 1 *****
*** budgets = [10, 20, 30, 40, 50] NPARALLEL = 50 n_test_episodes = 5 ***
***** num zs = 1500 *****
***** args.encode_sampling = normal *****
current_vel_mean [0.97, 1.78, 2.88, 3.65, 4.87]
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
zvs shape 1500
RT 980.9658666666667 58.547404909032664 999.2 459.6
LR 0.9738666666666667 0.12202615384507627 1.0 -0.2
VL-std-all 0.3465737833420683 0.0
*** budget = 10
VL10-1 1.0217179487504113 0.09473234922100898
RT-VL10-1 981.2373333333334 73.30091909079691 999.2 520.4
VL10-2 0.22196465891694545 0.06603411552777039
RT-VL10-2 982.5386666666668 71.81068424374995 999.2 520.4
VL10-3 0.4484790600184554 0.2115810926449184
RT-VL10-3 952.3039999999999 88.79855470295298 998.4 636.4
VL10-4 0.14569840996301883 0.1747326946500799
RT-VL10-4 971.1626666666668 57.74759509759307 997.6 794.4
VL10-5 0.3965607038203758 0.30021839837094527
RT-VL10-5 992.5066666666667 13.831416734698182 997.2 894.8
VL10-all 0.4468841562938414 0.3079839335516945
RT-VL10-all 975.9498666666666 13.620143242190194 992.5066666666667 952.3039999999999
*** budget = 20
VL20-1 0.9781997827702598 0.10536032544508056
RT-VL20-1 966.9119999999997 101.65639505707449 999.2 520.4
VL20-2 0.18855989972509593 0.05877994173786592
RT-VL20-2 969.4986666666664 99.85920954802093 999.2 520.4
VL20-3 0.3124252408942266 0.18606619145773703
RT-VL20-3 930.144 106.66120599355702 998.4 636.4
VL20-4 0.07108707103017545 0.08740024147079957
RT-VL20-4 971.3973333333332 57.86477563730421 997.6 794.4
VL20-5 0.24352531728535776 0.07890595044041325
RT-VL20-5 994.5653333333332 1.4086867012299862 997.2 988.8
VL20-all 0.35875946234102307 0.3196421044463387
RT-VL20-all 966.5034666666664 20.70056632354665 994.5653333333332 930.144
*** budget = 30
VL30-1 0.9474909194212424 0.11299256006555167
RT-VL30-1 958.2080000000001 117.20609513160994 999.2 520.4
VL30-2 0.1680310948534963 0.05351215810995681
RT-VL30-2 962.0880000000001 115.137101995838 999.2 520.4
VL30-3 0.2561462469332724 0.1736482226849845
RT-VL30-3 932.072 104.40075677886632 998.4 686.8
VL30-4 0.04337516321965173 0.0358951445421966
RT-VL30-4 970.808 60.44435404568404 997.2 794.4
VL30-5 0.21465293528305643 0.04202000198972937
RT-VL30-5 994.8079999999999 1.2457672334750147 997.2 991.6
VL30-all 0.32593927194214384 0.3188603561060755
RT-VL30-all 963.5968 20.254723245702426 994.8079999999999 932.072
*** budget = 40
VL40-1 0.9238927691916328 0.11999522230650814
RT-VL40-1 945.0594594594594 133.78295342357652 998.8 520.4
VL40-2 0.15128038507399025 0.047769823432461306
RT-VL40-2 962.2594594594594 115.01630309693576 999.2 520.4
VL40-3 0.20343727494157654 0.13884414765772538
RT-VL40-3 926.8972972972972 110.86449115068721 997.6 687.2
VL40-4 0.03867803849035818 0.036546383969238934
RT-VL40-4 973.0486486486485 57.87795930965021 997.6 796.8
VL40-5 0.20339583882201592 0.034166236846192334
RT-VL40-5 994.7891891891891 1.0944043878783818 996.4 991.6
VL40-all 0.3041368613039147 0.3156643896929024
RT-VL40-all 960.4108108108106 23.254354101121265 994.7891891891891 926.8972972972972
*** budget = 50
VL50-1 0.905404367892183 0.1259731885598874
RT-VL50-1 933.2266666666668 146.06164665038602 999.2 520.4
VL50-2 0.14613841264165162 0.051082962651545244
RT-VL50-2 939.7066666666668 144.36294753925685 999.2 520.4
VL50-3 0.1692464080515594 0.12815500146827277
RT-VL50-3 912.7733333333334 109.55341751350748 997.6 687.2
VL50-4 0.031133487710850657 0.03273890245448125
RT-VL50-4 981.9066666666666 48.53025814433256 996.0 796.8
VL50-5 0.19591677952469977 0.02590095088544801
RT-VL50-5 994.7333333333335 1.159693446083442 996.4 992.0
VL50-all 0.2895678911641889 0.31300565882297204
RT-VL50-all 952.4693333333335 30.860137999259365 994.7333333333335 912.7733333333334
logs_data/30hcd2/cgrew_pr/ablo/run_2024-08-10_20-39-21/run_2691b_00005_5_clip_discriminator=10,dr_cc=0.8000,seed=3_2024-08-10_20-39-21/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s3/08.10_20.40.14/models/ckpt_policy_T15000000.pt
0_hccustom_v0_type2-infogsdr_ppo-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s3
v_data: 0
Inferred from ckptpath name:
il_method: infogsdr
rl_method: ppo
activation: relu
hidden_size: [100, 100]
norm_obs: 0
info_loss_type: None
encode_sampling: normal
normalize_code: 0
tl_emb: 0
TRAINED C_DATA: 2
ARG C_DATA: 2
Using (100, 100) relu networks.
State dim: 17, action dim: 6, action bound 1
args.encode_dim: 2
TRAJ is loaded from /home/vsreeramdass3/code/vild_code1/imitation_data/STRAT_h5/HCCustom_v0.h5 with traj_num 25.0, data_size 25000 steps, and average return 996.96
No of trajs: Train 10 Val 4 Test 11
Total data pairs: 9990, K 10, state dim 17, action dim 6, a min -0.7615941, a_max 0.7615942
Policy model is loaded from logs_data/30hcd2/cgrew_pr/ablo/run_2024-08-10_20-39-21/run_2691b_00005_5_clip_discriminator=10,dr_cc=0.8000,seed=3_2024-08-10_20-39-21/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s3/08.10_20.40.14/models/ckpt_policy_T15000000.pt
Discr model is loaded from logs_data/30hcd2/cgrew_pr/ablo/run_2024-08-10_20-39-21/run_2691b_00005_5_clip_discriminator=10,dr_cc=0.8000,seed=3_2024-08-10_20-39-21/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s3/08.10_20.40.14/models/ckpt_discr_T15000000.pt
***** Max Ep Steps: 1000 args.seed 1 test_seed 1 *****
*** budgets = [10, 20, 30, 40, 50] NPARALLEL = 50 n_test_episodes = 5 ***
***** num zs = 1500 *****
***** args.encode_sampling = normal *****
current_vel_mean [0.97, 1.78, 2.88, 3.65, 4.87]
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
zvs shape 1500
RT 979.8797333333334 50.47522999051669 999.6 537.6
LR 0.9685333333333334 0.12424377471549856 1.0 -0.2
VL-std-all 0.3651468388515224 0.0
*** budget = 10
VL10-1 0.6971990951224701 0.27364390341460065
RT-VL10-1 988.5786666666668 41.5553744404847 999.6 537.6
VL10-2 0.14169590414150607 0.1322136175388517
RT-VL10-2 992.224 37.33190892520767 999.6 537.6
VL10-3 0.3050205427648619 0.18984145902633548
RT-VL10-3 944.1519999999999 93.36688543589746 998.0 644.0
VL10-4 0.18267212895375928 0.14445714266778062
RT-VL10-4 950.6213333333334 54.797071195051615 997.6 779.6
VL10-5 0.7147079291158039 0.07576333637806826
RT-VL10-5 994.5466666666666 7.566440525255072 999.2 908.4
VL10-all 0.4082591200196803 0.24899832298227328
RT-VL10-all 974.0245333333335 21.928426111227328 994.5466666666666 944.1519999999999
*** budget = 20
VL20-1 0.5430458205577047 0.19684460798799222
RT-VL20-1 984.8266666666668 55.36120743705731 998.8 537.6
VL20-2 0.07753845651813589 0.07513867456271495
RT-VL20-2 989.3173333333332 52.558752834856676 999.6 537.6
VL20-3 0.2076031336017836 0.15404913906245962
RT-VL20-3 921.9253333333332 108.70987823049425 997.6 644.0
VL20-4 0.09900202298704266 0.08935811498784725
RT-VL20-4 941.1306666666669 52.74968808333268 997.6 846.4
VL20-5 0.6844729532740812 0.030614965034027646
RT-VL20-5 995.344 1.4483314537770677 998.8 989.6
VL20-all 0.3223324773877496 0.2460989764181617
RT-VL20-all 966.5088 29.390397556269207 995.344 921.9253333333332
*** budget = 30
VL30-1 0.47134448203264795 0.14874557951819659
RT-VL30-1 991.936 5.5421930677305 999.6 964.8
VL30-2 0.049563349073557986 0.04175520765436498
RT-VL30-2 995.752 2.028422046813729 999.6 990.4
VL30-3 0.15890059404897128 0.13554090068255337
RT-VL30-3 911.6800000000002 110.07656971399499 997.6 715.6
VL30-4 0.06871267716021472 0.06519033177125132
RT-VL30-4 942.04 50.27700866201171 997.2 856.4
VL30-5 0.6756245085244449 0.028025440096738866
RT-VL30-5 995.3599999999999 1.2572986916401352 997.6 990.8
VL30-all 0.28482912216796735 0.24715311426307204
RT-VL30-all 967.3536 34.45417256356616 995.752 911.6800000000002
*** budget = 40
VL40-1 0.43245130606683724 0.14132442285096466
RT-VL40-1 990.6918918918918 6.970919454588948 998.8 964.8
VL40-2 0.041063762404311435 0.037604061644834604
RT-VL40-2 995.4810810810811 1.998626263054655 999.6 990.4
VL40-3 0.12922377003431826 0.11340853043426473
RT-VL40-3 910.1081081081084 111.97121825738238 997.6 715.6
VL40-4 0.048852872311649434 0.04510283796890388
RT-VL40-4 944.4540540540542 51.979718469931086 997.2 866.8
VL40-5 0.6678184662662607 0.023963504711901007
RT-VL40-5 995.4702702702701 1.3637913894657792 997.6 990.8
VL40-all 0.26388203541667543 0.24722403256345243
RT-VL40-all 967.2410810810812 34.4318179927356 995.4810810810811 910.1081081081084
*** budget = 50
VL50-1 0.4149868282086089 0.1515129008236177
RT-VL50-1 991.0133333333334 6.38308876816093 998.8 964.8
VL50-2 0.033804258620103354 0.030997283335976473
RT-VL50-2 995.3866666666668 2.2048028382500733 999.6 990.4
VL50-3 0.11018559613118786 0.09064231766881127
RT-VL50-3 882.9733333333332 120.05201909542748 996.8 647.6
VL50-4 0.04005217275243778 0.04231445881282343
RT-VL50-4 944.1866666666666 47.35812308593134 997.2 866.8
VL50-5 0.6619968822237137 0.02047387805245155
RT-VL50-5 995.2666666666667 1.3782436488355607 997.6 990.8
VL50-all 0.2522051475872103 0.2479131006425265
RT-VL50-all 961.7653333333334 43.875659949452675 995.3866666666668 882.9733333333332
logs_data/30hcd2/cgrew_pr/ablo/run_2024-08-10_20-39-21/run_2691b_00007_7_clip_discriminator=10,dr_cc=0.8000,seed=4_2024-08-10_20-39-21/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s4/08.10_20.40.15/models/ckpt_policy_T15000000.pt
0_hccustom_v0_type2-infogsdr_ppo-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s4
v_data: 0
Inferred from ckptpath name:
il_method: infogsdr
rl_method: ppo
activation: relu
hidden_size: [100, 100]
norm_obs: 0
info_loss_type: None
encode_sampling: normal
normalize_code: 0
tl_emb: 0
TRAINED C_DATA: 2
ARG C_DATA: 2
Using (100, 100) relu networks.
State dim: 17, action dim: 6, action bound 1
args.encode_dim: 2
TRAJ is loaded from /home/vsreeramdass3/code/vild_code1/imitation_data/STRAT_h5/HCCustom_v0.h5 with traj_num 25.0, data_size 25000 steps, and average return 996.96
No of trajs: Train 10 Val 4 Test 11
Total data pairs: 9990, K 10, state dim 17, action dim 6, a min -0.7615941, a_max 0.7615942
Policy model is loaded from logs_data/30hcd2/cgrew_pr/ablo/run_2024-08-10_20-39-21/run_2691b_00007_7_clip_discriminator=10,dr_cc=0.8000,seed=4_2024-08-10_20-39-21/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s4/08.10_20.40.15/models/ckpt_policy_T15000000.pt
Discr model is loaded from logs_data/30hcd2/cgrew_pr/ablo/run_2024-08-10_20-39-21/run_2691b_00007_7_clip_discriminator=10,dr_cc=0.8000,seed=4_2024-08-10_20-39-21/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s4/08.10_20.40.15/models/ckpt_discr_T15000000.pt
***** Max Ep Steps: 1000 args.seed 1 test_seed 1 *****
*** budgets = [10, 20, 30, 40, 50] NPARALLEL = 50 n_test_episodes = 5 ***
***** num zs = 1500 *****
***** args.encode_sampling = normal *****
current_vel_mean [0.97, 1.78, 2.88, 3.65, 4.87]
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
zvs shape 1500
RT 980.6917333333334 54.83965680352941 1000.0 378.4
LR 0.9693333333333334 0.12228200558090667 1.0 -0.2
VL-std-all 0.30484864840457243 0.0
*** budget = 10
VL10-1 0.6411830272282689 0.3060902330198359
RT-VL10-1 979.0693333333332 70.7585188715033 1000.0 452.8
VL10-2 0.13057559590643986 0.12575788957213913
RT-VL10-2 978.7306666666665 59.721535977866104 999.6 593.6
VL10-3 0.20028899442775536 0.13950268896415244
RT-VL10-3 970.6453333333335 65.78382028702465 999.6 664.0
VL10-4 0.12775123137152186 0.12327046530388362
RT-VL10-4 981.9786666666666 35.468725353784876 999.2 820.8
VL10-5 0.8875833416828792 0.1988906006583092
RT-VL10-5 991.4720000000001 17.342929856284375 999.2 865.2
VL10-all 0.397476438123373 0.31063429082265825
RT-VL10-all 980.3792 6.7050738784561865 991.4720000000001 970.6453333333335
*** budget = 20
VL20-1 0.47053755838419914 0.24152191194699796
RT-VL20-1 984.3466666666667 60.36586526248826 1000.0 557.2
VL20-2 0.0689183447401131 0.07260245480720054
RT-VL20-2 982.352 58.815615522863766 999.2 593.6
VL20-3 0.11921226469990036 0.09557278116222602
RT-VL20-3 973.7386666666667 61.43192632138077 998.8 706.8
VL20-4 0.06187209417540617 0.05452106174769596
RT-VL20-4 977.8239999999998 37.11282919243246 999.2 857.6
VL20-5 0.779718401817452 0.09342367925897067
RT-VL20-5 994.6986666666668 6.287712744357488 998.0 950.4
VL20-all 0.3000517327634141 0.28354915857591084
RT-VL20-all 982.592 7.082699598317057 994.6986666666668 973.7386666666667
*** budget = 30
VL30-1 0.3739262332522953 0.177820958179605
RT-VL30-1 982.3200000000002 69.12527757629621 998.8 557.2
VL30-2 0.04132094993777269 0.04735393870134101
RT-VL30-2 979.76 66.43791688486327 999.2 593.6
VL30-3 0.0969255194415316 0.08383226494446823
RT-VL30-3 972.7280000000001 58.926540845361025 998.0 734.8
VL30-4 0.045516040147404305 0.04117979600570469
RT-VL30-4 977.712 38.67005373670949 999.2 857.6
VL30-5 0.7538191793837352 0.08448747639690689
RT-VL30-5 994.8560000000001 6.56000487804697 998.0 950.4
VL30-all 0.26230158443254786 0.2746733566530848
RT-VL30-all 981.4752000000001 7.3942760132416225 994.8560000000001 972.7280000000001
*** budget = 40
VL40-1 0.32514397582276955 0.16355899613238348
RT-VL40-1 963.156756756757 116.62726152157528 998.8 452.8
VL40-2 0.03579940745362549 0.04978600307666467
RT-VL40-2 985.2432432432432 42.74988566787645 999.6 804.4
VL40-3 0.08430211353222226 0.07708155418769179
RT-VL40-3 977.5567567567567 46.48731614595906 998.8 799.2
VL40-4 0.03236974839158919 0.032520778738464526
RT-VL40-4 980.162162162162 35.62646565103693 999.2 857.6
VL40-5 0.7238017567053705 0.059963735458219515
RT-VL40-5 995.9027027027025 1.7433249509449822 998.0 988.8
VL40-all 0.2402834003811154 0.2647123986748737
RT-VL40-all 980.4043243243243 10.67015219409451 995.9027027027025 963.156756756757
*** budget = 50
VL50-1 0.2917441647320284 0.14717824350679715
RT-VL50-1 974.0266666666666 88.22074938597055 998.8 557.2
VL50-2 0.02162540586866033 0.022331315399789508
RT-VL50-2 982.3333333333334 46.99822218859867 998.0 804.4
VL50-3 0.06179007672338674 0.05826750503626425
RT-VL50-3 970.7466666666666 64.9947471381769 998.8 734.8
VL50-4 0.028317800974679175 0.028252116971254758
RT-VL50-4 979.4666666666666 34.6456859587966 999.2 890.4
VL50-5 0.7195148288697749 0.063841864450355
RT-VL50-5 995.88 1.859462287867122 998.0 988.8
VL50-all 0.22459845543370588 0.2667138861458722
RT-VL50-all 980.4906666666666 8.694876754605456 995.88 970.7466666666666
logs_data/30hcd2/cgrew_pr/ablo/run_2024-08-10_20-39-21/run_2691b_00009_9_clip_discriminator=10,dr_cc=0.8000,seed=5_2024-08-10_20-39-21/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s5/08.10_20.40.12/models/ckpt_policy_T15000000.pt
0_hccustom_v0_type2-infogsdr_ppo-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s5
v_data: 0
Inferred from ckptpath name:
il_method: infogsdr
rl_method: ppo
activation: relu
hidden_size: [100, 100]
norm_obs: 0
info_loss_type: None
encode_sampling: normal
normalize_code: 0
tl_emb: 0
TRAINED C_DATA: 2
ARG C_DATA: 2
Using (100, 100) relu networks.
State dim: 17, action dim: 6, action bound 1
args.encode_dim: 2
TRAJ is loaded from /home/vsreeramdass3/code/vild_code1/imitation_data/STRAT_h5/HCCustom_v0.h5 with traj_num 25.0, data_size 25000 steps, and average return 996.96
No of trajs: Train 10 Val 4 Test 11
Total data pairs: 9990, K 10, state dim 17, action dim 6, a min -0.7615941, a_max 0.7615942
Policy model is loaded from logs_data/30hcd2/cgrew_pr/ablo/run_2024-08-10_20-39-21/run_2691b_00009_9_clip_discriminator=10,dr_cc=0.8000,seed=5_2024-08-10_20-39-21/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s5/08.10_20.40.12/models/ckpt_policy_T15000000.pt
Discr model is loaded from logs_data/30hcd2/cgrew_pr/ablo/run_2024-08-10_20-39-21/run_2691b_00009_9_clip_discriminator=10,dr_cc=0.8000,seed=5_2024-08-10_20-39-21/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr10_no0_es=n_sr0011_rc0.80_reg00_ds0.001_zt=p_g0_s5/08.10_20.40.12/models/ckpt_discr_T15000000.pt
***** Max Ep Steps: 1000 args.seed 1 test_seed 1 *****
*** budgets = [10, 20, 30, 40, 50] NPARALLEL = 50 n_test_episodes = 5 ***
***** num zs = 1500 *****
***** args.encode_sampling = normal *****
current_vel_mean [0.97, 1.78, 2.88, 3.65, 4.87]
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
zvs shape 1500
RT 951.3973333333333 130.08783942483717 1000.0 2.4
LR 0.9178666666666668 0.2493206413881976 1.0 -0.6
VL-std-all 0.4510289031476744 0.0
*** budget = 10
VL10-1 0.8708126620185427 0.29774630973653454
RT-VL10-1 817.7333333333333 304.11637304748257 999.6 2.4
VL10-2 0.1915809095617582 0.09838441660902607
RT-VL10-2 925.872 174.57597433782232 999.6 386.0
VL10-3 0.35704185812983347 0.19886311339973797
RT-VL10-3 909.3786666666667 116.57957773507712 999.2 600.8
VL10-4 0.2508321142453616 0.255748969579776
RT-VL10-4 937.3333333333334 67.75747601228632 998.0 691.2
VL10-5 0.7336156885803157 0.4391687177558363
RT-VL10-5 977.1413333333334 46.83589248523922 998.0 691.2
VL10-all 0.4807766465071623 0.271246985100937
RT-VL10-all 913.4917333333335 52.838533214733026 977.1413333333334 817.7333333333333
*** budget = 20
VL20-1 0.7235932001885949 0.3434339499056319
RT-VL20-1 684.9866666666667 351.0730481759538 999.6 47.2
VL20-2 0.13602415278584024 0.08012001130221277
RT-VL20-2 889.5893333333333 204.74474226759088 999.6 398.0
VL20-3 0.25097293172400703 0.17791756865030584
RT-VL20-3 868.2026666666666 122.01047165259581 999.2 620.0
VL20-4 0.12430645226411348 0.11278942576717106
RT-VL20-4 924.0373333333333 67.16431348731425 996.4 810.0
VL20-5 0.5419358753070713 0.20099828526490335
RT-VL20-5 989.3013333333334 21.27008223355571 998.0 850.0
VL20-all 0.3553665224539254 0.23782214471018934
RT-VL20-all 871.2234666666667 101.74700297712297 989.3013333333334 684.9866666666667
*** budget = 30
VL30-1 0.5986252374117301 0.3368830613199224
RT-VL30-1 569.256 352.96135094936386 996.8 47.2
VL30-2 0.1079488352553091 0.07186545457910654
RT-VL30-2 860.424 225.81403194664412 998.0 404.8
VL30-3 0.19767245886179524 0.157838018408733
RT-VL30-3 851.672 119.59967063499798 998.4 620.0
VL30-4 0.08423320814895977 0.07832399791811369
RT-VL30-4 925.4960000000001 67.89746669795568 996.4 810.0
VL30-5 0.4548572507965426 0.10912349602812255
RT-VL30-5 993.824 2.4028782740704973 996.0 980.0
VL30-all 0.28866739809486736 0.2031873742311012
RT-VL30-all 840.1343999999999 144.7943283220721 993.824 569.256
*** budget = 40
VL40-1 0.5541479105255792 0.349645678432156
RT-VL40-1 513.6972972972973 333.9907803279108 995.6 49.6
VL40-2 0.0885637847952064 0.05743794685054608
RT-VL40-2 842.8000000000002 231.80217101331579 998.0 405.2
VL40-3 0.13856271071113746 0.1254919113957759
RT-VL40-3 833.7513513513513 117.48559801702118 995.6 620.0
VL40-4 0.07438430644725633 0.09861753533005746
RT-VL40-4 923.1891891891892 73.07139708747575 996.4 810.0
VL40-5 0.4202865447480297 0.0712536830816725
RT-VL40-5 994.1837837837838 1.3559704839156974 996.0 990.4
VL40-all 0.25518905144544185 0.19528933425586278
RT-VL40-all 821.5243243243243 164.62482608798604 994.1837837837838 513.6972972972973
*** budget = 50
VL50-1 0.489047186739197 0.3529462757451401
RT-VL50-1 463.33333333333337 342.0036698958393 996.8 49.6
VL50-2 0.0729238368118124 0.050535761715464936
RT-VL50-2 840.146666666667 232.85386509330027 996.8 405.2
VL50-3 0.12495531892364695 0.10952446822271995
RT-VL50-3 842.3733333333333 116.07799370346743 996.0 687.6
VL50-4 0.04519759725419085 0.05127490172919815
RT-VL50-4 901.3733333333334 69.95432287492238 996.4 810.0
VL50-5 0.4049064927310803 0.058388842509218594
RT-VL50-5 994.16 1.1247518244780361 996.0 992.4
VL50-all 0.2274060864919855 0.18304279190698752
RT-VL50-all 808.2773333333334 181.31948915779694 994.16 463.33333333333337
logs_data/30hcd2/cgrew_pr/ig/run_2024-05-31_08-48-46/run_1efe9_00009_9_dr_cc=0.8000,seed=5_2024-05-31_08-48-47/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr0_no0_es=n_sr0000_rc0.80_reg00_ds0.001_s5/05.31_08.49.29/models/ckpt_policy_T15000000.pt
0_hccustom_v0_type2-infogsdr_ppo-100-100-relu_cr0_no0_es=n_sr0000_rc0.80_reg00_ds0.001_s5
v_data: 0
Inferred from ckptpath name:
il_method: infogsdr
rl_method: ppo
activation: relu
hidden_size: [100, 100]
norm_obs: 0
info_loss_type: None
encode_sampling: normal
normalize_code: 0
tl_emb: 0
TRAINED C_DATA: 2
ARG C_DATA: 2
Using (100, 100) relu networks.
State dim: 17, action dim: 6, action bound 1
args.encode_dim: 2
TRAJ is loaded from /home/vsreeramdass3/code/vild_code1/imitation_data/STRAT_h5/HCCustom_v0.h5 with traj_num 25.0, data_size 25000 steps, and average return 996.96
No of trajs: Train 10 Val 4 Test 11
Total data pairs: 9990, K 10, state dim 17, action dim 6, a min -0.7615941, a_max 0.7615942
Policy model is loaded from logs_data/30hcd2/cgrew_pr/ig/run_2024-05-31_08-48-46/run_1efe9_00009_9_dr_cc=0.8000,seed=5_2024-05-31_08-48-47/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr0_no0_es=n_sr0000_rc0.80_reg00_ds0.001_s5/05.31_08.49.29/models/ckpt_policy_T15000000.pt
Error(s) in loading state_dict for PairDiscriminator:
Missing key(s) in state_dict: "reward_offset".
***** Max Ep Steps: 1000 args.seed 1 test_seed 1 *****
*** budgets = [10, 20, 30, 40, 50] NPARALLEL = 50 n_test_episodes = 5 ***
***** num zs = 1500 *****
***** args.encode_sampling = normal *****
current_vel_mean [0.97, 1.78, 2.88, 3.65, 4.87]
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
zvs shape 1500
RT 990.432 26.615937881903267 1000.0 725.6
LR 0.9890666666666666 0.06995566850195598 1.0 0.2
VL-std-all 0.2373863624486816 0.0
*** budget = 10
VL10-1 0.8068609193327522 0.13925560587172822
RT-VL10-1 995.7306666666665 16.791140309368178 1000.0 791.6
VL10-2 0.07242870005960828 0.08975381469563005
RT-VL10-2 996.2373333333331 12.76128283345979 1000.0 841.6
VL10-3 0.3868302479408187 0.21660709388184402
RT-VL10-3 973.3039999999999 51.372738399531194 999.2 725.6
VL10-4 0.08661826815176378 0.09278202593265558
RT-VL10-4 987.1039999999998 24.21008847567477 998.0 857.6
VL10-5 0.8474279406256293 0.11737537398417869
RT-VL10-5 995.0 3.3427334124435784 998.8 958.8
VL10-all 0.44003321522211447 0.3356780561483939
RT-VL10-all 989.4751999999999 8.746553885451755 996.2373333333331 973.3039999999999
*** budget = 20
VL20-1 0.7315922702204493 0.0995746834000683
RT-VL20-1 994.0853333333334 23.590035711903635 999.2 791.6
VL20-2 0.036214345995620977 0.03452613346603725
RT-VL20-2 995.1573333333333 17.917329959815127 1000.0 841.6
VL20-3 0.25537488508135914 0.17372447019071738
RT-VL20-3 972.784 53.885268339315154 999.2 725.6
VL20-4 0.043203515656006046 0.05250279251858071
RT-VL20-4 988.6186666666667 20.391924501843594 998.0 898.0
VL20-5 0.7879326231036912 0.05932681196035859
RT-VL20-5 995.2693333333333 1.5681813954032506 998.8 986.4
VL20-all 0.3708635280114253 0.32764634620371236
RT-VL20-all 989.1829333333333 8.55587117039782 995.2693333333333 972.784
*** budget = 30
VL30-1 0.6984379049607391 0.09090434081337964
RT-VL30-1 992.5039999999999 28.753385609350417 999.2 791.6
VL30-2 0.026036005615847575 0.02404287964803267
RT-VL30-2 997.2 1.6218507946170728 999.6 990.8
VL30-3 0.1949390978652427 0.15970719324067123
RT-VL30-3 973.6000000000001 56.454271760425705 999.2 725.6
VL30-4 0.026784888903100025 0.03517674994193529
RT-VL30-4 989.176 16.983834196081883 998.0 924.8
VL30-5 0.7684183413208525 0.048356325337566834
RT-VL30-5 995.3919999999999 1.2780985877466522 998.8 992.8
VL30-all 0.3429232477331564 0.32548334547975033
RT-VL30-all 989.5744000000001 8.437025295683249 997.2 973.6000000000001
*** budget = 40
VL40-1 0.6792073344503108 0.09456556021377976
RT-VL40-1 990.9405405405404 33.277891853215436 1000.0 791.6
VL40-2 0.02124361997023371 0.01997849021192365
RT-VL40-2 997.1675675675676 1.6826706658639368 999.6 990.8
VL40-3 0.14690620371090926 0.10965669609264024
RT-VL40-3 971.783783783784 60.35689133503254 998.8 725.6
VL40-4 0.01829684605908632 0.020260588469263457
RT-VL40-4 987.9027027027028 18.96897162703583 998.0 924.8
VL40-5 0.7590723634715063 0.03586342630238167
RT-VL40-5 995.3729729729729 1.362162162162156 998.8 992.8
VL40-all 0.3249452735324093 0.32616983332489663
RT-VL40-all 988.6335135135134 9.033495918112498 997.1675675675676 971.783783783784
*** budget = 50
VL50-1 0.6644865933518651 0.09853435796284395
RT-VL50-1 989.4799999999999 36.79509387585977 999.2 791.6
VL50-2 0.016578223965209603 0.016499061690024394
RT-VL50-2 997.4533333333331 1.0301240486250054 999.6 994.8
VL50-3 0.13665269176134068 0.11392583238420832
RT-VL50-3 973.5733333333334 61.803295129700714 998.4 725.6
VL50-4 0.012448897373208704 0.009519174525738977
RT-VL50-4 987.3600000000001 18.23713427780437 997.2 924.8
VL50-5 0.7510283300254756 0.024664256709951978
RT-VL50-5 995.36 1.3169662106523357 998.8 993.2
VL50-all 0.31623894729541996 0.3239303619711848
RT-VL50-all 988.6453333333333 8.39263423087962 997.4533333333331 973.5733333333334
logs_data/30hcd2/cgrew_pr/ig/run_2024-05-31_08-48-46/run_1efe9_00007_7_dr_cc=0.8000,seed=4_2024-05-31_08-48-47/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr0_no0_es=n_sr0000_rc0.80_reg00_ds0.001_s4/05.31_08.49.28/models/ckpt_policy_T15000000.pt
0_hccustom_v0_type2-infogsdr_ppo-100-100-relu_cr0_no0_es=n_sr0000_rc0.80_reg00_ds0.001_s4
v_data: 0
Inferred from ckptpath name:
il_method: infogsdr
rl_method: ppo
activation: relu
hidden_size: [100, 100]
norm_obs: 0
info_loss_type: None
encode_sampling: normal
normalize_code: 0
tl_emb: 0
TRAINED C_DATA: 2
ARG C_DATA: 2
Using (100, 100) relu networks.
State dim: 17, action dim: 6, action bound 1
args.encode_dim: 2
TRAJ is loaded from /home/vsreeramdass3/code/vild_code1/imitation_data/STRAT_h5/HCCustom_v0.h5 with traj_num 25.0, data_size 25000 steps, and average return 996.96
No of trajs: Train 10 Val 4 Test 11
Total data pairs: 9990, K 10, state dim 17, action dim 6, a min -0.7615941, a_max 0.7615942
Policy model is loaded from logs_data/30hcd2/cgrew_pr/ig/run_2024-05-31_08-48-46/run_1efe9_00007_7_dr_cc=0.8000,seed=4_2024-05-31_08-48-47/results_IL/HCCustom/INFOGSDR_PPO/0_HCCustom_v0_type2-INFOGSDR_PPO-100-100-relu_cr0_no0_es=n_sr0000_rc0.80_reg00_ds0.001_s4/05.31_08.49.28/models/ckpt_policy_T15000000.pt
Error(s) in loading state_dict for PairDiscriminator:
Missing key(s) in state_dict: "reward_offset".
***** Max Ep Steps: 1000 args.seed 1 test_seed 1 *****
*** budgets = [10, 20, 30, 40, 50] NPARALLEL = 50 n_test_episodes = 5 ***
***** num zs = 1500 *****
***** args.encode_sampling = normal *****
current_vel_mean [0.97, 1.78, 2.88, 3.65, 4.87]
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
State dim: 17, action dim: 6, action bound 1
zvs shape 1500
RT 990.2543999999999 36.05362710333964 1000.0 554.8
LR 0.9885333333333333 0.0743091440462672 1.0 0.2
VL-std-all 0.18819055842441343 0.0
*** budget = 10
VL10-1 0.7364655730675438 0.2034510701426069
RT-VL10-1 969.1919999999999 78.3546599507649 1000.0 554.8
VL10-2 0.07584669608490706 0.06684029806857397
RT-VL10-2 996.3066666666666 2.3877092694789184 1000.0 984.0
VL10-3 0.3996362282620626 0.20976530291811163
RT-VL10-3 970.632 70.71252253078423 1000.0 568.4
VL10-4 0.16637188934095823 0.18794393169034093
RT-VL10-4 991.5066666666669 17.884036332873954 999.6 870.8
VL10-5 0.8483789928475064 0.2432161596743707
RT-VL10-5 996.2986666666666 1.333215994836881 999.2 991.6
VL10-all 0.4453398759205956 0.30451005638671486
RT-VL10-all 984.7872 12.279602269346276 996.3066666666666 969.1919999999999
*** budget = 20
VL20-1 0.6084425340419566 0.19853349088645353
RT-VL20-1 941.9146666666666 103.8581862520021 999.6 554.8
VL20-2 0.03990153497711544 0.03805487903019847
RT-VL20-2 996.234666666667 2.9080803901466568 1000.0 984.0
VL20-3 0.2731517520153228 0.16676485260851184
RT-VL20-3 955.5520000000001 90.26451109193837 1000.0 568.4
VL20-4 0.08697045930100566 0.07214465357634665
RT-VL20-4 989.9626666666668 20.92069325386284 999.2 870.8
VL20-5 0.7545391225922524 0.06667742285384677
RT-VL20-5 996.1493333333335 1.414248760139312 999.2 991.6
VL20-all 0.3526010805855306 0.2834296623562406
RT-VL20-all 975.9626666666667 22.76100662487893 996.234666666667 941.9146666666666
*** budget = 30
VL30-1 0.5516723621472825 0.19791225218782463
RT-VL30-1 923.4639999999999 116.6760879700721 999.6 554.8
VL30-2 0.025944873336137007 0.023580682785077252
RT-VL30-2 996.4639999999999 2.1486516702341487 1000.0 989.2
VL30-3 0.2098908521449945 0.12295466907615382
RT-VL30-3 944.4079999999999 100.78347451839514 1000.0 568.4
VL30-4 0.05874230649950219 0.05327436228819789