Trial evaluation / evaluation hardware / scenario changes




This forum is dedicated to questions regarding Challenge 3

Trial evaluation / evaluation hardware / scenario changes

Postby markusachtelik » Fri 17. Oct 2014, 16:08

Dear Contestants,

In preparation of the final evaluation, we like to announce the changes that we are going to make in the dataset / simulation environment.

Trial evaluation run:
You have the option of uploading your contestant VM containing your solution once, and we will perform a test-evaluation on the current training datasets. We highly encourage you to make use of this, such that you can fix problems that prevent a successful final evaluation beforehand. The final evaluation is definite and there will be no interactions with teams in case of problems with their VM. Bugs appearing during the final evaluation cannot be corrected!

Log-in information for the upload will be sent to you by email separately. Please only upload the contestant VM (the one with your solution) in a zip archive and indicate in the filename that it is a test run. Uploaded test VMs will not be used automatically for the final evaluation. You can remove your source code from the VM, but make sure the launch-files for each subtask work.

Evaluation hardware for Task 1 and Task 2:
We want you to get good scores! Thus, in response to performance issues and non-deterministic timings that were observed on Core2 machines, we will use a host computer with a Core i7 processor. This also aligns with the hardware choice that was recently made for stage 2. With the reported issues, it looks like especially memory-bound algorithms perform much worse on the virtualized Core2 processor, and it makes indeed no sense anymore to optimize for old hardware.
The evaluation machine has a Q 820 Core i7 CPU @ 1.73GHz and 8 GB RAM. The settings for the contestant VM remain unchanged.

Example timings with libelas for reference:
Native:
Code: Select all
Processing: img/cones_left.pgm, img/cones_right.pgm
                    Descriptor   22.9 ms
               Support Matches   75.3 ms
        Delaunay Triangulation    6.4 ms
              Disparity Planes    7.7 ms
                          Grid    5.8 ms
                      Matching  142.1 ms
         L/R Consistency Check   21.2 ms
         Remove Small Segments   40.7 ms
             Gap Interpolation    8.9 ms
                 Adaptive Mean   71.1 ms
========================================
                    Total time  402.1 ms

In the VM:
Code: Select all
Processing: img/cones_left.pgm, img/cones_right.pgm
                    Descriptor   48.9 ms
               Support Matches   78.2 ms
        Delaunay Triangulation    8.3 ms
              Disparity Planes   10.7 ms
                          Grid   11.2 ms
                      Matching  184.2 ms
         L/R Consistency Check   34.7 ms
         Remove Small Segments   58.1 ms
             Gap Interpolation   18.7 ms
                 Adaptive Mean   86.9 ms
========================================
                    Total time  539.9 ms


Changes to the scenario in the final evaluation:

Task 1: The datasets are taken in the same environment, where we did another flight and roughly followed the path in the current training datasets.

Task 2: The dataset recording took place in the same room. The obstacle setup and the path flown are different. The scene is the same for all three subtasks. Note that we may start the evaluation of T2.3 first, such that no re-using of the map created in T2.1 is possible. The dataset for Task 2.3 does not contain moving objects.

Task 3 / 4 general:
- Model parameters, as announced in the technical annex, will change slightly in order to reflect some model uncertainty.
- Noise levels will stay the same. The initial seed for generating sensor noise will be chosen "randomly" once, and will be the same for each team to ensure comparability.

Task 3.2: The wind magnitude remains, but the direction will change.

Task 3.3: The wind magnitude remains, but the direction and the duration will change.

Task 4.1: The waypoints will change, but their distances remain.

Task 4.2: The waypoints will change, but their distances remain. Areas where sensors work, will change.

Task 4.3: The waypoints will change, but their distances remain.


Best, and we wish you much success with the final preparations!

The ETH Challenge hosts.
markusachtelik
 
Posts: 27
Joined: Mon 7. Jul 2014, 20:09

by Advertising » Fri 17. Oct 2014, 16:08

Advertising
 

Re: Trial evaluation / evaluation hardware / scenario change

Postby markusachtelik » Mon 20. Oct 2014, 14:37

As additional info, this is what /proc/cpuinfo on the standard simclient VM reports, when started on our evaluation machine:

Code: Select all
cat /proc/cpuinfo
processor    : 0
vendor_id    : GenuineIntel
cpu family    : 6
model        : 30
model name    : Intel(R) Core(TM) i7 CPU       Q 820  @ 1.73GHz
stepping    : 5
microcode    : 0x616
cpu MHz        : 1721.714
cache size    : 6144 KB
physical id    : 0
siblings    : 2
core id        : 0
cpu cores    : 2
apicid        : 0
initial apicid    : 0
fdiv_bug    : no
f00f_bug    : no
coma_bug    : no
fpu        : yes
fpu_exception    : yes
cpuid level    : 5
wp        : yes
flags        : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx rdtscp lm constant_tsc pni ssse3 lahf_lm
bogomips    : 3443.42
clflush size    : 64
cache_alignment    : 64
address sizes    : 36 bits physical, 48 bits virtual
power management:

processor    : 1
vendor_id    : GenuineIntel
cpu family    : 6
model        : 30
model name    : Intel(R) Core(TM) i7 CPU       Q 820  @ 1.73GHz
stepping    : 5
microcode    : 0x616
cpu MHz        : 1721.714
cache size    : 6144 KB
physical id    : 0
siblings    : 2
core id        : 1
cpu cores    : 2
apicid        : 1
initial apicid    : 1
fdiv_bug    : no
f00f_bug    : no
coma_bug    : no
fpu        : yes
fpu_exception    : yes
cpuid level    : 5
wp        : yes
flags        : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx rdtscp lm constant_tsc pni ssse3 lahf_lm
bogomips    : 3443.42
clflush size    : 64
cache_alignment    : 64
address sizes    : 36 bits physical, 48 bits virtual
power management:
markusachtelik
 
Posts: 27
Joined: Mon 7. Jul 2014, 20:09



Similar topics

Evaluation of log files
Forum: Challenge 3
Author: ceesietopc
Replies: 2
task_2_1 evaluation problem/question
Forum: Challenge 3
Author: rpretten
Replies: 2
About intermediate evaluation service
Forum: Stage 1 - Simulation
Author: michieletto
Replies: 22
Instructions for submission to final evaluation
Forum: Stage 1 - Simulation
Author: admin
Replies: 0
checksum for the final evaluation
Forum: Stage 1 - Simulation
Author: ghepard1
Replies: 1

TAGS

Return to Challenge 3

Who is online

No registered users

cron