AI Tournament

AI TOURNAMENT

Train you AI and compete in the tournament

Register Here

Leaderboard

Updated every Tuesday
Watch the live event on Twitch

New Submission


Rank Name Score
_
1 Kenshiro 9.53
2 rssalessio 2.37
3 Trrrrr 2.17
4
5
TO THE FINAL EVENT
6 Anu N/A
7 CharyMachine N/A
8 Chicago Junkman N/A
9 drlux N/A
10 Ema42 N/A
11 Furio19 N/A
12 galacticor N/A
13 HashArt N/A
14 JeyDi92 N/A
15 King Arthur N/A
16 muraatozbek N/A
17 Noob N/A
18 ralami N/A
19 Riju06 N/A
20 robocop N/A
41 wizardOfRobots N/A
22 sash N/A
23 Shivz N/A
24 Sumitnc12594 N/A
25 TeamWindNet N/A
26 The Trainer N/A
27 Valetudo N/A
28 VlaDiPooH N/A
29 Water007 N/A
30 wegfawefgawefg N/A
31 STARKBOT N/A
32 Maestrodigatto N/A
33 mohithtaker N/A
34 cblee N/A
35 nviada N/A
36 Djordje N/A
37 My Joy N/A
38 Kitadake N/A
39 Rao N/A
40 Amathlog N/A
41 Sakthi N/A
42 KingCob N/A
43 geronimo N/A
44 ankit7921 N/A
45 Quinton Starck N/A
46 Alwahsh N/A
47 Derezzed N/A
48 Adxenix N/A
49 syveqc N/A
50 pastellic N/A
51 ganman N/A
52 junhill N/A
53 helga N/A
54 Pierre N/A
...

Description

Train an AI agent to effectively play Dead Or Alive ++.

It will face the standard COM player, in the run for completing the game: seven stages plus the final boss.

The environment, with Tournament-specific settings, can be found in the dedicated section of DIAMBRA repository here. Inside it, a complete submission example is also provided, featuring a baseline random agent.

The repository also contains python scripts and notebooks showing how to easily interact with it, together with a working implementation of a Reinforcement Learning agent as a starting baseline for reference.

Public leaderboard will be updated once a week, with submissions evaluation performed live and streamed on our Twitch channel every Tuesday at 10 PM CET

After competition end, the best five agents will take part in the Final Event and evaluated live for the final standing definition. It will be broadcasted on our Twitch channel and on Reinforcement Learning Zurich meetup

Getting started

Three easy steps:

1) Get the environment from GitHub
Inside you find python notebooks with everything you need: from environment usage up to a working baseline PPO RL agent

2) Join the discord server
You'll meet other participants and the whole community, with a dedicated channel for support

3) Refer to "Getting started" video tutorials here
They provide additional info on coding aspects and implementation details

and you are ready to fight.

Timeline

Start Date April 20th, 2021
Competition Presentation Event April 27th, 2021 - RLZ Meetup
Weekly Submission Deadline Every Sunday 11:59 PM UTC
Weekly Submissions Evaluation Twitch Live every Tuesday 10 PM CET
Final Submission Deadline and End Date July 11th, 2021 11:59 PM UTC
Final Live Event July 13th, 2021 - Twitch | RLZ Meetup

Prizes

Kindly offered by Reinforcement Learning Zurich!

1st Place 1000 CHF
2nd Place 300 CHF
3rd Place 100 CHF

Submission

An example of a valid submission can be found in DIAMBRA Environment repository here. It is composed by:

1) An agent.py python file containing a python class with the proper constructor, a reset(self) method called when env.reset() is called, and a act(self, observation, info) method returning actions to be executed

2) A model.xxx file containing trained agent model parameters (e.g. policy network weights, if any)

3) A requirements.txt file containing python prerequisites to create the virtual environment (either virtualenv or conda) containing all modules to run the agent (if any)

4) A packages.txt file containing OS packages dependencies required to run the agent (if any)

Evaluation

The evaluation metric is the total cumulative reward collected by the agent in an episode. Thus, every contender will be able to estimate its own performance simply running the agent in local.

Environment default reward function will be used to evaluate submissions: it is directly proportional to characters health variation (positive when the agent hits the opponent and negative otherwise).

Agent performances, averaged on five runs, will be used to define leaderboard standings.

Technical Details

Observation space
A stack of the last four game frames (pixels) plus a fifth additional channel containing the following complementary info (numerical data): last 12 actions (one-hot encoding), own and opponent health, own and opponent side (Left/Right), stage number and selected character (one-hot encoding).

Action spaces
Four possible action spaces, depending on the choice of Discrete VS Multi Discrete and With VS Without Attack Buttons combinations, resulting in 12, 16, 36, 72 different possible actions.

Rules

Competition related
- Only one account per participant allowed
- Participants are not allowed to share code
- If participants want to team up, they are allowed to submit their agent from a single user profile belonging to one member of the team
- Maximum two members per team
- Only one submission per week allowed (the latest one will be considered)
- Only one final submission allowed (the latest one will be considered)

Game related
- Every available character can be used
- The agent can be character-specific

Software related
- Action space can be chosen by participants as they prefer, selecting it among previously described ones
- Observations can be modified by participants inside the agent act(self, observation) method, before feeding it to the agent policy, but it has to mantain an interface with the observation format described above
- The submitted agent must be compatible with Linux Ubuntu 18.04 OS
- Maximum GPU RAM usage for inference: 1.25 GB
- Maximum inference time (considering also observations modifications, if any): 50 ms on Nvidia 1050

Failing to comply with these rules will result in agent performance discarding (if violating RAM usage for example) and/or participant disqualification (if violating competition rules).

Competition-Specific Terms
Competition Name (the "Competition"): AI Tournament
Competition Organizer: DIAMBRA | Dueling AI Arena
Competition Sponsor: Reinforcement Learning Zurich (Nonprofit)
Competition Website: https://diambra.artificialtwin.com/aitournament/

Prizes: 1400 CHF
- First Prize: 1000 CHF
- Second Prize: 300 CHF
- Third Prize: 100 CHF

WINNER LICENSE TYPE: Non-Exclusive License

Competitions are open worldwide, except that if you are a resident of Crimea, Cuba, Iran, Syria, North Korea, Sudan, or are subject to Italian/EU export controls or sanctions, you may not enter the Competition. Other local rules and regulations may apply to you, so please check your local laws to ensure that you are eligible to participate in skills-based competitions. The Competition Sponsor reserves the right to award alternative prizes where needed to comply with local laws.

ENVIRONMENT DISCLAIMER
As specified on Terms of Use (Section 8) and General Competition Rules (Section 7D), Environments provided by Competition Organizer and/or Competition Sponsor, are a mere software interface to existing videogames, and they cannot work as a standalone application. As such, they require the User, or the Competition Participant, to own software elements protected by copyright, and to interface them with the Environment itself. It is the case, for example, of Game ROMS required to execute the correspondent Game-related Environment. In such cases, it is sole an only responsibility of the User, or Competition Participant, to comply with all the laws and regulations, and to make sure he has the right to use such copyright-protected material. Competition Organizer and Competition Sponsor will spend their maximum effort in avoiding illegal distribution of such material, and are by no mean responsible for copyright infringement.

Entry in this competition constitutes your acceptance of these official competition rules.

The Competition named above is a skills-based competition to promote and further the field of machine learning. You must register via the Competition Website to enter. Your competition submissions ("Submissions") must conform to the requirements stated on the Competition Website. Your Submissions will be scored based on the evaluation metric described on the Competition Website. Subject to compliance with the Competition Rules, Prizes described on the Competition Website, if any, will be awarded to participants with the best scores, based on the merits of the machine learning models submitted. See also General Competition Rules

AI TOURNAMENT

Train you AI and compete in the tournament

Register Here