Indexer Resourcing Census

All indexers of Kepler need to fill out the following form

This data is to help us understand the costs being incurred by each indexer so we can better understand the profitability and guide future consumers to decide on pricing.

---- Please copy and fill in below ----

  • What cloud provider are you using (or did you run your own hardware)

  • What dictionaries are you indexing (Polkadot, Kusama, Both)

  • If running both, did you run this in a single VM, or did you run all the services in seperate instances

  • Compute:

    • What type of compute are you using (e.g AWS ec2 t3a.large)
    • What has the average/max CPU been
    • What is your best estimate for your weekly compute cost per project (e.g. just Polkadot)
  • Storage/DB:

    • What type of storage are you using (SSD, HDD, RDS/Database service)?
    • What is your best estimate for your weekly storage cost per project (e.g. just Polkadot)
  • Data egress:

    • How much data egress do you see each week
    • What is your best estimate for your weekly egress cost per project (e.g. just Polkadot)
  • Total cost

    • What is your best estimate for your weekly total cost per project (e.g. just Polkadot)
  • What other docs or guides can we write to help you

  • What monitoring tools or data would you like us to provide

  • Is there anything else that we can do better?

1 Like
  • What cloud provider are you using (or did you run your own hardware)
    Hetzner
  • What dictionaries are you indexing (Polkadot, Kusama, Both)
    Both
  • If running both, did you run this in a single VM, or did you run all the services in seperate instances
    Separate docker containers (single dedicated server)
  • Compute:
    • What type of compute are you using (e.g AWS ec2 t3a.large)
      Dedicated bare metal server
    • What has the average/max CPU been
      1 cpu 8 cores
    • What is your best estimate for your weekly compute cost per project (e.g. just Polkadot)
      I don’t know
  • Storage/DB:
    • What type of storage are you using (SSD, HDD, RDS/Database service)?
      SSD NVMe RAID 1 drives
    • What is your best estimate for your weekly storage cost per project (e.g. just Polkadot)
      I don’t know
  • Data egress:
    • How much data egress do you see each week
      300Gb per week income traffic
    • What is your best estimate for your weekly egress cost per project (e.g. just Polkadot)
      I don’t know
  • Total cost
    • What is your best estimate for your weekly total cost per project (e.g. just Polkadot)
      100 euros
  • What other docs or guides can we write to help you
    N/A
  • What monitoring tools or data would you like us to provide
    Grafana monitoring
  • Is there anything else that we can do better?
    N/A

• What cloud provider are you using :Hetzner
• What dictionaries are you indexing : Both
• If running both, did you run this in a single VM, or did you run all the services in seperate instances : Single
• Compute:
o What type of compute are you using (e.g AWS ec2 t3a.large)
o What has the average/max CPU been 1 cpu 8 cores
o What is your best estimate for your weekly compute cost per project (e.g. just Polkadot)
• Storage/DB:
o What type of storage are you using (SSD, HDD, RDS/Database service)? NVME
o What is your best estimate for your weekly storage cost per project (e.g. just Polkadot) $25
• Data egress:
o How much data egress do you see each week 70G
o What is your best estimate for your weekly egress cost per project (e.g. just Polkadot) $3
• Total cost
o What is your best estimate for your weekly total cost per project (e.g. just Polkadot) $25
• What other docs or guides can we write to help you NA
• What monitoring tools or data would you like us to provide Prometheus/Grafana
• Is there anything else that we can do better? NA

  • What cloud provider are you using (or did you run your own hardware)
    -Hetzner
  • What dictionaries are you indexing (Polkadot, Kusama, Both)
    -Both
  • If running both, did you run this in a single VM, or did you run all the services in seperate instances
    Separate docker container on dedicated server
  • Compute:
    • What type of compute are you using (e.g AWS ec2 t3a.large)
      AMD Ryzen 5 3600 6-Core Processor
  • What has the average/max CPU been
    12% max -30%
  • What is your best estimate for your weekly compute cost per project (e.g. just Polkadot)
    Im here to validate, not to calculate :slight_smile:
  • Storage/DB:
    • What type of storage are you using (SSD, HDD, RDS/Database service)?
      NVME RAID 0
    • What is your best estimate for your weekly storage cost per project (e.g. just Polkadot)
      Im here to validate, not to calculate :slight_smile:
  • Data egress:
    • How much data egress do you see each week
      250-270GB income traff for 1 week
    • What is your best estimate
      Im here to validate, not to calculate :slight_smile:
  • Total cost
    • What is your best estimate for your weekly total cost per project (e.g. just Polkadot)
      500 buks ^)
  • What other docs or guides can we write to help you
    Only rules. can i make one more validator?
  • What monitoring tools or data would you like us to provide
    Yes, we need a system to keep track of whether our node is alive or not. Whether it’s doing well or not, how many blocks per second it can process and how many requests per second it can process. Is everything okay with her right now? I had cases when the traffic was 10Mbit and when it was 1.2Mbit
    and in both cases the node looks alive, but how should it be?
  • Is there anything else that we can do better?
  • What cloud provider are you using (or did you run your own hardware)
    Vultr

  • What dictionaries are you indexing (Polkadot, Kusama, Both)
    Both

  • If running both, did you run this in a single VM, or did you run all the services in seperate instances
    Single

  • Compute:

    • What type of compute are you using (e.g AWS ec2 t3a.large)
      Intel Hight Performance 4 CPU/8 GB
    • What has the average/max CPU been
      10-15%
    • What is your best estimate for your weekly compute cost per project (e.g. just Polkadot)
      $12/week
  • Storage/DB:

    • What type of storage are you using (SSD, HDD, RDS/Database service)?
      NVMe
    • What is your best estimate for your weekly storage cost per project (e.g. just Polkadot)
      $15/week
  • Data egress:

    • How much data egress do you see each week
      n/a
    • What is your best estimate for your weekly egress cost per project (e.g. just Polkadot)
      n/a
  • Total cost

    • What is your best estimate for your weekly total cost per project (e.g. just Polkadot)
      $27/week
  • What other docs or guides can we write to help you
    n/a

  • What monitoring tools or data would you like us to provide
    n/a

  • Is there anything else that we can do better?
    I would like to receive answers from the team more often (in Discord), especially on emerging problems

  • What cloud provider are you using (or did you run your own hardware)
    Contabo

  • What dictionaries are you indexing (Polkadot, Kusama, Both)
    Both

  • If running both, did you run this in a single VM, or did you run all the services in seperate instances
    run in single VM

  • Compute:

    • What type of compute are you using (e.g AWS ec2 t3a.large)
      Cloud VPS with SSD Storage VPS XL SSD: 10 cores 60 GB 1,600 GB (100% SSD) 1,000 Mbit/s
    • What has the average/max CPU been
      15%/100%
    • What is your best estimate for your weekly compute cost per project (e.g. just Polkadot)
      N/A
  • Storage/DB:

    • What type of storage are you using (SSD, HDD, RDS/Database service)?
      SSD
    • What is your best estimate for your weekly storage cost per project (e.g. just Polkadot)
      N/A
  • Data egress:

    • How much data egress do you see each week
      N/A
      I’m using 32 TB Out + Unlimited In
    • What is your best estimate for your weekly egress cost per project (e.g. just Polkadot)
      N/A
  • Total cost

    • What is your best estimate for your weekly total cost per project (e.g. just Polkadot)
      $15/week . 65$/month
      To meet 2 projects I use about 600GB SSD, VM has 6 cores, 16G Ram.
      If running 3 projects, the capacity will be about 800GB.

      I’m using the Monthly Plan, about $65/month, excluding Onfinality’s $50 API fee.

      The total cost of running Node Kepler costs about $115/month

  • What other docs or guides can we write to help you
    N/a

  • What monitoring tools or data would you like us to provide
    N/A

  • Is there anything else that we can do better?

  1. Multi Dictionary in Project, to reduce the load on indexers, and save money on buying RPC.
  2. Unstake in UI of https://kepler.subquery.network/
  3. Provide a way to move the indexer to another machine, in case Node fails or needs to switch to another service provider.
  • What cloud provider are you using (or did you run your own hardware)
    Hetzner
  • What dictionaries are you indexing (Polkadot, Kusama, Both)
    Both
  • If running both, did you run this in a single VM, or did you run all the services in seperate instances
    dedicated server, both containers on it
  • Compute:
    • What type of compute are you using (e.g AWS ec2 t3a.large)
      12core CPU, 128gb RAM, 2TB NVMe raid0
    • What has the average/max CPU been
      10%/30-40%. I’ve thought that would be 3 projects to index, not 2.
    • What is your best estimate for your weekly compute cost per project (e.g. just Polkadot)
      ~50 euros for both, so it’s 25 per project
  • Storage/DB:
    • What type of storage are you using (SSD, HDD, RDS/Database service)?
      2tb NVMe RAID0
    • What is your best estimate for your weekly storage cost per project (e.g. just Polkadot)
      Hard to calculate, N/A
  • Data egress:
    • How much data egress do you see each week
      Over 200GB AFAIK
    • What is your best estimate for your weekly egress cost per project (e.g. just Polkadot)
      50 euros that I’ve answered above - it’s about all costs, can’t calculate for each thread
  • Total cost
    • What is your best estimate for your weekly total cost per project (e.g. just Polkadot)
      Oh I see :slight_smile: It’s 50 euros
  • What other docs or guides can we write to help you
    I gues more accurate info about how networks works, how any numbers are calculating, also what we can expect about Kepler (new projects to index?), something like that
  • What monitoring tools or data would you like us to provide
    N/A
  • Is there anything else that we can do better?
    You’re doing great, I hope some kind of stress tests will come to us!
  • What cloud provider are you using (or did you run your own hardware)
    Contabo
  • What dictionaries are you indexing (Polkadot, Kusama, Both)
    Both
  • If running both, did you run this in a single VM, or did you run all the services in seperate instances
    single VM
  • Compute:
    • What type of compute are you using (e.g AWS ec2 t3a.large)
      VPS: 10 cores 60 GB RAM 1600GB (100% SSD) 1,000 Mbit/s
    • What has the average/max CPU been
      40%/100%
    • What is your best estimate for your weekly compute cost per project (e.g. just Polkadot)
      N/A
  • Storage/DB:
    • What type of storage are you using (SSD, HDD, RDS/Database service)?
      SSD
    • What is your best estimate for your weekly storage cost per project (e.g. just Polkadot)
      N/A
  • Data egress:
    • How much data egress do you see each week
      N/A
      32 TB Out + Unlimited In
    • What is your best estimate for your weekly egress cost per project (e.g. just Polkadot)
      N/A
  • Total cost
    • What is your best estimate for your weekly total cost per project (e.g. just Polkadot)
      ~30$ week both projects until I have enough disc space
  • What other docs or guides can we write to help you
    I need new SSL configuration guide. I have a technical background, but I haven’t worked with nginx and ssl. The current guide seems to be missing some important points or some points are meant to be obvious, so if you follow it step by step, you will not be able to set up the ssl.
  • What monitoring tools or data would you like us to provide
    N/A
  • Is there anything else that we can do better?

I see two problems:

  1. Documentation quality:
  • When I ran the indexer for the first time, the documentation suggested taking a preconfigured image, which I did, but nothing worked. I asked a question on the forum, and it turned out that this is an old image that has not been updated for a long time and should not be used.
  • Recently, a monitoring guide was published in the chat, in which there was a command that replaced the docker-compose file.
  • When the dictionary files were replaced, the old hash sums remained in the documentation.
  • The academy has a section for indexers and it indicates where to get information about all updates of the coordinator and proxy - the link leads to the Notion and if you click it you will see “access is denied”.

etc. I can go on with the list.
Good documentation is a separate big task. Those who already know what and how to do don’t need documentation, so we need to write it with the expectation that a person is doing it for the first time and it is necessary to describe in detail all the nuances and it is better to try this guide to make sure that there are no mistakes in it.

  1. The second problem is a continuation of the previous one. I see a lack of communication. If something went wrong and we encountered some problems, then we ask questions in the chat and on the forum. And some questions go unanswered for weeks. You ask a question, then someone else asks a question, the discussion moves on and no one returns to the old questions.

I propose the following solutions:

  1. Make a one-page guide for indexers that always contains the latest carefully tested working version.
  2. As an option for those instructions that most likely will not change over time (installing the SSL for example) make a video guide. If the author misses something important in the text, users will always be able to see it on the video and will not make mistakes.
  3. Implement a ticket system for indexers in Discord. So that no problem is left unattended. And accordingly, we need a team member who would regularly check tickets and answer questions.
1 Like
  • What cloud provider are you using (or did you run your own hardware)

I run two indexers on two separate vps. I use my own hardware at the moment (2x32core XEON, 128Gb RAM, 4x1Tb 15k hdd on raid 10).

  • What dictionaries are you indexing (Polkadot, Kusama, Both)

Both.

  • If running both, did you run this in a single VM, or did you run all the services in seperate instances

Single VM.

  • Compute:
    • What type of compute are you using (e.g AWS ec2 t3a.large)

VM 16 cpu, 16 Gb.

  • What has the average/max CPU been

8/48 cpu and 16/48 Gb. I’ve started indexing on 8cpu/16Gb and add cpu and memory. The system has entered a stable state with 16cpu/16Gb. Further addition of resources did not give any improvements.

  • What is your best estimate for your weekly compute cost per project (e.g. just Polkadot)

I suppose 70-80 eur monthly cost for server (17-20 eur per project).

  • Storage/DB:
    • What type of storage are you using (SSD, HDD, RDS/Database service)?

RAID10 hdd.

  • What is your best estimate for your weekly storage cost per project (e.g. just Polkadot)

I suppose 70-80 eur monthly cost for server (17-20 eur per project).

  • Data egress:
    • How much data egress do you see each week

  • What is your best estimate for your weekly egress cost per project (e.g. just Polkadot)

I use onfinality’s growth plan.

  • Total cost
    • What is your best estimate for your weekly total cost per project (e.g. just Polkadot)

I suppose 80eur monthly server cost and 50-60usd monthly api service cost.

  • What other docs or guides can we write to help you

I would like to see a guide for running indexers in a non-docker environment.

  • What monitoring tools or data would you like us to provide

n/a

  • Is there anything else that we can do better?
  • What cloud provider are you using (or did you run your own hardware)
    Hetzner

  • What dictionaries are you indexing (Polkadot, Kusama, Both)
    Both

  • If running both, did you run this in a single VM, or did you run all the services in seperate instances
    Single

  • Compute:

    • What type of compute are you using (e.g AWS ec2 t3a.large)
      AMD Ryzen 7 PRO 1700X 8-Core 64 GB
    • What has the average/max CPU been
      5% / 25%
  • Storage/DB:

    • What type of storage are you using (SSD, HDD, RDS/Database service)?
      1TB SSD
  • Data egress:

    • How much data egress do you see each week
      ~400k responses for both projects
  • Total cost

    • What is your best estimate for your weekly total cost per project (e.g. just Polkadot)
      30$ per project, 60$ for both.
  • Is there anything else that we can do better?

I have to agree with some of the previous statements that the documentation quality could be improved. I believe the instructions should be more detailed, and it seems that some of them were not adequately tested.

There is technically no Discord moderation, which leads to a lack of structure and communication. Many questions remain unanswered, and looking into channel history doesn’t always provide help. There is no definitive information on the current official version of the coordinator and proxy, resulting in confusion, and often upgrading leads to further mistakes. The last pinned message is dated 29.04, while it should be updated with the most common mistakes, the current official working version, and other relevant information.

It would be helpful to have an active moderator in the indexers channel.

  • What cloud provider are you using (or did you run your own hardware)
    Hetzner
  • What dictionaries are you indexing (Polkadot, Kusama, Both)
    Both+Noodle+Polygon
  • If running both, did you run this in a single VM, or did you run all the services in seperate instances
    Single
  • Compute:
    • What type of compute are you using (e.g AWS ec2 t3a.large)
      Dedicated bare metal server AX101 - AMDs Ryzen™ 9 5950X
    • What has the average/max CPU been
      Virtualize only 10 CPUs => 5%/50%(only during indexing)
  • Storage/DB:
    • What type of storage are you using (SSD, HDD, RDS/Database service)?
      7,68TB NVME SSD
  • Data egress:
    • How much data egress do you see each week
      just started tracking
  • Total cost
    • What is your best estimate for your weekly total cost per project (e.g. just Polkadot)
      30$ per week - Can run a lot of projects at the same time.
  • Is there anything else that we can do better?

A guided version recommendation for each project.
A clear policy on each version is recommended in my view.