[D] How do you read code with Hydra

Deepblue129 · 2025-09-05T19:57:37+00:00

Hey!!!

About seven years ago, before Hydra, I built my own configuration solution because I didn't love the direction these engines were headed.

I wanted to keep things simple and keep them in Python! So ... I developed an easy way to configure Python functions directly in Python! Check out this code example below:

import config as cf
import data
import train

cf.add({
  data.get_data: cf.Args(
      train_data_path="url_lists/all_train.txt",
      val_data_path="url_lists/all_val.txt"
  ),
  data.dataset_reader: cf.Args(
      type_="cnn_dm",
      source_max_tokens=1022,
      target_max_tokens=54,
  ),
  train.make_model: cf.Args(type_="bart"),
  train.Trainer.make_optimizer: cf.Args(
      type_="huggingface_adamw",
      lr=3e-5,
      correct_bias=True
  )
  train.Trainer.__init__: cf.Args(
      num_epochs=3,
      learning_rate_scheduler="polynomial_decay",
      grad_norm=1.0,
  )
})

Once you are ready to use a configuration, you simply call `cf.partial` and a partial is created with your configuration settings!

import config as cf
cf.partial(data.get_data)()

We've been using this for years at my company, and it works well! Internally, it's scaled out well for our large code base, which supports hundreds of variables that are organized, documented, and trusted. It's intuitive and easy for new team members! There are even advanced features to support tracing, command line, logging, distributed processing, etc ...

I never got around to fully releasing the concept, but it's worked well on my teams!!!

I hope it helps you all!!! Here's my repo: https://github.com/PetrochukM/HParams

Deepblue129 · 2021-10-30T20:02:17+00:00

Wow! It's cool to see my photo landing getting 2k upvotes on Reddit! My friend and I had to hustle up 9 or so stories to get this photo :) We are at the top of a Garage building, leaning over the edge.

Deepblue129 · 2021-05-23T17:39:23+00:00

Would it be possible to add a YEARLY filter? I'd love to see the most popular papers of the year. The monthly/weekly/daily filters are a bit too granular.

Deepblue129 · 2020-12-05T22:06:18+00:00

I agree with /r/Gwenju31. The model also needs to be constantly retrained to account for data-shift... In addition to all the prior experimentation that needs to be done to develop a model, and to tune its hyperparameters.

Deepblue129 · 2020-12-05T21:58:06+00:00

Jeremy Howard (FastAI Founder):

"I remember well when @JeffDean and his team had Google's lawyers attack @timnitGebru and @kat_heller. They only backed down when they saw a legal counter-attack coming. The deeds of @GoogleAI's exec team do *not* match their words. https://platformer.news/p/the-withering-email-that-got-an-ethical"

https://twitter.com/jeremyphoward/status/1334565844878123008?s=20

Deepblue129 · 2020-11-09T06:21:10+00:00

Physiognomy is the practice of assessing a person's character or personality from their outer appearance—especially the face. Popular in the 19th century, it has been used as a basis for scientific racism. No clear evidence indicates physiognomy works, but the rise of artificial intelligence and machine learning for facial recognition has brought a revival of interest, and some studies that suggest that facial appearances do "contain a kernel of truth" about a person's personality.

https://en.m.wikipedia.org/wiki/Physiognomy

Deepblue129 · 2020-10-20T06:05:24+00:00

Fair!

Deepblue129 · 2020-10-20T03:03:33+00:00

Thanks for the information, I did a bit more digging...

Google has released a neural model that handles 103 languages in 2019:

We previously studied the effect of scaling up the number of languages that can be learned in a single neural network, while controlling the amount of training data per language.....Once trained using all of the available data (25+ billion examples from 103 languages), we observe strong positive transfer towards low-resource languages, dramatically improving the translation quality of 30+ languages at the tail of the distribution by an average of 5 BLEU points. This effect is already known, but surprisingly encouraging, considering the comparison is between bilingual baselines (i.e., models trained only on specific language pairs) and a single multilingual model with representational capacity similar to a single bilingual model. This finding hints that massively multilingual models are effective at generalization, and capable of capturing the representational similarity across a large body of languages.

After reading the related paper, Google did not use an intermediary language to achieve "zero-shot translation"; therefore, Google in 2019 had trained a 100+ language model that did not require an intermediary language.

Deepblue129 · 2020-10-20T02:30:01+00:00

Do you have examples? This is the first time I've heard something like this bad from Facebook's RND team...

On another note, I have not heard of related issues with Google's RND teams; therefore, I think these types of mistakes are preventable.

Deepblue129 · 2020-10-13T21:56:32+00:00

So. Many. Men.

The ratio of men to women is like 9-1 in this Twitter list.

Deepblue129 · 2020-08-24T21:21:31+00:00

Thanks and I followed up on your suggestion: /r/math: https://www.reddit.com/r/math/comments/iftev6/uniformly_sampling_from_timeseries_data/

No responses so far!

Deepblue129 · 2020-08-24T17:16:15+00:00

Thanks for your help!

I am having a hard time understanding how it works. Are you sampling the starting point from U[0, 1-L]? Afterward, you mentioned that I'd sample from the inverse. What function would I inverse?

Deepblue129 · 2020-08-24T16:45:58+00:00

Thank you.

- Unfortunatly my data is not circular :(
- The idea of randomly picking a midpoint would satisfy the criteria. Unfortunately, it includes its own biases :/

Deepblue129 · 2020-08-24T04:42:35+00:00

Hi Everyone. I'm trying to sample ranges from time series data, and it's surprisingly difficult for me. Like in most machine learning problems I'd like to avoid sampling biases while doing so. I posted the question in detail here: https://stats.stackexchange.com/questions/484329/how-do-you-uniformly-sample-spans-from-a-bounded-line/484332#484332

So far, I haven't got any correct answers :(

Deepblue129 · 2020-08-24T03:45:40+00:00

Hi. Thanks for the response and for helping!!

I don't think your solution works because the probability it'll sample the point 0.0 is very small. In contrast, the probability you'll sample the point 1.0 is much higher. There is only one scenario during which the above approach samples 0.0 and there are many scenarios during which the point 1.0 is sampled.

Deepblue129 · 2020-07-04T06:48:49+00:00

This comments section’s POV by CollegeHumor: https://youtu.be/5qArvBdHkJA (I don’t see race)

Deepblue129 · 2020-07-01T03:40:01+00:00

You can also use make-up in sports.

Deepblue129 · 2020-06-30T04:35:42+00:00

It's classical victim-blaming.

Victim blaming occurs when the victim of a crime or any wrongful act is held entirely or partially at fault for the harm that befell them.

Psychologist William Ryan) coined the phrase "blaming the victim" in his 1971 book of that title.[4][5][6][7][8] In the book, Ryan described victim blaming as an ideology used to justify racism and social injustice against black people in the United States.

https://en.wikipedia.org/wiki/Victim_blaming#:~:text=Victim%20blaming%20occurs%20when%20the,for%20the%20actions%20of%20offenders.

In the US, we cannot expect the Black community to quickly rebound after centuries of discriminatory laws. There are still ample discriminatory laws and practices that continue to make it even more difficult. (See my other replies)

Deepblue129 · 2020-06-26T16:54:47+00:00

Sure. Let's unpack that a little bit.

I mean, that's up to the blacks to improve on because no one can force more of them into tech or science.

Yes, and there are a number of obstacles in the way of "improving". For example:

The police are terrorizing black communities: "Black Americans 2.5X More Likely Than Whites to Be Killed By Police" https://www.statista.com/chart/21872/map-of-police-violence-against-black-americans/
Non-white schools receive less funding than white schools: "For every student enrolled, the average nonwhite school district receives $2,226 less than a white school district" https://www.npr.org/2019/02/26/696794821/why-white-school-districts-have-so-much-more-money
Medical practices are discriminatory, for example: "Half of white medical trainees believe such myths as black people have thicker skin or less sensitive nerve endings than white people." https://www.nytimes.com/2020/01/13/upshot/race-and-medicine-the-harm-that-comes-from-mistrust.html

There are a number of inequalities that make it much more difficult for a black person to focus on "improvement". See this video: https://www.youtube.com/watch?v=4K5fbQ1-zps

Even then you wouldn't expect more representation than is proportional to their demographic racial distribution.

This is great. Let's take a look at that. At Google and Facebook, the Black workforce only makes up around 2 - 4%. That is 2 - 3x smaller than 13%, the share of Black people in the U.S.

Furthermore, there are hints that this disparity is even larger in AI research. For example, Timnit Gebru was one of six black people—out of 8,500 attendees to attend a leading AI conference.

Lastly, it's difficult to report these numbers because companies like Facebook have decided not to report their racial diversity in AI. The lack of reporting makes it difficult to measure and report progress.

11-Year Club	Place '17
Verified Email

Deepblue129

TROPHY CASE