Software Engineering Best Practices, System Design, High Scale, Algorithms, Math, Programming Languages, Statistics, Machine Learning, Databases, Front Ends, Frameworks, Low Level Machine Structure, Papers and Computing, Computer Science Book Reviews - Everything!
Latest in engineering and programming :)
Siamo un club di persone che per lavoro e passione si occupano di programmazione, nuove tecnologie e startup. Ci incontriamo una volta al mese a Padova, in un ambiente aperto e informale, dove sono i partecipanti a proporsi per i talk. Dopo l'evento, birra e pizza tutti assieme !
A video podcast for computer programmers in grades 3 and up. We learn about Scratch, Tynker, Alice, Python, Pygame, and Scala, and interview interesting programmers. From professional software developer and teacher Dave Briccetti, and many special guests.
The podcast for programmers who want to become software engineers. Software engineering analysis and reflection from Graham Lee, a software engineering educator, practitioner and researcher with two decades of field experience. Coming to you from https://www.sicpers.info.
Coders Startup Podcast
Podcast by Jeremy Nagel
Programmer's Notes is a podcast where programming tools, languages, and the like are thoroughly explained. By the end of each episode listeners should have a solid base on the episode topic and have the required knowledge to start using the discussed tool, language, or the like. Please feel free to contact us with questions or episode requests at ProgrammersNotesPodcast@gmail.com, requests will help us get more content out there.
In a system design interview, you will be asked to design a complex system, such as a website, mobile app, or network infrastructure. The interviewer will evaluate your ability to decompose the problem into smaller subproblems, make trade-offs between different design decisions, and consider the scalability and performance of the final solution. Th…
In Apache Spark, you can cache a DataFrame in memory using the cache() or persist() method. The cache() method is a shorthand for persist() with the default storage level of MEMORY_ONLY. You can use the persist() method to specify a different storage level if desired, such as MEMORY_AND_DISK or MEMORY_ONLY_SER.…
Why develop apps when you can create "social apps" without coding as side projects.
The n-queens problem is a well-known problem in computer science and mathematics that involves placing n chess queens on a n x n chessboard such that no two queens are able to attack each other. This means that no two queens can be placed on the same row, column, or diagonal. The problem can be generalized to other chess pieces, such as the n-rooks…
The time that it takes computers to process information.
Consistent Hashing is a distributed hashing scheme that operates independently of the number of servers or objects in a distributed hash table by assigning them a position on an abstract circle, or hash ring. This allows servers and objects to scale without affecting the overall system.
A couple of mitigations for a couple of ab testing challenges
Here are scenarios where we would want to use AA testing before trying out and AB testing
HTTP/1 is a textual data protocol and is good in terms of readability, ease of reimplementing, and ease of debugging. However, Binary protocols(Used in HTTP/2 )
In probability theory and statistics, Bayes' theorem, named after Thomas Bayes, describes the probability of an event, based on prior knowledge of conditions that might be related to the event.
The token bucket is an algorithm used in packet-switched and telecommunications networks. It can be used to check that data transmissions, in the form of packets, conform to defined limits on bandwidth and burstiness.
In software design and engineering, the observer pattern is a software design pattern in which an object, named the subject, maintains a list of its dependents, called observers, and notifies them automatically of any state changes, usually by calling one of their methods.
There are six (6) criteria that should be applied when evaluating any Web site: authority, accuracy, objectivity, currency, coverage, and appearance.
The dataclass() decorator examines the class to find fields. A field is defined as a class variable that has a type annotation. With two exceptions described below, nothing in dataclass() examines the type specified in the variable annotation.
Python dependency management and packaging made easy. ... Poetry comes with all the tools you might need to manage your projects in a deterministic way
In computer networks, rate limiting is used to control the rate of requests sent or received by a network interface controller. It can be used to prevent DoS attacks and limit web scraping
dbt - Transform data in your warehouseWhat is dbt? dbt is a development framework that combines modular SQL with software engineering best practices to make data transformation reliable, fast,
HashAggregateExec is a unary physical operator (i.e. with one child physical operator) for hash-based aggregation that is created (indirectly through AggUtils.
The Spark SQL shuffle is a mechanism for redistributing or re-partitioning data so that the data is grouped differently across partitions
We have two types of automatic backups in dynamodb one is point in time and another one is snapshots.
Giorgia Meloni is an Italian politician and journalist. A member of the Chamber of Deputies in Italy since 2006, she has led the Brothers of Italy political party since 2014, and has been the president of the European Conservatives and Reformists Party since 2020.
Apache spark unit tests
📚 Book - The Psychology of Money - https://www.amazon.com/Psychology-Money-Timeless-lessons-happiness/dp/0857197681#?&_encoding=UTF8&tag=planetizer0c-20&linkCode=ur2&linkId=c18c78f3d241db79ce045de652b93722&camp=1789&creative=9325
How can we pivot on spark?
Why a single file on your repository that describes the project could make life much easier for newcomer programmers
DataFrame.createOrReplaceTempView - Creates or replaces a local temporary view with this DataFrame.The lifetime of this temporary table is tied to the SparkSession that was used to create this DataFrame.
Spark SQL is a Spark module for structured data processing. It provides a programming abstraction called DataFrames and can also act as a distributed SQL query engine. It enables unmodified Hadoop Hive queries to run up to 100x faster on existing deployments and data.
We will describe how to use Apache Spark to get top 100 words from a file!
Apache Spark Paired RDDs are defined as RDDs containing the key-value pair(KVP), which consists of two linked data item in it. So, In most cases, the key is an identifier, and the value is data corresponding to the key value. Furthermore, Apache Spark operations work on RDDs that contain any objects.…
What is this name delta lake? Why delta? What is the core benefit of using it over a standard data lake? What are its disadvantages
Hive aws glue and additional data catalogs compare
In this episode we will discuss what are Go language to link and how is it different from other computer languages
What is AWS DynamoDB DAX and how does it relate to elastic cache
Java fork join pool uses the stealing algorithm in order to utilize better the threads that we have
In AWS the write capacity units is one write per second for an item of up to 1 KB in size
DynamoDb read capacity units and write capacity units
Why does reserve currency metals so much today in today's economy and what are the alternatives and investment opportunities in this area.
Common table expressions are very useful in constructing SQL and are your great next milestone to using SQL
What is the core of doing SEO did it change over the years and what would bring you to the topmost search results in the search engines
Here we discuss some of the most interesting product interview questions in the data area and in the data world
An intro to blockchain technology
What does term frequency means and what is its relationship with inverse document frequency that we use in order to identify categories of documents and in order to find testament and many more applications of this topic
The idea that increased autonomy and privilege for software engineers can only come when we have better confidence that software engineers are working in the best interests of society. Mike Klimek—Modern Software Development Gergely Orosz: What Silicon Valley “Gets” about … Continue reading →Kirjoittanut Graham Lee
📚 Book - https://amzn.to/39ty0Tq - "Terraform: Up & Running: Writing Infrastructure as Code" Terraform init will download all the dependencies needed by the terraform project like when you have a clean maven project, and running almost any mvn command will pull down the dependencies. tags in terraform helps you set the additional tag properties of …
AWS elastic IP is not dynamic for itself it's actually a static public IPv4 IP so why is it called elastic because we dynamically assign it to host so if our host or easy to instance is down we can programmatically and dynamically assign such a static public IP into this new host so it's used for resiliency…
With terraform we can use variables for example an integer which is defined as a number or a string or a list of strings and then we can reference these variables further down in the code in order to reuse them you can also output variables to the screen and we could also ask the customer to input interactively variable values into the console In t…
Terraform main operations such as init plan apply and destroy and the state file telephone is a infrastructure as code tool a command line to help us create infrastructure with standard adjacent files and then run telephone apply in order to actually create this infrastructure either in AWS cloud or Google cloud or azure cloud or any other cloud pr…