Thoughts on Vlogs

I believe every person who has been into Youtube or other video sites would have found vlog is gaining its place gradually. Let it be a lifestyle one, tech review one or other genres, it seems to suit everybody’s desire at this time of age. Would it be the next generation of social media? Very likely.

But seemingly, if one decides to shoot vlogs for living, it could be extremely difficult to pay off the efforts. Let’s put celebrities away first since they get all the attention wherever they go. For normal people, nobody will be interested in seeing how they live their lives, unless they are living very exotic ones, such as living in the North Pole. For them to succeed, they need to have a very specific theme, at least at the very beginning, to gain enough solid audience before they start to record their “boring” daily life.

Apart from the toughness to get initial audience, it is the personal character and the video quality that eventually get vloggers the attention they want. One might initially attract lots of viewers by his/her looking, but it is not everlasting if he/she doesn’t have a personality that goes along with audience. It is probably a talent instead of a skill that can be acquired. Additionally, the video quality has set a high standard on vloggers’ skills in video editing, so those famous vloggers often have background in directing or have a team who works on this.

Being famous is not easy, especially when there are over 6 billion people living on the earth, and most of them have free Internet. But I guess vlogger’s hard work will have its meaning for their colorful life at last as long as they don’t shoot videos out of the desire for money. Please, do it with joy. #SaluteToLife

The Down of PUBG

I can still remember around this time last year in 2017, PUBG(PlayerUnknown’s Battlegrounds) was still the hottest game in gaming industry. It almost strived to be the best game of the year in TGA, but fell short to the legendary Zelda game from Nintendo. In the morning when I was just casually browsing the tweets I found a chart about the most popular games streaming on Twitch, and PUBG had dropped out of Top 5, losing to much older games like DotA 2.

What has happened in a year? Is Fortnite the one to blame? Indeed Fortnite has a very similar setting like PUBG, nevertheless at the same time it provides gamers with much better experience. But I think it is the PUBG itself that has caused the problems.

Firstly, the poor optimization of the game has driven tons of gamers away. In some ways, PUBG has set the new standard of computer specs for those who love online gaming. The vivid graphics in the game has made most last-generation PCs out of date all at once, and if you want to play this game at a comfortable setting, you’d better have a GTX 1060 or above graphic card. But is it really a good thing to the company? Fortnite could even be run on Nintendo Switch, with playable graphic performance, and it has also enabled cross-platform plays. The power of connecting to the bigger user base is imperative for a game to thrive. Are you really going to pay for a $2000 computer for a game?

This leads to my second point, I think PC is not the future platform for most games, and instead, the role should belong to game consoles. I think console gaming has been gaining its share gradually, and whoever has tried it would feel good playing good games in front of TV. It has made the connection between friends and families more easy, and has allowed more interaction both physically and mentally. The lack of PUBG support on consoles will slow down its pace eventually.

Additionally, the popularity of the game has also helped the business of cheating software, but Bluehole has done a poor job in alleviating it. Not long after the game became crazy among fans, more and more users have found that there are some incredible kills during the game, all of which have highlighted the issues of the cheating scripts. It is not a new problem for FPS games, due to its mechanism in transmitting information, but the company behind PUBG is not keen on improving the condition for the fans, at least on the surface. I used to play PUBG with friends a lot last year, but after continuous being killed by impossible ways right after I hit the land, I told my friends I won’t play this game anymore out of rage.

PUBG could have been a 10/10 game, at least for us gamers, but now it is falling short of our expectation, and I am afraid it can never be as popular as before.

Deep Learning is severely overrated!

If I am not working in this field, and work as a general tech guy in a tech company, I would have been overwhelmed by this trend as well, seriously. While the world is promoting AI (specifically, tech companies), few people really understand the techniques that are in the center of play.

Machine learning, AKA modeling, has been in the field for a much longer period, and its base on mathematics and statistics has made it a very powerful tool for statisticians and engineers to train computers to help their business. Deep learning, which is signified by the development of computing power in neural network models, has become the hottest topic in recent years, followed by the successful stories of AlphaGo which has beaten one of the best Go players in human history.

If you look closely, though, or if you work as a data scientist like I do in one of those “big” tech corporations, you would soon realize that deep learning can quite often give you worse results in reality. In other words, deep learning is not for everyone in every situation. Neural networks have been great in three specific fields. Firstly, it is an excellent tool for computer vision. The development of new network structures such as convolutional neural network has transformed the way machines see pictures, thus giving it a pretty decent accuracies in fields like object detection and image recognition. Further more, text analysis, such as machine translation and word prediction have been enhanced by recurrent neural networks, in which the structure can remember previous occurrences for a specific event. Lastly, reinforcement learning(which is basically machine learning new things by exploration or exploitation) has seen its biggest enhancement followed by the help from deep learning. AlphaGo uses a more complex type of this network setting to succeed in overcoming the difficulties to defeat human top-notch Go players.

However, if you are in a traditional field such as anti-fraud, and you have about 20 features with slightly over 100,000 observations, you would be amazed by the fact that as simple as a model like logistic regression can serve you better. Theoretically, though, deep learning has the power to imitate any linear or non-linear models, but setting the hyper parameters just about right is an art instead of science. Quite often, at least to me what happened is tree models(gradient boosting machines, random forest) or linear models(logistic regression, elastic net regression) have better predictive power and easier to be interpreted. Does it mean some mistakes in my deep learning experiments? I used to think so, until I realized that the inner drawback of deep learning: it can’t replace math and statistics in modeling! Especially when you are dealing with a highly imbalanced dataset, using deep learning models would easily make it overfitting or less predictive than math models, and this has been exemplified by some practices of mine in my daily work.

So don’t be fooled by any crazy promotions of AI. It does have change some fundamental ways for machines to learn new things, but it can’t guarantee you good results when it comes to modeling. A lot of companies are using this as a trick to attract new fundings, just like what Bitcoin has been to our world. You won’t assume there’s a Swiss knife for modeling, won’t you? LOL.

Chinese Internet Companies Are Starving For Money

Not long ago, world’s fourth biggest phone maker – Xiaomi, has successfully held its IPO in Hong Kong, marking this newly-rising Chinese giant company a new milestone in its journey. It is hard to categorize Xiaomi as an Internet company, like how Google is, when the primary profits of it still come from its low-end smart phones.

I work for Xiaomi as of now, and I am proud to see its rapid but huge development in different areas, although I am indeed not convinced by its decision to go public. But I also understand that was probably not a perfect timing for the board, either. China has tighten its control on financial area recently, and as visionary as a CEO could be, he should realize that missing now probably would cause missing the future. Therefore, it is not really a multiple choice question. Instead, all would tend to do it as soon as possible before everything else goes wrong in the bigger context.

We all know recently America has been impacting the world economy aggressively, with the latest example being Turkey getting crushed in its financial section. When people rush to Istanbul to purchase luxuries, they need to realize that thing could happen anywhere any soon around the globe. For Chinese Internet giants or startups, they could be witnessing an approaching storm in funding area very soon. For giants, they could just hang in there for several years and then make it till next spring; for startups or unicorns, it would be a different story, since they still need tons of funding to support their great ideas and visions.

How does the storm come anyway? I guess it can’t strip away the relation with the real estate market, which is worth trillions of dollars today. It has become the centerpiece of the underlying problems – the ever-growing real estate market has taken away all the investment that should have gone into other industries, thus creating probably the biggest bubble in financial history. Other than that, it has also brought a lot of corruption because the land is controlled by the state, so it means by selling more land, local governments can increase their profits in the financial sheets. There are some other problems as well, but none of them is quite comparable to the one we just mentioned, which has taken years to form. Internet companies, which have enjoyed rapid investments and funding, inevitably would be impacted by the real estate market. On the other hand, employees of those Internet companies, many of them are strongly talented, are starting to feel huge pressure in terms of apartment rentals. The rents in Beijing area have gone up by a growing rate, and many of my colleagues are under pressure with limited payments from the company. It drives, instead, the companies to seek more funding from possible sources.

Although some of the Chinese tech giants originally copied their business model from foreign companies, I have to admit that in terms of localization they are far better than the original idea-holders. The CEO of Baidu just mentioned that if Google wanted to come back to China again, he has the confidence to win again. That is not completely bullshit, because the vast majority of Chinese netizens are not what we believe they are.

Anyways, the economic conditions are still not yet clear, but it has reminded me to be careful with what decisions I make, especially for financial decisions. Obviously more Chinese Internet companies will go for IPO soon, but we all know it is not the signal that the economy is going well.

Value something, not somebody

People who have gone through a lot of ups and downs would like to convey this concept to the young: value something, not somebody. When I was young, I used to put all my goals and plans on the girl I “loved”, for instance, I would choose the place where she is to go, and I would start to learn the things she likes. Nonetheless, that is not all. When we are in a relationship, we would tend to be over caring because we are afraid of losing, so some of us keep giving love without any valid feedback, only to find out that by putting all he wants/likes/needs on somebody, he has already risked losing all he had.

People change, and it is the truth. But it is not a bad thing. As intelligent creatures, we human beings have well adapted to the rule of nature – competition. No matter we are cooks, servers, coders, athletes or what, we compete against others to show we are valuable in this world. Some people would have already found what they love for the lifetime, but some people don’t. I happened to be one of them when I was young, so I used to rely the decision making on my girlfriend’s wills – whichever you choose, I will follow your path. While some of this type of stories would end up in good romance, most of them don’t. When your decision maker left, your entire system broke down. You feel like a zombie with no actual living purpose, crawling around searching for food.

Life is not like that, for sure. Something is always true, and that should be the things that we are after, that we should value. By finding those valuable things, such as good habits, good manners or morales, you will find your life unbreakable by anyone else, because those things won’t break. Some of us want to be a travel journalist because they want to show the world wonders that are undiscovered; some of us want to be a splendid cook because they like the smile when customers take a bite on the well-made food; some of us want to be coders because they love the simple idea that can change the whole world.

Those things don’t break, and we should value them instead of some individuals. Think about the time when we fall in love with somebody, we love them as a whole, not just their body. The abstract part of love is what we often ignore when the hormone effect hits. If you love to do something, keep the motivation, and if you still haven’t found one, go find it. Just don’t put your life’s blueprint on any individual, and if there has to be one, it got to be yourself.

Be your god, and love your world first before you can care others’.


Let’s face it, NFL is getting more and more boring nowadays, and I am not saying football is getting there, but NFL, specifically. Two years ago I started to get really into college football, and now I can convincingly say: college football is the most fun sport in the world, with no exception.

NFL is making itself quite embarrassed recently, by announcing that it would give the team that has players kneel down during national anthem a stunning 15-yard penalty, before the game even starts. This sounds very familiar to me, because I have been through quite a lot of similar penalties since I was young here in China. Yes, I am talking about coxxxnism. I wouldn’t say that this is not effective, instead, it is probably the most impactful policy ever if you want to punish an individual: if you don’t listen to me, fine, cause I am going to threat your teammates or families.

I don’t understand why would NFL choose this type of penalties. Even so, I don’t understand why would there be a penalty before the game starts. I’d rather attribute to this joke to NFL’s incompetence during these years, because clearly basketball is gaining more and more ground.

This season, at this time, there’re two 2-2 finals in both eastern and western conference, making it an exciting year for basketball fans since the foundation of warriors dynasty. I just hope NBA could get better and more balance between teams, and keep treating players as their partners instead of their tools.

Before the opening of next season’s college football, let us enjoy the May/June Madness!

Python is reigning!

When I started to learn Spark back in 2016, Scala is the best option to write Spark programs due to its simplicity and quickness. Back then PySpark was also available to Python users, but in order to use it, people have to initiate a SparkContext at the very beginning, and then go through some tedious steps to set things up.

As time reaches to 2018, I find that Spark community has officially created a new way for Pythonistas to interact with Spark core. The savior in this context is called SparkSession, which has bundled up a lot of things that normal users may not care at all. With SparkSession, Python users can dive right deep into data exploration, just as what they like to do in terms of machine learning. Though Scala still owns the crown in writing Spark programs, PySpark is now catching up.

Similarly, when I reviewed my knowledge in TensorFlow, I also found that Google has provided two new high-level APIs apart from the original low-level APIs. This has greatly lower the cost in time and effort for Python users to get their hands on this great open-source deep learning framework.

Java and other low-level programming languages are probably still the best overall for those who care about performance. I didn’t expect that Python would gain such huge popularity if its community is not active in machine learning area, thanks to third party packages such as scikit-learn, pandas, numpy and others. There’s a good saying about this – when one rides with the wind, even a pig can fly.

Next Big Area for Data Analysis

Artificial Intelligence? No, since it is already very popular and everybody wants some out of it.

Block chain technology? Nope, since it is more of a security thing, and data analyst is not playing a main role in its development.

Then what are we talking about here? I want to define the “Big Area” in the title as something that we could easily tap into with our current techniques, and it has not gone viral globally yet. The area I’d like to share my insights on, is electronic gaming.

Yep people might say that sports analytics has already been maturing especially in developed countries, and top players have dedicated data analyst to improve their performance. However, they could afford them partially due to the global recognition of their type of sport, and esport hasn’t been at the stage yet – it is still struggling to make itself enter the vision for most people. That is to say, the data analytic part in esport is very much related to the industry itself, and I would say it has a prosperous future.

Additionally, esport is so perfect for data analysis that I couldn’t think of another sport that has more appropriate case for data analysis. As of now, we have adapted high speed computers and machines to do complex calculation for us, and at the same time esport is played on computers. In other words, we’ll be able to analyze anything you would care about in games in order to improve the performance, because all data is available theoretically.

Esport’s data analysis has inherited the advantages from traditional sports, for example, you can’t be a good analyst without the experience or knowledge of that specific sport. It has also some merits that traditional sports can’t achieve, such as the low difficulty of getting observable data. As the esport industry continues to grow with more competitive games such as Dota2, League of Legends and Overwatch emerging, I believe it has quite a bright future for those brave pioneers.

Raspberry Pi 3, OMG

I’ve never imagined there would be such a revolutionary product, such a tiny little computer which is even smaller than the floppy disk, come into the world when I am only in my 20s. I knew its existence probably 3 or 4 years ago, but I have no interest in buying one until recently, when I realized I might need a 24/7 computer that can run linux.

With $35 in 2017, you could buy a Raspberry Pi 3 with WiFi and bluetooth capabilities, 4 USB ports, 1 RJ45 port, 1 HDMI port and an old fashioned 3.5mm headphone jack. Its IO system is so well-rounded. It doesn’t have a good CPU nor GPU, and its RAM is quite limited by 1 Gb, but who needs that much, really?

I bought one because like I mentioned at the beginning, I need a linux machine that can run 24/7 to serve as a gateway of my local network. I learned this technique accidentally when I was browsing the method to improve the connection of game consoles. Yeah, you are right, my purpose to do all this is simply trying to enable me to play online games LoL. I have owned a Nintendo Switch for quite a long time now, and since I came back to China the Internet condition here is quite annoying, with strict NAT type everywhere and usual lack of a public IP. All those factors and complexities have made me unable to connect to other players on my Switch games.

I then turned my research on those game proxies that claimed to be able to speed up my connection to outside world. Basically they are just proxies or VPNs that can redirect UDP traffic so as to give me a public IP. On PCs you could easily do it by installing some software that can do it, but it is hard to do it for consoles since you can’t install software on them. Some people decide to pay extra money to get a smart router which is basically a router with a smarter system on which you can install some “software”, but in essence they are linux-like systems. This fact makes me wonder if I could just use a local machine to serve as a router, and I soon happened to find out there’s a technique called transparent gateway, which is just another machine in the local network that can serve as a gateway to redirect all Internet traffic to a outside server, and what I need to do is simply changing some IP settings in my Switch’s Internet setting.

I have some experience in Linux, partially on my Mac, but mainly on my VPS. However, this time I am dealing with Debian on the Raspberry Pi instead of CentOS on my VPS, and it has some different properties, but not too much. The process is hard, and due to “fear” I don’t want to talk too detailed in this post. I simply install a “software” that can redirect my traffic to a server outside that has a public IP. Firstly, the Raspberry Pi should be able to redirect both TCP and UDP packets, and this could be done by setting up some rules in iptables. Secondly, the server in the outside world, should be able to handle UDP relay as well, and this was a mistake I made during the process. So at the end it is very simple, one step locally and one remotely, but the trials and errors along the way could be daunting to a lot of people. Luckily this is not my case, because I want to solve this problem so badly that I spent hours after work until I can’t keep my eyes open. (shameful problem-solver personality)

Now that there’s a 24/7 smart “router” in my house, I feel like there’re more opportunities for a smarter house. I remember I brought an Apple TV 4 back from the states, but couldn’t make it work because back then I didn’t have a machine that can do traffic relay, so my Apple TV is terribly crippled by the Internet in China. Proudly I am going to make it back to work when I am home in the upcoming Spring Festival, with my cutie Raspberry Pi, and believe it or not, it only costs me $35.

Please Hold On To Net Neutrality, America

It might sound weird to hear from a Chinese guy shouting out for American issues at first, but if you understand the current circumstances of Chinese Internet condition, or if you have ever lived here, you’ll realize right away what I am trying to say. IT WILL BE A STEP BACK, I PROMISE YOU.

I seriously believe this is a hot tech topic in the U.S. now, but as you could even imagine, there’s nearly no coverage for this piece of news here, partially thanks to the already-gone net neutrality here. ISPs should never be granted the rights to differentiate their customers, and I’ll use examples here to tell you what is going to happen.

To start with, what you are worried about is going to happen: charging more fees for heavier users, bundling up some websites to segment the market and so on. In China there’re as many types of packages as you could imagine for company network, especially if it is a foreign company that needs more open Internet, it would be charged with more fees with customized service. And this is totally unnecessary if net neutrality is still in the play. Internet is innovated as a motivation to connect across the globe, although ISPs could arrange the resource more effectively by shutting down net neutrality, it violated the basic ethics for Americans: every one is born equal.

Secondly, if the government has the power to abolish any policies without consulting the majority of the tech society, what could happen in the future? In the future, ISPs might not only be able to bundle the service they like to bundle, but also be able to censor your data as they like. Moreover, the government might also step in and say: hey now we are in charge, so your data will be sent to U.S. government before it leaves the U.S. territory. As so basically it has the potential of granting the government too much authority on this topic, which might make a lot of us feel violated.

It reminds me of the death of Aaron Swartz, who challenged the copyright world with his programming skills and sincere motivations. It also reminds me of people in Wikileaks, Pirate Bay, Anonymous… The values and the goals they are promoting to the world is shockingly similar: Internet/Knowledge should not be only for the rich, for celebrities, or for people who have authorities.

One final question to all of you: how could Donald Trump ever become the POTUS if the poll only serves for those who are “louder”? Please don’t lose this core value behind even if you are planning to turn you back on net neutrality.

1 2 3 4 5